CRISPRimmunity

Please click to download your results

Overview of predicted results

Overview of the results

Contig_ID	Contig_def	CRISPR array number	Contig Signature genes	Target MGE spacer number	Prophage number
NC_022545	Rhizobium pusense, complete genome	1 crisprs	DEDDh,csa3	0	0
NC_022536	Rhizobium pusense plasmid IRBL74_p, complete sequence	1 crisprs	csa3	1	52
NC_022535	Rhizobium pusense, complete genome	0 crisprs	cas3,WYL,csa3,DEDDh	0	6

Results visualization

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NC_022545_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NC_022545_1

1596210-1596331

Orphan

Consensus_repeat	Method
TCGCTGTACGACTTGCTTTCATAGCTCCAGTC	CRISPRCasFinder

1 spacers

The CRISPR arrays of NC_022545_1

>merge|NC_022545|1|1596210-1596331|CRISPRCasFinder
TCGCTGTACGACTTGCTTTCATAGCTCCAGTCGTAATCGGTCTTGATTTCGGTGCTGGTTTCGGTGCTCGTCTTGGTCGTCGTGTCGTTATCGCTGTACGACTTGCTGTCATAGCTCCAGTC

>NC_022545|1|1|1596210-1596331|CRISPRCasFinder
TCGCTGTACGACTTGCTTTCATAGCTCCAGTC	GTAATCGGTCTTGATTTCGGTGCTGGTTTCGGTGCTCGTCTTGGTCGTCGTGTCGTTA
TCGCTGTACGACTTGCTGTCATAGCTCCAGTC

Protein	Signature genes	Signature genes Name	Protein_function
NC_022545.1\|WP_004438593.1\|1589205_1589700_-\|hypothetical-protein	unknown	unknown	unknown
NC_022545.1\|WP_022563649.1\|1586676_1587933_+\|polysaccharide-biosynthesis/export-family-protein	unknown	unknown	gnl\|CDD\|224512
NC_022545.1\|WP_022563657.1\|1597478_1598249_+\|response-regulator-transcription-factor	unknown	unknown	gnl\|CDD\|225107
NC_022545.1\|WP_022563650.1\|1587940_1589209_+\|O-antigen-ligase-family-protein	unknown	unknown	gnl\|CDD\|225844
NC_022545.1\|WP_022563658.1\|1600055_1601039_+\|UDP-glucose-4-epimerase-GalE	unknown	unknown	gnl\|CDD\|224012
NC_022545.1\|WP_173402664.1\|1604514_1605690_-\|AGE-family-epimerase/isomerase	unknown	unknown	gnl\|CDD\|225493
NC_022545.1\|WP_022563661.1\|1603498_1604461_+\|glycoside-hydrolase-family-26-protein	unknown	unknown	gnl\|CDD\|226609
NC_022545.1\|WP_004438583.1\|1585811_1586492_+\|sugar-transferase	unknown	unknown	gnl\|CDD\|274395
NC_022545.1\|WP_022563664.1\|1607090_1609280_+\|UDP-forming-cellulose-synthase-catalytic-subunit	unknown	unknown	gnl\|CDD\|274400
NC_022545.1\|WP_022563651.1\|1589785_1591144_-\|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit	unknown	unknown	gnl\|CDD\|130902
NC_022545.1\|WP_022563652.1\|1591152_1593360_-\|type-I-secretion-system-permease/ATPase	unknown	unknown	gnl\|CDD\|226969
NC_022545.1\|WP_022563653.1\|1593428_1595450_-\|hypothetical-protein	unknown	unknown	unknown
NC_022545.1\|WP_006698573.1\|1599031_1600054_+\|SDR-family-oxidoreductase	unknown	unknown	gnl\|CDD\|187541
NC_022545.1\|WP_022563659.1\|1601035_1602967_+\|glycosyltransferase	unknown	unknown	gnl\|CDD\|133043
NC_022545.1\|WP_022563648.1\|1583165_1584500_-\|aspartate-aminotransferase-family-protein	unknown	unknown	gnl\|CDD\|181707
NC_022545.1\|WP_034498547.1\|1602981_1603485_+\|DUF995-domain-containing-protein	unknown	unknown	gnl\|CDD\|336337
NC_022545.1\|WP_006698574.1\|1598291_1598639_-\|hypothetical-protein	unknown	unknown	unknown
NC_022545.1\|WP_162472002.1\|1606033_1607023_+\|cellulose-biosynthesis-protein-BcsN	unknown	unknown	gnl\|CDD\|374958
NC_022545.1\|WP_004438580.1\|1584646_1585195_+\|cupin-domain-containing-protein	unknown	unknown	gnl\|CDD\|182158
NC_022545.1\|WP_022563647.1\|1582029_1582956_+\|homocysteine-S-methyltransferase-family-protein	unknown	unknown	gnl\|CDD\|224951

Protein	Function_ID	Function_description	E-value
NC_022545.1\|WP_022563649.1\|1586676_1587933_+\|polysaccharide-biosynthesis/export-family-protein	gnl\|CDD\|224512	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis].	4.84422e-35
NC_022545.1\|WP_022563657.1\|1597478_1598249_+\|response-regulator-transcription-factor	gnl\|CDD\|225107	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription].	1.55888e-39
NC_022545.1\|WP_022563650.1\|1587940_1589209_+\|O-antigen-ligase-family-protein	gnl\|CDD\|225844	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane].	2.1487e-43
NC_022545.1\|WP_022563658.1\|1600055_1601039_+\|UDP-glucose-4-epimerase-GalE	gnl\|CDD\|224012	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane].	0
NC_022545.1\|WP_173402664.1\|1604514_1605690_-\|AGE-family-epimerase/isomerase	gnl\|CDD\|225493	COG2942, COG2942, N-acyl-D-glucosamine 2-epimerase [Carbohydrate transport and metabolism].	2.84535e-157
NC_022545.1\|WP_022563661.1\|1603498_1604461_+\|glycoside-hydrolase-family-26-protein	gnl\|CDD\|226609	COG4124, ManB, Beta-mannanase [Carbohydrate transport and metabolism].	2.43609e-133
NC_022545.1\|WP_004438583.1\|1585811_1586492_+\|sugar-transferase	gnl\|CDD\|274395	TIGR03022, WbaP_sugtrans, Undecaprenyl-phosphate galactose phosphotransferase, WbaP. The WbaP (formerly RfbP) protein has been characterized as the first enzyme in O-antigen biosynthesis in Salmonella typhimurium. The enzyme transfers galactose from UDP-galactose to a polyprenyl carrier (utilizing the highly conserved C-terminal sugar transferase domain, pfam02397) a reaction which takes place at the cytoplasmic face of the inner membrane. The N-terminal hydrophobic domain is then believed to facilitate the "flippase" function of transferring the liposaccharide unit from the cytoplasmic face to the periplasmic face of the inner membrane. This model includes the enterobacterial enzymes, where the function is presumed to be identical to the S. typhimurium enzyme as well as a somewhat broader group which are likely to catalyze the same or highly similar reactions based on a phylogenetic tree-building analysis of the broader sugar transferase family. Most of these genes are found within large operons dedicated to the production of complex exopolysaccharides such as the enterobacterial O-antigen. The most likely heterogeneity would be in the precise nature of the sugar molecule transferred.	7.19492e-98
NC_022545.1\|WP_022563664.1\|1607090_1609280_+\|UDP-forming-cellulose-synthase-catalytic-subunit	gnl\|CDD\|274400	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.	0
NC_022545.1\|WP_022563651.1\|1589785_1591144_-\|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit	gnl\|CDD\|130902	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins. [Protein fate, Protein and peptide secretion and trafficking].	1.251e-132
NC_022545.1\|WP_022563652.1\|1591152_1593360_-\|type-I-secretion-system-permease/ATPase	gnl\|CDD\|226969	COG4618, ArpD, ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only].	0
NC_022545.1\|WP_006698573.1\|1599031_1600054_+\|SDR-family-oxidoreductase	gnl\|CDD\|187541	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs. UGD catalyzes the formation of UDP-xylose from UDP-glucuronate; it is an extended-SDR, and has the characteristic glycine-rich NAD-binding pattern, TGXXGXXG, and active site tetrad. Extended SDRs are distinct from classical SDRs. In addition to the Rossmann fold (alpha/beta folding pattern with a central beta-sheet) core region typical of all SDRs, extended SDRs have a less conserved C-terminal extension of approximately 100 amino acids. Extended SDRs are a diverse collection of proteins, and include isomerases, epimerases, oxidoreductases, and lyases; they typically have a TGXXGXXG cofactor binding motif. SDRs are a functionally diverse family of oxidoreductases that have a single domain with a structurally conserved Rossmann fold, an NAD(P)(H)-binding region, and a structurally diverse C-terminal region. Sequence identity between different SDR enzymes is typically in the 15-30% range; they catalyze a wide range of activities including the metabolism of steroids, cofactors, carbohydrates, lipids, aromatic compounds, and amino acids, and act in redox sensing. Classical SDRs have an TGXXX[AG]XG cofactor binding motif and a YXXXK active site motif, with the Tyr residue of the active site motif serving as a critical catalytic residue (Tyr-151, human 15-hydroxyprostaglandin dehydrogenase numbering). In addition to the Tyr and Lys, there is often an upstream Ser and/or an Asn, contributing to the active site; while substrate binding is in the C-terminal region, which determines specificity. The standard reaction mechanism is a 4-pro-S hydride transfer and proton relay involving the conserved Tyr and Lys, a water molecule stabilized by Asn, and nicotinamide. Atypical SDRs generally lack the catalytic residues characteristic of the SDRs, and their glycine-rich NAD(P)-binding motif is often different from the forms normally seen in classical or extended SDRs. Complex (multidomain) SDRs such as ketoreductase domains of fatty acid synthase have a GGXGXXG NAD(P)-binding motif and an altered active site motif (YXXXN). Fungal type ketoacyl reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.	0
NC_022545.1\|WP_022563659.1\|1601035_1602967_+\|glycosyltransferase	gnl\|CDD\|133043	cd06421, CESA_CelA_like, CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.	2.73488e-69
NC_022545.1\|WP_022563648.1\|1583165_1584500_-\|aspartate-aminotransferase-family-protein	gnl\|CDD\|181707	PRK09221, PRK09221, beta alanine--pyruvate transaminase; Provisional.	0
NC_022545.1\|WP_034498547.1\|1602981_1603485_+\|DUF995-domain-containing-protein	gnl\|CDD\|336337	pfam06191, DUF995, Protein of unknown function (DUF995). Family of uncharacterized Proteobacteria proteins.	1.19927e-66
NC_022545.1\|WP_162472002.1\|1606033_1607023_+\|cellulose-biosynthesis-protein-BcsN	gnl\|CDD\|374958	pfam17038, CBP_BcsN, Cellulose biosynthesis protein BcsN. This is a family of bacterial cellulose biosynthesis proteins. Cellulose is necessary for biofilm formation in bacteria. (Roemling U. and Galperin M.Y. "Bacterial cellulose biosynthesis. Diversity of operons and subunits" (manuscript in preparation)).	1.34662e-86
NC_022545.1\|WP_004438580.1\|1584646_1585195_+\|cupin-domain-containing-protein	gnl\|CDD\|182158	PRK09943, PRK09943, HTH-type transcriptional regulator PuuR.	2.14578e-37
NC_022545.1\|WP_022563647.1\|1582029_1582956_+\|homocysteine-S-methyltransferase-family-protein	gnl\|CDD\|224951	COG2040, MHT1, Homocysteine/selenocysteine methylase (S-methylmethionine-dependent) [Amino acid transport and metabolism].	7.40227e-100

>NC_022545.1|WP_022563653.1|1593428_1595450_-|hypothetical-protein
MHIDKISDMIAHFIGLFDTMAEEARLRNNYSEGPARSGPEQLPEEEAARLLDKNYDVALQDYDPGVKYHAGYYDFDYLPPHFARTVEHDMQQFANSIPVDISAANFRFPGRLSFEDERELVIHTGPGSVVAHIAQVNILQDDDYLNMTDGPNVARDTSFVTERTVEFYNEASVFTPFSGFQRTDSYDALQALAKTAHDYIEHARDNDVTSLGTGADQDFVLAGNDINGLYINGVAAAEKPALDDYMPDRGIAKPPEEPERSDVALHEESPAGNSLDVAAGANVVANVATLVNTAVMTSVTAVMGDYHQIDAITQAYIYSDRDEIASVFTRSEDQADTAAYNIANFERSIFPGAENTAAESPETGEMPIFPTAWRVSVLEGDVSFVHWIEQYQFVSDNDTMTITTSGASVSLLTGGNAALNIANFLGIGMQYDLIIVGGNVLDMNLISQIAVLYDNDWARANPDAPGGATIQSGNNLLWNDASIYNVGSNDRFETMPDYMAQTVSAINERDPNMPDALAHDSNFAGYQGLNVLYITGNIYDVSIIKQVSVLGDSDDVTQAAAKVLENNENADVHIDTGSNAVINLAQIIDYDSFGSTTYVAGGVYSDAILIQGGIIENDTSQPAQPGQLANEVIAFLHDDPATIENESDGVINAGHDLSWSNAHSSDVMQAVTA
>NC_022545.1|WP_022563652.1|1591152_1593360_-|type-I-secretion-system-permease/ATPase
MDHISRRTIADGRSLDLPLDQDAEDSKLTDACISAINEAVENLRQLSGAEQIARATPPSPEVAPPPSPTTPQSGTLRDMPAGNPAPQQEPHPASRPAPKAKIDNAELKSAPDLPFVKTIDNNDGPIRETERRPATGGGGDGKEPTGNGGGGGGGGSSQSGFHKRSEPINFAASLAKGIAAVRRNMIVVMLFTVAINVLLLAIPLYLFQISDRVLTSRSMDTLVMLTVAVLGAVLLQAFMDAIRRFILMRTAVELEVQLGAPILSAAARASLHGSGKDYQILQDLQQLRSFLTSGTLIAFLDAPLMPLFIVVVYLVHPHLGIIIMVCCAVLFTIAWLNQRFTARQFSEASGYLSRANFHLDSMSRNSQIINALAMIPEAVKMWGRETAGSLKSHVAAQDRNIMFSGVSKAARMITQVALLGWGAHLSLSGELTGGMVIAASIVSGRALAPIEGAIEGWHQFNKSAASYGRIKQLLISSPLNFPRLRLPNPEGRLDVERILFVPPPQKKVILNGISFSLKKGESLAIIGNSGSGKTTLGKMLVGSILPTSGNVRLDLMDLRNWDQRQFGESIGYLPQDVQLFPGTIKANICRMRDDIEDRQIYEAAVLADVHELIAGFPQGYETVVAADGAPLSGGQKQRIALARAFFGDPKFVVLDEPNSNLDTQGEQALAKALLHAKRQGITTVTITQRPALLQCVDKIMVLKDGSVAMFGERMDVLKALSGNGRQASQSPQIEG
>NC_022545.1|WP_022563651.1|1589785_1591144_-|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit
MFRKKNAIAEVKPQGQLEWYSEVPRSIRLHSSIGLAVLLASFGGFGYWAGTAPLSSAIIAQGSFVATGNNKVVQHLEGGIIKEMMVSEGDTVKVGDVLLTLDKTTALANERMLQLRRLRLETIVTRLRAEAQGAKSFKVPDIVMKEAGDPDINAIIQSQNIVFHSKLIKLEEQLNLIEKNIRSLEFRYAGYDGQKQSFDRQLALLTQERDSKDRLAKDGVIRKTDMLALERAIADAMGDIARLSGEMNQSEAEIAKFKQEAIIAVNANKQAALDALETAESDLDSVREQVRGAAEVLERTVIRSPVNGTVVRAYYHTPGGVITTGKPIMEILPAHVPLILEAQVLRTSIDQLHEGQTAAIRLSALNRRTTPVLNGKVFYVSADSIEENAGLQVKDVYIVRVQVPDEEIAKVHNFHPVPGMPADVLIQTSERTFFEYLTKPIADSMSRAFKER
>NC_022545.1|WP_004438593.1|1589205_1589700_-|hypothetical-protein
MAAMSDVLLLVGRLNYVWTNTESLMIYLIVHLLKVDKEAAIVVFLTLNTTRARMDLIERLAKLPSTDAKDRKTVLSIMARLKREAKTRNKYNHCIYSFDEKGDIASTQLMRLVEDDSQVRYGKVERMDEKEIGQLEKSIADIVEVSKDMWSFIHASPHVSADYL
>NC_022545.1|WP_022563650.1|1587940_1589209_+|O-antigen-ligase-family-protein
MRIAKSALIDPERNEIYGMTAVALSFFVFAYSSRFGQVSVLAYYGMWLPLVAVDYRRVLGNYPRYLWIFAFSILTVLSSFWSEAVSVTMRASIQYLTHIVCALVAMRVISIRTLTRGALIGIGVVLLYSLLFGIYLFDALDGTYSFVGAFSSKNQLGFYASLGVIFAASSVLVLKQRGIWLPIAGVTGLLSAYSLIASQSATSAITTAAVVALIIGFIPIGMLSPANRKMMFFALGGLGALLAVASLQFGLLDAILGVFGKDSTLTGRTYLWQQGIEAAKQAPILGVGYQGFWVAGFADAERLWNDFFITGRSGFHFHNTYIETVVENGFVGMILLGMVLYGTLLGHLRSVLMLRSDPQGVILFAICALFVVRSFVEIDIIFPYQIGSFLLYFAAGKLCLPVKAARNGETHPAIGMRLQTRP
>NC_022545.1|WP_022563649.1|1586676_1587933_+|polysaccharide-biosynthesis/export-family-protein
MNGFSAAHRPFVALVLAATVAFSAPLSAMAADGAQYKLGTADKLRIRVAEWQPADGSIRNWDVINGDYSVGPSGTLSLPFIGQLDVAGKTPSEVSDQIGAQLQSKFALRNLPSASVEIAQFRPIFLSGDVQTPGEYPYAPNITVLKAVSLAGGLRRSDAGQRFARDFINARGDAAVYDNQRARLLARQARLIAEVKGDQTITKTPEMEKIAEIDTLLASESALMKSRTERYTLQLKALTDLHALLQSEVESLKKKSETQNRQLQLANEDRDRVNRLNEQGLALSQRRISAEERAAEVESTLLDIDTQSLRAKQDINKATQDEINLRNDWVAQRSKELQDTEAELDKLNLQLTTSRELMSEALAQSAEAIRFDPSGKSATISYVVVREENGKPKELKVDENALLQPGDVIKVSSEILMQ
>NC_022545.1|WP_004438583.1|1585811_1586492_+|sugar-transferase
MKSATQSAEQTLSSSEDFDVSFPIGGIAKRSFDMTSAALALLIFSPIFLLIAVLVKMSDPGPIFYGHRRVGHNGRYFHCLKFRTMAMNGDEILRQYLAANPEAAEEWRATRKLKNDPRVTAVGAVLRKLSLDELPQLINILRGEMSVVGPRPVVDEELSYYESAAAYYLSTRPGLTGLWQISGRNDVSYKTRVAFDTQYVQNWSMRQDVFIIVKTIPAVCLSRGSY
>NC_022545.1|WP_004438580.1|1584646_1585195_+|cupin-domain-containing-protein
MSVDIGGRLRHLRLRHNISQRELARRAGVTNSTISLIESNTSNPSVGALKRILDGIPIGLAEFFAFEPETSRKAFYRADELVEIGKGPISFRQVGENVFGRSLQILKECYQPGADTGKVPLVHDGEEGGIILSGRLEVTVDEERRVLGPGDAYYFESRRPHRFRCVGPVACEVISACTPPTF
>NC_022545.1|WP_022563648.1|1583165_1584500_-|aspartate-aminotransferase-family-protein
MDNPSRSNSTSLDSYWMPFTANRQFKANPRLLASAEGMYYTSNDGRQVLDGTAGLWCVNAGHGRQQIASAVKHQLSTMDYAPSFQMGHPVAFEFAERLAEIAPGPEGGKLDRVFYTGSGSESVDTALKIAIAYQRAIGQGTRSRLIGRERGYHGVGFGGISVGGLVNNRRVFPQIPADHLRHTHDLTKNSFVKGQPEHGAELADDLERLVALHGAETIAACIVEPVAGSTGVLVPPKGYLERLRTICDKHGILLIFDEVITGFGRMGSSFASNYFGVTPDIVTTAKGLTNGAIPMGAVFTSREVHDALMHGPESQIELFHGYTYSGHPVACAAGIATLDIYRDEGLFTRASELQDAWHDAIHSLKGSPNVIDIRTIGFIAGIELQPRDGAIGARAYDVFVDCFERGLLIRVTGDIIALSPPLIAEKSHFDDIVSILGDALKRAE
>NC_022545.1|WP_022563647.1|1582029_1582956_+|homocysteine-S-methyltransferase-family-protein
MSDIRILDGGMSRELQRLGAELKQPEWSALALINAPDIVRQVHAEFIEAGADVVTTNSYALVPFHIGEYRFDKEGASLIALSGRLAREAAEASKRNVTVAGSLPPIFGSYEPENFDPSRVQDYLKVLVENLQPHVDVWLGETLSLIAEGEAVRQAVAETGKPFWISFTLNDEPAQVNGAEPKLRSGETVRSAAEWAAGSGAAALLFNCSKPEVMRAAVETASAVFKEKGVALDIGVYANAFEGEQGDSAANEGLHGTRADLTDDVYSRFACSWADAGATLIGGCCGIGAAHIHTVADTLRRRGTSRTI
>NC_022545.1|WP_022563657.1|1597478_1598249_+|response-regulator-transcription-factor
MFMGTSDYAAKQKNAVTHINGTLLIVADPDLFSECLMEALGKKFPTFSVVSVTSSATIDDDYGADVRLVLPYRLAGERLNSVLSAIREKHPEAPIALVVETIDKIEEPLKRLVGMRIIDGVLPLNLRLDVFMAAVDLLMKGGEHFPAALLGKLTPYPTAVGGKSVRNSPVIANRADALAESRSDMATLTTREVQILDLLCKGTQNKIIADRLHLSENTVKVHVRNIYKKMNVRNRTEAASRFFSKDEGATFSGWKN
>NC_022545.1|WP_006698574.1|1598291_1598639_-|hypothetical-protein
MMNEMPYFRGGKTVRQCRLALVAGALLTFTFATASCSVVEDAVLTTASASPTTIKSRVTPAKAAYGYQKTGNAAVTLVADASDTPAVSRPSYSGSSPYICSPSGFGQKSRCFLRP
>NC_022545.1|WP_006698573.1|1599031_1600054_+|SDR-family-oxidoreductase
MRNFVPNEHDNGVTIYSDWKPGQRVLVNGGAGFLGSHLCERLLSSGHEVICLDDLSTGRTANVEHLRNNKRFLMVEHDVRKPYDIDVSLIFNFASPASPPDYQRDPVGTLLTNVLGAVNVLEVARRCGATVVQSSTSEVYGDPHVNPQPETYFGNVNTIGPRACYDEGKRSAETLFFDYHRTFGVDIKVGRIFNTYGPRMRPDDGRVVSNFIVQALKGDDITIYGDGSQTRSFCYVDDLIDGFLRFSAKPKDCTGPINLGNPTEIPVRQLADIVIRMTGSRSRIVHLPAAIDDPQQRRPDISRANELLKWQPRVPLEIGLERTIVYFDALLAGRKVAEAV
>NC_022545.1|WP_022563658.1|1600055_1601039_+|UDP-glucose-4-epimerase-GalE
MPRTILVTGGAGFIGSHICKALAQSGFKPIAYDNLSTGHADSVRWGPFIEGDILDRGLLKATLQEFSPAFVIHCAANAYVGESVEDPRKYYRNNVGGSLSLLDACLDQNIGGLVFSSSCATYGVPPQLPISEETAQTPVNPYGRTKLIFEMALDDYAAAYGLRFVALRYFNAAGADPDGELCERHEPETHLIPRALMAAAARLPQLDVFGADYDTSDGTCIRDYIHVSDLADAHVAAVNYLADGGETLRVNLGSGHGTSVGDIIRAIHRVTGQEVPVHFGARRAGDPPALFADIERARQTLGFAPRRSDIDTIIRTAGPGFGLEVLS
>NC_022545.1|WP_022563659.1|1601035_1602967_+|glycosyltransferase
MTMRSTPSVASPRAAAGVLRMLPIFTGWNRLAYLLGIGGWLVTLAYFWIWWLDRDRVIDWPYYSVVTLALAWITLLPSYFIFIFLNARVVDRRSPLPGGRVAMVVTKAPSEPFAVVEKTLLAMLEQKGLEFDVWLADEAPDAETLKWCGAHGVFVSTRQGIAEYHRKTWPRRTRCKEGNLAYFYDRYGYERYDFVAQFDADHVPEPDYLSEVIRPFADPRIGYVSAPSICDANANESWAARGRLYAEASLHGALQLGYNNGWAPLCIGSHYAVRTKALREVGGLGPELAEDHSTTLVMNAGGWRGVHAVNAIAHGDGPVTFADLIVQEFQWSRSLMTILLQYSRHYVPKLPARLKFQFLFSQLWYPLFSGFMALMFVLPAVALVRGHVLVNASYPAFLAHFLPVSLIMIVFAFFWRATGAFRPHDAKLFSWEAMVFLFLRWPWSLMGVLAAVRDTIRGDFVDFRITPKGTQAKPPLPLRVVAPYMVLAALSLLAMVLAPRQNGAEGFFIFAAINVAIYAGLSVFLLIRHAVENGLPKLPALRGGASAAACSLLLVAGSAAELSSHAIGGLEALSHGQPYISFTETRFTVAGAGAEGARSVRLKLRIALPGLRGPQEMQAEPIVAPPPAVATGEIMLADNRVGQ
>NC_022545.1|WP_034498547.1|1602981_1603485_+|DUF995-domain-containing-protein
MKTTISTCGFSAFVLCLAVVAPSVVHAAGGTKTPKPLRTSEVVEMYFDKTWKWDTGGGRFIADGRKFIAATEEKGKKSIGEGRWTVDANGTLCMRATWKAEAGSGKADTCFDHGRIGKVLYQRKQGGQWYVFRHNPPRPGDEFLKLVRKDDVTPQIAAYDKAMTATR
>NC_022545.1|WP_022563661.1|1603498_1604461_+|glycoside-hydrolase-family-26-protein
MQITRRTLLFASGAAAAFTAGMYPVLKLDAQGVAPMTSTGMKTLADKRPTLHADGIRFGAYDPHGDFTGQSGVATEHLFLPWEDVDLDSLALADAYALERKRNVMITVEPWSWDVNWRLSSDELRRKVLSGDYDKNMQAIAARMSAMKSPLILRWGQEMEDTTGRFSWSGWNPRDYITAYKRVVDMTRKAVPGVKVMWSPKGLDGLRAYYPGDNYADLVGLSVFGLEDYDKIEYGAPKTFTDLLRKGYGLVETFDKPVWVAELGYEGSDSYVRPWMNDVTLKQADFPKLEEVVYFNDKDVHPWPHNLGRPDWRVVRPAKV
>NC_022545.1|WP_173402664.1|1604514_1605690_-|AGE-family-epimerase/isomerase
MQFPSIAQTLAEEIGTLRKWLDEDALPLWWEAGSARPDGGFYERLGQDAKPVFSDDRRARVQPRQAYCYAAAGQHGWHGPWKDAVLHALSWFEKVYRLENGLYGNLADQTGKLIDPSFDLYNQAFALFAAAQTAAILPERRNEMRSRALEILAILERDYRHPIAGFEEANPPRTPLCSNPHMHLFEAMLAWEEQDRDGPWSALADEIAGLALSRFIDDGNGGLREFFAHDWTPYEGEKGRIMEPGHQFEWAWLLVRWGSLRGNEEAIRKAKRLFEIGEAYGICPRRKVAVMSLYDDFSMRDGLARLWPQTEWLKAAVRLASVTDGEERQRYLACGLSAIGALQPFLDTPVKGLWFDKWPADRPMLDEPAPASTFYHIVCAIYEAEAVLAAG
>NC_022545.1|WP_162472002.1|1606033_1607023_+|cellulose-biosynthesis-protein-BcsN
MKFSAYTSLLFSIAVLSGCNTPAGVRSFGGTQLLSPSEALIFPPPGGPEIVTVVSRTYSNAVAQQVILRSEAATPGQNYLKAEFFGPQQAGDTDFDSLAFTGFGASSLAREIRAEFPGETIAMSANYLQNSYGAFSYAAGKGRGEDTCLYGWQDIRSPESMRQDFRNLGRIKVRLRLCQSGASVERLLAVMYNYTITGTYASPSWNPYGTPQAVDKNLGRPGNPVYPIKSEEVPMRPGGEVTASVPVRPVRRAAATAPVQPEQQPLPPVAAVNIPSPVSAGASGQPAVTAPRAAGGGAVTGQQQSSGVPQVSIPSPSCLSGSGAGQGCR
>NC_022545.1|WP_022563664.1|1607090_1609280_+|UDP-forming-cellulose-synthase-catalytic-subunit
MNKAITIIVWLLVSLCVLAIITMPVSLQTHLVATAISLILLATIKGFNGQGVWRLVALGFGTAIVLRYVYWRTTSTLPPVNQLENFIPGFLLYLAEMYSVVMLALSLVIVSMPLPSRKTRPGSPTYRPTVDVFVPSYNEDAELLANTLAAAKNMDYPADRFTVWLLDDGGSVQKRNAANIVEAQAAQRRHEELKKLCEDLDVRYLTRERNVHAKAGNLNNGLAHSTGELVTVFDADHAPARDFLLETVGYFEEDPRLFLVQTPHFFVNPDPIERNLRTFETMPSENEMFYGIIQRGLDKWNGAFFCGSAAVLRREALQDTEGFSGVSITEDCETALALHSRGWNSIYVDKPLIAGLQPATFASFIGQRSRWAQGMMQILIFRQPLFKRGLTFTQRLCYMSSTLFWLFPFPRTIFLFAPLFYLFFDLQIFVASGGEFLAYTAAYMLVNLMMQNYLYGSFRWPWISELYEYVQTVHLLPAVVSVIFNPGKPTFKVTAKDESIAEARLSEISRPFFVIFGLLVVAMIFAVWRIYSEPYKADVTLVVGGWNLLNLIFAGCALGVVSERGDKSASRRITVKRRCEVKLEGSDTWVPASIDNVSVHGLLINLFDNATTVQKGETAIVRVKPHSEGVPETMPLNIVRTVRGEGFISIGCTFSPQRAVDHRLIADLIFANSEQWSEFQRVRRRNPGLIRGTATFLAISLFQTQRGLFYLARALRPGSKAVKPAGAVK

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity

Prophage detection

Region	Region Position	Protein_number	Hit_taxonomy	Key_proteins	Att_site	Prophage annotation

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NC_022536_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NC_022536_1

270634-270709

Orphan

Consensus_repeat	Method
TGGCCCAAAATCGGGGAGCACTTCA	CRISPRCasFinder

1 spacers

The CRISPR arrays of NC_022536_1

>merge|NC_022536|1|270634-270709|CRISPRCasFinder
TGGCCCAAAATCGGGGAGCACTTCATAACCCCGGAAAATTCCAGCTCCCGCTGGCCCAAAATCGGGGAGCACTTCA

>NC_022536|1|1|270634-270709|CRISPRCasFinder
TGGCCCAAAATCGGGGAGCACTTCA	TAACCCCGGAAAATTCCAGCTCCCGC
TGGCCCAAAATCGGGGAGCACTTCA

Protein	Signature genes	Signature genes Name	Protein_function
NC_022536.1\|WP_022557354.1\|261420_262335_-\|nodulation-factor-ABC-transporter-ATP-binding-protein-NodI	unknown	unknown	gnl\|CDD\|237419
NC_022536.1\|WP_022557364.1\|270737_271943_-\|hypothetical-protein	unknown	unknown	unknown
NC_022536.1\|WP_022557357.1\|264798_266052_-\|chitooligosaccharide-synthase-NodC	unknown	unknown	gnl\|CDD\|275076
NC_022536.1\|WP_022557355.1\|262331_264047_-\|Nodulation-protein-U	unknown	unknown	gnl\|CDD\|280673
NC_022536.1\|WP_048903080.1\|278208_279132_-\|LysR-family-transcriptional-regulator	unknown	unknown	gnl\|CDD\|176151
NC_022536.1\|WP_022557361.1\|268867_269149_-\|acyl-carrier-protein	unknown	unknown	gnl\|CDD\|223314
NC_022536.1\|WP_022557358.1\|266065_266713_-\|chitooligosaccharide-deacetylase-NodB	unknown	unknown	gnl\|CDD\|211966
NC_022536.1\|WP_022557291.1\|280092_281328_-\|aminotransferase-class-I/II-fold-pyridoxal-phosphate-dependent-enzyme	unknown	unknown	gnl\|CDD\|99738
NC_022536.1\|WP_022557359.1\|266709_267300_-\|NodA-family-N-acyltransferase	unknown	unknown	gnl\|CDD\|179111
NC_022536.1\|WP_022557365.1\|271939_272674_-\|SDR-family-oxidoreductase	unknown	unknown	gnl\|CDD\|180838
NC_022536.1\|WP_022557371.1\|279107_280037_-\|dihydrodipicolinate-synthase-family-protein	unknown	unknown	gnl\|CDD\|223406
NC_022536.1\|WP_048903029.1\|276131_278117_-\|acyltransferase	unknown	unknown	gnl\|CDD\|224748
NC_022536.1\|WP_144115371.1\|269503_270603_+\|IS3-family-transposase	unknown	unknown	gnl\|CDD\|372671
NC_022536.1\|WP_022557353.1\|260628_261417_-\|ABC-transporter-permease	unknown	unknown	gnl\|CDD\|130358
NC_022536.1\|WP_022557366.1\|272673_273999_-\|FAD-binding-oxidoreductase	unknown	unknown	gnl\|CDD\|223354
NC_022536.1\|WP_022557368.1\|274414_275578_-\|UbiA-family-prenyltransferase	unknown	unknown	gnl\|CDD\|236195
NC_022536.1\|WP_048903028.1\|264077_264761_-\|methyltransferase-domain-containing-protein	unknown	unknown	gnl\|CDD\|283141
NC_022536.1\|WP_022557372.1\|281409_282897_+\|PLP-dependent-aminotransferase-family-protein	unknown	unknown	gnl\|CDD\|224089
NC_022536.1\|WP_022557367.1\|273995_274463_-\|GtrA-family-protein	unknown	unknown	gnl\|CDD\|377229
NC_022536.1\|WP_022557360.1\|267658_268867_-\|beta-ketoacyl-[acyl-carrier-protein]-synthase-family-protein	unknown	unknown	gnl\|CDD\|238430

Protein	Function_ID	Function_description	E-value
NC_022536.1\|WP_022557354.1\|261420_262335_-\|nodulation-factor-ABC-transporter-ATP-binding-protein-NodI	gnl\|CDD\|237419	PRK13536, PRK13536, nodulation factor ABC transporter ATP-binding protein NodI.	0
NC_022536.1\|WP_022557357.1\|264798_266052_-\|chitooligosaccharide-synthase-NodC	gnl\|CDD\|275076	TIGR04242, glycosyl_transferase_family_2, chitooligosaccharide synthase NodC. Members of this family are NodC, an N-acetylglucosaminyltransferase involved in the production of nodulation factors through which rhizobia establish symbioses with leguminous plants.	0
NC_022536.1\|WP_022557355.1\|262331_264047_-\|Nodulation-protein-U	gnl\|CDD\|280673	pfam02543, Carbam_trans_N, Carbamoyltransferase N-terminus. This domain is found in NodU from Rhizobium, CmcH from Nocardia lactamdurans and the bifunctional carbamoyltransferase TobZ from Streptoalloteichus tenebrarius. NodU a Rhizobium nodulation protein involved in the synthesis of nodulation factors has 6-O-carbamoyltransferase-like activity. CmcH is involved in cephamycin (antibiotic) biosynthesis and has 3-hydroxymethylcephem carbamoyltransferase activity, EC:2.1.3.7 catalyzing the reaction: Carbamoyl phosphate + 3-hydroxymethylceph-3-EM-4-carboxylate <=> phosphate + 3-carbamoyloxymethylcephem. TobZ functions as an ATP carbamoyltransferase and tobramycin carbamoyltransferase. These proteins contain two domains, this is the larger, N-terminal, domain.	1.54339e-123
NC_022536.1\|WP_048903080.1\|278208_279132_-\|LysR-family-transcriptional-regulator	gnl\|CDD\|176151	cd08462, PBP2_NodD, The C-terminal substsrate binding domain of NodD family of LysR-type transcriptional regulators that regulates the expression of nodulation (nod) genes; contains the type 2 periplasmic binding fold. The nodulation (nod) genes in soil bacteria play important roles in the development of nodules. nod genes are involved in synthesis of Nod factors that are required for bacterial entry into root hairs. Thirteen nod genes have been identified and are classified into five transcription units: nodD, nodABCIJ, nodFEL, nodMNT, and nodO. NodD is negatively auto-regulates its own expression of nodD gene, while other nod genes are inducible and positively regulated by NodD in the presence of flavonoids released by plant roots. This substrate-binding domain has significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two cytoplasmically located ATPase domains. This interaction triggers the ligand translocation across the cytoplasmic membrane energized by ATP hydrolysis.	3.22791e-101
NC_022536.1\|WP_022557361.1\|268867_269149_-\|acyl-carrier-protein	gnl\|CDD\|223314	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism].	0.00207383
NC_022536.1\|WP_022557358.1\|266065_266713_-\|chitooligosaccharide-deacetylase-NodB	gnl\|CDD\|211966	TIGR04243, polysaccharide_deacetylase, chitooligosaccharide deacetylase NodB. Nodulation factors are lipooligosaccharide signalling molecules produced by rhizobia, the symbiotic nitrogen-fixing bacteria that form nodules in plants. These Nod factor sustems have the NodABC genes in common but differ subtly in what they produce, which affects host range. NodB is a chitooligosaccharide deacetylase.	1.08037e-115
NC_022536.1\|WP_022557291.1\|280092_281328_-\|aminotransferase-class-I/II-fold-pyridoxal-phosphate-dependent-enzyme	gnl\|CDD\|99738	cd00614, CGS_like, CGS_like: Cystathionine gamma-synthase is a PLP dependent enzyme and catalyzes the committed step of methionine biosynthesis. This pathway is unique to microorganisms and plants, rendering the enzyme an attractive target for the development of antimicrobials and herbicides. This subgroup also includes cystathionine gamma-lyases (CGL), O-acetylhomoserine sulfhydrylases and O-acetylhomoserine thiol lyases. CGL's are very similar to CGS's. Members of this group are widely distributed among all three forms of life.	4.43531e-153
NC_022536.1\|WP_022557359.1\|266709_267300_-\|NodA-family-N-acyltransferase	gnl\|CDD\|179111	PRK00756, PRK00756, acyltransferase NodA; Provisional.	2.17386e-134
NC_022536.1\|WP_022557365.1\|271939_272674_-\|SDR-family-oxidoreductase	gnl\|CDD\|180838	PRK07102, PRK07102, SDR family oxidoreductase.	6.0608e-128
NC_022536.1\|WP_022557371.1\|279107_280037_-\|dihydrodipicolinate-synthase-family-protein	gnl\|CDD\|223406	COG0329, DapA, Dihydrodipicolinate synthase/N-acetylneuraminate lyase [Amino acid transport and metabolism / Cell envelope biogenesis, outer membrane].	4.92742e-46
NC_022536.1\|WP_048903029.1\|276131_278117_-\|acyltransferase	gnl\|CDD\|224748	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].	2.59177e-66
NC_022536.1\|WP_144115371.1\|269503_270603_+\|IS3-family-transposase	gnl\|CDD\|372671	pfam13683, rve_3, Integrase core domain.	1.68248e-30
NC_022536.1\|WP_022557353.1\|260628_261417_-\|ABC-transporter-permease	gnl\|CDD\|130358	TIGR01291, Nodulation_protein_J, ABC-2 type transporter, NodJ family. Nearly all members of this subfamily are NodJ which, together with NodI (TIGR01288), acts to export a variety of modified carbohydrate molecules as signals to plant hosts to establish root nodules. The seed alignment includes a highly divergent member from Azorhizobium caulinodans that is, nonetheless, associated with nodulation. This model is designated as subfamily in part because not all sequences derived from the last common ancestral sequence of Rhizobium sp. and Azorhizobium caulinodans NodJ are necessarily nodulation proteins. [Cellular processes, Other, Transport and binding proteins, Other].	1.17269e-118
NC_022536.1\|WP_022557366.1\|272673_273999_-\|FAD-binding-oxidoreductase	gnl\|CDD\|223354	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion].	6.57919e-28
NC_022536.1\|WP_022557368.1\|274414_275578_-\|UbiA-family-prenyltransferase	gnl\|CDD\|236195	PRK08238, PRK08238, UbiA family prenyltransferase.	4.62376e-136
NC_022536.1\|WP_048903028.1\|264077_264761_-\|methyltransferase-domain-containing-protein	gnl\|CDD\|283141	pfam05401, NodS, Nodulation protein S (NodS). This family consists of nodulation S (NodS) proteins. The products of the rhizobial nodulation genes are involved in the biosynthesis of lipochitin oligosaccharides (LCOs), which are host-specific signal molecules required for nodule formation. NodS is an S-adenosyl-L-methionine (SAM)-dependent methyltransferase involved in N methylation of LCOs. NodS uses N-deacetylated chitooligosaccharides, the products of the NodBC proteins, as its methyl acceptors.	3.01185e-102
NC_022536.1\|WP_022557372.1\|281409_282897_+\|PLP-dependent-aminotransferase-family-protein	gnl\|CDD\|224089	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism].	2.37238e-131
NC_022536.1\|WP_022557367.1\|273995_274463_-\|GtrA-family-protein	gnl\|CDD\|377229	pfam04138, GtrA, GtrA-like protein. Members of this family are predicted to be integral membrane proteins with three or four transmembrane spans. They are involved in the synthesis of cell surface polysaccharides. The GtrA family are a subset of this family. GtrA is predicted to be an integral membrane protein with 4 transmembrane spans. It is involved is in O antigen modification by Shigella flexneri bacteriophage X (SfX), but does not determine the specificity of glucosylation. Its function remains unknown, but it may play a role in translocation of undecaprenyl phosphate linked glucose (UndP-Glc) across the cytoplasmic membrane. Another member of this family is a DTDP-glucose-4-keto-6-deoxy-D-glucose reductase, which catalyzes the conversion of dTDP-4-keto-6-deoxy-D-glucose to dTDP-D-fucose, which is involved in the biosynthesis of the serotype-specific polysaccharide antigen of Actinobacillus actinomycetemcomitans Y4 (serotype b). This family also includes the teichoic acid glycosylation protein, GtcA, which is a serotype-specific protein in some Listeria innocua and monocytogenes strains. Its exact function is not known, but it is essential for decoration of cell wall teichoic acids with glucose and galactose.	6.52911e-06
NC_022536.1\|WP_022557360.1\|267658_268867_-\|beta-ketoacyl-[acyl-carrier-protein]-synthase-family-protein	gnl\|CDD\|238430	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II. KASs are responsible for the elongation steps in fatty acid biosynthesis. KASIII catalyses the initial condensation and KAS I and II catalyze further elongation steps by Claisen condensation of malonyl-acyl carrier protein (ACP) with acyl-ACP.	1.78237e-171

>NC_022536.1|WP_144115371.1|269503_270603_+|IS3-family-transposase
MKASKFSEAQIAFVLKQAEDGTPIGEVCRKAGISDATFYNWRKKYAGLMPSEMKRLRQLEEENAKLKRIVADLSLDKAMLQDVLFKKALRPARKRKLVDTIKADWKVSIRRACSVLKVDRSLYVYKSRRGEQAELKLKIKDICQTRIRYGYRRVHILIKREGWSVNPKRIYRLYKEMDLQLRNKVPKRRVKAKLRADRTEPSHSNHVWAMDFVHDQLATGRKIRVLTVVDTFSRFSPVVDARFSYKGEDVVQTLERVCRQIGYPATIRVDNGSEFISRDLDLWAYHKGVVLDFSRPGKPTDNSYIESFNGKFRAECLNAHWFMSLDDARAKMEDWRRDYNEFRPHSAIGNKVPISLMSGSSASPPT
>NC_022536.1|WP_022557361.1|268867_269149_-|acyl-carrier-protein
MSDQLAKEVIATINNRALAERGEPTTATPSAEITLSTELSSLDLDSLALADILWDLEQANNIKIEMNTADAWSNLQTVGDVVMAVRSLLVKEA
>NC_022536.1|WP_022557360.1|267658_268867_-|beta-ketoacyl-[acyl-carrier-protein]-synthase-family-protein
MDGRVVITGIGGLCGLGTDVAPIWRGMCTGVSAIGPIANPELHELAGVIGCEIKTLPEHDITRRQLVSMDRFSLLAVLAAREAMQQAGLSSEEGNPYRFGAAVGVGVCGWDAIEENYRALLLNGAKRAEVLTAPRVMPCAAAGQVSMHFGLRGPVFGASSACASANHAIALAVDQIRLGRADVMLAGGSDAPLVWGVLKSWEALRILAPDTCRPFSADRKGVVLGEGAGIAVLESYEHASRRGASVLAEIAGIGLSADAFDIVAPAVEGPEAAMRACLEDARLNVEDVDYLNAHGTGTKANDQVETAAIKRVFSEHAYSMSISSTKSVHAHCLGAASALEMIACVMAIREGIIPPTANYNERDPNCDLDVTPNVPRERKIRVALSNAFAMGGLNAVLAFRQV
>NC_022536.1|WP_022557359.1|266709_267300_-|NodA-family-N-acyltransferase
MCSDVRWKICWETELQVDDHAELSAFFRNTYGPTGAFNAQPFEGGRSWAGARPEMRVIAYDSRGVAAHMGLLRRFIKVGEVDLLVGELGLWGVRADLEGLGLSHSMFTMYPELQRLGVPFAFGTVRHALYKHVERLCRGGIATILPGVRVRSTLPEVYLDLPATRVEDPLAVVFPIARSMDEWPSGTLIDRNGPEL
>NC_022536.1|WP_022557358.1|266065_266713_-|chitooligosaccharide-deacetylase-NodB
MKQLNYTCKVSSNSADRSVYLTFDDGPNPFCTPDILDVLAERRVPATFFVIGAYAADQPALIQRMVAEGHAVGNHTMTHPDLTTCRLEEIEYQITEASSAIKAASPQAAPKHMRAPYGLWNEDVLSISERAGLTPVHWSVDPRDWSRPGVNVIVDAVLNSVQPGAIVLLHDGCPPSELVGQSLSGLRDQTLMALSRVITGLHERGFFIRLLSQHN
>NC_022536.1|WP_022557357.1|264798_266052_-|chitooligosaccharide-synthase-NodC
MDLFGTAGTVAISLYALLSGVYKGMQVLYAPPASFFSASPGSSPFHALASVDVIVPCFNEDPDTLSACLESIANQDYAGRLQVFVVDDGSTNREDLAPVHNKYARDPRFNFILLSTNAGKRKAQIAAIRRSSGDLLLNVDSDTMLAPDVVTKLVHRMRDPSIGAVMGQLVARNRSDTWLTRLIDMEYWLACNEERAAQARFGAVMCCCGPCAMYRRSALLLLLDQYETQLFRGKPSDFGEDRHLTILMLKAGFRTEYVADAIAATVVPARLAPYLRQQLRWARSTFRDTWLALRLLPSLDRYLTLDVIGQNLGPLLLAVSVLMGLTQIAVAATVPWSTIILIAFMTIVRSSVAALRARQVRFLAFAAHTPINLFLILPLKAYALCTLSNSAWLSRTSILQSPPRGAETSSLKSLPMD
>NC_022536.1|WP_048903028.1|264077_264761_-|methyltransferase-domain-containing-protein
MLQLTARPRAGKSNIVCRISHKRCRKLSQKKHYQLLHDELAEDDPWRLDSNPFEQERHRQTLRLALAQQSITHALEVGCAAGAFTEKLAPHCEQLTVIDVVPHALARTRRRLMDPPNISWISCDILQFATQQFFDLIVVAEVLYYLESVAEMRTVVRNLAQMLAPSGHLIFGSAGDASCQRWGHVAGAETVITILDEELVQIDRVRCVGQTTNEDCLLTRYRHPVSQ
>NC_022536.1|WP_022557355.1|262331_264047_-|Nodulation-protein-U
MRICGIKLTHDGAVALIENGKLIFCIEQEKRNNNSRYQAIDNLDAIVEALNDHGLSVGDVDQFVIDGWDGELESEFQVFSEGAPITLSGAPYVERWPESPLKPHDSSGLALSGSILPYKSFPHVISHVVSAYCTSPFAKAGDPSFCLVWDGCIFPRLYYAEPKGVRLVKCLFPMIGHAYAVAGHHFGPFRNADPQSWDLGVAGKLMAYIALGAAQENILNVFRELYEEHFAGETLIAVNYRENIHNADALLACVNDYFDASASRLQGEKPQDVLASFHVFLERLLVGEMTNALQMQSQFESRNLCVVGGCGLNIKWNSALRASGLFESVWVPPFPNDSGSAIGAACCALAVDRGLAPLDWSVYSGPKLKSSAIPQGWKAAPCTLAELATILASNEPVVFLAGRAELGPRALGGRSILAAGTSPQMKDHLNKVKFREHFRPVAPICLEDRARDIFDPGTPDPYMLFDHTTRPEWRERIPAVVHLDGTARLQTISRDSEHEVAKLLVEYESLTGIPLLCNTSANYNGRGFFPDAAAACEWGQIGHVWCDGLLLTKASEVDGSPAGCADFSASA
>NC_022536.1|WP_022557354.1|261420_262335_-|nodulation-factor-ABC-transporter-ATP-binding-protein-NodI
MTTIAISFIEVTKTYIDRTVVDRFSFAVKKGECFGLLGPNGAGKSTIARMVLGMTPPDEGKITVLGAPVPAQARLARASIGVVPQFDDLDQEFTVRENLLVFGRYFNMSTRQIEAAIPSLLEFARLESKADARVVGLSGGMKRRLMLARALINDPQLLVLDEPTTGLDPHARHLIWERLRSLLARGKTILLTTHFMEEAERLCDRLCVLEEGQKIAEGRPQGLIDEQIGCQVIEINGGNPHELRALIKTCAQRIEVSGETLFCYSSTPEQVRIKLREHTNLRLLQRPPNLEDVFLRLTGREMKD
>NC_022536.1|WP_022557353.1|260628_261417_-|ABC-transporter-permease
MWENYAAVLPANGWNWTAVWRRNYLAWKKAALVSILGNLADPMIYLFGLGTGLGLMVGRVEGMSYIAFLATGMVAASAMTASTFETIYAAFARMRDQRSWEAILYTQITLGDIVLGEVIWAATKALLAGTAIAVVAAILGYSVWSSIPYVVPVIALTGIAFASLAMIVAALAPSYDYFIFYQTLILTPMLFLSGAVYPVTQLPGKVQQMATFLPLAHSIDLIRPAMFGRPAADVIFHLGMLFVFGVLPFFVSTALLRRRLMS
>NC_022536.1|WP_022557364.1|270737_271943_-|hypothetical-protein
MIRFVPTTKQIEYFIAVCFIIAGMFGLIWIFIAQEPFRITNGSEWDAVYYDRLLRLLAAEGGLQLRIPFPYCARVGTPWILVNIFHNRSSFYEFNLVVSGLFAATLLFATRSLWHGSIKGLTAVIGASSFLYFAPVKFTNFYPAYMDPPFLLVLSLCLIFIIKKNYLLASIICIAGIPFREASFYLLPLLIGFYIKNAQISIGVWVISISIIICGFLLKELMLFVSDCDSQSQLITAIFWFYRFLSEPAHVLGSIAAISLTLGPLYVVLDKQTLTGIKSDDTVIFSIIASVYSGFLSIVGGSDVTRIFYSFLPFYMPLLIKCFKVSSLTSFVLSCFGWLLTNHMLQKYEQPISEGPNKDILGFFAQFPDYGHPTIALVVLGIWFVLAMSRTLIEPLEGYLE
>NC_022536.1|WP_022557365.1|271939_272674_-|SDR-family-oxidoreductase
MKPSVLILGARSDIGNAVAHKFAAQGHPIQLAARQSETLDAEKTNLQLRYGVPVTLHEFDALLTETHAQFLAMLPELPEVAVSVVGLMESQERSERDHLLARCIMRSNYEGPANLLALLANRFEERGSGTLVGLSSVAGERGRATNYVYGSAKAGFTAFLSGLRSRFAKSDVHVVTVLPGYVATKMTEGMNLPAWLTAQPSEVAESIVVAVERKKNVIYVRPVWRMIMLIIRLIPERLFKRVRM
>NC_022536.1|WP_022557366.1|272673_273999_-|FAD-binding-oxidoreductase
MKLSGWGRSPLVDAQVYMPRDLEALQKLLASRPSMIARGWGRAYGDSAINSSATIDMRHLNRMLAFDPKTGQLIAEAGVVLYDIIAAFLPRGWFPMVTPGTKFVTLGGMIAADVHGKNHRKHGSFRGCVDWIDVMGPDGSIQRCSSNSHVELYEHTLGGMGLTGIIIRAAVRLRTVETGWIRRTTIPAPNLRSAMTALEGAQDSTYSVAWIDCLGTGKNLGRSLVFLGDHANTSDLPIYRSAHPFATPARRKLSVPFNFPCFALNQLSLRAFNALYYRIGLWNRGQQLIDWDSYFYPLDAVTDWNRIYGRKGFAQFQCVIPIKNSEEGLSALLKTVAKAGAGSFLAVLKRFGPQESCFSFPMEGYSLALDFPITTKTSRLLANLDRVTIEHGGRFYLAKDSRMSAETLRASDGRVASFVRVRAKNGWKSSFQSAQAERLVL
>NC_022536.1|WP_022557367.1|273995_274463_-|GtrA-family-protein
MPSNHSGLRYRRSAEVKRQGLALRYAAFALIAMVVNVTGQHVVLHFGNTSAIFALAMCAGTIAGLMIKYLLDKFWIFGDREIGLINDGWKFSLYTAVGALTTAIFWSAEAAAWWIWKTELMHDLGAAMGLTIGYLVKYQLDKRFVFAGHRRRISS
>NC_022536.1|WP_022557368.1|274414_275578_-|UbiA-family-prenyltransferase
MSGGSKDEKAYVHPSPGDTGTIGFEHLVTHHPFDSTVFKRTVRPRVFTVAPYGLENARANDAAATLDGKSDGCDGFADIRCGGGQTMSETPRIKARAFKHYVNAFRPHQWLKNILVFLPALAAHKLDWPTLLSSLEAFVCFSLVASSVYVMNDLLDVCADRAHPRKRYRPFASHSIPTAHGTWMVVGLVLPGVLIAIFIGWSFFLVVAVYFLVTTAYSLHLKRRIVIDLCILAGLYTIRIVAGGIATSTPLSVLLIAFSVFFFLSLAAVKRQSELVDGAERGSLQATGRGYHVNDLPIISMIAVGAGYVSVLVMTYYVNSPVVMELYPHPQMLWGVCAVLLYWITRTVMVSHRGNMHDDPVIYAAQDRTSQVCLAIILVFVTGGVLR
>NC_022536.1|WP_048903029.1|276131_278117_-|acyltransferase
MARTEPHFFRMDIEGLRALAVSGVIAFHFGMTSVPGGFVGVDIFFVISGYLITRHIQLEIERTGSLDLLRFYARRARRLLPASCFVILATLFFGYFILSPPEQQLYSKGSFYASAYMINIWLISWAADYFAPDAFNNPFIHFWSLSVEEQFYLVWPALLLLFARLRPGRYGLFLPVVLMGVISFAFCWYYTAISQPWAFYFSPFRAWEFACGGLALMISEEAAKRFRLTPVFGWTGIGLIMTAYLGMSEDVPFPGLTALVPVAGTVMVLLSGTRPGPAGPQVLLSLPPLQWLGRISYSLYLWHWPVIVYSGILKPELTTFERFLCLALILGLSVFSYSFIENPMRRNPWLLARTSRSLGFAALLTACGAAAAYGSARVANHNIDLQQNLILRSAERDSSARQFDDGCLLNAQQVQPKPCEFGAIPPGKTIVLFGDSHADHWSTPLISIAKSNGWQLVTYLKSSCPAADVTIWNSMLMRNYEECDRWRQLAMREIATRKPDMVIISEYSSAYVKNDINVVSIHQIDATTWAQGLRRTVDALESAVTKIAVLRDGPVHKTYLDKCVARALWQKRGAETCDTPRSGAMEETIPDAERKAVSDFGNASYVDITDVFCNATTCPAMIGGKLTFRDRHHIATPFAATLATPLQRALFEMMNAGTPTN
>NC_022536.1|WP_048903080.1|278208_279132_-|LysR-family-transcriptional-regulator
MRFKGLDLNLLVALDALMTERNLTVAARSINLSQPAMSAAVGRLREYFRDDLFTMNGRELCLTPRAEGLAPVVRDVLLKIRCSIISWEPFNPSQSERRFRILLSDFMTLVFFDRVIARVAREAPAVSFELLPLDNDPNELLRRGEVDFLLLPDLYMANSHPTAKLFDEKLVCVSCPTNNAVPRELTFEQYMSMGHVAVMFGRLLRPSIEEWYLLEHGIKRRLEVVVQGFSFIPQMLSGTNRIATMPLRLVEYFEKTIPLRIINLPLPLPAFTEAVQWPALHNGDPASIWMRQILIEEASRMAVSHNA
>NC_022536.1|WP_022557371.1|279107_280037_-|dihydrodipicolinate-synthase-family-protein
MAFLSGLSAFPITPSDLNGRVDTAALKRLVARLCIAGVDSIGLLGSTGTYMYLCREERRRALDAAIQEANGVPVVAGVGALRTDEAVRLAQDAKAAGAAAGLLAAVSYAPLTEDEVFEHFSTVTRKSGLLIVIYDNPGTTHFRFTAALVERMAHRAGVGLRQSNGWHRRITKALLYIVWALPRADRHRPACKRSCLHARCRHHRTYRGNSGDHGSLQDRPSSNSRGDVDDRTSVGVGHRLAVRAGITMPLEREPWSSRRLFWPVRSAGQQRQCCQWPLHELRSQSLSGMITENDPQHGWFQTCASKALI
>NC_022536.1|WP_022557291.1|280092_281328_-|aminotransferase-class-I/II-fold-pyridoxal-phosphate-dependent-enzyme
MTGGSEHDNRQRQSDHLHAASFGFDTRAIHHAFSPVDFKRAVQPPVFLTSTYGFESVEANDAAAALGGRLYAREYNPTTEILEQRLANLERAEAGLVVSTGMAAFGTLILSLLSQGDELVVHKTLYSNSVAMVEQGLPRFGIKVIPVDLSDPSNLDAAITERTKLVYFETPVNPLSSILDIAAISERARARGVKVAVDSTFASPALQRPIEHGADIVLHSLTKYINGHGDTLGGALLGDAETLHRLHETGLRYITGATLAPHSAFLIMRGLKTLSLRMDRHSASALAIARMLEAHPAVSWVSYPFLESHPDQAIARKQMTQGSGMLAFGLHAGFDGARNMLDRLQLMTRAVSLGDTDTLIYHPASITRARQSIRKDAHMVSGVGDDLIRLSVGLEDVTDLIGDLRQALATL
>NC_022536.1|WP_022557372.1|281409_282897_+|PLP-dependent-aminotransferase-family-protein
MVQSENQAAAVHAPRMGARRIYEALKSQILSRVYEAGSQLPSSRSLANELHVSRTTVTVAYEQLAAEGFVELRQGARPRVTALELRQRPRESDTTLEAFGPLSAYGERLRALSPWLDYLPTNLAVDFRYGDLAPSDFPALAWKRAINSVLTQRQGRLSYEDPRGSRRLRQALQGYLWRARTLQCDLEQIIVVNGSQQGLDLCARILLDANSAFVMENPGYRMARQIFSSTGASAVAVDADAGGLKTLDLSGIDARMAYVTPSHQFPLGGVMPISRRHQLLAWARDRDAYVVEDDYDSEYRYDISPVPPLQSLAEGRNVIYLGTVSKTLSPMMRIGYLVVPKQLQEVFATAKQLTDRHTPMTEQEALAFLIESGAYESHVRRVRRLNRERRETLLSALETAFGDRITIEGADAGLHVVVWFNELPGSAEIALMDAARQRGVGLYGISLLYDSAPWASEAPRERLGLVMGYSALTPRQIEKGIQLVAPAVDAVKGAG

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP020910	Rhizobium etli strain NXC12 plasmid pRetNXC12d, complete sequence	87739-87764	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP032928	Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence	500785-500810	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP030762	Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed2, complete sequence	404459-404484	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NC_011981	Agrobacterium vitis S4 plasmid pAtS4e, complete sequence	454408-454433	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NC_011981	Agrobacterium vitis S4 plasmid pAtS4e, complete sequence	628807-628832	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NC_011981	Agrobacterium vitis S4 plasmid pAtS4e, complete sequence	430718-430743	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	CP007645	Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803d, complete sequence	226557-226582	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP039909	Agrobacterium tumefaciens strain CFBP6624 plasmid pAtCFBP6624, complete sequence	339965-339990	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NC_022536	Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence	270659-270684	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP030763	Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence	301633-301658	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP049249	Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence	876349-876374	0	1.0
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NC_022536	Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence	270608-270633	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP039694	Agrobacterium larrymoorei strain CFBP5473 plasmid pTiCFBP5473, complete sequence	121030-121055	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP053858	Rhizobium pusense strain 76 plasmid pR76, complete sequence	285789-285814	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP053209	Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence	287296-287321	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP050084	Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence	447643-447668	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP048425	Rhizobium daejeonense strain KACC 13094 plasmid unnamed2, complete sequence	109324-109349	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP032697	Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525a, complete sequence	142583-142608	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP017148	Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence	96971-96996	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP017148	Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence	98932-98957	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP048284	Rhizobium leguminosarum bv. viciae 248 plasmid pRle248b, complete sequence	113370-113395	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP016287	Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence	966649-966674	1	0.962
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP017943	Phyllobacterium zundukense strain Tri-48 plasmid unnamed2, complete sequence	367864-367889	2	0.923
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	LR606149	Rhizobium sp. Q54 genome assembly, plasmid: 6	209586-209611	2	0.923
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP025505	Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvC, complete sequence	234493-234518	2	0.923
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP017105	Rhizobium gallicum strain IE4872 plasmid pRgalIE4872d, complete sequence	1719677-1719702	2	0.923
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP053209	Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence	424848-424873	3	0.885
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP050084	Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence	308224-308249	3	0.885
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP032696	Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525b, complete sequence	352346-352371	3	0.885
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	LR606149	Rhizobium sp. Q54 genome assembly, plasmid: 6	14950-14975	4	0.846
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP007796	Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence	274362-274387	4	0.846
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP018781	Ochrobactrum pituitosum strain AA2 plasmid pOAAA2, complete sequence	291106-291131	4	0.846
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP020900	Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence	120244-120269	4	0.846
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP033321	Azospirillum brasilense strain Cd plasmid p3, complete sequence	244428-244453	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP033315	Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence	214007-214032	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP032349	Azospirillum brasilense strain MTCC4039 plasmid p3, complete sequence	617405-617430	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP032342	Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence	520429-520454	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP033323	Azospirillum brasilense strain Cd plasmid p5, complete sequence	108219-108244	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP050083	Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence	224557-224582	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP049733	Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence	215510-215535	5	0.808
NC_022536_1	1.1\|270659\|26\|NC_022536\|CRISPRCasFinder	270659-270684	26	NZ_CP053207	Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence	246322-246347	5	0.808

1. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP020910 (Rhizobium etli strain NXC12 plasmid pRetNXC12d, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

2. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032928 (Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

3. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP030762 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed2, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

4. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

5. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

6. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

7. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to CP007645 (Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803d, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

8. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP039909 (Agrobacterium tumefaciens strain CFBP6624 plasmid pAtCFBP6624, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

9. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_022536 (Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

10. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

11. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP049249 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence) position: , mismatch: 0, identity: 1.0

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgc	Protospacer
**************************

12. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_022536 (Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgt	Protospacer
*************************.

13. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP039694 (Agrobacterium larrymoorei strain CFBP5473 plasmid pTiCFBP5473, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
aaaccccggaaaattccagctcccgc	Protospacer
 *************************

14. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053858 (Rhizobium pusense strain 76 plasmid pR76, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcccgt	Protospacer
*************************.

15. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053209 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
caaccccggaaaattccagctcccgc	Protospacer
.*************************

16. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050084 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
caaccccggaaaattccagctcccgc	Protospacer
.*************************

17. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP048425 (Rhizobium daejeonense strain KACC 13094 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcacgc	Protospacer
********************** ***

18. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032697 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525a, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcctgc	Protospacer
***********************.**

19. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017148 (Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcgcgc	Protospacer
********************** ***

20. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017148 (Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctcgcgc	Protospacer
********************** ***

21. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP048284 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248b, complete sequence) position: , mismatch: 1, identity: 0.962

-taaccccggaaaattccagctcccgc	CRISPR spacer
ataacccc-gaaaattccagctcccgc	Protospacer
 ******* ******************

22. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP016287 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962

-taaccccggaaaattccagctcccgc	CRISPR spacer
ataacccc-gaaaattccagctcccgc	Protospacer
 ******* ******************

23. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017943 (Phyllobacterium zundukense strain Tri-48 plasmid unnamed2, complete sequence) position: , mismatch: 2, identity: 0.923

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagctccggg	Protospacer
*********************** *

24. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to LR606149 (Rhizobium sp. Q54 genome assembly, plasmid: 6) position: , mismatch: 2, identity: 0.923

taaccccggaaaattccagctcccgc	CRISPR spacer
taactccggaaaactccagctcccgc	Protospacer
****.********.************

25. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP025505 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvC, complete sequence) position: , mismatch: 2, identity: 0.923

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggaaaattccagcttctgc	Protospacer
*********************.*.**

26. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017105 (Rhizobium gallicum strain IE4872 plasmid pRgalIE4872d, complete sequence) position: , mismatch: 2, identity: 0.923

taaccccggaaaattccagctcccgc	CRISPR spacer
taaacccggaaaattccagttcccgc	Protospacer
*** ***************.******

27. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053209 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence) position: , mismatch: 3, identity: 0.885

taaccccggaaaattccagctcccgc	CRISPR spacer
aaaaccccgaaaattccagctcccgc	Protospacer
 ** *** ******************

28. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050084 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence) position: , mismatch: 3, identity: 0.885

taaccccggaaaattccagctcccgc	CRISPR spacer
aaaaccccgaaaattccagctcccgc	Protospacer
 ** *** ******************

29. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032696 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525b, complete sequence) position: , mismatch: 3, identity: 0.885

taaccccggaaaattccagctcccgc	CRISPR spacer
caaaaccggaaaattccagctcccgc	Protospacer
.**  *********************

30. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to LR606149 (Rhizobium sp. Q54 genome assembly, plasmid: 6) position: , mismatch: 4, identity: 0.846

taaccccggaaaattccagctcccgc	CRISPR spacer
atgccccggaaaattccagctctcgc	Protospacer
  .*******************.***

31. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP007796 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence) position: , mismatch: 4, identity: 0.846

taaccccggaaaattccagctcccgc	CRISPR spacer
ccgccccggaaaattccagctcgcgc	Protospacer
. .******************* ***

32. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP018781 (Ochrobactrum pituitosum strain AA2 plasmid pOAAA2, complete sequence) position: , mismatch: 4, identity: 0.846

taaccccggaaaattccagctcccgc	CRISPR spacer
taaacccggaaaattccagctctccg	Protospacer
*** ******************.*

33. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP020900 (Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence) position: , mismatch: 4, identity: 0.846

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggagaattccaggtccggg	Protospacer
**********.******** *** *

34. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033321 (Azospirillum brasilense strain Cd plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
gcggcccggaaaattccagctcgcgc	Protospacer
  . ****************** ***

35. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033315 (Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
gcggcccggaaaattccagctcgcgc	Protospacer
  . ****************** ***

36. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032349 (Azospirillum brasilense strain MTCC4039 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
gcggcccggaaaattccagctcgcgc	Protospacer
  . ****************** ***

37. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032342 (Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
gcggcccggaaaattccagctcgcgc	Protospacer
  . ****************** ***

38. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033323 (Azospirillum brasilense strain Cd plasmid p5, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
gcggcccggaaaattccagctcgcgc	Protospacer
  . ****************** ***

39. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050083 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggagaattccaggtccgtg	Protospacer
**********.******** ***

40. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP049733 (Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggagaattccaggtccgtg	Protospacer
**********.******** ***

41. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053207 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence) position: , mismatch: 5, identity: 0.808

taaccccggaaaattccagctcccgc	CRISPR spacer
taaccccggagaattccaggtccgtg	Protospacer
**********.******** ***

Prophage detection

Region

Region Position

Protein_number

Hit_taxonomy

Key_proteins

Att_site

Prophage annotation

DBSCAN-SWA_1

0 : 2246

Ochrobactrum_phage(100.0%)

The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048902998.1\|1220_2246_+	plasmid partitioning protein RepB	A0A240F4U0	Ochrobactrum_phage	1.0e-29	33.5

DBSCAN-SWA_2

7014 : 10159

uncultured_Caudovirales_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557053.1\|7014_7740_-	arsenical resistance protein ArsH	A0A2H4J5V6	uncultured_Caudovirales_phage	1.1e-86	72.5
WP_022557054.1\|7714_8137_-	arsenate reductase (glutaredoxin)	A0A2H4J8T1	uncultured_Caudovirales_phage	1.2e-45	63.5
WP_022557055.1\|8142_9183_-	ACR3 family arsenite efflux transporter	NA	NA	NA	NA
WP_022557057.1\|9628_10159_-	arsenate reductase ArsC	A0A2H4J8A6	uncultured_Caudovirales_phage	6.3e-28	43.3

DBSCAN-SWA_3

15131 : 15425

Moumouvirus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_173402666.1\|15131_15425_-	methylated-DNA--[protein]-cysteine S-methyltransferase	M1PC92	Moumouvirus	5.6e-10	54.5

DBSCAN-SWA_4

20804 : 28711

Aureococcus_anophage(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557080.1\|20804_23813_+	DEAD/DEAH box helicase	A0A076FHE1	Aureococcus_anophage	1.5e-25	25.7
WP_022557081.1\|23878_24736_+	conserved exported protein of unknown function	NA	NA	NA	NA
WP_022557082.1\|24732_25308_+	hypothetical protein	NA	NA	NA	NA
WP_022557083.1\|25309_28711_+	UvrD-helicase domain-containing protein	A7KV33	Bacillus_phage	3.4e-42	25.5

DBSCAN-SWA_5

39414 : 42406

Acanthamoeba_polyphaga_mimivirus(33.33%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557097.1\|39414_40437_+	bifunctional DNA-binding transcriptional regulator/O6-methylguanine-DNA methyltransferase Ada	A0A0G2Y1B6	Acanthamoeba_polyphaga_mimivirus	2.4e-15	49.4
WP_022557098.1\|40520_41294_-	SOS response-associated peptidase	A0A291AUP1	Sinorhizobium_phage	1.1e-41	38.5
WP_022557099.1\|41353_42406_-	ATP-dependent DNA ligase	A0A068CDF3	Rhizobium_phage	5.9e-126	62.2

DBSCAN-SWA_6

57497 : 59654

Bacillus_phage(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557116.1\|57497_58247_+	SDR family oxidoreductase	W8CYX9	Bacillus_phage	2.2e-10	46.8
WP_022557117.1\|58319_58868_+	flavin reductase family protein	NA	NA	NA	NA
WP_022557118.1\|58922_59654_+	SDR family oxidoreductase	A0A0N9R355	Chrysochromulina_ericina_virus	1.2e-08	29.0

DBSCAN-SWA_7

64390 : 72532

Trichoplusia_ni_ascovirus(40.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557125.1\|64390_65377_+	alpha/beta hydrolase	M1PGN2	Moumouvirus	3.0e-07	24.0
WP_020808317.1\|65422_66232_-	SDR family NAD(P)-dependent oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	6.1e-14	29.2
WP_034499614.1\|66278_66659_+	helix-turn-helix transcriptional regulator	NA	NA	NA	NA
WP_048903004.1\|66815_67349_+	TetR/AcrR family transcriptional regulator	NA	NA	NA	NA
WP_022557127.1\|67429_68848_+	MFS transporter	A0A0M3UL24	Mycobacterium_phage	2.0e-44	34.0
WP_013637163.1\|68936_69692_+	SDR family oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	5.5e-17	34.1
WP_022557128.1\|69958_70648_+	aspartate/glutamate racemase family protein	NA	NA	NA	NA
WP_022557129.1\|70699_71611_-	AraC family transcriptional regulator	NA	NA	NA	NA
WP_020808319.1\|71704_72532_+	oxidoreductase	I7B2R4	Escherichia_phage	7.4e-07	27.8

DBSCAN-SWA_8

82882 : 84894

Pandoravirus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_034499639.1\|82882_83311_+	RidA family protein	S4W0G3	Pandoravirus	1.3e-07	32.7
WP_003519970.1\|84684_84894_+	cold-shock protein	A0A218MMZ6	uncultured_virus	1.2e-09	54.7

DBSCAN-SWA_9

93048 : 93810

Trichoplusia_ni_ascovirus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557160.1\|93048_93810_+	SDR family oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	3.1e-12	31.0

DBSCAN-SWA_10

99508 : 103621

Bacillus_virus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557167.1\|99508_100573_+	ABC transporter ATP-binding protein	G3M9Y6	Bacillus_virus	8.2e-27	34.0
WP_048903007.1\|100572_101451_+	MurR/RpiR family transcriptional regulator	NA	NA	NA	NA
WP_084317464.1\|101494_103621_-	HAMP domain-containing protein	A0A2H4J162	uncultured_Caudovirales_phage	7.9e-13	38.1

DBSCAN-SWA_11

109468 : 116886

Emiliania_huxleyi_virus(25.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557178.1\|109468_111598_+	ParB/RepB/Spo0J family partition protein	G8DH78	Emiliania_huxleyi_virus	5.7e-35	29.3
WP_022557179.1\|111702_112359_+	hypothetical protein	NA	NA	NA	NA
WP_022557180.1\|112433_113624_+	hypothetical protein	A0A1V0DX75	Synechococcus_virus	1.5e-72	42.2
WP_034499700.1\|113730_113985_+	hypothetical protein	NA	NA	NA	NA
WP_022557183.1\|114112_114565_+	hypothetical protein	NA	NA	NA	NA
WP_022557185.1\|114577_115615_+	toprim domain-containing protein	A0A0U4B0G9	Pseudomonas_phage	5.1e-13	36.6
WP_022557186.1\|115944_116886_+	DUF2493 domain-containing protein	A0A291L9X7	Bordetella_phage	1.1e-19	41.4

DBSCAN-SWA_12

120621 : 120900

Burkholderia_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557192.1\|120621_120900_-	HU family DNA-binding protein	B5TA87	Burkholderia_phage	3.4e-09	43.7

DBSCAN-SWA_13

124187 : 125273

Escherichia_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_052349969.1\|124187_125273_+	lytic transglycosylase domain-containing protein	I6ZXX9	Escherichia_phage	2.4e-05	32.7

DBSCAN-SWA_14

128452 : 129625

Geobacillus_virus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903064.1\|128452_129625_+	transglycosylase SLT domain-containing protein	A0A0H3V0Q1	Geobacillus_virus	8.8e-22	42.4

DBSCAN-SWA_15

143229 : 143976

Vibrio_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903067.1\|143229_143976_+	ParA family protein	Q8H9K9	Vibrio_phage	8.7e-07	26.9

DBSCAN-SWA_16

164119 : 165607

uncultured_Caudovirales_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903014.1\|164119_165607_+	SulP family inorganic anion transporter	A0A2H4J153	uncultured_Caudovirales_phage	1.7e-166	64.5

DBSCAN-SWA_17

170012 : 172783

uncultured_virus(66.67%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_006699024.1\|170012_170327_+	co-chaperone GroES	A0A221S4G8	uncultured_virus	1.8e-22	56.4
WP_022557254.1\|170381_172007_+	chaperonin GroEL	A0A240F779	uncultured_virus	2.3e-177	60.2
WP_022557256.1\|172435_172783_+	hypothetical protein	A0A223W0B5	Agrobacterium_phage	4.7e-24	47.7

DBSCAN-SWA_18

178518 : 188855

uncultured_Caudovirales_phage(20.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557266.1\|178518_180333_+	methyl-accepting chemotaxis protein	A0A2H4J162	uncultured_Caudovirales_phage	5.6e-07	25.4
WP_022557267.1\|180558_182514_-	acetate--CoA ligase	A0A2H4PQU7	Staphylococcus_phage	1.0e-83	37.5
WP_022557268.1\|182858_184481_-	MHS family MFS transporter	NA	NA	NA	NA
WP_022557269.1\|184495_185323_-	alpha/beta fold hydrolase	G9VYU4	Mycobacterium_phage	2.8e-06	23.9
WP_022557271.1\|185577_186936_+	sigma 54-interacting transcriptional regulator	NA	NA	NA	NA
WP_022557272.1\|187007_188210_-	PLP-dependent transferase	A0A0B5JD48	Pandoravirus	5.3e-14	27.9
WP_022557273.1\|188231_188855_-	helix-turn-helix domain-containing protein	I3VYZ1	Thermoanaerobacterium_phage	8.8e-05	41.4

DBSCAN-SWA_19

205084 : 206320

Pandoravirus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557291.1\|205084_206320_-	aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme	A0A0B5JD48	Pandoravirus	3.2e-22	30.3

DBSCAN-SWA_20

209335 : 211341

Acanthocystis_turfacea_Chlorella_virus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903022.1\|209335_210391_+	GDP-mannose 4,6-dehydratase	M1IBC3	Acanthocystis_turfacea_Chlorella_virus	1.0e-125	63.8
WP_048903072.1\|210396_211341_+	GDP-L-fucose synthase	D1LW79	Prochlorococcus_phage	5.1e-89	51.8

DBSCAN-SWA_21

217874 : 218953

Shigella_phage(100.0%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_144115367.1\|217874_218953_-\|transposase	IS3 family transposase	U5P429	Shigella_phage	1.2e-49	40.6

DBSCAN-SWA_22

228921 : 230583

Trichoplusia_ni_ascovirus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557314.1\|228921_230583_-	SDR family oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	2.1e-16	32.7

DBSCAN-SWA_23

235801 : 239095

Synechococcus_phage(33.33%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557320.1\|235801_236341_+	peroxiredoxin	M1TUY3	Synechococcus_phage	9.5e-48	52.9
WP_022557322.1\|237596_237917_+	iron-sulfur cluster assembly accessory protein	A0A2H4N7M3	Lake_Baikal_phage	2.2e-12	40.2
WP_022557323.1\|238030_239095_+	aminotransferase class V-fold PLP-dependent enzyme	A0A141ZJV0	Faustovirus	4.5e-09	24.2

DBSCAN-SWA_24

249489 : 249951

Erythrobacter_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557339.1\|249489_249951_-	MucR family transcriptional regulator	A0A1P8VVG0	Erythrobacter_phage	2.2e-13	36.7

DBSCAN-SWA_25

258680 : 266052

Stx2-converting_phage(25.0%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557351.1\|258680_259676_+\|transposase	IS66 family transposase	A0A0P0ZBS5	Stx2-converting_phage	8.2e-37	42.1
WP_158454204.1\|260201_260375_+	hypothetical protein	NA	NA	NA	NA
WP_022557353.1\|260628_261417_-	ABC transporter permease	NA	NA	NA	NA
WP_022557354.1\|261420_262335_-	nodulation factor ABC transporter ATP-binding protein NodI	W6JKT0	Anomala_cuprea_entomopoxvirus	2.5e-24	29.0
WP_022557355.1\|262331_264047_-	Nodulation protein U	A0A292GAM4	Xanthomonas_phage	3.7e-37	30.5
WP_048903028.1\|264077_264761_-	methyltransferase domain-containing protein	NA	NA	NA	NA
WP_022557357.1\|264798_266052_-	chitooligosaccharide synthase NodC	M1IKD7	Paramecium_bursaria_Chlorella_virus	1.7e-26	27.0

DBSCAN-SWA_26

269503 : 270602

Leptospira_phage(100.0%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_144115371.1\|269503_270602_+\|transposase	IS3 family transposase	S5WIU1	Leptospira_phage	8.8e-48	39.8

DBSCAN-SWA_27

276131 : 281328

Gordonia_phage(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903029.1\|276131_278117_-	acyltransferase	A0A166XZF2	Gordonia_phage	2.8e-60	30.2
WP_048903080.1\|278208_279132_-	LysR family transcriptional regulator	NA	NA	NA	NA
WP_022557371.1\|279107_280037_-	dihydrodipicolinate synthase family protein	NA	NA	NA	NA
WP_022557291.1\|280092_281328_-	aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme	A0A0B5JD48	Pandoravirus	3.2e-22	30.3

DBSCAN-SWA_28

287275 : 288771

Cedratvirus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557376.1\|287275_288007_-	ABC transporter ATP-binding protein	A0A285PWH2	Cedratvirus	5.0e-15	28.0
WP_048903031.1\|288003_288771_-	ABC transporter ATP-binding protein	G9BWD6	Planktothrix_phage	1.2e-14	29.0

DBSCAN-SWA_29

304141 : 307475

Staphylococcus_phage(66.67%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557393.1\|304141_305026_-	bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	A0A249XZQ2	Enterococcus_phage	2.3e-30	36.7
WP_048903033.1\|305031_305916_-	formyltetrahydrofolate deformylase	NA	NA	NA	NA
WP_022557395.1\|306028_306733_-	ABC transporter ATP-binding protein	A0A2H4PQG7	Staphylococcus_phage	3.4e-13	26.6
WP_048903034.1\|306725_307475_-	ABC transporter ATP-binding protein	A0A2H4PQG7	Staphylococcus_phage	1.2e-11	25.4

DBSCAN-SWA_30

331335 : 332106

Planktothrix_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557429.1\|331335_332106_+	amino acid ABC transporter ATP-binding protein	G9BWD6	Planktothrix_phage	2.3e-31	39.4

DBSCAN-SWA_31

340609 : 348328

Acanthocystis_turfacea_Chlorella_virus(25.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903087.1\|340609_342322_-	FAD-binding protein	M1I0Y3	Acanthocystis_turfacea_Chlorella_virus	8.1e-08	25.4
WP_022557440.1\|342333_343860_-	acetate CoA-transferase YdiF	NA	NA	NA	NA
WP_022557441.1\|343917_344709_-	SDR family NAD(P)-dependent oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	8.0e-11	26.9
WP_048903038.1\|344892_346344_-	amidase	NA	NA	NA	NA
WP_022557443.1\|346340_347375_-	ATP-binding cassette domain-containing protein	G9BWD6	Planktothrix_phage	3.6e-19	32.9
WP_022557444.1\|347371_348328_-	ABC transporter ATP-binding protein	G3M9Y6	Bacillus_virus	8.2e-10	27.6

DBSCAN-SWA_32

353093 : 357989

Enterobacteria_phage(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557450.1\|353093_354146_+	LacI family DNA-binding transcriptional regulator	C6ZCU4	Enterobacteria_phage	1.8e-21	29.0
WP_022557451.1\|354355_354910_-	hemerythrin domain-containing protein	NA	NA	NA	NA
WP_022557452.1\|354906_355650_-	SDR family oxidoreductase	NA	NA	NA	NA
WP_022557453.1\|355743_356115_+	helix-turn-helix transcriptional regulator	NA	NA	NA	NA
WP_022557454.1\|356245_356395_-	hypothetical protein	NA	NA	NA	NA
WP_022557455.1\|356519_357989_-	FAD-binding protein	A0A2P0ZL82	Lactobacillus_phage	2.8e-25	25.8

DBSCAN-SWA_33

362672 : 364175

Planktothrix_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557462.1\|362672_364175_+	amino acid ABC transporter permease/ATP-binding protein	G9BWD6	Planktothrix_phage	1.1e-32	41.9

DBSCAN-SWA_34

375174 : 376677

Staphylococcus_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903094.1\|375174_376677_+	sugar ABC transporter ATP-binding protein	A0A2H4PQG7	Staphylococcus_phage	1.2e-18	28.0

DBSCAN-SWA_35

382767 : 384600

Ostreococcus_lucimarinus_virus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903040.1\|382767_384600_+	3D-(3,5/4)-trihydroxycyclohexane-1,2-dione acylhydrolase (decyclizing)	A0A0P0CRC5	Ostreococcus_lucimarinus_virus	1.1e-23	23.4

DBSCAN-SWA_36

394107 : 397828

Oenococcus_phage(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557490.1\|394107_395301_-	mandelate racemase/muconate lactonizing enzyme family protein	Q6A202	Oenococcus_phage	9.2e-51	33.6
WP_022557491.1\|395397_396273_-	SMP-30/gluconolactonase/LRE family protein	NA	NA	NA	NA
WP_022557493.1\|396850_397828_-	ABC transporter ATP-binding protein	G3M9Y6	Bacillus_virus	8.4e-26	33.8

DBSCAN-SWA_37

403361 : 404378

Enterobacteria_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557498.1\|403361_404378_+	LacI family DNA-binding transcriptional regulator	C6ZCU4	Enterobacteria_phage	1.8e-18	27.9

DBSCAN-SWA_38

408020 : 410454

Escherichia_phage(50.0%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557501.1\|408020_408806_+	DeoR/GlpR transcriptional regulator	A0A077SK06	Escherichia_phage	2.7e-19	28.7
WP_111818031.1\|409241_410454_+\|transposase	IS3 family transposase	A0A1B0Z042	Pseudomonas_phage	7.9e-66	48.8

DBSCAN-SWA_39

417316 : 421575

Cedratvirus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557511.1\|417316_418225_+	alpha/beta hydrolase	A0A2R8FCY4	Cedratvirus	1.5e-05	29.0
WP_022557515.1\|419376_419703_+	hypothetical protein	NA	NA	NA	NA
WP_048903045.1\|419820_420414_+	CGNR zinc finger domain-containing protein	NA	NA	NA	NA
WP_022557517.1\|421161_421575_-	DUF1810 domain-containing protein	A0A2H4UVK5	Bodo_saltans_virus	7.6e-13	41.8

DBSCAN-SWA_40

425826 : 429037

Orpheovirus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557526.1\|425826_427494_+	FAD-dependent oxidoreductase	A0A2I2L5E1	Orpheovirus	9.9e-19	27.6
WP_022557528.1\|428056_429037_+	alpha/beta hydrolase	A0A286MQ79	Mycobacterium_phage	7.4e-14	30.0

DBSCAN-SWA_41

442952 : 443765

Halovirus(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557546.1\|442952_443765_-	CbbQ/NirQ/NorQ/GpvN family protein	R4TG24	Halovirus	4.8e-11	38.2

DBSCAN-SWA_42

450813 : 451791

Phage_TP(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557556.1\|450813_451791_+	U32 family peptidase	Q6DW11	Phage_TP	1.3e-18	34.5

DBSCAN-SWA_43

455058 : 460629

Bacillus_virus(50.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903049.1\|455058_455811_+	ATP-binding cassette domain-containing protein	G3M9Y6	Bacillus_virus	3.8e-26	32.4
WP_022557562.1\|455807_456995_+	NnrS family protein	NA	NA	NA	NA
WP_022557563.1\|457176_457362_+	periplasmic nitrate reductase, NapE protein	NA	NA	NA	NA
WP_034499579.1\|457369_457870_+	ferredoxin-type protein NapF	NA	NA	NA	NA
WP_003520944.1\|457862_458150_+	chaperone NapD	NA	NA	NA	NA
WP_022557565.1\|458124_460629_+	periplasmic nitrate reductase subunit alpha	A0A077SK27	Escherichia_phage	1.3e-09	22.9

DBSCAN-SWA_44

464576 : 465353

Sinorhizobium_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557577.1\|464576_465353_-	SOS response-associated peptidase	A0A291AUP1	Sinorhizobium_phage	7.3e-41	39.3

DBSCAN-SWA_45

474887 : 475877

Salmonella_phage(100.0%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557583.1\|474887_475877_+\|transposase	IS5 family transposase	A0A1B0VFY5	Salmonella_phage	6.4e-50	37.9

DBSCAN-SWA_46

484764 : 500735

Yellowstone_lake_phycodnavirus(16.67%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557591.1\|484764_486519_-	thiamine pyrophosphate-binding protein	A0A0P0YLY7	Yellowstone_lake_phycodnavirus	1.5e-38	24.7
WP_022557592.1\|486648_487902_-	FAD-binding oxidoreductase	NA	NA	NA	NA
WP_173402668.1\|488094_488856_-	ATP-binding cassette domain-containing protein	G9BWD6	Planktothrix_phage	2.3e-31	42.1
WP_022557594.1\|488890_489550_-	amino acid ABC transporter permease	NA	NA	NA	NA
WP_048903053.1\|489618_490305_-	aspartate/glutamate racemase family protein	NA	NA	NA	NA
WP_022557596.1\|490405_491251_-	transporter substrate-binding domain-containing protein	NA	NA	NA	NA
WP_022557597.1\|491455_492355_+	LysR family transcriptional regulator	Q6JIH3	Burkholderia_virus	5.3e-19	26.2
WP_022557599.1\|493503_494358_+	transketolase	A0A2K9L6P9	Tupanvirus	2.9e-30	32.3
WP_022557600.1\|494354_495329_+	putative transketolase, alpha subunit	NA	NA	NA	NA
WP_022557601.1\|495412_496891_-	NAD-dependent succinate-semialdehyde dehydrogenase	NA	NA	NA	NA
WP_022557602.1\|496908_497943_-	L-idonate 5-dehydrogenase	A0A0G2YAX3	Acanthamoeba_polyphaga_mimivirus	9.5e-12	28.0
WP_022557603.1\|498022_498856_+	shikimate dehydrogenase	NA	NA	NA	NA
WP_022557604.1\|498852_499923_+	glycerol dehydrogenase	NA	NA	NA	NA
WP_022557605.1\|499925_500735_+	glucose 1-dehydrogenase	Q06VL0	Trichoplusia_ni_ascovirus	1.8e-18	32.1

DBSCAN-SWA_47

510904 : 512047

Planktothrix_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048903099.1\|510904_512047_-	ABC transporter ATP-binding protein	G9BWD6	Planktothrix_phage	2.4e-24	35.5

DBSCAN-SWA_48

525294 : 530349

Trichoplusia_ni_ascovirus(33.33%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_077981974.1\|525294_526044_+	SDR family oxidoreductase	Q06VL0	Trichoplusia_ni_ascovirus	3.1e-12	29.4
WP_022557631.1\|526047_526944_+	NAD(P)-dependent oxidoreductase	NA	NA	NA	NA
WP_022557632.1\|526940_527840_+	branched-chain amino acid ABC transporter permease	NA	NA	NA	NA
WP_144115389.1\|527892_528819_+	branched-chain amino acid ABC transporter permease	NA	NA	NA	NA
WP_022557634.1\|528818_529595_+	ABC transporter ATP-binding protein	A0A2H4PQG7	Staphylococcus_phage	1.9e-12	25.6
WP_022557635.1\|529587_530349_+	ABC transporter ATP-binding protein	W6JKT0	Anomala_cuprea_entomopoxvirus	1.1e-12	25.4

DBSCAN-SWA_49

540965 : 548398

Pseudomonas_phage(33.33%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_111818031.1\|540965_542178_+\|transposase	IS3 family transposase	A0A1B0Z042	Pseudomonas_phage	7.9e-66	48.8
WP_022557646.1\|542661_543153_+	DUF4142 domain-containing protein	NA	NA	NA	NA
WP_022557647.1\|543454_543886_-	type II toxin-antitoxin system VapC family toxin	NA	NA	NA	NA
WP_022557648.1\|543882_544134_-	type II toxin-antitoxin system VapB family antitoxin	NA	NA	NA	NA
WP_022557651.1\|544628_544895_-	type II toxin-antitoxin system Phd/YefM family antitoxin	NA	NA	NA	NA
WP_022557652.1\|545033_545885_-	2-hydroxy-3-oxopropionate reductase	A0A077SLF7	Escherichia_phage	3.1e-24	28.3
WP_006699134.1\|545888_546674_-	hydroxypyruvate isomerase	NA	NA	NA	NA
WP_022557653.1\|546685_548398_-	ABC transporter ATP-binding protein	G9BWD6	Planktothrix_phage	1.2e-16	30.3

DBSCAN-SWA_50

554952 : 555966

Enterobacteria_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557658.1\|554952_555966_+	LacI family DNA-binding transcriptional regulator	C6ZCU4	Enterobacteria_phage	2.6e-14	25.9

DBSCAN-SWA_51

559242 : 561177

uncultured_Caudovirales_phage(100.0%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557666.1\|559242_561177_-	HAMP domain-containing protein	A0A2H4J162	uncultured_Caudovirales_phage	1.6e-12	33.0

DBSCAN-SWA_52

566467 : 577024

Escherichia_phage(16.67%)

transposase

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022557673.1\|566467_567181_+\|transposase	IS6 family transposase	A0A077SL39	Escherichia_phage	2.6e-16	33.5
WP_084317470.1\|568518_568827_-	hypothetical protein	NA	NA	NA	NA
WP_022557677.1\|569147_571799_+	DNA ligase D	A0A291AUV6	Sinorhizobium_phage	1.8e-46	38.9
WP_022557679.1\|572096_572396_-	DUF982 domain-containing protein	NA	NA	NA	NA
WP_003585428.1\|572464_572941_-	Hsp20 family protein	A0A2L0V0Y9	Agrobacterium_phage	6.1e-22	42.5
WP_006699351.1\|573110_574142_-	ATP-dependent DNA ligase	A0A068CDF3	Rhizobium_phage	6.3e-133	68.7
WP_022557681.1\|574138_574951_-	Ku protein	NA	NA	NA	NA
WP_022557682.1\|574962_575862_-	Ku protein	A0A0A1EPK3	Mycobacterium_phage	3.0e-14	27.7
WP_080823558.1\|576062_576239_+	hypothetical protein	NA	NA	NA	NA
WP_022557684.1\|576247_577024_+	exodeoxyribonuclease III	A0A0N9QXX6	Chrysochromulina_ericina_virus	9.6e-17	26.7

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

3. NC_022535

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity

Prophage detection

Region

Region Position

Protein_number

Hit_taxonomy

Key_proteins

Att_site

Prophage annotation

DBSCAN-SWA_1

925740 : 935557

Paracoccus_phage(33.33%)

portal,tail

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022555858.1\|925740_926907_+\|portal	phage portal protein	W8ECU7	Geobacillus_phage	3.5e-63	39.0
WP_022555859.1\|927095_927416_+	hypothetical protein	NA	NA	NA	NA
WP_022555861.1\|928151_928559_+\|tail	phage major tail protein, TP901-1 family	NA	NA	NA	NA
WP_006700098.1\|928558_928918_+	gene transfer agent family protein	NA	NA	NA	NA
WP_084317300.1\|928914_929244_+\|tail	phage tail assembly chaperone	NA	NA	NA	NA
WP_004440761.1\|929183_929765_+\|tail	phage tail tape measure protein	C0LP53	Escherichia_virus	4.5e-11	43.1
WP_022555863.1\|929863_930502_+	TIGR02217 family protein	A0A0B5A2K3	Paracoccus_phage	6.4e-51	49.3
WP_022555864.1\|930498_931305_+	DUF2163 domain-containing protein	A0A0K1Y6Z2	Rhodobacter_phage	1.6e-30	37.0
WP_022555865.1\|931301_931736_+	C40 family peptidase	F4YXU4	Roseobacter_phage	4.0e-36	47.1
WP_022555866.1\|931753_935557_+\|tail	glycoside hydrolase/phage tail family protein	A0A0B5A7K5	Paracoccus_phage	2.2e-215	36.8

DBSCAN-SWA_2

1418490 : 1430543

Agrobacterium_phage(78.57%)

capsid

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_048902683.1\|1418490_1418832_-	hypothetical protein	A0A223W046	Agrobacterium_phage	5.1e-31	62.5
WP_022556140.1\|1418824_1419895_-	hypothetical protein	A0A223W0B5	Agrobacterium_phage	1.4e-188	97.1
WP_022556141.1\|1419777_1420464_-	hypothetical protein	A0A223W0Q1	Agrobacterium_phage	3.8e-102	96.3
WP_022556142.1\|1420463_1421165_-	hypothetical protein	A0A223W0X8	Agrobacterium_phage	6.4e-121	91.0
WP_022556143.1\|1421168_1421606_-	hypothetical protein	A0A2L0V102	Agrobacterium_phage	6.4e-26	96.6
WP_022556144.1\|1421668_1422130_-	hypothetical protein	A0A223W0C1	Agrobacterium_phage	2.1e-72	94.1
WP_022556145.1\|1422202_1423300_-\|capsid	N4-gp56 family major capsid protein	A0A2L0V108	Agrobacterium_phage	1.2e-211	99.7
WP_022556146.1\|1423448_1424201_+	Secretion activator protein	A0A223W052	Agrobacterium_phage	3.1e-105	99.6
WP_022556147.1\|1424197_1424722_+	hypothetical protein	A0A223W0D2	Agrobacterium_phage	4.5e-87	98.3
WP_022556148.1\|1424721_1425054_+	hypothetical protein	A0A223W041	Agrobacterium_phage	6.5e-55	100.0
WP_022556149.1\|1425258_1425489_+	AlpA family phage regulatory protein	A0A2L0V109	Agrobacterium_phage	5.5e-37	98.7
WP_004441749.1\|1425726_1426065_-	hypothetical protein	NA	NA	NA	NA
WP_006697463.1\|1426380_1427469_+	septal ring lytic transglycosylase RlpA family protein	F5B3X9	Synechococcus_phage	7.1e-18	50.0
WP_035208907.1\|1427540_1428713_+	D-alanyl-D-alanine carboxypeptidase	NA	NA	NA	NA
WP_004441759.1\|1428850_1429525_+	dTMP kinase	M1PSC7	Streptococcus_phage	1.1e-37	42.7
WP_022556152.1\|1429517_1430543_+	DNA polymerase III subunit delta'	M1NSC1	Streptococcus_phage	2.2e-05	28.5

DBSCAN-SWA_3

1591458 : 1599715

uncultured_Mediterranean_phage(66.67%)

tRNA

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022556252.1\|1591458_1592991_+	AMP-binding protein	A0A2K9L3I8	Tupanvirus	2.8e-20	24.4
WP_022556253.1\|1592974_1594252_+	type III PLP-dependent enzyme	NA	NA	NA	NA
WP_048902825.1\|1594252_1595410_-	aminodeoxychorismate synthase component I	S4VT78	Pandoravirus	1.2e-42	43.1
WP_004442089.1\|1595601_1596732_-\|tRNA	tRNA guanosine(34) transglycosylase Tgt	A0A1B1IVQ4	uncultured_Mediterranean_phage	2.1e-105	53.0
WP_022556255.1\|1596731_1597814_-\|tRNA	tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA	NA	NA	NA	NA
WP_022556256.1\|1598076_1598586_-	peptidylprolyl isomerase	A0A1B1IVS0	uncultured_Mediterranean_phage	1.1e-45	58.8
WP_004442095.1\|1598622_1599192_-	peptidylprolyl isomerase	A0A1B1IVS0	uncultured_Mediterranean_phage	8.5e-47	53.8
WP_006698870.1\|1599220_1599715_-	pantetheine-phosphate adenylyltransferase	A0A1B1IVQ3	uncultured_Mediterranean_phage	1.5e-26	37.7

DBSCAN-SWA_4

1616093 : 1624495

uncultured_Mediterranean_phage(75.0%)

tRNA

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_037093448.1\|1616093_1617659_-	peptidoglycan DD-metalloendopeptidase family protein	I3PV24	Clostridium_phage	1.1e-11	30.4
WP_022556269.1\|1618003_1618657_-	protein-L-isoaspartate(D-aspartate) O-methyltransferase	A0A1J0MC37	Streptomyces_phage	1.0e-11	35.1
WP_004442133.1\|1618653_1619424_-	5'/3'-nucleotidase SurE	A0A1B1ITZ2	uncultured_Mediterranean_phage	1.8e-23	29.8
WP_004442134.1\|1619597_1620881_-\|tRNA	serine--tRNA ligase	A0A1B1IVT2	uncultured_Mediterranean_phage	1.8e-97	45.8
WP_022556270.1\|1620982_1621786_-	twin-arginine translocase subunit TatC	A0A1B1IVR7	uncultured_Mediterranean_phage	2.4e-42	40.7
WP_022556271.1\|1621782_1622529_-	twin-arginine translocase subunit TatB	NA	NA	NA	NA
WP_003509722.1\|1622585_1622792_-	twin-arginine translocase TatA/TatE family subunit	A0A1B1IVR9	uncultured_Mediterranean_phage	2.1e-08	69.8
WP_048902690.1\|1622922_1623657_-	SMC-Scp complex subunit ScpB	A0A1B1IVT7	uncultured_Mediterranean_phage	3.4e-40	47.1
WP_022556273.1\|1623649_1624495_-	segregation/condensation protein A	A0A1B1IVW1	uncultured_Mediterranean_phage	8.5e-35	34.7

DBSCAN-SWA_5

1761573 : 1768497

Rhizobium_phage(66.67%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022556376.1\|1761573_1762230_-	putative deoxynucleotide monophosphate kinase	V9QJ84	Rhizobium_phage	1.6e-25	44.8
WP_052349964.1\|1762707_1762983_-	hypothetical protein	NA	NA	NA	NA
WP_022556379.1\|1762988_1763189_-	hypothetical protein	NA	NA	NA	NA
WP_048902829.1\|1763185_1763845_-	hypothetical protein	B4UTT6	Rhizobium_phage	2.7e-60	61.8
WP_052349965.1\|1763913_1764279_-	hypothetical protein	A4JWM9	Burkholderia_virus	5.9e-33	65.5
WP_158454197.1\|1764548_1764698_+	hypothetical protein	NA	NA	NA	NA
WP_022556383.1\|1764702_1765956_-	hypothetical protein	B4UTT2	Rhizobium_phage	8.1e-183	78.3
WP_022556384.1\|1766008_1766674_-	trypsin-like peptidase domain-containing protein	B4UTS4	Rhizobium_phage	2.0e-31	50.6
WP_022556386.1\|1767054_1767255_+	hypothetical protein	NA	NA	NA	NA
WP_006697486.1\|1767244_1767721_+	hypothetical protein	NA	NA	NA	NA
WP_022556387.1\|1767723_1768497_+	SOS response-associated peptidase	A0A291AUP1	Sinorhizobium_phage	7.8e-43	39.8

DBSCAN-SWA_6

1783178 : 1822766

Rhizobium_phage(53.85%)

portal,tail,terminase,capsid,head

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_022556408.1\|1783178_1785524_-\|tail	phage tail length tape measure family protein	A0A2D2W291	Sinorhizobium_phage	5.1e-53	31.8
WP_144115320.1\|1785538_1785913_+	hypothetical protein	NA	NA	NA	NA
WP_022556410.1\|1786090_1786801_+	hypothetical protein	NA	NA	NA	NA
WP_084317319.1\|1786883_1787087_-\|tail	phage tail assembly chaperone	NA	NA	NA	NA
WP_022556411.1\|1787083_1787443_-	hypothetical protein	NA	NA	NA	NA
WP_013636135.1\|1787444_1787876_-\|tail	phage tail protein	NA	NA	NA	NA
WP_022556412.1\|1787910_1788321_-	DUF3168 domain-containing protein	I3UM06	Rhodobacter_phage	1.0e-09	37.0
WP_022556413.1\|1788320_1788815_-	HK97 gp10 family phage protein	NA	NA	NA	NA
WP_022556414.1\|1788811_1789168_-\|head,tail	head-tail adaptor protein	NA	NA	NA	NA
WP_022556416.1\|1789293_1789641_-\|head,tail	phage gp6-like head-tail connector protein	NA	NA	NA	NA
WP_006697512.1\|1789642_1789984_-	hypothetical protein	A0A0H4INV0	Stenotrophomonas_phage	1.1e-06	52.6
WP_022556417.1\|1789961_1790222_-	hypothetical protein	NA	NA	NA	NA
WP_022556418.1\|1790269_1791478_-\|capsid	phage major capsid protein	Q6JIM7	Burkholderia_virus	4.8e-116	59.7
WP_022556419.1\|1791489_1792608_-	S49 family peptidase	K4HZZ6	Acidithiobacillus_phage	4.3e-34	34.7
WP_022556420.1\|1792617_1793847_-\|portal	phage portal protein	A0A1V0E8B9	Vibrio_phage	4.0e-65	36.4
WP_022556421.1\|1793871_1795515_-\|terminase	terminase large subunit	B4UTP0	Rhizobium_phage	1.9e-224	75.4
WP_022556422.1\|1795527_1795983_-	hypothetical protein	B4UTN9	Rhizobium_phage	2.0e-51	70.2
WP_048902713.1\|1796101_1796422_-	HNH endonuclease	NA	NA	NA	NA
WP_022556424.1\|1796482_1796905_+	hypothetical protein	NA	NA	NA	NA
WP_022556425.1\|1797213_1798158_-	hypothetical protein	NA	NA	NA	NA
WP_022556426.1\|1798195_1798669_-	VRR-NUC domain-containing protein	B4UTZ1	Rhizobium_phage	1.3e-48	71.8
WP_022556428.1\|1799036_1799984_-	hypothetical protein	NA	NA	NA	NA
WP_022556429.1\|1800213_1801581_-	putative transcriptional regulator	Q8W6N3	Burkholderia_virus	8.2e-88	41.5
WP_022556430.1\|1801661_1803884_-	bifunctional DNA primase/polymerase	A0A1B1INT2	uncultured_Mediterranean_phage	7.4e-54	35.4
WP_022556431.1\|1803961_1804783_+	HNH endonuclease	NA	NA	NA	NA
WP_022556432.1\|1804788_1805163_-	hypothetical protein	B4UTY6	Rhizobium_phage	1.1e-31	54.4
WP_022556433.1\|1805159_1805393_-	hypothetical protein	A0A1X9SGK7	Bradyrhizobium_phage	2.5e-05	51.1
WP_022556434.1\|1805389_1806448_-	site-specific DNA-methyltransferase	F4YCV3	Synechococcus_phage	3.2e-87	52.6
WP_048902834.1\|1806447_1807851_-	helicase	F8TUJ9	EBPR_podovirus	6.0e-118	48.9
WP_144115322.1\|1807958_1808708_+	hypothetical protein	NA	NA	NA	NA
WP_022556437.1\|1808711_1809239_-	siphovirus Gp157 family protein	NA	NA	NA	NA
WP_022556438.1\|1809411_1809861_+	hypothetical protein	NA	NA	NA	NA
WP_022556439.1\|1809893_1811648_-	DEAD/DEAH box helicase	B4UTX6	Rhizobium_phage	1.3e-138	43.9
WP_022556440.1\|1811707_1812199_+	hypothetical protein	NA	NA	NA	NA
WP_022556441.1\|1812195_1812732_-	sigma-70 family RNA polymerase sigma factor	B4UTX5	Rhizobium_phage	1.5e-08	29.7
WP_022556442.1\|1812752_1813682_-	hypothetical protein	B4UTX3	Rhizobium_phage	1.1e-155	83.1
WP_022556443.1\|1813722_1814778_-	DNA polymerase III subunit beta	B4UTW9	Rhizobium_phage	9.8e-97	56.0
WP_022556444.1\|1814835_1815669_-	hypothetical protein	K4FB09	Cronobacter_phage	1.4e-34	39.2
WP_022556445.1\|1815729_1816308_-	DUF669 domain-containing protein	NA	NA	NA	NA
WP_022556447.1\|1816490_1817264_-	ATP-binding protein	B4UTW5	Rhizobium_phage	9.3e-121	80.6
WP_022556448.1\|1817329_1817557_-	hypothetical protein	NA	NA	NA	NA
WP_022556449.1\|1817696_1818008_-	hypothetical protein	NA	NA	NA	NA
WP_022556450.1\|1818068_1818959_-	hypothetical protein	B4UTW2	Rhizobium_phage	4.2e-08	43.3
WP_022556451.1\|1818958_1819240_-	hypothetical protein	B4UTW1	Rhizobium_phage	7.4e-28	61.5
WP_022556452.1\|1819236_1819587_-	hypothetical protein	B4UTW0	Rhizobium_phage	6.3e-08	44.4
WP_048902835.1\|1819782_1820292_+	hypothetical protein	B4UTV8	Rhizobium_phage	2.1e-36	44.3
WP_022556454.1\|1820915_1821575_+	helix-turn-helix transcriptional regulator	NA	NA	NA	NA
WP_022556456.1\|1822018_1822372_+	hypothetical protein	NA	NA	NA	NA
WP_048902716.1\|1822514_1822766_+	hypothetical protein	V9QKZ2	Rhizobium_phage	3.1e-25	64.3

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

Overview of predicted results

Overview of the results

Cas Category Instructions

Results visualization

1. NC_022545

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Click the colored protein region to show detailed information

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection

2. NC_022536

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Click the colored protein region to show detailed information

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection

3. NC_022535

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection