cd05276, p53_inducible_oxidoreductase, PIG3 p53-inducible quinone oxidoreductase. PIG3 p53-inducible quinone oxidoreductase, a medium chain dehydrogenase/reductase family member, acts in the apoptotic pathway. PIG3 reduces ortho-quinones, but its apoptotic activity has been attributed to oxidative stress generation, since overexpression of PIG3 accumulates reactive oxygen species. PIG3 resembles the MDR family member quinone reductases, which catalyze the reduction of quinone to hydroxyquinone. NAD(P)(H)-dependent oxidoreductases are the major enzymes in the interconversion of alcohols and aldehydes or ketones. Alcohol dehydrogenase in the liver converts ethanol and NAD+ to acetaldehyde and NADH, while in yeast and some other microorganisms ADH catalyzes the conversion acetaldehyde to ethanol in alcoholic fermentation. ADH is a member of the medium chain alcohol dehydrogenase family (MDR), which has a NAD(P)(H)-binding domain in a Rossmann fold of a beta-alpha form. The NAD(H)-binding region is comprised of 2 structurally similar halves, each of which contacts a mononucleotide. A GxGxxG motif after the first mononucleotide contact half allows the close contact of the coenzyme with the ADH backbone. The N-terminal catalytic domain has a distant homology to GroES. These proteins typically form dimers (typically higher plants, mammals) or tetramers (yeast, bacteria), and have 2 tightly bound zinc atoms per subunit, a catalytic zinc at the active site, and a structural zinc in a lobe of the catalytic domain. NAD(H) binding occurs in the cleft between the catalytic and coenzyme-binding domains at the active site, and coenzyme binding induces a conformational closing of this cleft. Coenzyme binding typically precedes and contributes to substrate binding. In human ADH catalysis, the zinc ion helps coordinate the alcohol, followed by deprotonation of a histidine, the ribose of NAD, a serine, then the alcohol, which allows the transfer of a hydride to NAD+, creating NADH and a zinc-bound aldehyde or ketone. In yeast and some bacteria, the active site zinc binds an aldehyde, polarizing it, and leading to the reverse reaction.
pfam12725, DUF3810, Protein of unknown function (DUF3810). This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are typically between 333 and 377 amino acids in length. There is a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterized family of peptidase enzymes.
pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin. AbiEi_4 is the cognate antitoxin of the type IV toxin-antitoxin 'innate immunity' bacterial abortive infection (Abi) system that protects bacteria from the spread of a phage infection. The Abi system is activated upon infection with phage to abort the cell thus preventing the spread of phage through viral replication. There are some 20 or more Abis, and they are predominantly plasmid-encoded lactococcal systems. TA, toxin-antitoxin, systems on plasmids function by killing cells that lose the plasmid upon division. AbiE phage resistance systems function as novel Type IV TAs and are widespread in bacteria and archaea. The cognate antitoxin is pfam13338.
pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein. Members of this family are similar to the protein product of ORF-3 found on plasmid pRiA4 in the bacterium Agrobacterium rhizogenes. This plasmid is responsible for tumorigenesis at wound sites of plants infected by this bacterium, but the ORF-3 product does not seem to be involved in the pathogenetic process. Other proteins found in this family are annotated as being putative TnpR resolvases, but no further evidence was found to back this. Moreover, another member of this family is described as a probable lexA repressor and in fact carries a LexA DNA binding domain (pfam01726), but no references were found to expand on this.
cd02612, HAD_PGPPase, phosphatidylglycerol-phosphate phosphatase, similar to Escherichia coli K-12 phosphatidylglycerol-phosphate phosphatase C. This family includes Escherichia coli K-12 phosphatidylglycerol-phosphate phosphatase C, PgpC (previously named yfhB) which catalyzes the dephosphorylation of phosphatidylglycerol-phosphate (PGP) to phosphatidylglycerol (PG). This family belongs to the haloacid dehalogenase-like (HAD) hydrolases, a large superfamily of diverse enzymes that catalyze carbon or phosphoryl group transfer reactions on a range of substrates, using an active site aspartate in nucleophilic catalysis. Members of this superfamily include 2-L-haloalkanoic acid dehalogenase, azetidine hydrolase, phosphonoacetaldehyde hydrolase, phosphoserine phosphatase, phosphomannomutase, P-type ATPases and many others. HAD hydrolases are found in all three kingdoms of life, and most genomes are predicted to contain multiple HAD-like proteins. Members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. HAD hydrolases are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
pfam14029, DUF4244, Protein of unknown function (DUF4244). This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are typically between 66 and 95 amino acids in length. There is a conserved EYA sequence motif.
cd02966, TlpA_like_family, TlpA-like family; composed of TlpA, ResA, DsbE and similar proteins. TlpA, ResA and DsbE are bacterial protein disulfide reductases with important roles in cytochrome maturation. They are membrane-anchored proteins with a soluble TRX domain containing a CXXC motif located in the periplasm. The TRX domains of this family contain an insert, approximately 25 residues in length, which correspond to an extra alpha helix and a beta strand when compared with TRX. TlpA catalyzes an essential reaction in the biogenesis of cytochrome aa3, while ResA and DsbE are essential proteins in cytochrome c maturation. Also included in this family are proteins containing a TlpA-like TRX domain with domain architectures similar to E. coli DipZ protein, and the N-terminal TRX domain of PilB protein from Neisseria which acts as a disulfide reductase that can recylce methionine sulfoxide reductases.
pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III. Phage_holin_3_6 is a family of small hydrophobic proteins with two or three transmembrane domains of the Hol-X family. Holin proteins are produced by double-stranded DNA bacteriophages that use an endolysin-holin strategy to achieve lysis of their hosts. The endolysins are peptidoglycan-degrading enzymes that are usually accumulated in the cytosol until access to the cell wall substrate is provided by the holin membrane lesion.
cd01151, GCD, Glutaryl-CoA dehydrogenase. Glutaryl-CoA dehydrogenase (GCD). GCD is an acyl-CoA dehydrogenase, which catalyzes the oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA and carbon dioxide in the catabolism of lysine, hydroxylysine, and tryptophan. It uses electron transfer flavoprotein (ETF) as an electron acceptor. GCD is a homotetramer. GCD deficiency leads to a severe neurological disorder in humans.
cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain. Members of the MBL-fold metallohydrolase superfamily are mainly hydrolytic enzymes which carry out a variety of biological functions. The class B metal beta-lactamases (MBLs) for which this fold was named perform only a small fraction of the activities included in this superfamily.Activities carried out by superfamily members include class B beta-lactamases, hydroxyacylglutathione hydrolases, AHL (acyl homoserine lactone) lactonases, persulfide dioxygenases, flavodiiron proteins, cleavage and polyadenylation specificity factors such as the Int9 and Int11 subunits of Integrator, Sdsa1-like and AtsA-like arylsulfatases, 5'-exonucleases human SNM1A and yeast Pso2p, ribonuclease J and ribonuclease Z, cyclic nucleotide phosphodiesterases, insecticide hydrolases, and proteins required for natural transformation competence. Classical members of the superfamily are di-, or less commonly mono-, zinc-ion-dependent hydrolases, however the diversity of biological roles is reflected in variations in the active site metallo-chemistry.
TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein. Members of this protein family belong to the MinD/ParA family of P-loop NTPases, and in particular show homology to the CpaE family of pilus assembly proteins (see ). Nearly all members are found, not only in a gene context consistent with pilus biogenesis or a pilus-like secretion apparatus, but also near a DEAD/DEAH-box helicase, suggesting an involvement in DNA transfer activity. The model describes a clade restricted to the Actinobacteria.
TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein. Members of this small, highly hydrophobic protein family occur in a pilus/secretion-like region that usually is next to an uncharacterized DEAH-box helicase, in Actinobacteria. Members show sequence similarity to the TadE-like family described by pfam07811. The function is unknown. [Unknown function, General].
cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes. The need of CoAses mainly arises under conditions of oxidative stress. CoAse has a conserved Nudix fold and requires a single divalent cation for catalysis. In addition to a signature Nudix motif G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val, CoAse contains an additional motif upstream called the NuCoA motif (LLTXT(SA)X3RX3GX3FPGG) which is postulated to be involved in CoA recognition. CoA plays a central role in lipid metabolism. It is involved in the initial steps of fatty acid sythesis in the cytosol, in the oxidation of fatty acids and the citric acid cycle in the mitochondria, and in the oxidation of long-chain fatty acids in peroxisomes. CoA has the important role of activating fatty acids for further modification into key biological signalling molecules.
TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase. Members of this protein family comprise a distinct clade of putative ATPase associated with an integral membrane complex likely to act in pilus formation, secretion, or conjugal transfer. The association of most members with a nearby gene for a DEAH-box helicase suggests a role in conjugal transfer.
cd03450, NodN, NodN (nodulation factor N) contains a single hot dog fold similar to those of the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Rhizobium and related species form nodules on the roots of their legume hosts, a symbiotic process that requires production of Nod factors, which are signal molecules involved in root hair deformation and meristematic cell division. The nodulation gene products, including NodN, are involved in producing the Nod factors, however the role played by NodN is unclear.
pfam02674, Colicin_V, Colicin V production protein. Colicin V production protein is required in E. Coli for colicin V production from plasmid pColV-K30. This protein is coded for in the purF operon.
cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas2 is present in majority of CRISPR/Cas systems along with Cas1; RNAse specific to U-rich regions; Possesses an RRM/ferredoxin fold.
COG4759, COG4759, Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones].
TIGR03462, Bifunctional_lycopene_cyclase/phytoene_synthase, lycopene cyclase domain. This domain is often repeated twice within the same polypeptide, as is observed in Archaea, Thermus, Sphingobacteria and Fungi. In the fungal sequences, this tandem domain pair is observed as the N-terminal half of a bifunctional protein, where it has been characterized as a lycopene beta-cyclase and the C-terminal half is a phytoene synthetase. In Myxococcus and Actinobacterial genomes this domain appears as a single polypeptide, tandemly repeated and usually in a genomic context consistent with a role in carotenoid biosynthesis. It is unclear whether any of the sequences in this family truly encode lycopene epsilon cyclases. However a number are annotated as such. The domain is generally hydrophobic with a number of predicted membrane spanning segments and contains a distinctive motif (hPhEEhhhhhh). In certain sequences one of either the proline or glutamates may vary, but always one of the tandem pair appear to match this canonical sequence exactly.
TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc. This model describes a CRISPR-associated putative DEAH-box helicase, or Cas3, of a subtype found in Actinomyces naeslundii MG1, Geobacter sulfurreducens PCA, Gemmata obscuriglobus UQM 2246, and Desulfotalea psychrophila. This protein includes both DEAH and HD motifs.
cd03468, PolY_like, DNA Polymerase Y-family. Y-family DNA polymerases are a specialized subset of polymerases that facilitate translesion synthesis (TLS), a process that allows the bypass of a variety of DNA lesions. Unlike replicative polymerases, TLS polymerases lack proofreading activity and have low fidelity and low processivity. They use damaged DNA as templates and insert nucleotides opposite the lesions. The active sites of TLS polymerases are large and flexible to allow the accomodation of distorted bases. Expression of Y-family polymerases is often induced by DNA damage and is believed to be highly regulated. TLS is likely induced by the monoubiquitination of the replication clamp PCNA, which provides a scaffold for TLS polymerases to bind in order to access the lesion. Because of their high error rates, TLS polymerases are potential targets for cancer treatment and prevention.
TIGR03462, Bifunctional_lycopene_cyclase/phytoene_synthase, lycopene cyclase domain. This domain is often repeated twice within the same polypeptide, as is observed in Archaea, Thermus, Sphingobacteria and Fungi. In the fungal sequences, this tandem domain pair is observed as the N-terminal half of a bifunctional protein, where it has been characterized as a lycopene beta-cyclase and the C-terminal half is a phytoene synthetase. In Myxococcus and Actinobacterial genomes this domain appears as a single polypeptide, tandemly repeated and usually in a genomic context consistent with a role in carotenoid biosynthesis. It is unclear whether any of the sequences in this family truly encode lycopene epsilon cyclases. However a number are annotated as such. The domain is generally hydrophobic with a number of predicted membrane spanning segments and contains a distinctive motif (hPhEEhhhhhh). In certain sequences one of either the proline or glutamates may vary, but always one of the tandem pair appear to match this canonical sequence exactly.
cd09764, Csb3_I-U, CRISPR/Cas system-associated RAMP superfamily protein Csb3. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; RAMP superfamily protein; Might be a catalytically active RNA endoribonuclease.
TIGR02734, Phytoene_desaturase_lycopene-forming, phytoene desaturase. Phytoene is converted to lycopene by desaturation at four (two symmetrical pairs of) sites. This is achieved by two enzymes (crtP and crtQ) in cyanobacteria (Gloeobacter being an exception) and plants, but by a single enzyme in most other bacteria and in fungi. This single enzyme is called the bacterial-type phytoene desaturase, or CrtI. Most members of this family, part of the larger pfam01593, which also contains amino oxidases, are CrtI itself; it is likely that all members act on either phytoene or on related compounds such as dehydrosqualene, for carotenoid biosynthesis. [Biosynthesis of cofactors, prosthetic groups, and carriers, Other].
pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053). This entry is found in CRISPR-associated (cas) proteins in the genomes of Geobacter sulfurreducens PCA and Desulfotalea psychrophila LSv54 (both Desulfobacterales from the Deltaproteobacteria), Gemmata obscuriglobus (a Planctomycete), and Actinomyces naeslundii MG1 (Actinobacteria).
TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4. This model represents a family of proteins associated with CRISPR repeats in a wide set of prokaryotic genomes. This scope of this model has been broadened since it was first built to describe an archaeal subset only. The function of the protein is undefined. Distantly related proteins, excluded from this model, include ORFs from Mycobacteriophage D29 and Sulfolobus islandicus filamentous virus and a region of the Schizosaccharomyces pombe DNA replication helicase Dna2p.
TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system. This model represents a CRISPR-associated protein from the Dpsyc subtype (a type I-C variant), named for Desulfotalea psychrophila LSv54. CRISPR systems confer resistance in prokaryotes to invasive DNA or RNA, including phage and plasmids. CRISPR-associated proteins typically are found near CRISPR repeats and other CRISPR-associated proteins, have low levels of sequence identify, have sequence relationships that suggest lateral transfer, and show some sequence similarity to DNA-active proteins such as helicases and repair proteins.
pfam01867, Cas_Cas1, CRISPR associated protein Cas1. Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. This family of proteins corresponds to Cas1, a CRISPR-associated protein. Cas1 may be involved in linking DNA segments to CRISPR.
cd02651, nuc_hydro_IU_UC_XIUA, nuc_hydro_IU_UC_XIUA: inosine-uridine preferring, xanthosine-inosine-uridine-adenosine-preferring and, uridine-cytidine preferring nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains proteins similar to nucleoside hydrolases which hydrolyze both pyrimidine and purine ribonucleosides: the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata, the inosine-uridine-xanthosine preferring nucleoside hydrolase RihC from Escherichia coli and the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium. This group also contains proteins similar to the pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases URH1 from Saccharomyces cerevisiae, E. coli RihA and E. coli RihB. E. coli RihA is equally efficient with uridine and cytidine, E. coli RihB prefers cytidine over uridine. S. cerevisiae URH1 prefers uridine over cytidine. .
cd01004, PBP2_MidA_like, Mimosine binding domain of ABC-type transporter MidA and similar proteins; the type 2 periplasmic binding protein fold. This subgroup includes the periplasmic binding component of ABC transporter involved in uptake of mimosine MidA and its similar proteins. This periplasmic binding domain belongs to the type 2 periplasmic-binding fold protein (PBP2) superfamily, whose members are involved in chemotaxis and uptake of nutrients and other small molecules from the extracellular space as a primary receptor. PBP2 typically comprises of two globular subdomains connected by a flexible hinge and bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two receptor cytoplasmically-located ATPase domains. This interaction triggers the ligand translocation across the cytoplasmic membrane energized by ATP hydrolysis.
cd19127, AKR_AKR5B1, AKR5B family of aldo-keto reductase (AKR). Pseudomonas putida morphine 6-dehydrogenase (M6DH) is a founding member of the aldo-keto reductase family 5 member B1 (AKR5B1). M6DH (EC 1.1.1.218), also called naloxone reductase, oxidizes the C-6 hydroxy group of morphine and codeine.
cd17504, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide)-like multidrug resistance (MDR) transporters of the Major Facilitator Superfamily. This subfamily is composed of putative multidrug resistance (MDR) transporters including Chlamydia trachomatis antiseptic resistance protein QacA_2, and Serratia sp. DD3 Bmr3. MDR transporters are drug/H+ antiporters (DHA) that mediate the efflux of a variety of drugs and toxic compounds, and confer resistance to these compounds. This subfamily belongs to the Methylenomycin A resistance protein (also called MMR peptide) and similar multidrug resistance (MDR) transporters (MMR-like MDR transporter) family of the Major Facilitator Superfamily (MFS) of transporters. MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement.
TIGR01995, beta-glucosides_PTS_EIIBCA, PTS system, beta-glucoside-specific IIABC component. This model represents a family of PTS enzyme II proteins in which all three domains are found in the same polypeptide chain and which appear to have a broad specificity for beta-glucosides including salicin (beta-D-glucose-1-salicylate) and arbutin (Hydroquinone-O-beta-D-glucopyranoside). These are distinct from the closely related sucrose-specific and trehalose-specific PTS transporters.
pfam06889, DUF1266, Protein of unknown function (DUF1266). This family consists of several hypothetical bacterial proteins of around 235 residues in length. Members of this family seem to be found exclusively in the Enterobacteria Salmonella typhimurium and Escherichia coli. The function of this family is unknown.
pfam00582, Usp, Universal stress protein family. The universal stress protein UspA is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. UspA enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae UspA reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, though UspA lacks ATP-binding activity.
TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin. Several proteins, such as protein disulfide isomerase, have two or more copies of a domain closely related to thioredoxin. This model is designed to recognize authentic thioredoxin, a small protein that should be hit exactly once by this model. Any protein that hits once with a score greater than the second (per domain) trusted cutoff may be taken as thioredoxin. [Energy metabolism, Electron transport].
The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')