pfam01867, Cas_Cas1, CRISPR associated protein Cas1. Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. This family of proteins corresponds to Cas1, a CRISPR-associated protein. Cas1 may be involved in linking DNA segments to CRISPR.
cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins. This subfamily consists of uncharacterized hypothetical bacterial proteins with similarity to Escherichia coli sugar kinase ygcE , whose functional roles are not yet clear. Escherichia coli ygcE is recognized by this model, but is not present in the alignment as it contains a deletion relative to other members of the group. These proteins belong to the FGGY family of carbohydrate kinases, the monomers of which contain two large domains, which are separated by a deep cleft that forms the active site. This model includes both the N-terminal domain, which adopts a ribonuclease H-like fold, and the structurally related C-terminal domain.
pfam13280, WYL, WYL domain. WYL is a Sm-like SH3 beta-barrel fold containing domain. It is a member of the WYL-like superfamily, named for three conserved amino acids found in a subset of the superfamily. However, these residues are not strongly conserved throughout the family. Rather, the conservation pattern includes four basic residues and a position often occupied by a cysteine, which are predicted to line a ligand-binding groove typical of the Sm-like SH3 beta-barrels. A WYL domain protein (sll7009) is a negative regulator of the I-D CRISPR-Cas system in Synechocystis sp. It is predicted to be a ligand-sensing domain that could bind negatively charged ligands, such as nucleotides or nucleic acid fragments, to regulate CRISPR-Cas and other defense systems such as the abortive infection AbiG system.
pfam07892, DUF1667, Protein of unknown function (DUF1667). Hypothetical archaeal and bacterial proteins make up this family. A few proteins are annotated as being potential metal-binding proteins, and in fact the members of this family have four highly conserved cysteine residues, but no further literature evidence was found in this regard.
cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas2 is present in majority of CRISPR/Cas systems along with Cas1; RNAse specific to U-rich regions; Possesses an RRM/ferredoxin fold.
pfam18320, Csc2, Csc2 Crispr. The Csc2 Crispr family of proteins forms a core RNA recognition motif-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. The CRISPR-Cas system is possibly a mechanism of defence against invading pathogens and plasmids that functions analogously to the RNA interference (RNAi) systems in eukaryotes.
cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Diverged DNA helicase Cas3'; signature gene for Type I and subtype I-D.
TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This broadly distributed, highly divergent Cas family is now characterized as an endoribonuclease that generates guide RNAs for host defense against phage and other invaders. The family contains a C-terminal motif GXGXXXXXGXG, where the each X between two Gly is hydrophobic and the spacer XXXXX contains (usually) one Arg or Lys. The seed alignment for the current version of this model has gappy columns removed. Members of this protein family are found associated with several different CRISPR/cas system subtypes, and consequently we designate this family Cas6.
cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons. The polymerase reaction of this enzyme leads to the production of a unique RNA-DNA complex called msDNA (multicopy single-stranded (ss)DNA) in which a small ssDNA branches out from a small ssRNA molecule via a 2'-5'phosphodiester linkage. Bacterial retron RTs produce cDNA corresponding to only a small portion of the retron genome.
TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This protein family is a CRISPR-associated (Cas) family strictly associated with the Cyano subtype of CRISPR/Cas locus, found in several species of Cyanobacteria and several archaeal species. This family is designated Csc1 for CRISPR/Cas Subtype Cyano protein 1, as it is often the first gene upstream of the core cas genes, cas3-cas4-cas1-cas2.
cd07010, cupin_PMI_type_I_N_bac, Phosphomannose isomerase in bacteria and archaea, N-terminal cupin domain. This subfamily contains type I phosphomannose isomerase (PMI; E.C. 5.3.1.8; also known as mannose-6-phosphate isomerase) found in many bacteria (e.g. Bacillus subtilis) and archaea. PMI catalyzes the reversible isomerization of fructose-6-phosphate (F6P) and mannose-6-phosphate (M6P), the first committed step in the synthesis of mannosylated glycoproteins. The active site, located within the N-terminal jelly roll-like beta-barrel cupin fold, contains a single essential zinc atom and forms a deep, open cavity large enough to contain M6P or F6P. PMI type I also has a C-terminal beta-barrel fold which has diverged considerably from the N-terminal domain and is not included here. This subfamily does not contain an alpha helical domain that exists in eukaryotic and some prokaryotic PMIs. F6P is a substrate for glycolysis and gluconeogenesis, while M6P is a substrate for production of activated mannose donor guanosine 5'-diphosphate D-mannose, an important precursor of mannosylated biomolecules such as glycoproteins, bacterial exopolysaccharides and fungal cell wall components. PMI is also essential for survival, virulence and possibly pathogenicity of some bacteria and protozoan parasites, as well as for cell wall integrity of certain yeasts. Thus, PMI is a potential target against fungal infections causing serious illness or death.
pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisingly the family also includes jellyfish crystallins, these proteins appear to have lost the presumed active site residues.
pfam03961, FapA, Flagellar Assembly Protein A. Members of this family include FapA (flagellar assembly protein A), found in Vibrio vulnificus. The synthesis of flagella allows bacteria to respond to chemotaxis by facilitating motility. Studies examining the role of FapA show that the loss or delocalization of FapA results in a complete failure of the flagellar biosynthesis and motility in response to glucose mediated chemotaxis. The polar localization of FapA is required for flagellar synthesis, and dephosphorylated EIIAGlc (Glucose-permease IIA component) inhibited the polar localization of FapA through direct interaction.
TIGR03824, FlgM_jcvi, flagellar biosynthesis anti-sigma factor FlgM. FlgM interacts with and inhibits the alternative sigma factor sigma(28) FliA. The C-terminus of FlgM contains the sigma(28)-binding domain.
cd19964, PBP1_BMP-like, periplasmic binding component of a basic membrane lipoprotein (BMP) from Aeropyrum pernix K1 and its close homologs in other bacteria. Periplasmic binding component of a family of basic membrane lipoproteins from Aeropyrum pernix K1 and various putative lipoproteins from other bacteria. These outer membrane proteins include Med, a cell-surface localized protein regulating the competence transcription factor gene comK in Bacillus subtilis, and PnrA, a periplasmic purine nucleoside binding protein of an ATP-binding cassette (ABC) transport system in Treponema pallidum. All contain the type 1 periplasmic sugar-binding protein-like fold.
cd06580, TM_PBP1_transp_TpRbsC_like, Transmembrane subunit (TM) of Treponema pallidum (Tp) RbsC-1, RbsC-2 and related proteins. This is a functionally uncharacterized subgroup of TMs which belong to a larger group of TMs of Periplasmic Binding Protein (PBP)-dependent ATP-Binding Cassette (ABC) transporters, which are mainly involved in the uptake of branched-chain amino acids (AAs) or in the uptake of monosaccharides including ribose, galactose, and arabinose, and which generally bind type 1 PBPs. PBP-dependent ABC transporters consist of a PBP, two TMs, and two cytoplasmic ABCs, and are mainly involved in importing solutes from the environment. The solute is captured by the PBP, which delivers it to a gated translocation pathway formed by the two TMs. The two ABCs bind and hydrolyze ATP and drive the transport reaction.
TIGR02550, Flagellar_hook-associated_protein_3, flagellar hook-associated protein 3. This protein family consists of flagellar hook-associated proteins designated FlgL (or HAP3) encoded in bacterial flagellar operons. A N-terminal region of about 150 residues and a C-terminal region of about 85 residues are conserved. Members show considerable length heterogeneity between these two well-conserved terminal regions; the seed alignment 486 columns, 393 of which are represented in the model, while members of this family are from 287 to over 500 residues in length. This model distinguishes FlgL from the flagellin gene product FliC. [Cellular processes, Chemotaxis and motility].
TIGR00135, Glutamyl-tRNAGln_amidotransferase_subunit_C, aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase, C subunit. Archaea, organelles, and many bacteria charge Gln-tRNA by first misacylating it with Glu and then amidating Glu to Gln. This small protein is part of the amidotransferase heterotrimer and appears to be important to the stability of the amidase subunit encode by gatA, but its function may not be required in every organism that expresses gatA and gatB. The seed alignment for this model does not include any eukaryotic sequence and is not guaranteed to find eukaryotic examples, although it does find some. Saccharomyces cerevisiae, which expresses the amidotransferase for mitochondrial protein translation, seems to lack a gatC ortholog. This model has been revised to remove the candidate sequence from Methanococcus jannaschii, now part of a related model. [Protein synthesis, tRNA aminoacylation].
The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')