Gene expression is the process by which genetic information flows from DNA to RNA to the protein. The translation of DNA into RNA is termed transcription; protein synthesis from RNA templates is called translation. Details on gene expression and transcription can be found in a separate article.
Translation is carried out by ribosomes, which are large molecular complexes of ribosomal RNA (rRNA) and proteins. Ribosomes bind to RNA templates, also termed messenger RNA (mRNA), and catalyze the formation of a polypeptide based on this template. In the process, a charged transfer RNA (tRNA) recognizes a nucleotide triplet of mRNA that matches a specific amino acid (AA). The new AA is then linked to the next AA of the growing polypeptide on the ribosome. Translation ends once a specific nucleotide sequence of the mRNA is reached (a stop codon). The ribosome subsequently dissociates and the mRNA and newly synthesized protein are released. Before proteins are functional, a proper shape and destination are both necessary. Proteins begin to fold into their three-dimensional structure during translation according to the AA sequence and local chemical forces and reactions. Various specialized proteins (folding catalysts, chaperones) also help the newly formed proteins to fold properly and reach their correct destinations (e.g., cytosol, organelles, extracellular matrix) via protein modifications. The translation rate of proteins is adjusted to the current conditions of the cell and bodily demands, and is affected by the presence or absence of certain nutrients.
The genetic code
- Description: relationship between the DNA (or mRNA) nucleotide sequence and the respective amino acid sequence of a protein
- Codon: sequence of three nucleotides (a triplet) of mRNA that codes for a specific AA
- Anticodon: sequence of three nucleotides in tRNA that is complementary to the codon on mRNA
- The codon and anticodon are always paired in an antiparallel manner.
- There are 64 combinations of codons: 61 for 20 AAs (including the start codon), and 3 for stop codons
- Notable codons
- Start codon: AUG (located at the 5' end of mRNA), initiates translation of the mRNA
- Stop codons: UGA, UAG, or UAA end translation
|Features of the genetic code|
|Unambiguity (genetic code)|| |
|Commaless, non-overlapping (genetic code)|| |
|Redundancy, degeneracy (genetic code)|| |
|Universality (genetic code)|
Start codon: AUG AUthorizes Growth.
- Description: binding of tRNA to its corresponding AA
- Relevant enzymes: aminoacyl-tRNA synthetases
- Reaction mechanism
- Mischarged tRNA
Ribosome binding sites
- See “Ribosomes” for more details.
- The ribosome is composed of rRNA and proteins that form subunits.
- Binding site for mRNA is on the small ribosomal subunit.
Both ribosomal subunits jointly form the binding sites for tRNA.
- Aminoacyl site (A site)
- Peptidyl site (P site)
- Exit site (E site)
- Because of the length of most mRNA, more than one ribosome can bind them, allowing synthesis of multiple polypeptides at once = polysome.
- Description: assembly of functional ribosomes with the help of initiation factors (IFs): and recognition of the start codon (AUG) on the mature mRNA by the initiator methionyl-tRNA (met-tRNA)
- Initiator met-tRNA, eukaryotic IF2 (eIF2), and GTP bind to the small ribosomal subunit to form a preinitiation complex (initially a 43s preinitiation complex).
mRNA is recognized by eIF4 and binds to the preinitiation complex, leading to the formation of the 48s preinitiation complex; eIF4 recognizes mRNA
- Usually at the 5' cap in eukaryotes
- Sometimes at an internal ribosome entry site (IRES)
- Initiator met-tRNA recognizes the start codon (typically the first AUG triplet after the 5' cap of the mRNA) and binds the P site.
Shine-Dalgarno sequence: ribosomal binding site contained in prokaryotic mRNA that enables the initiation of protein synthesis by aligning start codon and ribosome
- Generally found ∼ 8 bases upstream of start codon
- Common in bacteria
- GTP hydrolysis provides energy for the release of eIF2, allowing the large and small ribosomal subunits to assemble into a functional ribosome (the final initiation complex; 80S in eukaryotes, 70S in prokaryotes).
- Directions of processes
- Initiator met-tRNA is located at the P-site, or another previously matching aminoacyl-tRNA is bound there.
- An aminoacyl-tRNA complex with eukaryotic elongation factor 1 (eEF1) hydrolyzes GTP, thereby releasing eEF1 and GDP and providing the energy for aminoacyl-tRNA to bind the A site (anticodon matches the codon of the mRNA).
- The polypeptide is elongated by the stepwise addition of AAs via between the AAs bound to the A-sites and P-site (via tRNA).
- Bond created by a dehydration reaction catalyzed by a peptidyl transferase that is intrinsic to the rRNA (“ribozyme”) of the large complex
- The ribosome moves one triplet along the mRNA in the 3' direction.
- After translocation, the tRNA that was in the A-site is now in the P-site, and the tRNA that was in the P-site is now in the E-site.
- The unloaded tRNA is released from the E-site.
- A release factor recognizes the stop codon, halts translation, and hydrolytically cleaves the peptidyl tRNA bonds (requires GTP), leading to release of the protein.
Protein folding and misfolding
Description: The process by which a protein goes from an unfolded native state to form a three-dimensional structure via progressive stabilization of the intermediate states until the most favorable energy level is achieved.
- Correct protein folding begins during translation and is required for a protein to perform its function within the cell or organism.
- Intrinsic factors
Regulatory proteins: proteins that help other proteins to form their native structure
- Intracellular protein complexes that prevent protein aggregation during synthesis (thus prevent making proteins nonfunctional) and permit refolding of misfolded proteins in a protected environment
- Assist in transporting (precursor) proteins
- ATP is consumed in the process.
- Example: heat shock proteins (e.g., Hsp70, Hsp60, Hsp90)
Folding catalysts: enzymes that accelerate rate-limiting steps during protein folding
- Protein disulfide-isomerase: Catalyzes the formation of thermodynamically favorable disulfide bonds within proteins (in proteins that possess more than two cysteine residues), if less energetically favorable disulfide bonds are formed.
- Prolyl isomerase: Assists in finding the energetically favorable conformation of a peptide bond with the AA proline.
- Chaperone proteins
- Description: lack of folding or non-native protein folding as a result of denaturing factors
- Mechanisms of misfolding
- Intracellular reaction: Misfolded proteins are identified and either rescued by chaperones or are ubiquitinated and labeled for proteasomal degradation (see “P ”).
- Consequences of misfolding
- Many proteins require specific covalent alterations (co- or posttranslational modifcations) in addition to correct folding to function properly.
- Examples include glycosylation, lipid anchors, phosphorylation, acetylation, ubiquitination, ADP-ribosylation, biotinylation, carboxylation, methylation, and hydroxylation.
- Enzymatic attachment of a carbohydrate to specific AA side chains of proteins via N-glycosidic (N-linked glycosylation) or O-glycosidic bonds (O-linked glycosylation), forming a glycoprotein
- Glycoproteins are usually cell membrane, lysosomal, or secretory proteins (e.g., serum proteins such as erythropoietin).
|N-linked vs. O-linked glycosylation|
|N-linked glycosylation||O-linked glycosylation|
|% of all glycosylations|| || |
|Amino acids involved|
|Carbohydrate side chains|| || |
|Sugar residues involved|| || |
|Enzymes/molecules involved|| |
Enzymatic glycosylation should not be confused with nonenzymatic glycation. In glycation, aldoses (e.g., glucose) spontaneously bind to the amino groups of proteins and may influence their function. A classic example is HbA1c, whose function is unaffected by glycation.
- Acylation: linkage with long chain fatty acids, e.g., palmitic acid
Isoprenylation: linkage of a cysteine side chain of the protein with polyisoprene via a thioester bond, such as in:
- Farnesylation: linkage with a farnesyl residue (three isoprene units, total of 15 C atoms)
- Geranylgeranylation: linkage with a geranylgeranyl residue (four isoprene units, total of 20 C atoms)
- GPI anchor: linkage with glycosylphosphatidylinositol (GPI), a glycolipid
Reversible covalent alterations
Enzymatic reversible protein modification alters the protein's spatial structure (conformation), thereby allowing its activity to be regulated. For example, a protein may interact with other proteins and/or become recognizable as a substrate. Reversible protein modification essentially allows the protein to be switched on or off.
- Phosphorylation: attachment of phosphate residues to the hydroxyl group (OH group) of serine or threonine or of tyrosine residues by kinases
- Acetylation: linkage with a CO-CH3 group by acetyltransferases
- Ubiquitination: attachment of ubiquitin (a small protein) to the ε-amino group of lysine residues in proteins, especially in proteins to be degraded (e.g., cell cycle regulation via ubiquitination and degradation of cyclins)
- SUMOylation: attachment of a small ubiquitin-like modifier (SUMO) protein to lysine residues (similar to ubiquitination)
- ADP-ribosylation: transfer of an ADP-ribose residue from NAD+ by ADP-ribosyltransferase (e.g., histone modification by poly ADP-ribosylation)
- Biotinylation: attachment of biotin (e.g., pyruvate carboxylase uses a biotin cofactor to catalyze the ATP-dependent carboxylation of pyruvate to oxaloacetate)
- Carboxylation: attachment of a carboxylic acid group (R–COOH) (e.g., vitamin K-dependent carboxylation is involved in liver synthesis of clotting factors that bind Ca2+)
- Hydroxylation: attachment of hydroxy groups (-OH), typically to proline and sometimes lysine (e.g., important for collagen synthesis to ensure crosslinking outside the cell )
- Methylation: attachment of methyl groups
Irreversible covalent alterations
|Overview of protein transportation|
|Cytosol|| || |
|Mitochondria, peroxisomes, or nucleus|| || |
|Cell membrane, lysosomes, or extracellular (secretory proteins)|| |
- Proteins that leave the cell (secretory proteins), as well as membrane and lysosomal proteins, are initially synthesized on free ribosomes of the cytosol.
- However, their synthesis is paused shortly after starting and the ribosome is transported to the cytosolic side of the rough ER.
- Protein synthesis then recommences and the protein is directly synthesized into the ER lumen.
- Initiation of translation on the free ribosomes in the cytosol
- If a signal sequence (specific amino acid sequence of 9–12 amino acids) is synthesized, it is bound to a signal recognition particle (SRP, a cytosolic ribonucleoprotein ).
- SRP induces a pause in translation and transports the ribosome with the peptide chain (polypeptide-ribosome complex) across the ER membrane.
- SRP facilitates binding of the ribosome with the signal peptide to the SRP receptor on the ER membrane.
- SRP and the SRP receptor are both bound to GTP, which is hydrolyzed to GDP; SRP is released and can bind to a new signal sequence.
- The ribosome is transferred to a translocon, a protein-lined channel composed of a complex of proteins spanning the ER membrane, with opening of the translocon channel. This translocon is termed the sec61 channel.
- Translation resumes and the protein is synthesized in the ER lumen.
- The signal sequence is cut off from the growing protein by a signal peptidase.
- After termination of translation, the ribosome is released into the cytosol.
- The translocon channel closes and the synthesized protein is left in the ER. During translation, the protein is folded into its native conformation.
- Protein modification in the ER
Proteins destined for the cell membrane are directly anchored in the ER membrane.
- Process: Hydrophobic segments are usually present in the amino acid sequence, which are then integrated into the ER membrane.
- Soluble proteins for lysosomes or proteins destined to leave the cell via exocytosis remain following completion of translation until they are transported into the ER lumen.
- All newly synthesized proteins in the ER are transported via the Golgi network to their target destination by transport vesicles.
- Lysosomal proteins contain a mannose 6-phosphate residue, which is recognized and bound by the membrane-bound mannose 6-phosphate receptor in the trans-Golgi network and transports the proteins to lysosomes in vesicles.
- Soluble proteins destined to remain in the ER are labeled by the signal sequence KDEL (encodes the letters of the amino acids lysine-aspartate-glutamate-leucine) on their C-terminus.
- Only properly folded proteins can leave the ER and travel to their final destination.
- Proteins destined for the cell membrane are directly anchored in the ER membrane.
- Main component: initiation phase of translation (regulation via initiation factors)
- Decreased formation of the ternary complex of eIF2, GTP, and initiating tRNA through phosphorylation of the initiation factor eIF2
- Phosphorylated eIF2 in the GDP-bound form can no longer be converted to its active GTP-bound form. The ternary complex necessary for initiating translation can no longer be formed and the translation rate is reduced.
- Regulation of the cap recognition process by eIF4
- Decreased formation of the ternary complex of eIF2, GTP, and initiating tRNA through phosphorylation of the initiation factor eIF2