TRANSLATION, PART 2
CONTENTS
The
Concept of Gene Expression
Cells and Proteins
Transcription
Amino
Acids are carried by t-RNA
Codon-Anticodon
Match-up
Some
Details of the Cellular Components Used for Protein Synthesis
The Ribosomes
Ribosomal Size
Polyribosomes
and m-RNA
The
Genetic Code of Messenber RNA
64 Possible Codons
Start and
Stop Codons
The Code
is Degenerate
Ribosomal RNA
Transfer RNA
t-RNA
Must Bond to the Correct Amino Acid
t-RNa
Must Bond to the Correct Codon on m-RNA
t-RNA
must Bond to the Correct Spot on the Ribosome
t-RNA
Molecules as "Adaptors"
Translation
Review
of the Components for Protein Synthesis
Initiation
Elongation
The
Forming Polypeptide Assumes its Secondary and Tertiary Structure
Termination
The
Concept of Gene Expression
Cells
and Proteins
Who we are in both
our mental and physical attributes begins with the expression of our genetic
heritage. Our genes must express themselves to provide us with a physical structure
(our anatomy) and with functionality and regulation for our parts (physiology).
Gene expression starts with the code that is the sequence of bases composing
the DNA molecules locked inside the nuclei of our cells. But gene expression
requres the translation of that sequence of bases in DNA into a sequence of
amino acids in proteins. Those proteins, tens of thousands of different ones,
are organized into our living cells and express our genes through their structure
and actions. This is the premiere process occurring in cells; cells devote more
energy to protein synthesis than to any other aspect of metabolism. A typical
bacterial cell (e.g., E. coli) contains about 200,000 ribosomes. Eucaryotic
cells (like yours) contain even more of them. This is no surprise. Our proteins,
interacting with other macromolecules, are the genetic basis of what and who
we are.
Transcription
The first step of translation occurs in the nucleus
and is the transcription of DNA's code onto molecules of messenger RNA (m-RNA).
We considered this process of transcription in the previous exercise of this
WebLab. Then the m-RNA molecules must leave the nucleus, enter the cytoplasm
of the cell, and associate with ribosomes. Ribosomes are the protein synthesizing
factories of cells. But where did the ribosomes come from? They are constructed
from ribosomal RNA (r-RNA) molecules (also constructed from DNA templates in
the cell's nucleus) and proteins.
Amino
Acids are carried by t-RNA
After we imagine m-RNA molecules asociated with
ribosomes ready to produce proteins, we must ask: where do the amino acids come
from and how do they get to the ribosomes? Amino acids come from the proteins
we eat. We digest those proteins into amino acids and absorb them into our blood
from the small intestine. Bacterial cells would get amino acids by absorbing
them from the liquid or solid meduim on which they are living. Once transported
through the cells outer membrane (called the cell membrane or plasma membrane)
the amino acids enter the cell's cytoplasm. Then the amino acids attach to transfer
RNA molecules (t-RNA) that carry individual amino acids to the ribosomes. Transfer
RNA molecules are also constructed from DNA templates in the nucleus.
Codon-Anticodon
Match-up
Another important question is: how are the amino
acids placed in the precise order required by the genetic code? In other words,
how is the primary structure of a protein (polypeptide) insured to come out
correctly? These details are handled by the t-RNA molecules as they link up
with the m-RNA molecules on the ribosomes. Each three bases of m-RNA is called
a codon. Each codon specifies, or stands for, a distinct amino acid.
The t-RNA molecules possess anticodons. These are base sequences complementary
to the genetic code's codons. Codon-anticodon match-up insures that each t-RNA
aligns itself at the right spot along the m-RNA molecule. This insures that
the correct amino acid gets into the right spot in the growing polypeptide chain.
Some Details of the Cellular Components Used for Protein Synthesis
The Ribosomes
Ribosomal Size
Ribosomes found in
all cells are composed of approximately 50% ribosomal RNA and 50% protein. They
all have the same basic three dimensional structure. There are just differences
in overall size. The largest ribosomes come from eucaryotic cells (these are the
cells from all the kingdoms of living organisms except the two bacterial kingdoms
- the Archaebacteria and the Eubacteria). Ribosomes from procaryotic cells (the
two bacterial kingdoms) are slightly smaller than eucaryotic ribosomes. The smallest
ribosomes are found inside mitochondria and chloroplasts (that these organelles
possess their own ribosomes might come as a surprise to you if you don't already
know about the theory of endosymbiosis; you could look up the word!).
Polyribosomes
and m-RNA
Each ribosome consists of two pieces, each one looking
roughly like an ellipse, that come together just prior to protein synthesis. The
two pieces separate just after protein synthesis is complete. Ribosomes work together
in groups called polyribosomes. A messenger RNA molecule threads itself
along a groove at the junction between the two ribosomal subunits. Then, as it
is being "read" or translated by that ribosome, it emerges out from that ribosome
and threads itself into the next ribosome, and so on. In this way, several ribosomes
are translating the genetic code on the messenger simultaneously, and each ribosome
produces a copy of the specified protein. Very efficient! To visualize this process,
go to
The Control of Gene Expression and scroll down to the second illustration
which consists of three images - OBSERVE THE MIDDLE IMAGE and look for the structures
labeled m-RNA, ribosome, 1st polypeptide, and ribosomal subunits. Ignore the other
labels.
The first ribosome threaded into by the messenger
finishes translation first and the completed protein drops off and the ribosome's
two components separate. Then the second ribosome that the messenger associated
with finishes translation, and its protein drops off and that ribosome's components
separate. And so on.
The Genetic Code of Messenger
RNA
64 Possible Codons
Translation is the process of converting a sequence
of bases in DNA into a sequence of amino acids in a protein. The sequence of bases
transcribed onto m-RNA, and delived to the ribosome as the m-RNA molecule, is
called the genetic code. There are 4 bases used in m-RNA - adenine, guanine,
uracil, and cytosine - and the genetic code is a triplet code, meaning that each
sequence of 3 bases codes for one amino acid. Each group of 3 bases coding for
one amino acid is called a codon. So the first base of the codon can be any one
of the 4 possibilities. Similarly the second base of the codon can be any one
of the 4 bases. And finally, the third position of the codon can be any one of
the 4 bases. This is expressed mathematically as 4 to the third power, or 43.
This is computed as 4 x 4 x 4 which equals 64. What this means is that there are
64 possible codons.
Start and Stop
Codons
Experiments have proven that 61 of the codons are
specific for amino acids. The remaining 3 codons signal polypeptide chain termination
(in other words, stop translation and let the completed protein fall off the ribosome).
Interestingly, the codon for the amino acid, methionine, is the signal to "start"
translation. This means that every protein starts out having methionine as its
first amino acid. However, after translation is completed, the methionine is often
removed in what is called post-translational modification.
The Code is Degenerate
Another aspect of the genetic code is that most
amino acids are coded for by several different codons. Actually, only two amino
acids have just one codon that represents them. This is expected since the genetic
code specifies only 20 amino acids, and yet there are 61 functional codons. So
some amino acids must be coded for by more than one codon! Because of this fact,
the code is said to be degenerate. You can verify this by re-looking at
the What is a
Gene Translation Site and scrolling down to the table presenting the genetic
code. Find out which are the two amino acids that have only one codon representing
them.
Ribosomal RNA
Recall that the ribosomes are composed of about
50% protein and 50% RNA. The type of RNA in the ribosome is called specifically
ribosomal RNA (r-RNA). In eucaryotic cells r-RNA is synthesized in specific regions
of the nucleus called the nucleolus. This area appears as a dark spot or
several spots within the nucleus. The nucleolus (or nucleoli) is a region of the
nucleus where there are many copies of ribosomal RNA-coding genes. After the r-RNA
is synthesized, it is modified and constructed into ribosomal subunits (recall
that each ribosome is composed of two subunits). The subunits can then migrate
out from the nucleus into the cytoplasm where they can associate with each other
to become complete ribosomes - the cellular factories for protein synthesis.
Transfer RNA
Transfer RNA molecules carry amino acids to the
ribosomes where they can be linked together in the proper sequence to form polypeptides.
Transfer RNA molecules possess truly precise and extensive specificity. Take a three dimensional look at a transfer RNA molecule by going to The 3D t-RNA Site.
t-RNA
Must Bond to the Correct Amino Acid
Transfer RNA molecules must be recognized by the
correct enzyme (called an aminoacyl t-RNA synthetase) so they can be loaded
with the correct amino acid.
Just imagine the cytoplasm of a cell where tens
of thousands of amino acid molecules are located. A given t-RNA must locate and
bind the specific one of the 20 different amino acids that it is constructed to
pick up. How can this happen? On the surface of an enzyme specific for that purpose,
of course! So there is a different aminoacyl t-RNA synthetase enzyme for each
different t-RNA molecule and the amino acid to which it binds. So the specific
shape of the t-RNA must conform to a specific shape on the enzyme. Then the enzyme
works in two steps. First the amino acid is activated using a molecule of ATP
(the cell's most commonly used high energy molecule), and then the activated amino
acid is attached to the t-RNA molecule. The resulting molecule is represented
as t-RNA~aa (It is called an aminoacyl-t-RNA. The squiggly line between
the t-RNA and the aa (amino acid) means that the bond is a high energy bond.
t-RNA
Must Bond to the Correct Codon on m-RNA
Transfer RNA molecules possess the correct anticodon
so that, upon reaching the ribosome, they can align with the proper codon on the
messengeg RNA. This assures that the amino acid carried by that t-RNA will be
added to the growing polypeptide in the correct sequence as coded for in the sequence
of m-RNA bases. Since there are 61 codons that actually code for amino acids,
there are 61 anti-codons found on the different types of t-RNA molecules.
t-RNA
Must Bond to the Correct Spot on the Ribosome
Transfer RNA molecules bind to the correct spot
on the ribosomal surface. Compared with the size of an individual t-RNA molecule,
a ribosome is much larger. The t-RNA must "dock" itself into the correct spot.
This requires a complementary shape so that there is correct placement and alignment
with respect to both the m-RNA molecule and the previously attached t-RNA.
t-RNA Molecules
as "Adaptors"
Overall, what we see are transfer RNA molecules
acting as "adaptor" molecules. The t-RNA's are the intermediaries between the
genetic code, on the m-RNA threaded along the ribosomes, and the amino acids that
each t-RNA is carrying. The anticodon area of the t-RNA and the overall, specific
shape of the t-RNA determines its fit onto the ribosome at the exact moment that
the amino acid it is carrying is called for. When you consider that multiple copies
of proteins containing 300 hundred to a thousand amino acids can be synthesized
typically in under a minute, the efficiency of these processes is truly astounding.
Translation
Review
of the Components for Protein Synthesis
The previous discussion has presented details of
the components of cells used for protein synthesis: m-RNA carrying the nuclear
DNA-derived genetic code for how to build a protein, ribosomes as the cytoplasmic
sites for protein synthesis, and t-RNA molecules carrying amino acids to be incorporated
into synthesizing polypeptide chains, and possessing anticodons to match up with
the m-RNA codons.
Initiation
The first step in translation is initiation.
The m-RNA from the nucleus attaches to the smaller of the two ribosomal subunits.
Then the anticodon of the first t-RNA~aa attaches to the start, or initiation,
codon (always AUG) of the messenger RNA. Initiation is completed as the larger
ribosomal subunit is attached to the smaller subunit. All these initial "attachments"
require special protein initiation factors and an energy source. The energy
source is, in this case, a compound related to ATP - it is called GTP (guanosine
triphosphate). So at the end of initiation, we have 1) a complete ribosome (both
subunits have come together), 2) the m-RNA is in place threaded along the ribosomal
surface, and 3) the first t-RNA is already attached. Remember that the first t-RNA
is always the same: the signal to "start" a protein is the codon AUG, which binds
to the t-RNA anticodon, UAC, which is always on a t-RNA carrying methionine.
Elongation
Before beginning to read this section take a look at this Translation Diagram from "To Know Ourselves" by the U.S. Dept.of Energy. You will then have a visual image of what is being described.
The second translational step is called elongation.
This is a step that will repeat itself as many times as is necessary to complete
the synthesis of the polypeptide. With each round of the elongation process, one
additional amino acid is added to the growing polypeptide.
t-RNA binds to the ribosome The
first step in elongation is a t-RNA~aa binding to the ribosome right next to the
preceding t-RNA~aa. Which t-RNA binds second depends on the codon right after
the initiation codon. So the second t-RNA~aa now binds to the ribosome and that
t-RNA is parked right next to the first (initiator) t-RNA.
peptide bond formation
At this point, the second step of elongation is the formation of a peptide bond
between the first and second amino acids that are positioned right next to each
other. This peptide bond formation requires an enzyme, peptidyl transferase, and
an energy source which is GTP.
ribosome shifts its position
The last step in elongation is the physical shifting of the ribosome along the
m-RNA molecule to align the next codon of m-RNA for binding to the anti-codon
of the next (in this case the third) t-RNA. This physical shifting of the ribosome
along the m-RNA requires the energy of GTP. In addition, elongation requires special
proteins called elongation factors.
So at the end of the first round of elongation,
can you picture the ribosome? It has the m-RNA threaded in it, and there are two
t-RNA's attached to it: the initial t-RNA that always codes for methionine, and
the second t-RNA which codes for the second amino acid, which is whatever amino
acid was called for in the second codon of the m-RNA. Which t-RNA is holding the
dipeptide, which is as much of a polypeptide that has been synthesized so far?
The second t-RNA! And at the end of the third round of elongation, the third
t-RNA that has attached to the ribosome will be holding the tripeptide, and so
on.
The
Forming Polypeptide Assumes its Secondary and Tertiary Structure
As each amino acid is added to the growing chain,
the t-RNA molecules that have given up their amino acids drop off the ribosome
and can be used again to pick up other amino acids. As the polypeptide elongates
(from a dipeptide, to tripeptide and up to however long the polypeptide is coded
for by m-RNA) it spirals and pleats (secondary structure), and folds (tertiary
structure) into its natural three dimensional conformation. It does this right
on the surface of the ribosome as the polypeptide is still forming. In the bacterium
E. coli, amino acids are added at a rate of about one each 0.05 seconds, or at
a rate of about 500 amino acids (the size of a typical protein) per 25 seconds.
Termination
The third translational step is called termination.
It occurs when any of the three termination codons (UAA, UAG, or UGA) shows up
along the m-RNA as it is read on the ribosome. The termination codons are recognized
by release factors which add a water molecule to the previous amino acid,
instead of another amino acid. Now the m-RNA detaches from the ribosome, the polypeptide
drops off the ribosome, and the ribosome separates into its two subunits. The
two ribosomal subunits are now ready to begin a new round of protein synthesis
repeating the above steps.
RETURN
TO WEBLAB#5: EXERCISE#2