If the amine and carboxylic acid functional groups in amino acids join together to form amide bonds, a chain of amino acid units, called a peptide, is formed. A simple tetrapeptide structure is shown in the following diagram. By convention, the amino acid component retaining a free amine group is drawn at the left end (the
N-terminus) of the peptide chain, and the amino acid retaining a free carboxylic acid is drawn on the right (the C-terminus). As expected, the free amine and carboxylic acid functions on a peptide chain form a zwitterionic structure at their isoelectric pH. Show
The conformational flexibility of peptide chains is limited chiefly to rotations about the bonds leading to the alpha-carbon atoms. This restriction is due to the rigid nature of the amide (peptide) bond. As shown in the following diagram, nitrogen electron pair delocalization into the carbonyl group results in significant double bond character between the carbonyl carbon and the nitrogen. This keeps the peptide links relatively planar and resistant to conformational change. The color shaded rectangles in the lower structure define these regions, and identify the relatively facile rotations that may take place where the corners meet (i.e. at the alpha-carbon). This aspect of peptide structure is an important factor influencing the conformations adopted by proteins and large peptides. 2. The Primary Structure of PeptidesBecause the N-terminus of a peptide chain is distinct from the C-terminus, a small peptide composed of different aminoacids may have a several constitutional isomers. For example, a dipeptide made from two different amino acids may have two different structures. Thus, aspartic acid (Asp) and phenylalanine (Phe) may be combined to make Asp-Phe or Phe-Asp, remember that the amino acid on the left is the N-terminus. The methyl ester of the first dipeptide (structure on the right) is the artificial sweetener aspartame, which is nearly 200 times sweeter than sucrose. Neither of the component amino acids is sweet (Phe is actually bitter), and derivatives of the other dipeptide (Phe-Asp) are not sweet.A tripeptide composed of three different amino acids can be made in 6 different constitutions, and the tetrapeptide shown above (composed of four different amino acids) would have 24 constitutional isomers. When all twenty of the natural amino acids are possible components of a peptide, the possible combinations are enormous. Simple statistical probability indicates that the decapeptides made up from all possible combinations of these amino acids would total 2010! Natural peptides of varying complexity are abundant. The simple and widely distributed tripeptide glutathione (first entry in the following table), is interesting because the
side-chain carboxyl function of the N-terminal glutamic acid is used for the peptide bond. An N-terminal glutamic acid may also close to a lactam ring, as in the case of TRH (second entry). The abbreviation for this transformed unit is pGlu (or pE), where p stands for "pyro" (such ring closures often occur on heating). The larger peptides in the table also demonstrate the importance of amino acid abbreviations, since a full structural formula for a nonapeptide (or larger) would prove to be
complex and unwieldy. The formulas using single letter abbreviations are colored red. Some Common Natural Peptides
The different amino acids that make up a peptide or protein, and the order in which they are joined together by peptide bonds is referred to as the primary structure. From the examples shown above, it should be evident that it is not a trivial task to determine the primary structure of such compounds, even modestly sized ones. Partial hydrolysis will produce a mixture of shorter peptides and some amino acids. If the primary structures of these fragments are known, it is sometimes possible to deduce part or all of the original structure by taking advantage of overlapping pieces. For example, if a heptapeptide was composed of three glycines, two alanines, a leucine and a valine, many possible primary structures could be written. On the other hand, if partial hydrolysis gave two known tripeptide and two known dipeptide fragments, as shown on the right, simple analysis of the overlapping units identifies the original primary structure. Of course, this kind of structure determination is very inefficient and unreliable. First, we need to know the structures of all the overlapping fragments. Second, larger peptides would give complex mixtures which would have to be separated and painstakingly examined to find suitable pieces for overlapping. It should be noted, however, that modern mass spectrometry uses this overlap technique effectively. The difference is that bond cleavage is not achieved by hydrolysis, and computers assume the time consuming task of comparing a multitude of fragments. 3. N-Terminal Group AnalysisOver the years that chemists have been studying these important natural products, many
techniques have been used to investigate their primary structure or amino acid sequence. Indeed, commercial instruments that automatically sequence peptides and proteins are now available. A few of the most important and commonly used techniques will be described here. Repeated clicking of the "Next Diagram" button displays the mechanism of this important analytical method. 4. C-Terminal Group Analysis Chemical Analysis Enzymatic Analysis 5. Selective Peptide Cleavage
Mechanisms for the enzymatic reactions are not as easily formulated. Other enzymatic cleavages have been developed, but the two listed here will serve to illustrate their application. An Example of Primary Structure AnalysisTo see how these procedures can be combined to elucidate the primary structure of a peptide, consider the melanocyte stimulating hormone isolated from pigs. This octadecapeptide (18 amino acid units) has the composition: Arg,Asp2,Glu2,Gly2,His,Lys2,Met,Phe,Pro3,Ser,Tyr2, and is abbreviated P18. The following diagram, which begins with the results of terminal unit analysis, illustrates the logical steps that could be used to solve the structural problem. By clicking the "Next Stage" button the results and conclusions from each step will be displayed. Comments about each stage are presented under the diagram. a) Cyanogen bromide cleavage gives two peptide fragments, the longer of which has all the units on the C-terminal side of methionine. 6. Cyclic PeptidesIf the carboxyl function at the C-terminus of a peptide forms a peptide bond with the N-terminal amine group a cyclic peptide is formed. Carboxyate and amine functions on side chains may also combine to form rings. Cyclic peptides are most commonly found in microorganisms, and often incorporate some D-amino acids as well as unusual amino acids such as ornithine (Orn). The decapeptide antibiotic gramacidin S, produced by a strain of Bacillus brevis, is one example of this interesting class of natural products. The structure of gramicidin S is shown in the following diagram. The atypical amino acids are colored. When using a shorthand notation for cyclic structures, the top line is written by the usual convention (N-group on the left), but vertical and lower lines must be adjusted to fit the bonding. Arrows on these bonds point in the CO-N direction of each peptide bond. To see a model of another cyclic peptide, having potentially useful medicinal properties Click Here. Structure-Property RelationshipsThe compounds we call proteins exhibit a broad range of physical and biological properties. Two general categories of simple proteins are commonly recognized.
1. The Secondary and Tertiary Structure of Large Peptides and ProteinsThe various properties of peptides and proteins depend not only on their component amino acids and their bonding sequence in peptide chains, but also on the way in which the peptide chains are stretched, coiled and folded in space. Because of their size, the orientational options open to these macromolecules might seem nearly infinite. Fortunately, several factors act to narrow the structural options, and it is possible to identify some common structural themes or secondary structures that appear repeatedly in different molecules. These conformational segments are sometimes described by the dihedral angles Φ & Ψ, defined in the diagram on the right below. Most proteins and large peptides do not adopt completely uniform conformations, and full descriptions of their preferred three dimensional arrangements are defined as tertiary structures.
A. Helical Coiling Examine the drawing activated by the middle button. The N-terminal residue (Ala) is on the left, and the C-terminal Gly on the right. The alpha-helix is right-handed, which means that it rotates clockwise as it spirals away from a viewer
at either end. Other structural features that define an alpha-helix are: the relative locations of the donor and acceptor atoms of the hydrogen bond, the number of amino acid units per helical turn and the distance the turn occupies along the helical axis. The first hydrogen bond (from the N-terminal end) is from the carbonyl group of the alanine to the N-H group of the phenylalanine. Three amino acids, Thr, Gly & Ala, fall entirely within this turn. Parts of the N-terminal alanine acceptor
and the phenylalanine donor also fall within this helical turn, and careful analysis of the structure indicates there are 3.6 amino acid units per turn. The distance covered by the turn is 5.4 Å. Using the dihedral angle terminology noted above, a perfect α-helix has Φ = -58º and Ψ = -47º. In natural proteins the values associated with α-helical conformations range from -57 to -70º for Φ, and from -35 to -48º for Ψ. To examine a model of this alpha-helix, click on the green circle. Once this
display is activated, the important hormone insulin may be shown by clicking the appropriate button in the blue-shaded rows. B. β-Pleated Sheets Some proteins have layered stacks of β-sheets, which impart structural integrity and may open to form a cavity (a beta barrel). An example is human retinol binding protein, which has a cavity formed by eight β-sheet strands. A model of
this interesting protein may be displayed by clicking the upper button in the blue-shaded rows. C. Other Structures Additional ExamplesA full description and discussion of protein structure is beyond the scope of this text, but a few additional examples will be instructive. In addition to the tertiary structures that will be displayed, attention must also be given to the way in which peptide structures may aggregate to form dimeric, trimeric and tetrameric clusters.
These assemblies, known as quaternary structures, have characteristic properties different from their monomeric components. The examples of mellitin, collagen and hemoglobin, shown below demonstrate this feature. Glycoproteins, incorporating polysaccharide prosthetic groups (e.g. collagen and mucus). The seven illustrations shown below identify a set of peptides and proteins that may be examined as Jmol models by clicking on a selected picture. Endothelin
& Angiogenin are small peptides that have important and selective physiological properties.
2. Quaternary Structures of ProteinsMany proteins are actually assemblies of several polypeptides, which in the context of the larger aggregate are known as protein subunits. Such multiple-subunit proteins possess a quaternary structure, in addition to the tertiary structure of the subunits. The subunits of a quaternary structure are held together by the same forces that are responsible for tertiary structure stabilization. These include hydrophobic attraction of nonpolar side chains in contact regions of the subunits, electrostatic interactions between ionic groups of opposite charge: hydrogen bonds between polar groups; and disulfide bonds. Examples of proteins having a quaternary (or quartary) structure include hemoglobin, HIV-1 protease and the insulin hexamer. The hemoglobin molecule is an assembly of four protein subunits, two alpha units and two beta units. Each protein chain folds into a set of alpha-helix structural segments connected together in a globin arrangement, so called because this arrangement is the same folding motif used in other heme/globin proteins such as myoglobin. This folding pattern contains a pocket which strongly binds the heme group. The four polypeptide chains are bound to each other by salt bridges, forming a tetrameric quaternary structure. A model of hemoglobin was shown above, and may also be examined by clicking the image on the left.In animals, hemoglobin transports oxygen from the lungs or gills to the rest of the body, where it releases the oxygen for cell use. Hemoglobin's oxygen-binding capacity is decreased in the presence of carbon monoxide because both gases compete for the same binding sites on hemoglobin. The binding affinity of hemoglobin for CO is 200 times greater than its affinity for oxygen. When hemoglobin combines with CO, it forms a very bright red compound called carboxyhemoglobin, which may cause the skin of CO poisoning victims to appear pink in death. In heavy smokers, up to 20% of the oxygen-active sites can be blocked by CO. Similarly, hemoglobin has a competitive binding affinity for cyanide, sulfur monoxide, nitrogen dioxide and sulfides including hydrogen sulfide . All of these bind to the heme iron without changing its oxidation state, causing grave toxicity. Insulin is a peptide hormone composed of 51 amino acids, with a molecular weight of 5808 Da. Insulin has a strong effect on metabolism and other body functions, causing cells in the liver, muscle, and fat tissue to take up glucose from the blood, storing it as glycogen in the liver and muscle. Insulin is formed in the islets of Langerhans in the pancreas. The molecular structure of insulin varies slightly between species of animals. Porcine (pig) insulin is especially close to the human version. Insulin molecules have a tendency to form dimers in solution due to hydrogen-bonding between the C-termini of B chains. In the presence of zinc ions, insulin dimers associate into hexamers. Insulin is stored in the body as a hexamer, whereas the active form is the monomer. These interactions have important clinical ramifications. Monomers and dimers readily diffuse into blood; hexamers diffuse poorly. By clicking the image on the far left, a model of the insulin monomer will be displayed . A model of the hexamer will be shown by clicking its image. HIV-1 protease is an enzyme made by the HIV virus that is essential for it's life-cycle. The virus makes certain proteins that need to be cleaved or cut, in order to transform into functional proteins that enable the virus to infect new cells. HIV-1 protease cleaves the nascent proteins into their functional form. The enzyme is composed of two symmetrically related subunits, shown here in cartoon backbone representation to highlight the secondary structure. Each subunit consists of the same small chain of 99 amino acids, which come together in such as way as to form a tunnel where they meet. The protein to be cleaved sits in this tunnel, which houses the active site of the enzyme. Two Asp-Thr-Gly catalytic triads, one on each chain, compose the active site. The two Asp's act as the main catalytic agents, and together with a water molecule cleave the protein chain bound in the tunnel. Without effective HIV-1 protease, HIV virions remain uninfectious. Because of its role in HIV replication, HIV-1 protease has been a target for antiviral drugs. Such drugs function as inhibitors, binding to the active site by mimicking the tetrahedral intermediate of its substrate, thus disabling the enzyme. The structure of one such inhibitor, BEA388, will be displayed on the left by clicking here. TropomyosinThe following animation shows a segment of the fibrous protein tropomyosin, a common muscle regulator. The peptide chains are largely alpha-helices. These are wrapped in superhelix pairs, which are then aligned in a parallel array.If animation is not occurring, click on the drawing or reload. Peptide SynthesisIn order to synthesize a peptide from its component amino acids, two obstacles must be overcome. The first of these is statistical in nature, and is illustrated by considering the dipeptide Ala-Gly as a proposed target. If we ignore the chemistry involved, a mixture of equal molar amounts of
alanine and glycine would generate four different dipeptides. These are: Ala-Ala, Gly-Gly, Ala-Gly & Gly-Ala. In the case of tripeptides, the number of possible products from these two amino acids rises to eight. Clearly, some kind of selectivity must be exercised if complex mixtures are to be avoided. From the perspective of an organic chemist, peptide synthesis requires selective acylation of a free amine. To accomplish the desired amide bond formation, we must first deactivate all extraneous amine functions so they do not compete for the acylation reagent. Then we must selectively activate the designated carboxyl function
so that it will acylate the one remaining free amine. Fortunately, chemical reactions that permit us to accomplish these selections are well known. The strategy for peptide synthesis, as outlined here, should now be apparent. The following example shows a selective synthesis of the dipeptide Ala-Gly. An important issue remains to be addressed. Since the N-protective group is an amide, removal of this function might require conditions that would also cleave the just formed peptide bond. Furthermore, the harsh conditions often required for amide hydrolysis might cause extensive racemization of the amino acids in the resulting peptide. This problem strikes at the heart of our strategy, so it is important to give careful thought to the design of specific N-protective groups. In particular, three qualities are desired: 1) The protective amide should be easy to attach to amino acids. A number of protective groups that satisfy these conditions have been devised; and two of the most widely used, carbobenzoxy (Cbz) and t-butoxycarbonyl (BOC or t-BOC), are described here. The reagents for introducing these N-protective groups are the acyl chlorides or anhydrides shown in the left portion of the above diagram. Reaction with a free amine function of an amino acid occurs rapidly to give the "protected" amino acid derivative shown in the center. This can then be used to form a peptide (amide) bond to a second amino acid. Once the desired peptide bond is created the protective group can be removed under relatively mild non-hydrolytic conditions. Equations showing the protective group removal will be displayed above by clicking on the diagram. Cleavage of the reactive benzyl or tert-butyl groups generates a common carbamic acid intermediate (HOCO-NHR) which spontaneously loses carbon dioxide, giving the corresponding amine. If the methyl ester at the C-terminus is left in place, this sequence of reactions may be repeated, using a different N-protected amino acid as the acylating reagent. Removal of the protective groups would then yield a specific tripeptide, determined by the nature of the reactants and order of the reactions. The synthesis of a peptide of significant length (e.g. ten residues) by this approach requires many steps, and the product must be carefully purified after each step to prevent unwanted cross-reactions. To facilitate the tedious and time consuming purifications, and reduce the material losses that occur in handling, a clever modification of this strategy has been developed. This procedure, known as the Merrifield Synthesis after its inventor R. Bruce Merrifield, involves attaching the C-terminus of the peptide chain to a polymeric solid, usually having the form of very small beads. Separation and purification is simply accomplished by filtering and washing the beads with appropriate solvents. The reagents for the next peptide bond addition are then added, and the purification steps repeated. The entire process can be automated, and peptide synthesis machines based on the Merrifield approach are commercially available. A series of equations illustrating the Merrifield synthesis may be viewed by clicking on the following diagram. The final step, in which the completed peptide is released from the polymer support, is a simple benzyl ester cleavage. This is not shown in the display. The Merrifield Peptide SynthesisTwo or more moderately sized peptides can be joined together by selective peptide bond formation, provided side-chain functions are protected and do not interfere. In this manner good sized peptides and small proteins may be synthesized in the laboratory. However, even if chemists assemble the primary structure of a natural protein in this or any other fashion, it may not immediately adopt its native secondary, tertiary and quaternary structure. Many factors, such as pH, temperature and inorganic ion concentration influence the conformational coiling of peptide chains. Indeed, scientists are still trying to understand how and why these higher structures are established in living organisms. DenaturationThe natural or native structures of proteins may be altered, and their biological activity changed or destroyed by treatment that does not disrupt the primary structure.
This denaturation is often done deliberately in the course of separating and purifying proteins. For example, many soluble globular proteins precipitate if the pH of the solution is set at the pI of the protein. Also, addition of trichloroacetic acid or the bis-amide urea (NH2CONH2) is commonly used to effect protein precipitation. Following denaturation, some proteins will return to their native structures under proper conditions; but extreme conditions, such as
strong heating, usually cause irreversible change.
Not all proteins are easily denatured. As noted above, fibrous proteins such as keratins, collagens and elastins are robust, relatively insoluble, quaternary structured proteins that play important roles in the physical structure of organisms. Secondary structures such as the α-helix and β-sheet take on a dominant role in the architecture and
aggregation of keratins. In addition to the intra- and intermolecular hydrogen bonds of these structures, keratins have large amounts of the sulfur-containing amino acid Cys, resulting in disulfide bridges that confer additional strength and rigidity. The more flexible and elastic keratins of hair have fewer interchain disulfide bridges than the keratins in mammalian fingernails, hooves and claws. Keratins have a high proportion of the smallest amino acid, Gly, as well as the next smallest, Ala.
In the case of β-sheets, Gly allows sterically-unhindered hydrogen bonding between the amino and carboxyl groups of peptide bonds on adjacent protein chains, facilitating their close alignment and strong binding. Fibrous keratin chains then twist around each other to form helical filaments. Globular proteins are more soluble in aqueous solutions, and are generally more sensitive to temperature and pH change than are their fibrous counterparts; furthermore, they do not have the high glycine content or the repetitious sequences of the fibrous proteins. Globular proteins incorporate a variety of amino acids, many with
large side chains and reactive functional groups. The interactions of these substituents, both polar and nonpolar, often causes the protein to fold into spherical conformations which gives this class its name. In contrast to the structural function played by the fibrous proteins, the globular proteins are chemically reactive, serving as enzymes (catalysts), transport agents and regulatory messengers. By clicking the cartoon image on the left, an interactive model of ribonuclease A will be displayed.
This page is the property of William Reusch. Comments, questions and errors should be sent to . Where do unused amino acids go?Dietary protein is used to replace proteins which were previously broken down and used by the body. Extra protein does not get stored. Instead, excess amino acids get converted to carbohydrate or fat.
Are amino acids stored when excess?Amino acids consumed in excess of the amounts needed for the synthesis of nitrogenous tissue constituents are not stored but are degraded; the nitrogen is excreted as urea, and the keto acids left after removal of the amino groups are either utilized directly as sources of energy or are converted to carbohydrate or fat ...
Are amino acids stored for later use?Unlike fat and starch, the human body does not store excess amino acids for later use—the amino acids must be in the food every day.
What are the end products of protein?Hence, the end product of protein digestion is amino acids.
|