Presentation is loading. Please wait.

Presentation is loading. Please wait.

1001 Stories of Protein Folding

Similar presentations


Presentation on theme: "1001 Stories of Protein Folding"— Presentation transcript:

1 1001 Stories of Protein Folding
Talk at U of Maryland CS882, Fall 2006 1001 Stories of Protein Folding Ming Li School of Computer Science University of Waterloo This is Ming Li’s talk 1.

2 By the time I finish telling these protein stories, I hope we know better how to fold them by computers.

3 Prelude: Why should you care?
Through 3 billion years of evolution, nature has created an enormous number of protein structures for different biological functions. Understanding these structures is key to proteomics. Fast computation of protein structures is one of the most important unsolved problems in science today. Much more important than, for example, the P≠NP conjecture. We now have a real chance to solve it. This course: I do ½ of the course, so that we understand everything about proteins. You do ½ of the course, to present all methods for protein folding. 50% marks. You do a final project designing your method for folding proteins. 50% marks.

4 Proteins – the life story
Proteins are building blocks of life. In a cell, 70% is water and 15%-20% are proteins. Examples: hormones – regulate metabolism structures – hair, wool, muscle,… antibodies – immune response enzymes – chemical reactions Sickle-cell anemia: hemoglobin protein is made of 4 chains, 2 alphas and 2 betas. Single mutation from Glu to Val happens at residue 6 of the beta chain. This is recessive. Homozygotes die but Heterozygotes have resistance to malaria, hence it had some evolutionary advantage in Africa.

5 mRNA Protein cDNA Human: 3 billion bases, 30k genes.
E. coli: 5 million bases, 4k genes T A A T C G cDNA T A reverse transcription A T transcription translation G C mRNA Protein C G (20 amino acids) G (A,C,G,U) C Codon: three nucleotides encode an amino acid. 64 codons 20 amino acids, some w/more codes A T C G T A A T

6 They are built from 20 amino acids and fold in space into functional shapes

7 Several polypeptide chains can form more complex structures:

8 What happened in sickle-cell anemia
Mutating to Valine. Hydrophobic patch on the surface. Mutating to Valine. Hydrophobic patch on the surface. Hemoglobin

9 Amino acids stories There are 500 amino acids in nature. Only 20 (22) are used in proteins. The first amino acid was discovered from asparagus, hence called Asparagine, in All 20 amino acids in proteins are discovered by 1935. Traces of glycin, alanine etc were found in a meteorite in Australia in That brings the conjecture that life began from extraterrestrial origin.

10 20 Amino acids – the boring part
Hydrophobic amino acids Alanine Valine Phenylalanine Proline Methionine Isoleucine Lucine Polar amino acids Serine Threonine Tyrosine Histidine Cysteine Asparagine Glutamine Tryptophan Neutral Non-polar Polar: one positive and one negative charged ends, e.g. H2O is polar, oil is non-polar. Charged Amino Acids Aspartic acid Glutamic acid Lysine Arginine Simplest Amino Acid Glycine

11 Why do protein fold? Some philosophy
The folded structure of a protein is actually thermodynamically less favorable because it reduces the disorder or entropy of the protein.  So, why do proteins fold?  One of the most important factors driving the folding of a protein is the interaction of polar and nonpolar side chains with the environment.  Nonpolar (water hating) side chains tend to push themselves to the inside of a protein while polar (water loving) side chains tend to place themselves to the outside of the molecule.  In addition, other noncovalent interactions including electrostatic and van der Waals will enable the protein once folded to be slightly more stable than not.  When oil, a nonpolar, hydrophobic molecule, is placed into water, they push each other away. Since proteins have nonpolar side chains their reaction in a watery environment is similar to that of oil in water.  The nonpolar side chains are pushed to the interior of the protein allowing them to avoid water molecule and giving the protein a globular shape. There is, however, a substantial difference in how the polar side chains react to the water.  The polar side chains place themselves to the outside of the protein molecule which allows for their interact with water molecules by forming hydrogen bonds.  The folding of the protein increases entropy by placing the nonpolar molecules to the inside, which in turn, compensates for the decrease in entropy as hydrogen bonds form with the polar side chains and water molecules.  

12 1 letter label & how to remember them
If only one amino acid begins with a letter, that letter is used: C = Cys = Cysteine H = His = Histidine I = Ile = Isoleucine M = Met = Methionine S = Ser = Serine V = Val = Valine Otherwise the letter is assigned to the more frequent one: A = Ala = Alanine G = Gly = Glycine L = Leu = Leucine P = Pro = Proline T = Thr = Threonine The losers try phonetically F = Phe = Phenylalanine R = Arg = Arginine Y = Tyr = Tyrosine W = Trp = Trptophan (double ring) When everything fails: D = Asp = Aspartic acid N = Asn = Asparagine E = Glu = Glutamic acid Q = Gln = Glutamine K = Lys = Lysine

13 They really look all the same:
One amino acid. The difference is only in the side chain R. Lose H2O Many amino acids connected to a polypeptide chain

14 The amino acids are connected to form polypeptide chains: going from N terminal to C terminal
Planar, rigid, with known bond distances and angles. Lose water H2O when forming the peptide bond

15 They could have been different
L-form vs D-form: Looking down the H-Cα bond from H, the L-form is CORN. The D-form is NRCO All amino acids occur in proteins have L-form. It is unclear why D-form was not chosen Mirror image In functioning proteins, only L-form occur In nature, L, D- forms occur with equal chance.

16 Story of cysteines Two cysteine residues in different (non-adjacent) parts of a protein sequence can be oxidized to form a disulfide bridge, as end product of air oxidation: 2 cysteines + ½ O2 = 2 linked cysteines + H2O They have the functions: Stablize single protein fold Linking two chains (linking A and B chains in insulin)

17 Disulfide bond between two cystines:
SH | CH2 Note: We will not study amino acids one by one, but we will study their structures when we meet them. Red bond connects to Cα

18 The Φ and Ψ angles The angle at N-Cα is Φ angle
The angle at Cα-C’ is Ψ angle No side chain is involved (which is at Cα) These angles determine backbone structure.

19 The Ramachandran plot L-amino acids cannot form
Large left-handed helix, but Gly (also apn, asp) can form short left-handed helix, with side chain forming hydrogen bound with main chain. Red: good Yellow: ok White: forbidden Except Glycine

20 The story of Glycine Glycines have no side-chain (just H), so it can adopt phi and psi angles in all 4 quadrants of the Ramachadran plot. Thus, it frequently occur in turn regions of proteins where any other residue would be sterically hindered. Glycine: H |

21 Staggered carbon atoms for side chains
Most favorable rotations Ethan: CH3CH3 Aligned, too crowded Valine: (b) is more favorable, least crowded


Download ppt "1001 Stories of Protein Folding"

Similar presentations


Ads by Google