Department of Mechanical Engineering

Slides:



Advertisements
Similar presentations
Review.
Advertisements

Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
Protein Secondary Structure II Lecture 2/24/2003.
Proteins Function and Structure.
Proteins. Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Amino Acid and Protein1. 2  The formation of a peptide bond between glycine and alanine is shown in Figure 5.8. The product is called dipeptide, the.
The Structure and Functions of Proteins BIO271/CS399 – Bioinformatics.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Protein folding kinetics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Proteins Function and Structure. Proteins more than 50% of dry mass of most cells functions include – structural support – storage, transport – cellular.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Protein Structure Elements Primary to Quaternary Structure.
Chapter 3 The Chemistry of Organic Molecules
Protein Structure FDSC400. Protein Functions Biological?Food?
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Marlou Snelleman 2012 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Proteins account for more than 50% of the dry mass of most cells
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules. V. proteins are macromolecules that are polymers formed from amino acids monomers A. proteins have great structural diversity.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Protein Secondary Structure Lecture 2/19/2003. Three Dimensional Protein Structures Confirmation: Spatial arrangement of atoms that depend on bonds and.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Amino acids and proteins … for AS Biology. Amino acids Proteins are macromolecules consisting of long unbranched chains of amino acids. All amino acids.
BIOL 200 (Section 921) Lecture # 2, June 20, 2006 Reading for lecture 2: Essential Cell Biology (ECB) 2nd edition. Chap 2 pp 55-56, 58-64, 74-75; Chap.
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
1 10/26/2015 MOLECULES. 2 10/26/2015 H 2 N-CH-C-OH O R Monomer E.g. protein Monomer vs polymer amino acid monomer R is a side group.
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Protein Structure 1 Primary and Secondary Structure.
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
3-D Structure of Proteins
Marlou Snelleman 2011 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Proteins.
BIOLOGICALLY IMPORTANT MACROMOLECULES PROTEINS. A very diverse group of macromolecules characterized by their functions: - Catalysts - Structural Support.
Chapter 3 Proteins.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Proteins: 3D-Structure Chapter 6 (9 / 17/ 2009)
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Doug Raiford Lesson 14.  Reminder  Involved in virtually every chemical reaction ▪ Enzymes catalyze reactions  Structure ▪ muscle, keratins (skin,
Structure and Function
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Protein Structure BL
Proteins account for more than 50% of the dry mass of most cells
Protein Structure and Properties
Transport proteins Transport protein Cell membrane
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules
Haixu Tang School of Inforamtics
Chapter 3 Proteins.
Fig. 5-UN1  carbon Amino group Carboxyl group.
Packet #9 Supplement.
Packet #9 Supplement.
Proteins account for more than 50% of the dry mass of most cells
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Levels of Protein Structure
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
Proteins Proteins have many structures, resulting in a wide range of functions Proteins do most of the work in cells and act as enzymes 2. Proteins are.
The Three-Dimensional Structure of Proteins
Presentation transcript:

Department of Mechanical Engineering Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering Department of Physics Yale University

Protein Folding Statistical Mechanics Biochemistry Overlap between biochemistry and physics Outline of class: Biochemistry of amino acids/proteins Physics of protein folding/Energy landscape Coarse-Grained Models for Proteins

What are proteins? Linear polymer Folded state Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed of tens (peptides) to thousands (proteins) of monomers Monomers are 20 naturally occurring amino acids Different proteins have different amino acid sequences Structureless, extended unfolded state Compact, ‘unique’ native folded state (with secondary and tertiary structure) required for biological function Sequence determines protein structure (or lack thereof) Proteins unfold or denature with increasing temperature or chemical denaturants

Side chains differentiate amino acid repeat units Amino Acids I N-terminal C C-terminal peptide bonds R Amino acids strung together via peptide bonds; N terminus, C-terminus. variable side chain Side chains differentiate amino acid repeat units Peptide bonds link residues into polypeptides

Amino Acids II Polar-hydrophilic; nonpolar-hydrophobic Acid+ Base-

The Protein Folding Problem: What is ‘unique’ folded 3D structure of a protein based on its amino acid sequence? Sequence  Structure Lys-Asn-Val-Arg-Ser-Lys-Val-Gly-Ser-Thr-Glu-Asn-Ile-Lys- His-Gln-Pro- Gly-Gly-Gly-…

Folding: hydrophobicity, hydrogen bonding, van der Driving Forces Folding: hydrophobicity, hydrogen bonding, van der Waals interactions, … Unfolding: increase in conformational entropy, electric charge… Hydrophobicity index inside H (hydrophobic) outside P (polar)

Higher-order Structure

Secondary Structure: Loops, -helices, -strands/sheets -helix -strand -sheet 5Å Not proline (no hydrogen bonding, steric clash), not glycine(flexibility) for helix formation Right-handed; three turns Vertical hydrogen bonds between NH2 (teal/white) backbone group and C=O (grey/red) backbone group four residues earlier in sequence Side chains (R) on outside; point upwards toward NH2 Each amino acid corresponds to 100, 1.5Å, 3.6 amino acids per turn (,)=(-60,-45) -helix propensities: Met, Ala, Leu, Glu 5-10 residues; peptide backbones fully extended NH (blue/white) of one strand hydrogen-bonded to C=O (black/red) of another strand C ,side chains (yellow) on adjacent strands aligned; side chains along single strand alternate up and down (,)=(-135,-135) -strand propensities: Val, Thr, Tyr, Trp, Phe, Ile

Backbonde Dihedral Angles

Ramachandran Plot: Determining Steric Clashes Non-Gly Gly Backbone dihedral angles  theory  4 atoms define dihedral angle:  PDB CCNC =0,180 CN CC  N CCN vdW radii backbone flexibility < vdW radii

How can structures from PDB exist outside Ramachadran bounds? Studies were performed on alanine dipeptide Fixed bond angle (=110) [105,115] J. Mol. Biol. (1963) 7, 95-99

Side-Chain Dihedral Angles 4: Lys, Arg 5: Arg Side chain: C-CH2-CH2-CH2-CH2-NH3 Use NCCCCCN to define 1, 2, 3, 4

Molecular biologist Biological Physicist Your model is oversimplified and has nothing to do with biology! Your model is too complicated and has no predictive power! Molecular biologist Biological Physicist

Possible Strategies for Understanding Protein Folding For all possible conformations, compute free energy from atomic interactions within protein and protein- solvent interactions; find conformation with lowest free energy…e.g using all-atom molecular dynamics simulations Not possible?, limited time resolution Use coarse-grained models with effective interactions between residues and residues and solvent General, but qualitative

 = # allowed dihedral angles Why do proteins fold (correctly & rapidly)?? Levinthal’s paradox: For a protein with N amino acids, number of backbone conformations/minima  = # allowed dihedral angles How does a protein find the global optimum w/o global search? Proteins fold much faster. Nc~ 3200 ~1095 fold ~ Nc sample ~1083 s vs fold ~ 10-6-10-3 s universe ~ 1017 s

Energy Landscape U, F =U-TS S12 S23 S-1 M2 M3 M1 minimum saddle point Locate M and S by setting gradient = 0 Assess stability using second derivative. M1 minimum all atomic coordinates; dihedral angles saddle point maximum

Roughness of Energy Landscape smooth, funneled rough (Wolynes et. al. 1997)

Folding Pathways dead end similarity to native state

Folding Phase Diagram smooth rough

NP Hard Problem! Open Questions What differentiates the native state from other low-lying energy minima? How many low-lying energy minima are there? Can we calculate landscape roughness from sequence? What determines whether protein will fold to the native state or become trapped in another minimum? What are the pathways in the energy landscape that a given protein follows to its native state? NP Hard Problem!

Digression---Number of Energy Minima for Sticky Spheres Nm Ns Np 4 1 1 sphere packings 5 1 6 6 2 50 7 5 486 Why does number of energy minima grow exponentially with Nm? Systems can be described as collections of weakly correlated subsystems. Torsion and bond bending break energy degeneracies. 8 13 5500 polymer packings 9 52 49029 10 - - Ns~exp(aNm); Np~exp(bNm) with b>a