Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials.

Slides:



Advertisements
Similar presentations
Review.
Advertisements

Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range of functions.
Proteins Function and Structure.
Proteins. Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range.
The Structure and Functions of Proteins BIO271/CS399 – Bioinformatics.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
Protein folding kinetics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Proteins Function and Structure. Proteins more than 50% of dry mass of most cells functions include – structural support – storage, transport – cellular.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Protein Structure Elements Primary to Quaternary Structure.
Chapter 3 The Chemistry of Organic Molecules
Protein Structure FDSC400. Protein Functions Biological?Food?
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Marlou Snelleman 2012 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Proteins account for more than 50% of the dry mass of most cells
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules. V. proteins are macromolecules that are polymers formed from amino acids monomers A. proteins have great structural diversity.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Protein Secondary Structure Lecture 2/19/2003. Three Dimensional Protein Structures Confirmation: Spatial arrangement of atoms that depend on bonds and.
Proteins: Secondary Structure Alpha Helix
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Amino acids and proteins … for AS Biology. Amino acids Proteins are macromolecules consisting of long unbranched chains of amino acids. All amino acids.
BIOL 200 (Section 921) Lecture # 2, June 20, 2006 Reading for lecture 2: Essential Cell Biology (ECB) 2nd edition. Chap 2 pp 55-56, 58-64, 74-75; Chap.
Department of Mechanical Engineering
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Protein Structure 1 Primary and Secondary Structure.
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Marlou Snelleman 2011 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
5.4: Proteins Introduction
Proteins.
BIOLOGICALLY IMPORTANT MACROMOLECULES PROTEINS. A very diverse group of macromolecules characterized by their functions: - Catalysts - Structural Support.
Chapter 3 Proteins.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Doug Raiford Lesson 14.  Reminder  Involved in virtually every chemical reaction ▪ Enzymes catalyze reactions  Structure ▪ muscle, keratins (skin,
Structure and Function
Protein Structure BL
Proteins account for more than 50% of the dry mass of most cells
Protein Structure and Properties
The heroic times of crystallography
Amino Acids and Proteins
Proteins.
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules
Haixu Tang School of Inforamtics
Chapter 3 Proteins.
Fig. 5-UN1  carbon Amino group Carboxyl group.
Packet #9 Supplement.
Packet #9 Supplement.
Proteins account for more than 50% of the dry mass of most cells
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
The Chemical Building Blocks of Life
Proteins Proteins have many structures, resulting in a wide range of functions Proteins do most of the work in cells and act as enzymes 2. Proteins are.
Presentation transcript:

Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials Science Department of Physics Department of Applied Physics Program in Computation Biology & Bioinformatics Integrated Graduate Program in Physical & Engineering Biology Yale University 1

Statistical Mechanics Biochemistry Protein Folding 2

What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed of tens (peptides) to thousands (proteins) of monomers Monomers are 20 naturally occurring amino acids Different proteins have different amino acid sequences Structureless, extended unfolded state Compact, ‘unique’ native folded state (with secondary and tertiary structure) required for biological function Sequence determines protein structure (or lack thereof) Proteins unfold or denature with increasing temperature or chemical denaturants Linear polymerFolded state 3

Amino Acids I N-terminalC-terminal CC variable side chain R peptide bonds Side chains differentiate amino acid repeat units Peptide bonds link residues into polypeptides 4

Amino Acids II 5

The Protein Folding Problem: What is ‘ unique ’ folded 3D structure of a protein based on its amino acid sequence? Sequence  Structure Lys-Asn-Val-Arg-Ser-Lys-Val-Gly-Ser-Thr-Glu-Asn-Ile-Lys- His-Gln-Pro- Gly-Gly-Gly-… 6

Folding: hydrophobicity, hydrogen bonding, van der Waals interactions, … Unfolding: increase in conformational entropy, electric charge… H (hydrophobic) P (polar) inside outside Hydrophobicity index Driving Forces 7

Higher-order Structure 8

Secondary Structure: Loops,  -helices,  -strands/sheets  -sheet  -strand Right-handed; three turns Vertical hydrogen bonds between NH 2 (teal/white) backbone group and C=O (grey/red) backbone group four residues earlier in sequence Side chains (R) on outside; point upwards toward NH 2 Each amino acid corresponds to 100 , 1.5Å, 3.6 amino acids per turn ( ,  )=(-60 ,-45  )  -helix propensities: Met, Ala, Leu, Glu  -helix 5-10 residues; peptide backbones fully extended NH (blue/white) of one strand hydrogen-bonded to C=O (black/red) of another strand C ,side chains (yellow) on adjacent strands aligned; side chains along single strand alternate up and down ( ,  )=(-135 ,135  )  -strand propensities: Val, Thr, Tyr, Trp, Phe, Ile 5Å 9

Backbonde Dihedral Angles 10

Ramachandran Plot: Determining Steric Clashes CC  NC C  N CC  N CC  N Backbone dihedral angles    =0,180  Non-Gly Gly theory PDB   vdW radii < vdW radii backbone flexibility 4 atoms define dihedral angle: 14

Backbone dihedral angles from PDB

16

Prof. Fred Richards, Yale MB&B

Side-Chain Dihedral Angles Side chain: C  -CH 2 -CH 2 -CH 2 -CH 2 -NH 3  4 : Lys, Arg  5 : Arg Use NC  C  C  C  C  N  to define  1,  2,  3,  4 18

Sidechain Dihedral Angle Distributions for Leu PDB Hard-sphere model

Molecular biologistBiological Physicist Your model is oversimplified and has nothing to do with biology! Your model is too complicated and has no predictive power! 21

Folding Transition T>T m T<T m Oxygen: Red Carbon: Cyan Nitrogen: Dark Blue Sulfur: Yellow

For all possible conformations, compute free energy from atomic interactions within protein and protein- solvent interactions; find conformation with lowest free energy…e.g using all-atom molecular dynamics simulations Possible Strategies for Understanding Protein Folding Not possible?, limited time resolution Use coarse-grained models with effective interactions between residues and residues and solvent General, but qualitative 23

Why do proteins fold (correctly & rapidly)??  = # allowed dihedral angles How does a protein find the global optimum w/o global search? Proteins fold much faster. For a protein with N amino acids, number of backbone conformations/minima Levinthal ’ s paradox: N c ~ ~10 95  fold ~ N c  sample ~10 83 s  universe ~ s vs  fold ~ s 24

U, F =U-TS Energy Landscape all atomic coordinates; dihedral angles M1M1 M3M3 M2M2 S 12 S 23 S -1 minimum saddle point maximum 25

Roughness of Energy Landscape smooth, funneled rough (Wolynes et. al. 1997) 26

Folding Pathways similarity to native state dead end 27

Folding Phase Diagram smoothrough 28

What differentiates the native state from other low-lying energy minima? How many low-lying energy minima are there? Can we calculate landscape roughness from sequence? What determines whether protein will fold to the native state or become trapped in another minimum? What are the pathways in the energy landscape that a given protein follows to its native state? Open Questions NP Hard Problem! 29

Digression---Number of Energy Minima for Sticky Spheres NmNm NsNs NpNp sphere packings polymer packings N s ~exp(aN m ); N p ~exp(bN m ) with b>a 30