Molecular Dynamics Valerie Daggett Bioengineering Department University of Washington...everything that living things do can be understood in terms of.

Slides:



Advertisements
Similar presentations
Time averages and ensemble averages
Advertisements

Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Evaluating Free Energies of Binding using Amber: The MM-PBSA Approach.
Molecular Dynamics at Constant Temperature and Pressure Section 6.7 in M.M.
Lecture 13: Conformational Sampling: MC and MD Dr. Ronald M. Levy Contributions from Mike Andrec and Daniel Weinstock Statistical Thermodynamics.
Lecture 14: Special interactions. What did we cover in the last lecture? Restricted motion of molecules near a surface results in a repulsive force which.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Thermodynamics of Protein Folding
Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
A New Analytical Method for Computing Solvent-Accessible Surface Area of Macromolecules.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Molecular Dynamics, Monte Carlo and Docking Lecture 21 Introduction to Bioinformatics MNW2.
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Protein Rigidity and Flexibility: Applications to Folding A.J. Rader University of Pittsburgh Center for Computational Biology & Bioinformatics.
Protein folding kinetics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Joo Chul Yoon with Prof. Scott T. Dunham Electrical Engineering University of Washington Molecular Dynamics Simulations.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Molecular Modeling Part I Molecular Mechanics and Conformational Analysis ORG I Lab William Kelly.
Molecular Mechanics, Molecular Dynamics, and Docking
Ana Damjanovic (JHU, NIH) JHU: Petar Maksimovic Bertrand Garcia-Moreno NIH: Tim Miller Bernard Brooks OSG: Torre Wenaus and team.
Max Shokhirev BIOC585 December 2007
1 Physical Chemistry III Molecular Simulations Piti Treesukol Chemistry Department Faculty of Liberal Arts and Science Kasetsart University :
Algorithms and Software for Large-Scale Simulation of Reactive Systems _______________________________ Ananth Grama Coordinated Systems Lab Purdue University.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
Department of Mechanical Engineering
Computational Chemistry Molecular Mechanics/Dynamics F = Ma Quantum Chemistry Schr Ö dinger Equation H  = E 
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
A Technical Introduction to the MD-OPEP Simulation Tools
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Molecular simulation methods Ab-initio methods (Few approximations but slow) DFT CPMD Electron and nuclei treated explicitly. Classical atomistic methods.
Molecular Mechanics Studies involving covalent interactions (enzyme reaction): quantum mechanics; extremely slow Studies involving noncovalent interactions.
Altman et al. JACS 2008, Presented By Swati Jain.
7. Lecture SS 2005Optimization, Energy Landscapes, Protein Folding1 V7: Diffusional association of proteins and Brownian dynamics simulations Brownian.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Interacting Molecules in a Dense Fluid
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Dynameomics: Protein Mechanics, Folding and Unfolding through Large Scale All-Atom Molecular Dynamics Simulations INCITE 6 David A. C. Beck Valerie Daggett.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Molecular dynamics (MD) simulations  A deterministic method based on the solution of Newton’s equation of motion F i = m i a i for the ith particle; the.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
1 of 21 SDA development -Description of sda Description of sda-5a - Sda for docking.
Computational Structure Prediction
Implementation of the TIP5P Potential
Molecular Modelling - Lecture 3
Volume 84, Issue 6, Pages (June 2003)
Algorithms and Software for Large-Scale Simulation of Reactive Systems
Computational Analysis
Enzyme Kinetics & Protein Folding 9/7/2004
An Integrated Approach to Protein-Protein Docking
Giovanni Settanni, Antonino Cattaneo, Paolo Carloni 
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
Algorithms and Software for Large-Scale Simulation of Reactive Systems
Experimental Overview
Conformational Search
Protein Folding and Unfolding at Atomic Resolution
Presentation transcript:

Molecular Dynamics Valerie Daggett Bioengineering Department University of Washington...everything that living things do can be understood in terms of the jigglings and wigglings of atoms. Richard Feynman

Protein Dynamics Proteins are not static Motion is an incontrovertible consequence of room temperature (or any T > 0 K) Kinetic energy per atom is ~ 1 298K (25°C)  several Å/ps Motion recognized to be important early on. Kendrew (1950s) solved crystal structure of myoglobin (Perutz, phasing)

Myoglobin No pathway for O 2  heme!

Protein Dynamics Kendrew: “Perhaps the most remarkable features of the molecule are its complexity and its lack of symmetry. The arrangement seems to be almost totally lacking in the kind of regularities which one instinctively anticipates, and it is more complicated than has been predicted by any theory of protein structure.” Situation gets worse when you consider dynamics. But proteins are dynamic and dynamic behavior critical for function. So, static, average structures are only part of the story.

Function from static structure?

Dynamics necessary for function

Snapshots

Static and/or average structures may not be representative of conformations critical to function

Theory Experiment clearly demonstrates that proteins are mobile, but no single experiment or combination of experiments can provide an all-inclusive view of the dynamic behavior of all atoms in a protein. Computer simulations can however –ea. atom as a function of time

Why Molecular Dynamics? Most realistic simulation method available Can provide structural and dynamic information unobtainable by experiment, but is experimentally testable Native and nonnative interactions apparent But Sampling is limited, the goal is to sample experimentally relevant regions of conformational space, not all of conformational space

Molecular Dynamics Potential function for MD 1,2 [sum of following terms] U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101:

Molecular Dynamics Potential function for MD U = Bond + Angle + Dihedral + van der Waals + Electrostatic

Molecular Dynamics Potential function for MD U = Bond + Angle + Dihedral + van der Waals + Electrostatic b0b0

Molecular Dynamics Potential function for MD U = Bond + Angle + Dihedral + van der Waals + Electrostatic θ0θ0

Molecular Dynamics Potential function for MD U = Bond + Angle + Dihedral + van der Waals + Electrostatic Φ0Φ0

Molecular Dynamics Potential function for MD U = Bond + Angle + Dihedral + van der Waals + Electrostatic

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic To a large degree, protein structure is dependent on non-bonded atomic interactions

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic

Molecular Dynamics Non-bonded components of potential function + -

Molecular Dynamics Non-bonded components of potential function +

Molecular Dynamics Non-bonded components of potential function NOTE: Sum over all pairs of N atoms, or pairs N is often between 5x10 5 to 5x10 6 For 5x10 5 that is 1.25x10 11 pairs THAT IS A LOT OF POSSIBLE PAIRS!

What can you do with a force field? Generation of ‘experimental’ structures Refinement of ‘experimental’ structures Monte Carlo Scoring functions Energy minimization Analysis Perform MD simulations etc.

Molecular Dynamics Time dependent integration of classical equations of motion

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration Evaluate forces and perform integration for every atom Each picosecond of simulation time requires 500 iterations of cycle E.g. w/ 50,000 atoms, each ps ( s) involves 25,000,000 evaluations

Molecular Dynamics Actual integration the equations of motion Conserves energy Smooth, robust

Molecular Dynamics Determination of temperature

Methods Molecular dynamics (MD) –Brooks-Beeman integration algorithm –Microcanonical ensemble (NVE) Number of atoms, box volume, & energy are conserved Energy conservation is naturally satisfied with classical equations of motion Energy conservation is an inherent check on the implementation Free from coupling the microscopic system to macroscopic variables as do NVT and NPT

Molecular dynamics Microcanonical ensemble, all atoms, solvent, fully flexible molecules, continuous trajectories, no restraints/biases predictive MD---expt to check No Ewald --- artificial periodicity, altered conformational and dynamical properties No fictitious bonds between H atoms of water No Shake Correct masses Good simple, flexible water correct D and RDF

Methods Molecular dynamics (MD) –Temperature in NVE Mean T over hundreds of steps Energy drift in ilmm is primarily kinetic resulting from numerical round-off –Over thousands of steps mean T can be monitored for energy conservation –Velocity rescales once per 10ns

Implementation Written in C –Ubiquitous, standardized, optimized language 64 bit math Software design –Kernel Compiles user’s molecular mechanics programs Schedules execution across processor and machines –Modules, e.g. Energy minimization Molecular Dynamics Monte Carlo Analysis REMD RDCs + others

Implementation Dual mode parallelization –Standardized tools available on modern platforms POSIX threads –Distribute computations across multiple CPUs in a single computer Message Passing Interface (MPI) –Distribute computations across multiple computers on a high speed network –Benefit is scalability

State of the Art MD What can be done with PCs? Environment: Possible to characterize solvent-dependent conformational behavior Proteins in membranes Size: ≤ 500 residues (more possible if willing to dedicate resources to it, our record is 2519 residues in solvated membrane) Timescale: Multiple ns simulations fairly routine for proteins  s possible if willing to dedicate resources to it

Molecular Dynamics MD provides atomic resolution of native dynamics PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics MD provides atomic resolution of native dynamics PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, hydrogens added

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, waters added (i.e. solvated)

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, waters and hydrogens hidden

Molecular Dynamics MD provides atomic resolution of native dynamics native state simulation of 3chy at 298 Kelvin, waters and hydrogens hidden

Molecular Dynamics MD provides atomic resolution of native dynamics native state simulation of 3chy at 298 Kelvin, waters and hydrogens hidden

Molecular Dynamics MD provides atomic resolution of folding / unfolding unfolding simulation (reversed) of 3chy at 498 Kelvin, waters & hydrogens hidden

Average may not be representive

Storch et al., Biochem, 1995, 1999a,b, 2000 Dynamic cleft discovered through MD Cytochrome b 5

Storch et al., Biochem, 1999 Bill Atkins, Patricia Campbell S18 R47 Construction of mutants to test whether cleft forms

Construction of cyt c – cyt b 5 complexes

Changes in cyt b 5 upon binding cyt c = Predicted binding surface = Change in chemical shift Hom et al., Biochem, 2000

Cleft allows for electron transfer through the protein in channel lined with aromatics = Nonpolar = Polar

Validation Validation, how do you know if a simulation is correct? How do you know it is done?

Starting a Molecular Dynamics Simulation Crystal or NMR Structure Solvate with water or other solvent Å from protein Heat to desired temperature and allow motion to evolve over time T = 298 K r = gm/ml T = 498 K r = gm/ml All atoms present Fully flexible water NVE

Native Dynamics at 25 ºC All C  atoms C  RMSD (Å) Time (ns) Active site loop and N-terminus removed = 1.7 Å = 0.7 Å Crystal structure NOEs reproduced

Crystal StructureAfter 50 ns of MD Structural Changes to Native State During MD = Xtal Structure = 50 ns MD Active Site Loop N-terminus Turn

Chymotrypsin Inhibitor 2 Expt = Shaw et al (1995) Biochem 34:2225 MD = Li & Daggett (1995) Prot. Eng. 8(11) N 15 –H no mobility high

Temperatur e (K) Cutoff range (Å) Time a (ns) NOEs satisfied b (% of 603) Xtal No restraints 22 simulations >1.2  s NOEs courtesy of Stefan Freund & Trevor Rutherford, ARF

Pushing to high temperature Taking excursions farther from the native state, will the force fields and methods hold? Thermal unfolding of proteins

Thermal Denaturation of CI2 at 498 K Ca RMSD (Å) Time (ns) Li & Daggett, 1994, PNAS; JMB, 1996 N TS? D

TS not localized to single bond, distributed and ensemble From MD cannot calculate DG along reaction coordinate Structure-based definition of TS: Kinetically, protein will not succeed in every attempt to cross TS but will change rapidly afterwards A process with a large change in energy but small change in entropy  large change in free energy N TS D Reaction Coordinate H G -TS Identifying Transition States in MD Trajectories

TS D Projection of Trajectory in RMSD Space Calculate the RMSD between all structures---15,000 x 15,000 dimensional space Reduce to 3-dimensions, distance between points  RMSD between structures Clusters indicate similar conformations, conformational states Li & Daggett, 1994, 1996 N

Main-chain Fold Preserved in Transition State a Crystal Structure Average TS1 Structure ~4 Å, 43 % native H-Bonds b1 b2 b3b3

Packing is Disrupted in Transition State Crystal Structure WT TS1 32 % SASA

Structure of TS from Experiment  = 1  site of mutation native-like in TS  = 0  site of mutation unfolded in TS Fractional  values  partial structure in TS  =  G TS-D /  G N-D = 1 N TS D  G N-D  G TS-D  G N- D  =  G TS-D /  G N-D = 0 N TS D  G N-D  G TS-D  G N- D NTSD Matouschek, Kellis, Serrano, Fersht, 1989, Nature, Fersht, Leatherbarrow, Wells, 1986, Nature; 1987 Biochem, +++

Calculation of S Values for Comparison with Experimental  F Values S = structure index = (S 2º ) (S 3º ) % native secondary structure (  ) % tertiary structure, contacts For each residue calculate: Daggett & Li, 1994, PNAS; Daggett, Li, Itzhaki, Otzen & Fersht, 1996, JMB

Residue Number Phi or S Value  S Value Phi Value Comparison of Calculated S Values and Experimental  Values Otzen et al., 1994, PNAS; Itzhaki et al., 1995, JMB; Li & Daggett, 1994,1996, JMB; Daggett et al., 1996, JMB

Overall TS Structure and Unfolding Pathway are Independent of Temperature 373 K 398 K 448 K 473 K 498 K (21 ns) (0.3 ns) (0.57 ns) (1.44 ns) (8.26 ns) TS1 TS2 TS3 TS4 498 K (0.225 ns) (0.335 ns) (0.1 ns) (0.07 ns) TT

Rotate 90 to right Conformational Heterogeneity of TS = Crystal Structure = TS1-4, 498 K = TS5-9,  T XTAL  MD  4.5 Å

N TS D Free Energy Calculations for Direct Determination of  WT Mutant NTSD N'TS'D'  G N  TS  G TS  D  G' TS  D  G' N  TS GNGN  G TS GDGD GNDGND  G' N  D  G N-TS =  G N-TS -  G' N-TS =  G N -  G TS  G TS-D =  G TS-D -  G' TS-D =  G TS -  G D  G N-D =  G N-D -  G’ N-D =  G N -  G D  F =  G TS-D /  G N-D

R = 0.85 R = 0.91 (no V47A) FEP Calculations for Hydrophobic Core Mutants (kcal/mole) Pan & Daggett, Biochem, 2001 Extended peptide NOT a good model of D

DA 23 TS A23 K2 E7 WT TSRF48 TS K2 E7 D23 R48 R62 F50 F48 R62 F50 Ladurner, Itzhaki, Daggett, Fersht, 1998, PNAS Designing Faster Folding Forms of CI2 Based on MD-Generated TS Models

Removal of Unfavorable Interactions Identified in TS Models Accelerates Folding Removal of charge repulsion and improvement of packing in the TS yields fastest-folding form of CI2.

Thermal Denaturation of CI2 at 498 K C  RMSD (Å) Time (ns) D 40,000 structures

 The Denatured State of CI2 Distances in N: W5-V14 15 Å I30-Y42 13 Å P33-I37 11 Å V14 Y42 I37 I57 I30 L49 P33 Kazmirski et al., PNAS, 2001 Experimental Results expt = 7.2 Hz MD = 7.0 Hz  V19-L21, I30-T36 Nearly random coil (res ) hydrophobic clustering W5

Summary of CI2 Simulations N is well behaved and in good agreement with experiment. TS is an expanded version of N with disrupted core and loops and frayed secondary structure. Validity of MD-generated TS models tested through indirect comparison with experimental  values, direct comparison of  Gs, behavior when T is quenched, and design of faster folding mutants. WT D is very disrupted with only minor amounts of hydrophobic clustering and fluctuating helical structure. Nearly random coil.

k obs (s -1 ) k unf kfkf En-HD unfolds at 348 & 373 K on the same timescale by simulation and experiment 47,000 s -1 Time to reach TS in MD simulation But, it is not enough to get the timescale right, must get pathway too! 10 °C Mayor et al., Nature 2003

Development of information-rich property space Low information content property: Main-chain non-polar SASA No discrimination between native & non-native states Native Nonnative

Development of information-rich property space High information content property: CONGENEAL structural dissimilarity score 1 Excellent discrimination between native & non-native states 1. Yee and Dill, Protein Science, Native Nonnative

Mean distance in PS (32 -> 10 properties) for a given conformation to the folded or native state ‘cluster’ is acceptable reaction coordinate –Value increases with distance from native cluster –Native cluster is bounded Development of a reaction coordinate from property space Foldedness ↔ location along folding reaction coordinate