Advanced Sampling Techniques When molecular dynamics is just not enough.

Slides:



Advertisements
Similar presentations
Time averages and ensemble averages
Advertisements

Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Review Of Statistical Mechanics
Lecture 13: Conformational Sampling: MC and MD Dr. Ronald M. Levy Contributions from Mike Andrec and Daniel Weinstock Statistical Thermodynamics.
Monte Carlo Methods and Statistical Physics
Rare Event Simulations Theory 16.1 Transition state theory Bennett-Chandler Approach 16.2 Diffusive Barrier crossings 16.3 Transition path ensemble.
TOPIC A: EQUILIBRIUM Equilibrium, Le Chatelier’s Principle, Acid- Base Equilibrium, Ksp, pH.
Introduction to Sampling based inference and MCMC Ata Kaban School of Computer Science The University of Birmingham.
Lecture 14: Advanced Conformational Sampling
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation.
P2P-based Simulator for Protein Folding Shun-Yun Hu 2005/06/03.
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
Behaviour of velocities in protein folding events Aldo Rampioni, University of Groningen Leipzig, 17th May 2007.
Practical Guide to Umbrella Sampling
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Leipzig, 17 May Markov Models of Protein Folding - Application to Molecular Dynamics Simulations Christian Hedegaard Jensen.
Optimization of thermal processes2007/2008 Optimization of thermal processes Maciej Marek Czestochowa University of Technology Institute of Thermal Machinery.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Coordinates and Pathways in MM and QM/MM modeling Haiyan Liu School of Life Sciences, University of Science and Technology of China.
Protein Dynamics Studies of Structure-Switching in Lymphotactin By Max Shokhirev Final Project for PHAR201 Fall 2008.
Advanced methods of molecular dynamics Monte Carlo methods
Geometry Optimisation Modelling OH + C 2 H 4 *CH 2 -CH 2 -OH CH 3 -CH 2 -O* 3D PES.
Relating computational and physical complexity Computational complexity: How the number of computational steps needed to solve a problem scales with problem.
‘Tis not folly to dream: Using Molecular Dynamics to Solve Problems in Chemistry Christopher Adam Hixson and Ralph A. Wheeler Dept. of Chemistry and Biochemistry,
Introduction to (Statistical) Thermodynamics
1 IE 607 Heuristic Optimization Simulated Annealing.
ExTASY 0.1 Beta Testing 1 st April 2015
Free energies and phase transitions. Condition for phase coexistence in a one-component system:
Conformational Sampling
Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.
Deca-Alanine Stretching
Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: WHAM and Related Methods Dr. Ronald M. Levy Statistical.
Basic Monte Carlo (chapter 3) Algorithm Detailed Balance Other points.
Monte Carlo Methods in Statistical Mechanics Aziz Abdellahi CEDER group Materials Basics Lecture : 08/18/
Simulated Annealing.
Chem. 860 Molecular Simulations with Biophysical Applications Qiang Cui Department of Chemistry and Theoretical Chemistry Institute University of Wisconsin,
Rates of Reactions Why study rates?
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 14 Some materials adapted from Prof. Keith E. Gubbins:
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
A chaotic collection of thoughts on stochastic transport what are the issues that M3D must consider to accurately determine heat transport which analytical.
Copyright©2004 by Houghton Mifflin Company. All rights reserved. 1 Introductory Chemistry: A Foundation FIFTH EDITION by Steven S. Zumdahl University of.
A Technical Introduction to the MD-OPEP Simulation Tools
Understanding Molecular Simulations Introduction
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Exploring the connection between sampling problems in Bayesian inference and statistical mechanics Andrew Pohorille NASA-Ames Research Center.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
By Steven S. Zumdahl & Don J. DeCoste University of Illinois Introductory Chemistry: A Foundation, 6 th Ed. Introductory Chemistry, 6 th Ed. Basic Chemistry,
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.
13. Extended Ensemble Methods. Slow Dynamics at First- Order Phase Transition At first-order phase transition, the longest time scale is controlled by.
An Introduction to Monte Carlo Methods in Statistical Physics Kristen A. Fichthorn The Pennsylvania State University University Park, PA
Chapter 16 Kinetic Theory of Gases. Ideal gas model 2 1. Large number of molecules moving in random directions with random speeds. 2. The average separation.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
MSc in High Performance Computing Computational Chemistry Module Parallel Molecular Dynamics (i) Bill Smith CCLRC Daresbury Laboratory
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Basic Monte Carlo (chapter 3) Algorithm Detailed Balance Other points non-Boltzmann sampling.
Statistical Mechanics for Free Energy Calculations
Lecture 14: Advanced Conformational Sampling Dr. Ronald M. Levy Statistical Thermodynamics.
Computational Physics (Lecture 11) PHY4061. Variation quantum Monte Carlo the approximate solution of the Hamiltonian Time Independent many-body Schrodinger’s.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Simulated Annealing Chapter
Overview of Molecular Dynamics Simulation Theory
Lecture 17: Kinetics and Markov State Models
Maria Okuniewski Nuclear Engineering Dept.
Rare Event Simulations
Enzyme Kinetics & Protein Folding 9/7/2004
Driven Adiabatic Dynamics Approach to the Generation of Multidimensional Free-Energy Surfaces. Mark E. Tuckerman, Dept. of Chemistry, New York University,
Lecture 17: Kinetics and Markov State Models
Large Time Scale Molecular Paths Using Least Action.
Constrained Molecular Dynamics as a Search and Optimization Tool
Presentation transcript:

Advanced Sampling Techniques When molecular dynamics is just not enough.

Why do we need enhanced sampling?

MD N Trying to fold a peptide from two different initial conditions, gives two different free energy surfaces !

Rates depend on barriers and T

What can we do to speed things up? It’s basically a problem of rates, or kinetics – Rates are exponentially dependent on temperature – higher T = higher KE = ability to cross higher barriers Can we just simulate at a higher T? Yes! Folding is faster at higher T.

Why is convergence so slow? We want the average of observable properties- so we need to include in that weighted average all of the important values that it might have Structural averages can be slow to converge- you need to wait for the structure to change, and do it many, many times. Sampling an event one time doesn’t tell you the correct probability (or weight) for that event – What is the proper weight? Boltzmann’s equation

What can we do to speed things up? It’s basically a problem of rates, or kinetics – Rates are exponentially dependent on temperature – higher T = higher KE = ability to cross higher barriers Can we just simulate at a higher T? Yes! Folding is faster at higher T. but…we mess up the thermodynamics. – Equilibrium constants (weights) also depend on T

Populations depend on ΔE and T

Free energies along a defined reaction coordinate via Umbrella Sampling

Umbrella Sampling How to force barrier crossings without compromising thermodynamic properties? Very slow transitions

Umbrella Sampling trapped, bad ΔG good for ΔG

One could just run dynamics and wait until all space has been sampled. Then, if one extracts P(x k ) from the trajectory, the PMF can be written as: However, it takes forever to properly sample all conformations, and to jump over the barrier. The solution is to bias the system towards whatever value of the coordinate we want. This is called unbiased sampling

AMBER WS 2012 CECAM Lausanne Umbrella Sampling True PMF Ideal Biasing Potential No barrier, perfect sampling We could BIAS the simulation, but we do not really know how to do it exactly.

AMBER WS 2012 CECAM Lausanne Umbrella Sampling True PMF Windows: choose i, k and x i system-dependent Introduce biasing potentials along the reaction coordinate

AMBER WS 2012 CECAM Lausanne Adding a quadratic biasing potential

Check for sufficient overlap Histograms from neighboring windows should overlap strongly, all points on the RC must be sampled suffciently.

AMBER WS 2012 CECAM Lausanne Umbrella Sampling Simulation Window Histogram Part of PMF Final computed PMF from many windows Solved iteratively using e.g. the WHAM program by Alan Grossfield Constructing the PMF

AMBER WS 2012 CECAM Lausanne Umbrella Sampling Histograms from neighboring windows should overlap strongly, all points on the RC must be sampled suffciently. Solved iteratively using e.g. the WHAM program by Alan Grossfield ( content/wham) Check for sufficient overlap between sampled regions

Histograms & free energy profiles Umbrella run needs many simulations Do NOT need to sample full range in 1 simulation G= - RTlnP/P 0

Comparing 2 conformations Song, Hornak, de los Santos, Grollman and Simmerling, Biochemistry 2006 It will take much too long to get precise populations for these 2 minima just by running MD.

8OG binding mode in complex: dihedral umbrella sampling syn anti Song, Hornak, de los Santos, Grollman and Simmerling, Biochemistry 2006

Simulations reveal how the energy profile changes if a mutation is made syn anti Song, Hornak, de los Santos, Grollman and Simmerling, Biochemistry 2006 Effect of mutations

 Conformational space has many local minima  Barriers hinder rapid escape from high-energy minima  How can we search and score all conformations? Replica Exchange Coordinate Energy

Early tries: Run MD at T low, once in a while heat up to T high, run some more. Change to T low Repeat… The problem is that the statistical weights of the first and second T low are unknown and unknowable.

Metropolis Monte Carlo, one of the most important algorithms of the 20 th century.

MANIAC computer. Los Alamos national Lab Kbytes memory 11Khz Clock

When Bugs were real (Mark II computer, 1945)

Coordinate Energy We want to sample all conformations in a way that when we are done, the probability of finding a particular conformation is

Moreover, in equilibrium, we require that the rates from minimum A to B be exactly equal to the rate from minimum B to A. This is called Microscopic Reversibility. Coordinate Energy

Temperature Replica Exchange (Parallel tempering) 375K 350K 325K 300K REMD Hansmann, U., CPL 1997 Sugita & Okamoto, CPL 1999

 Impose desired weighting (limiting distributionon exchange calculation 375K 350K 325K 300K  Impose reversibility/detailed balance Rewrite using only potential energies, by rescaling kinetic energy for both replicas. Exchange?

Temperature Replica Exchange  MD runs over range of T  Periodically swap structures  Faster convergence than regular MD  Populations as a function of T  Drawback: ΔE must be small  small ΔT for large systems Number of replicas ~ N 1/2

Does it really converge faster than regular MD? Yes ! But not really as much as one would think (or hope, or dream) 5k B T For a barrier of ~5k B T low, and a T high= 2T low, the barrier crossing rate is x12 Better methods use replicas in Hamiltonian space.  Different biasing for each replica

Do you have an MD code?  Build yourself a REMD code in minutes !  Just write a wrapper ! START REMD Spawn replicas and run MD at different temperatures Stop MDs Decide to exchange or not

Advanced methods can help MD Replica exchange N Roe, Hornak and Simmerling, J. Mol. Biol. 2006

What if you don’t know the pathway for the change?

in silico structure interconversions ? “endpoint method”: link many simulations timescale independent Bergonzo, Campbell, Walker & Simmerling, Intl J Quantum Chem 2009

NEB : use many simulations to study slow dynamics R P Coordinate 2 F || F┴F┴ Coordinate 1 Final path Standard forces Spring forces

Reaction Path Calculations: NEB kcal/mol NEB requires coupling of multiple simulations using “springs” A B How can we get from A to B?

Other techniques Accelerated MD Hamiltonian Replica Exchange Multi Dimensional Replica Exchange Constant pH ( + REMD) Steered Molecular Dynamics

Accelerated Molecular Dynamics de Oliveira C.A.F., Hamelberg, D., McCammon, J.A., On the Application of Accelerated Molecular Dynamics to Liquid Water Simulations. J. Phys. Chem. B Hamelberg, D., de Oliveira C.A.F., McCammon J.A., Sampling of slow diffusive conformational transitions with accelerated molecular dynamics. The Journal of chemical physics, Grant, B.J., Gorfe, A.A., and McCammon, J.A., Ras conformational switching: simulating nucleotide-dependent conformational transitions with accelerated molecular dynamics. PLoS computational biology, de Oliveira, C.A.F., et al., Large-Scale Conformational Changes of Trypanosoma cruzi Proline Racemase Predicted by Accelerated Molecular Dynamics Simulation. PLoS computational biology, 2011.

AMD effect on the Potential

Principal Component Analysis Build PC space based on the 500ns aMD simulation Project X-ray, Shaw structures, 500ns cMD, and 1ms cMD Cost: several million dollars Energy consumption ~116.5KW Code mostly hardwired to the hardware Cost: $2000 Energy consumption ~0.9KW AMBER 12 Routine Access to Millisecond Time Scale Events with Accelerated Molecular Dynamics. Levi Pierce, Romelia Salomon-Ferrer, Cesar Augusto de Oliveira, J. Andrew McCammon and Ross C. Walker. J. Chem. Theory Comput., 2012, 8 (9), pp 2997–3002 DOI: /ct300284c