Coordinates and Pathways in MM and QM/MM modeling Haiyan Liu School of Life Sciences, University of Science and Technology of China.

Slides:



Advertisements
Similar presentations
Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Advertisements

Chemical Kinetics : rate of a chemical reaction Before a chemical reaction can take place the molecules involved must be raised to a state of higher potential.
Multiscale Dynamics of Bio-Systems: Molecules to Continuum February 2005.
SCCDFTB as a bridge between MM and high-level QM. Jan Hermans University of North Carolina 1.
Molecular Dynamics at Constant Temperature and Pressure Section 6.7 in M.M.
Principal Component Analysis in MD Simulation Speaker: ZHOU Chen-Yang Supervisor: Wu Yun-Dong.
Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
Lecture 14: Advanced Conformational Sampling
The total energy as a function of collective coordinates, ,  for upright and tilted structures are plotted. Two types of frequency spectrum are calculated:
Presented by Next Generation Simulations in Biology: Investigating biomolecular structure, dynamics and function through multiscale modeling Pratul K.
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Computational Materials Science Network Grain Boundary Migration Mechanism:  Tilt Boundaries Hao Zhang, David J. Srolovitz Princeton Institute for the.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Generation, comparison and merging of pathways between protein conformations: Gating in K-channels * Generation, comparison and merging of pathways between.
Potential Energy Surfaces
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Behaviour of velocities in protein folding events Aldo Rampioni, University of Groningen Leipzig, 17th May 2007.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Molecular Dynamics and Normal Mode Analysis of WW domain Santanu Chatterjee 1, Christopher Sweet 1, Tao Peng 2, John Zintsmaster 2, Brian Wilson 2, Jesus.
Leipzig, 17 May Markov Models of Protein Folding - Application to Molecular Dynamics Simulations Christian Hedegaard Jensen.
The Geometry of Biomolecular Solvation 1. Hydrophobicity Patrice Koehl Computer Science and Genome Center
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Constraints-based Motion Planning for an Automatic, Flexible Laser Scanning Robotized Platform Th. Borangiu, A. Dogar, A. Dumitrache University Politehnica.
ExTASY 0.1 Beta Testing 1 st April 2015
Statistical Physics of the Transition State Ensemble in Protein Folding Alfonso Ramon Lam Ng, Jose M. Borreguero, Feng Ding, Sergey V. Buldyrev, Eugene.
Deca-Alanine Stretching
1 Heat Conduction in One- Dimensional Systems: molecular dynamics and mode-coupling theory Jian-Sheng Wang National University of Singapore.
Chem. 860 Molecular Simulations with Biophysical Applications Qiang Cui Department of Chemistry and Theoretical Chemistry Institute University of Wisconsin,
Molecular Dynamics Simulation
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
Approximation of Protein Structure for Fast Similarity Measures Fabian Schwarzer Itay Lotan Stanford University.
Folding of Distinct Sequences into Similar Functional Topologies Hossein Mohammadiarani and Harish Vashisth (Advisor) Department of Chemical Engineering,
Stochastic Thermodynamics in Mesoscopic Chemical Oscillation Systems
A Technical Introduction to the MD-OPEP Simulation Tools
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Enrico Spiga 1,2 Andrea Scorciapino 1, Arturo Robertazzi 1, 2, Roberto Anedda 2, Mariano Casu 2, Paolo Ruggerone 1 and Matteo Ceccarelli 1 1 University.
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.
Advanced Sampling Techniques When molecular dynamics is just not enough.
Molecular dynamics (2) Langevin dynamics NVT and NPT ensembles
Modeling Protein Flexibility with Spatial and Energetic Constraints Yi-Chieh Wu 1, Amarda Shehu 2, Lydia Kavraki 2,3  Provided an approach to generating.
Assignment for the course: “Introduction to Statistical Thermodynamics of Soft and Biological Matter” Dima Lukatsky In the first.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
The Mechanical Simulation Engine library An Introduction and a Tutorial G. Cella.
Lecture 14: Advanced Conformational Sampling Dr. Ronald M. Levy Statistical Thermodynamics.
Mechanism of Curcumin Inhibiting Amyloid-  Peptides Aggregation by Molecular Dynamics Simulations Zhenyu Qian, and Guanghong Wei State Key Laboratory.
1 Xin Zhou Asia Pacific Center for Theoretical Physics, Dep. of Phys., POSTECH, Pohang, Korea Structuring and Sampling in Complex Conformational.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Dr. Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
The Nudged Elastic Band Method and its Implementation in VASP
Parastou Sadatmousavi§, & Ross C. Walker*
The Accelerated Weighted Ensemble
Maintaining Adiabaticity in Car-Parrinello Molecular Dynamics
Large Time Scale Molecular Paths Using Least Action.
Volume 108, Issue 3, Pages (February 2015)
Transconformations of the SERCA1 Ca-ATPase: A Normal Mode Study
The Closing Mechanism of DNA Polymerase I at Atomic Resolution
Protein structure prediction.
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
Xin-She Yang, Nature-Inspired Optimization Algorithms, Elsevier, 2014
Volume 16, Issue 5, Pages (May 2008)
Experimental Overview
Close Correspondence between the Motions from Principal Component Analysis of Multiple HIV-1 Protease Structures and Elastic Network Modes  Lei Yang,
Probing the Energy Landscape of the Membrane Protein Bacteriorhodopsin
Molecular Mechanism for Stabilizing a Short Helical Peptide Studied by Generalized- Ensemble Simulations with Explicit Solvent  Yuji Sugita, Yuko Okamoto 
Multiple Folding Pathways of the SH3 Domain
Volume 80, Issue 1, Pages (January 2001)
Po-chia Chen, Jochen S. Hub  Biophysical Journal 
Amir Marcovitz, Yaakov Levy  Biophysical Journal 
Presentation transcript:

Coordinates and Pathways in MM and QM/MM modeling Haiyan Liu School of Life Sciences, University of Science and Technology of China

In MM and QM/MM modeling of biomolecules,we often aim at understanding mechanisms of processes, many of which too slow to be investigated by direct simulations. Examples To study protein functions: Possible chemical/conformational (sub)states ? Mechanism of transitions between them? To study protein/peptide folding: any preferred “pathways” or “order of events”? Roles of topologies and sequences?

Two (?) basic causes for macroscopic slowness Need to overcome major enthalpic barriers (e.g., chemical reactions…) Need to “zoom” into a very limited region in the conformational space (e.g., protein folding, binding…)

Among major obstacles in simulations Sampling (in)efficiency time A state B state Waiting time Transition time

Two basic types of approaches A. Connecting known terminal states A1 “forced” barrier crossing Umbrella sampling, Targeting or Steered MD, Drawbacks: projecting a many-dimensional system onto a few pre-assumed reaction coordinates A projected representation of the many-dimensional problem

Reaction coordinates (Rc) Problem associated with Improper projection Environ. Degrees of Freedom Restrained optimization: discontinuous environment Potential of mean forces along Rc: sampling minima but not transition states

A2 Chain of states or path optimization methods Discrete representation of pathways (a pathway is represented by a chain of replicas) “enforced” continuity of the pathway A parametric representation of the many- dimensional problem

B. Introducing more frequent transitions between states Accelerate minimum-escaping (elevated temperature simulations, conformational flooding or local elevation, parallel replica simulations, potential energy function deformation) The key is to avoid over-expanding the accessible conformational space.

Accelerated sampling approaches Potential energy-based v.s. kinetic energy- based Equilibrium v.s. non-equilibrium sampling Degree of freedom (DOF)-specific and degree of freedom-nonspecific –delocalized (collective) DOF or local DOF

coordinates (or order parameters) are essential, provided that we have good enough energy model… “forced” transitions and free energy surfaces: which coordinates to project onto? Chain of states method: enforcing continuity on which coordinates? Accelerated sampling: which coordinates to apply the bias?

Examples Local elevation –Potential energy-based, non-equilibrium,DOF-specific, local DOFs, Conformational flooding –Potential energy-based, non-equilibrium, DOF- specific, delocalized DOFs Temperature REMD –Kinetic energy-based, equilibrium, DOF-non specific Amplified collective motion (ACM) model –Kinetic energy-based, non-equilibrium, DOF-specific, delocalized DOFs …

Our works in recent years Amplified collective motion MD simulation (B) Obtaining minimum energy paths in QM/MM modeling of enzymatic reactions with a modified nudged elastic band method (A2) coarsely-guided sampling of folding trajectories of a small protein domain in implicit solvent (A1) Hamiltonian replica change simulation with free energy-surface-derived umbrella potentials (B)

Accelerate conformation search by Amplifying collective motions Collective coordinates have been used in the analysis of protein dynamics for a long time: Normal mode analysis Principal component (or essential dynamics) analysis of conformational sets Coarse grained elastic net work models.

Several important observations from such studies: Protein motions (e.g. atomic positional fluctuations) are dominated by a very small number of slow modes. These slow modes often correspond to functional motions. The low frequency space is insensitive to details of models

Zhang et al Biophys. J., 2003, 84, 3583 He , et al J. Chem. Phys. 2003, 119, 4005.

Derive low frequency collective modes using the coarse-grained elastic network model no need for exact minimum but use only a single conformation; low frequency modes can be updated on the fly in a simulation; correctly captures the low frequency modes along the “valley” on the energy surface (for compact structures)

Higher T Lower T Coupling the low-frequency and higher frequency modes to different temperature baths Zhang et al Biophys. J., 2003, 84, 3583

Advantages Sampling in conformational space extended along “valleys” of the energy landscape. No “melting” of local structures. Lower frequency subspace updated on the fly. No deformation of potential energy surface. No pre-definition of “path” or “reaction coordinates”.

Drawbacks Functionally important motions may not correspond to the slowest few modes Does not correspond to any equilibrium ensemble. Difficult to be quantitative

Test systems Inter-domain motions of T4 lysozyme in explicit solvent. Folding of a S-peptide analog (in implicit solvent described by a Generalized-Born model)

env 1 (0.40 nm) env 2 (0.13 nm) X-ray structures Bacteriophage T4 lysozyme First three modes of the coarse grained model: 80% of the variations

Atom position RMS fluctuations in MD (300 K dashed line) and ACM- MD (Three slowest modes: 800 K, other modes: 300 K) ACM-MD produces larger fluctuations Zhang et al Biophys. J., 2003, 84, 3583

Projection on the two largest principal components of the crystal structures(dots), MD trajectory (red), and ACM-MD trajectory(blue). ACM-MD sampled larger variations in the two PCA direction. Zhang et al Biophys. J., 2003, 84, 3583

N-term domain C-term domain RMSD from native structure Number of residues In secondary structures Solid: MD Dotted: ACM-MD ACM-MD and normal MD are similar in intra-domain motions Zhang et al Biophys. J., 2003, 84, 3583

ab MD ACM-MD MD ACM-MD Folding of a S-peptide analog

MD , start from nativeACM-md , start from native MD , start from unfolded ACM-md , start from unfolded Solid : RMS deviation from unfolded as functions of time Dotted : RMSD from native as functions of time ACM-MD refolds the peptide while normal MD cannot Zhang et al Biophys. J., 2003, 84, 3583

The ACM method: Collective DOF; kinetic energy based; improves sampling; non-equilibrium ensemble thus difficult to go quantitative Application by another group: Biochemistry, 2006, 45 (51) :

Chain of states method in path optimization The nudged elastic band method Each replica moves to minimize the force perpendicular to the path. and to maintain even distribution of the replicas along the path  Force: Reaction coordinate driven

Problems for enzymatic reactions Enzyme systems contain many floppy degrees of freedom. Impractically small radius of convergence. Advantages: No pre-assumed reaction coordinate. Suits for parallel computations

Soft spectator degree of freedom Y spoils the NEB calculation Xie et al J.Chem. Phys., 2004, 120,8039.

d f(d) Heuristic solution: Exclude spectator degrees of freedom Use a set of inter-atomic distances (chemical subspace) Multiple step reactions Xie et al J.Chem. Phys., 2004, 120,8039.

Active site groups of A-type beta-lactamase

The acylation step of type A beta-lactamase

Xie et al J.Chem. Phys., 2004, 120,8039.

Energy decomposition TS stabilization Xie et al J.Chem. Phys., 2004, 120,8039.

An application Metal-preferences of metallo- proteases E-coli peptide deformylase: prefers Fe++ over Zn++ Thermolysin: prefers Zn++ Dong et al, J.Phys.Chem. B, 2008 ( 112 ),

comparative modeling of Zn-TLN and Zn-PDF using NEB Dong et al, J.Phys.Chem. B, 2008 ( 112 ),

ab initio QM/MM Potential energy surfaces reproduce metal preferences

Dong et al, J.Phys.Chem. B, 2008 ( 112 ),

Summary Some general discussions on “coordinates”- based or DOF specific approaches to accelerate the modeling of slow processes Two particular types of approaches –Amplified collective motions –NEB adapted for the simulations of enzyme reactions An example showing comparative modeling provides biochemical insights

Acknowledgements Zhiyong Zhang, Jianbin He (ACM) Li Xie (adapted NEB) Minghui Dong (PDF and TLN) All former and current group members Adapted NEB: Weitao Yang and group Funding: CAS, NSCFC

谢 谢! Thanks