1 Xin Zhou Asia Pacific Center for Theoretical Physics, Dep. of Phys., POSTECH, Pohang, Korea Structuring and Sampling in Complex Conformational.

Slides:



Advertisements
Similar presentations
Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Advertisements

Chemical Kinetics : rate of a chemical reaction Before a chemical reaction can take place the molecules involved must be raised to a state of higher potential.
Mutidimensional Data Analysis Growth of big databases requires important data processing.  Need for having methods allowing to extract this information.
Lecture 13: Conformational Sampling: MC and MD Dr. Ronald M. Levy Contributions from Mike Andrec and Daniel Weinstock Statistical Thermodynamics.
Single-molecule fluorescence spectroscopy maps the folding landscape of a large protein Menahem Pirchi, Guy Ziv, Inbal Riven, sharona sedghani Cohen, nir.
Lecture 14: Advanced Conformational Sampling
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
1 2 Extreme Pathway Lengths and Reaction Participation in Genome Scale Metabolic Networks Jason A. Papin, Nathan D. Price and Bernhard Ø. Palsson.
Dimensional reduction, PCA
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Statistical Mechanics
Mechanics.
Potential Energy Surfaces
Bayesian belief networks 2. PCA and ICA
What is Cluster Analysis?
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Leipzig, 17 May Markov Models of Protein Folding - Application to Molecular Dynamics Simulations Christian Hedegaard Jensen.
22/5/2006 EMBIO Meeting 1 EMBIO Meeting Vienna, 2006 Heidelberg Group IWR, Computational Molecular Biophysics, University of Heidelberg Kei Moritsugu MD.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
ChE 551 Lecture 19 Transition State Theory Revisited 1.
Modeling molecular dynamics from simulations Nina Singhal Hinrichs Departments of Computer Science and Statistics University of Chicago January 28, 2009.
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
Advanced methods of molecular dynamics Monte Carlo methods
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 11 Some materials adapted from Prof. Keith E. Gubbins:
Monte-Carlo simulations of the structure of complex liquids with various interaction potentials Alja ž Godec Advisers: prof. dr. Janko Jamnik and doc.
Molecular Information Content
Spatial Statistics Applied to point data.
Conformational Sampling
Javier Junquera Molecular dynamics in the microcanonical (NVE) ensemble: the Verlet algorithm.
Molecular Dynamics Simulation Solid-Liquid Phase Diagram of Argon ZCE 111 Computational Physics Semester Project by Gan Sik Hong (105513) Hwang Hsien Shiung.
Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: WHAM and Related Methods Dr. Ronald M. Levy Statistical.
Minimization v.s. Dyanmics A dynamics calculation alters the atomic positions in a step-wise fashion, analogous to energy minimization. However, the steps.
How and why things crackle We expect that there ought to be a simple, underlying reason that earthquakes occur on all different sizes. The very small earthquake.
Molecular Dynamics Simulation
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 14 Some materials adapted from Prof. Keith E. Gubbins:
8. Selected Applications. Applications of Monte Carlo Method Structural and thermodynamic properties of matter [gas, liquid, solid, polymers, (bio)-macro-
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling II Prof. Corey O’Hern Department of Mechanical Engineering Department.
Study of Pentacene clustering MAE 715 Project Report By: Krishna Iyengar.
Molecular Mechanics Studies involving covalent interactions (enzyme reaction): quantum mechanics; extremely slow Studies involving noncovalent interactions.
7. Lecture SS 2005Optimization, Energy Landscapes, Protein Folding1 V7: Diffusional association of proteins and Brownian dynamics simulations Brownian.
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Exploring the connection between sampling problems in Bayesian inference and statistical mechanics Andrew Pohorille NASA-Ames Research Center.
Rotational spectra of molecules in small Helium clusters: Probing superfluidity in finite systems F. Paesani and K.B. Whaley Department of Chemistry and.
Insight into peptide folding role of solvent and hydrophobicity dynamics of conformational transitions.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
ChE 452 Lecture 25 Non-linear Collisions 1. Background: Collision Theory Key equation Method Use molecular dynamics to simulate the collisions Integrate.
3.The Canonical Ensemble 1.Equilibrium between a System & a Heat Reservoir 2.A System in the Canonical Ensemble 3.Physical Significance of Various Statistical.
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.
An Introduction to Monte Carlo Methods in Statistical Physics Kristen A. Fichthorn The Pennsylvania State University University Park, PA
Review of Matrix Operations Vector: a sequence of elements (the order is important) e.g., x = (2, 1) denotes a vector length = sqrt(2*2+1*1) orientation.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
ECE-7000: Nonlinear Dynamical Systems 2. Linear tools and general considerations 2.1 Stationarity and sampling - In principle, the more a scientific measurement.
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Chapter 7 The electronic theory of metal Objectives At the end of this Chapter, you should: 1. Understand the physical meaning of Fermi statistical distribution.
Review of Matrix Operations
Modeling molecular dynamics from simulations
Lecture 17: Kinetics and Markov State Models
Statistical Mechanics and Multi-Scale Simulation Methods ChBE
Maintaining Adiabaticity in Car-Parrinello Molecular Dynamics
Energy Profiles of Reactions
Rare Event Simulations
Driven Adiabatic Dynamics Approach to the Generation of Multidimensional Free-Energy Surfaces. Mark E. Tuckerman, Dept. of Chemistry, New York University,
Lecture 17: Kinetics and Markov State Models
Large Time Scale Molecular Paths Using Least Action.
Implications of Microgravity on Calcium Dynamics in the Cardiac Troponin Complex Morgan Beckett University of Arizona Department of Chemistry & Biochemistry.
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
Experimental Overview
Universality and diversity of the protein folding scenarios:a comprehensive analysis with the aid of a lattice model  Leonid A Mirny, Victor Abkevich,
Atomic Detail Peptide-Membrane Interactions: Molecular Dynamics Simulation of Gramicidin S in a DMPC Bilayer  Dan Mihailescu, Jeremy C. Smith  Biophysical.
Presentation transcript:

1 Xin Zhou Asia Pacific Center for Theoretical Physics, Dep. of Phys., POSTECH, Pohang, Korea Structuring and Sampling in Complex Conformational Space Weighted Ensemble Dynamics Simulation Structuring and Sampling in Complex Conformational Space Weighted Ensemble Dynamics Simulation Oct 12, 2009 Beijing

Independent Junior Research Group Multiscale Modeling & Simulations in soft materials One available position for postdoctor/exchange Ph.D student Shun Xu (Ph.D candidate, ) Linchen Gong (Ph.D candidate, ) Shijing Lu (Ph.D candidate, ) Members: X.Z. (Leader, ) Pakpoom Reunchan (Postdoctor, )

Understanding results of simulations Improve efficiencies of simulations traditional: project to low-dimensional (reaction coordinates) space new : kinetic transition network coarse-graining, enhanced sampling, accelerated slow-dynamics Extend spatial and temporal scales but keep necessary details Multiscale simulations: 1.More sufficient simulation provides more complete understanding 2.The understanding of systems is helpful to design more efficient simulation algorithm

Vibration of bonds: second Protein folding > second There are coupling among different scales ! multiple scales

Energetic barrier Entropic barrier Due to high free energy (energy and/or entropy) barriers, standard MC/MD simulations need very long time to reach equilibrium Current advanced simulation techniques are not very helpful in overcoming entropic barriers Barriers

A. F. Voter (1998) V. S. Pande (2000) Ensemble Dynamics Independently generate multiple short trajectories Statistically analyze slow transition dynamics n trajectories: transition rate

Weighted Ensemble Dynamics arbitrarily select initial conformations Independently generate multiple short trajectories weight the trajectories Statistically analyze slow transition dynamics analyze state structure and equilibrium properties Linchen Gong & X.Z. (2009)

Weighted Ensemble Dynamics A single t-length MD trajectory is not sufficient to reach global equilibrium Multiple t-length trajectories can be used to reproduce global equilibrium properties by reweighting the trajectories Each trajectory has an unique weight (w i ) in contributing to equilibrium properties The weight of trajectory is only dependent on its initial conformation

Weighted Ensemble Dynamics {W i } satisfies a self-consistent equation for any selected initial conformations The initial distribution might be unknown The fluctuation of weights might be too huge to be practice in reproducing equilibrium properties Usually impractical

Expansion of Probability Density

Theory of WED Self-consistent equation: the (short) initial segments of trajectories replace the initial configurations

Theory of WED A symmetric linear homogeneous equation:

The ground state of H (eigenvector with zero eigenvalue) gives weights of trajectories If the ground state of H is non-degenerate, a unique w is obtained, the equilibrium distribution is reproduced The ground state of H (eigenvector with zero eigenvalue) gives weights of trajectories If the ground state of H is non-degenerate, a unique w is obtained, the equilibrium distribution is reproduced

parallel simulation from any initial conformations Equilibrium Criterion In principlenot practice In practice for any A(x) for complete independent basis functions WED: Judge if simulations reach equilibrium Reweighting trajectories to reach equilibrium distribution

If the ground state is degenerate, the trajectories are limited in different conformational regions, which are separated each other within the scale of total simulation time: meta-stable states The number of degenerated ground states equals to the number of meta-stable states in the total simulation time scale Meta-stable states Simulation trajectories visit in a few completely separated conformation regions, the relative weights of the regions are unknown

States and eigenvalues of H Eigenvalue = 0 : separated states in the time scale Eigenvalue = 1 : trajectories in a same state 0<eigenvalue<1 : partially separated states in the time scale a (small) fraction of trajectories happen transitions between states

Weights and eigenvector Trajectories are grouped into states Transition trajectories slightly split the degenerate ground states The weights of trajectories in the same state are almost constant

Projection in ground states S1 (1.75) : 77 S2 (0.75) : 92 S3 (-0.25) : 37 S4 (-1.25) : 66 S4-S3-S2 : 9 S4-S3 : 119 Non-transition trajectories transition trajectories 1. Non-transition trajectories inside a state are mapped to the same point 2. Transition trajectories between two states are mapped to the line connected by the states 3. Transition trajectories among three states are mapped to the plane of the states 1. Non-transition trajectories inside a state are mapped to the same point 2. Transition trajectories between two states are mapped to the line connected by the states 3. Transition trajectories among three states are mapped to the plane of the states

Occupation fraction vs projection Multi-time transition trajectories single-time transition trajectories The occupation fraction of a trajectory in states is linearly related to its projection

Transition time vs projection single-time transition trajectories Without requiring knowledge of states and transitions Transition state ensemble

Free energy reconstruction Two different initial distributions are re-weighted to the accurate free energy profile Weights of trajectories started from the same state are almost same

Transition network in 2D multi-well potential Topology of transition network is kept

Mexico-hat: entropy effects Eigenvalues gradually increase from 0 to 1 Topology of transition network is kept

Alanine dipeptide in waters An alanine dipeptide solvated in 522 TIP3P water molecules: 1588 atoms 500 initial conformations, generated from a 10 ns simulation at T=600K 500 WED trajectories (600 ps each)

Eigenvalue of H

450K and 300K Initial psi Started from C eq 7 Started from Alpha 7 Started from C ax 7

Modified potential at 300K projection Free energy reconstruction

Occupation fraction vs projection (300K mod) Occupation fraction vs projection (300K mod) Single-time transition trajectories

Transition time vs projection (300K mod) Transition time vs projection (300K mod) Single-time transition trajectories Real transition time by checking along the single-time transition trajectories

Dipeptide at 150K Eigenvalues of H continuously increase from zero to unity at 150K: Inter-trajectory difference due to entropy effects makes multiple eigenvalues be smaller than (but close to) unity

150K Count of traj. 300K modified More dispersive projection

Diffusive dynamics at 150K Histogram at transition regions is significant Diffusively cross the transition regions Trajectories do not sufficiently cover whole the state at the low temperature Statistical difference between trajectories is large

Solvent effects Include solvent-related functions in expansion

Completeness of Basis functions S quickly reaches saturation while the number of basis functions is far smaller than the size of sample It does not require the expansion is accurate at everywhere, but distinguish conformational regions

More complex cases While there are n small eigenvalues, trajectories should be projected to n-1 dimensional space Cluster analysis is required Trajectories might need to be split into multiple shorter segments to distinguish transition and non- transition trajectories Meta-stable states can be clustered in different time scales

Generalization The overlap matrix of trajectories i=1,…,n trajectories generated from the same potential but different initial configurations, the distributions are denoted as P i (x) Each trajectory (set of conformations) is mapped to a point

mapping 01 If two samples come from the same distribution, their mapped points locate at the same position The error satisfies a Gaussian distribution (the center limit theorem) If two samples come from the same distribution, their mapped points locate at the same position The error satisfies a Gaussian distribution (the center limit theorem)

Trajectory mapping 01 t-length MD trajectory: 1.Inside a state and reach local equilibrium 2.Transition among a few states and reach local equilibrium in each of the states, but not reach the inter-state equilibrium 3.Inside a state (conformational region) but not reach local equilibrium Equilibrium distribution of meta-stable states

Trajectory clustering t-length MD trajectory: 1.Concentrated points (clusters): non-transition 2. Points on lines connected with clusters: transition 3. Diffusion dynamics in entropy-dominated regions

Dimension of manifold n d is the dimension of trajectories basis functions n s is number of meta- stable states It is an equality while the set of applied basis functions is sufficient

Hierarchic kinetic network Split trajectory into short segments: detect kinetic network in shorter time scales Hierarchic meta-stable state structure

Fraction in states: correlation: 1.Form hierarchic Kinetic network which involving complete equilibrium and transition kinetic/dynamical properties 2. Calculate weights of trajectories and and correlation in diffusion regions The trajectory ensemble is mapped as a trajectory Understanding of system

74 atoms, charged terminals; Implicit solvent simulation: Generalized Born; 1000 trajectories; 20 ns and conformations per trajectory. Example

12-alanine peptide ns trajectories Application 172 basis functions from torsion angles

Dimension of manifold Principle Component Analysis Total dimension: 172

clustering minimal spanning tree clustering algorithm 2

sub-trajectory clustering (1 ns)

G1, 1ns clustering

G1, 2D free energy profile 2D might be insufficient

G11, sub-trajectory clustering

Hierarchic kinetic transition network put everything together to form a network 1.meta-stable states in different time scales 2.Transition connections among states 3.Transition rates, transition states and transition paths 4.Typical (or average) configurations of states 10 ns 1 ns 0.1 ns

summary 1.A complex conformational space can be understood by constructing hierarchical meta-stable state structure in time scales 2.WED generates multiple trajectories from different initial configurations, the trajectories are mapped to the average values of some independent physical variables 3.Clustering algorithm groups these trajectories to form the meta- stable state structure 4.Equilibrium properties can be reproduced based on overlapping of trajectories, thus the sampling is further enhanced 5.Dynamics and kinetics within the total simulation time can be obtained from the WED simulations 6.Dynamics in longer time scales might be much more easily obtained based on the state structure

Thanks for your attention!