Princeton University IMA, Minneapolis, January 2008 I.G. Kevrekidis -C. W. Gear, G. Hummer, R. Coifman and several other good people- Department of Chemical.

Slides:



Advertisements
Similar presentations
Nonlinear Dimension Reduction Presenter: Xingwei Yang The powerpoint is organized from: 1.Ronald R. Coifman et al. (Yale University) 2. Jieping Ye, (Arizona.
Advertisements

Princeton University Projective Integration - a sequence of outer integration steps based on inner simulator + estimation (stochastic inference) Accuracy.
Princeton University Using the computer to select the right variables Rationale: Lake Carnegie, Princeton, NJ Straight Line Distance Actual transition.
Topology-Invariant Similarity and Diffusion Geometry
1 Numerical Geometry of Non-Rigid Shapes Diffusion Geometry Diffusion geometry © Alexander & Michael Bronstein, © Michael Bronstein, 2010 tosca.cs.technion.ac.il/book.
Ronald R. Coifman , Stéphane Lafon, 2006
1 CE 530 Molecular Simulation Lecture 8 Markov Processes David A. Kofke Department of Chemical Engineering SUNY Buffalo
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Lattice regularized diffusion Monte Carlo
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Transport Equations and Flux Laws Basic concepts and ideas 1.A concept model of Diffusion 2.The transient Diffusion Equation 3.Examples of Diffusion Fluxes.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Three Algorithms for Nonlinear Dimensionality Reduction Haixuan Yang Group Meeting Jan. 011, 2005.
Mathematical Modelling Classes of models Ordinary d.e. (Box models) Partial d.e. (Diffusion & Advection) Stochastic (different time & length scales) Discrete.
Anders Eriksson Complex Systems Group Dept. Energy and Environmental Research Chalmers EMBIO Cambridge July 2005 Complex Systems at Chalmers Information.
A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.
Chapter 4 Numerical Solutions to the Diffusion Equation.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Monte Carlo Methods in Partial Differential Equations.
Physics of fusion power
Diffusion Maps and Spectral Clustering
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
6. Markov Chain. State Space The state space is the set of values a random variable X can take. E.g.: integer 1 to 6 in a dice experiment, or the locations.
Modeling molecular dynamics from simulations Nina Singhal Hinrichs Departments of Computer Science and Statistics University of Chicago January 28, 2009.
Introduction to Non-Archimedean Physics of Proteins.
Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University
Max Shokhirev BIOC585 December 2007
A Study of The Applications of Matrices and R^(n) Projections By Corey Messonnier.
Motion and Stress Analysis by Vector Mechanics Edward C. Ting Professor Emeritus of Applied Mechanics Purdue University, West Lafayette, IN National Central.
Equation-Free (EF) Uncertainty Quantification (UQ): Techniques and Applications Ioannis Kevrekidis and Yu Zou Princeton University September 2005.
Statistical Physics of the Transition State Ensemble in Protein Folding Alfonso Ramon Lam Ng, Jose M. Borreguero, Feng Ding, Sergey V. Buldyrev, Eugene.
Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: WHAM and Related Methods Dr. Ronald M. Levy Statistical.
Minimization v.s. Dyanmics A dynamics calculation alters the atomic positions in a step-wise fashion, analogous to energy minimization. However, the steps.
Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.
Materials Process Design and Control Laboratory MULTISCALE MODELING OF ALLOY SOLIDIFICATION LIJIAN TAN NICHOLAS ZABARAS Date: 24 July 2007 Sibley School.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
Surface and Bulk Fluctuations of the Lennard-Jones Clusrers D. I. Zhukhovitskii.
Protein Structure 1 Primary and Secondary Structure.
Princeton University Department of Chemical Engineering and PACM Equation-Free Uncertainty Quantification on Stochastic Chemical Reactions SIAM Annual.
Stochastic Thermodynamics in Mesoscopic Chemical Oscillation Systems
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling II Prof. Corey O’Hern Department of Mechanical Engineering Department.
A Technical Introduction to the MD-OPEP Simulation Tools
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Serge Andrianov Theory of Symplectic Formalism for Spin-Orbit Tracking Institute for Nuclear Physics Forschungszentrum Juelich Saint-Petersburg State University,
NCN nanoHUB.org Wagner The basics of quantum Monte Carlo Lucas K. Wagner Computational Nanosciences Group University of California, Berkeley In collaboration.
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Incremental Integration of Computational Physics into Traditional Undergraduate Courses Kelly R. Roos, Department of Physics, Bradley University Peoria,
Protein folding dynamics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Collective diffusion of the interacting surface gas Magdalena Załuska-Kotur Institute of Physics, Polish Academy of Sciences.
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Math 285 Project Diffusion Maps Xiaoyan Chong Department of Mathematics and Statistics San Jose State University.
NO EQUATIONS, NO VARIABLES NO PARAMETERS:
Global Illumination (3) Path Tracing. Overview Light Transport Notation Path Tracing Photon Mapping.
Lecture 14: Advanced Conformational Sampling Dr. Ronald M. Levy Statistical Thermodynamics.
Intrinsic Data Geometry from a Training Set
Modeling molecular dynamics from simulations
Monte Carlo methods 10/20/11.
Lecture 17: Kinetics and Markov State Models
Volume 106, Issue 6, Pages (March 2014)
Computational Analysis
Lecture 17: Kinetics and Markov State Models
Lecture 4 - Transverse Optics II
Understanding protein folding via free-energy surfaces from theory and experiment  Aaron R Dinner, Andrej Šali, Lorna J Smith, Christopher M Dobson, Martin.
Lecture 4 - Transverse Optics II
Probing the Energy Landscape of the Membrane Protein Bacteriorhodopsin
Michael Schlierf, Felix Berkemeier, Matthias Rief  Biophysical Journal 
Multiple Folding Pathways of the SH3 Domain
Computer simulation studies of forced rupture kinetics of
Presentation transcript:

Princeton University IMA, Minneapolis, January 2008 I.G. Kevrekidis -C. W. Gear, G. Hummer, R. Coifman and several other good people- Department of Chemical Engineering, PACM & Mathematics Princeton University, Princeton, NJ Computational Experiments in coarse graining atomistic simulations (Equation-free -& Variable Free- Framework For Complex/Multiscale Systems)

Princeton University SIAM– July, 2004 Clustering and stirring in a plankton model Young, Roberts and Stuhne, Nature 2001

Princeton University Dynamics of System with convection

Princeton University Simulation Method Random (equal) birth and death, probability: = . Brownian motion. Advective stirring.  are random phases) IC: particles randomly placed in 1*1 box Analytical Equation for G(r):

Princeton University Stirring by a random field (color = y)

Princeton University Dynamics of System with convection

Princeton University

Projective Integration: From t=2,3,4,5 to 10

Princeton University RESTRICTION - a many-one mapping from a high-dimensional description (such as a collection of particles in Monte Carlo simulations) to a low-dimensional description - such as a finite element approximation to a distribution of the particles. LIFTING - a one-many mapping from low- to high-dimensional descriptions. We do the step-by-step simulation in the high-dimensional description. We do the macroscopic tasks in the low-dimensional description.

Princeton University So, the main points: You have a “microscopic code” Somebody tells you what are good coarse variable(s) Somebody tells you what KIND of equation this variable satisfies (deterministic, stochastic…) but NOT what the equation looks like. Then you can use the IDEA that such an equation exists and closes to accelerate the simulation/ extraction of information. KEY POINT – make micro states consistent with macro states

Princeton University Application to Micelle Formation Hydrophobic tail (T) Hydrophilic head (H)

Princeton University Surfactant = chain of H and T beads (H 4 T 4 ) No explicit solvent Hydrophobic-hydrophilic interactions modeled as direct interaction between H and T beads Pairwise interactions with nearest sites: Lattice Model (Larson et al., 1985)

Princeton University Snapshot of Micellar System T = 7.0, µ =

Princeton University Kinetic Approach to Rare Events (Hummer and Kevrekidis, JCP 118, (2003)) Evolution of coarse variables is slow – micelle birth/death are rare events Reconstruction of free energy surface: – long equilibrium simulation – series of short nonequilibrium simulations

Princeton University Reconstruction of Free Energy Surface Obtain G(  ) and  (  ) from short-scale nonequilibrium simulations

Princeton University Reconstructed free energy curve

Princeton University Results: Micelle Destruction Rate Kramers’ formula Result of nonequilibrium simulation: k = 5.58 x Equilibrium result: k = 7.70 x CPU time required: less than 7% of equilibrium simulation Extension to multi-dimensional systems (Hummer and Kevrekidis, 2003) – Chapman-Kolmogorov equation

Princeton University Reverse Projective Integration – a sequence of outer integration steps backward; based on forward steps + estimation We are studying the accuracy and stability of these methods Reverse Integration: a little forward, and then a lot backward !

Princeton University Reverse coarse integration from both sides

Princeton University Reconstructed free energy curve

Princeton University Details of Multidimensional Dynamics Small clusters: 2d dynamicsLarger clusters: 1d dynamics

Princeton University Multidimensional Dynamics (2 nd variable = E)

Princeton University Alanine Dipeptide In 700 tip3p waters w/ Gerhard Hummer, NIDDK / J.Chem.Phys. 03 The waters The dipeptide and the Ramachandran plot

Princeton University 1.Start with constrained MD 2.Let 50 configurations free 3.Estimate d/dt of average 4.Perform projective step

Princeton University Alanine Dipeptide Energy Landscape G.Hummer, I.G.Kevrekidis. Coarse molecular dynamics of a peptide fragment: Free energy, kinetics, and long-time dynamics computations J.Chem. Phys. 118 (2003). 

Princeton University

Protocols for Coarse MD (CMD) using Reverse Ring Integration Step Ring BACKWARD in Energy MD Simulations run FORWARD in Time Initialize ring nodes

Princeton University Fokker-Planck Equation (2D) for distribution P(x 1,x 2 ) 2D FPE: Probability Currents Drift Coefficient Diffusion Coefficients

Princeton University Drift and Diffusion Coefficients MD Simulations run FORWARD in Time Ring node ICs   multiple replicas per node compute drift (v) and diffusion (D) coefficients use (v, D) estimates to check for existence of potential (  ) and compute potential values at each node Dihedral angles P

Princeton University Ring at  +  Ring at  Ring Stepping in Generalized Potential Goal: Compute potential  associated with ring nodes (ring “height” above x 1 - x 2 plane)

Princeton University Right-handed  -helical minimum

Princeton University Potential Conditions I Case I : Diffusion matrix is proportional to unit matrix: ( D = scalar) Probability Current: Probability Current vanishes  Conditions for existence of generalized potential  : Path Drift Coefficient Potential Conditions

Princeton University CMD Exploration of Alanine Dipeptide using drift coefficients Reverse ring integration stagnates at saddle points on coarse free energy landscape Ring nodes Short bursts of MD simulation initialized at each node in the ring Data analysis of forward in time MD provides gradient information small step FORWARD in time large step BACKWARD in energy   ONLY drifts estimated here ring step size is scaled by unknown diffusivity D stepsize ~ nodal “heights” unknown

Princeton University Src homology 3 (SH3) domain Small Modular Domain amino acids long Characteristic fold consisting of five or six β-strands arranged as two tightly packed anti-parallel β sheets blue (N-terminus) to red (C-terminus) Distal Loop n-Src Diverging Turn Protein modeled using off-lattice C  representation associated with simplified minimally frustrated Hamiltonian P. Das, S. Matysiak, and C. Clementi, Proc. Natl. Acad. Sci. USA 102, (2005). N-terminus C-terminus Fraction native contacts formed Fraction non-native contacts formed FOLDED UNFOLDED TRANSITION STATE Protein folding intrinsically low-dimensional “Collective” (coarse, slow) coordinates fully describe long time system dynamics

Princeton University Reverse Ring Integration and MD Coarse MD (CMD) Transition state identification using CMD

Princeton University So, again, the main points Somebody needs to tell you what the coarse variables are And what TYPE of equation they satisfy And then you can use this information to bias the simulations “intelligently” accelerating the extraction of information (also need hypothesis testing).

Princeton University and now for something completely different: Little stars ! (well…. think fishes)

Princeton University

Fish Schooling Models Initial State Compute Desired Direction Update Direction for Informed Individuals ONLY Zone of Deflection R ij <  Zone of Attraction R ij <  Normalize  INFORMED UNINFORMED Update Positions Position, Direction, Speed Couzin, Krause, Franks & Levin (2005) Nature (433) 513

Princeton University STUCK ~ typically around rxn coordinate value of about 0.5 INFORMED DIRN STICK STATES INFORMED individual close to front of group away from centroid

Princeton University SLIP ~ wider range of rxn coordinate values for slip 0  0.35 INFORMED DIRN SLIP STATES INFORMED individual close to group centroid

Princeton University Effective Fokker-Planck Equation Drift Coefficient Diffusion Coefficient FPE: Potential:

Princeton University Coarse Free Energy Calculation Estimate Drift and Diffusion coefficients numerically from simulation “bursts” X0X0 X0X0 X0X0 Kopelevich, Panagiotopoulos & Kevrekidis J Chem Phys 122 (2005) Hummer & Kevrekidis J Chem Phys 118 (2003) Improved estimates using Maximum Likelihood Estimation (MLE) Y. Aït-Sahalia. Maximum Likelihood Estimation of Discretely Sampled Diffusions: A Close-Form Approximation Approach. Econometrica 70 (2002).

Princeton University STUCK SLIP Energy Landscape – Fish Swarming Problem CENTROID Informed DIRN

Princeton University Using the computer to select variable Rationale: Lake Carnegie, Princeton, NJ Straight Line Distance between locations NOT representative of actual transition difficulty\distance

Princeton University Using the computer to select variable Rationale: Lake Carnegie, Princeton, NJ Straight Line Distance Actual transition difficulty represented by curved path Curved Transition Distance

Princeton University Using the computer to select good variables Rationale: Lake Carnegie, Princeton, NJ Straight Line Distance IS representative of actual transition difficulty\distance in small LOCAL patches Patch size related to problem “geography”

Princeton University Using the computer to select variable Rationale: Euclidean Distance Selected Datapoint X Y Z 3D Dataset with 2D manifold Euclidean distance in input space may be weak indicator of INTRINSIC similarity of datapoints Geodesic distance is good for this dataset

Princeton University W 12 W 23 vertices edges weights parameter t Dataset as Weighted Graph

Princeton University Multiple random walks through simulation data initialized at Unequal separation (Euclidean distance) between IC ( ) and limits of random walk (, )

Princeton University Parameter Local neighborhood size Compute N  N “neighborhood” matrix K Compute diagonal normalization matrix D Compute Markovian matrix M N datapoints Require: Eigenvalues λ and Eigenvectors Φ of M A few Eigenvalues\Eigenvectors provide meaningful information on dataset geometry Top 2 nd N th

Princeton University Dataset in x, y, zDataset Diffusion Map N datapoints eigencomputation Diffusion Maps R. Coifman, S. Lafon, A. Lee, M. Maggioni, B. Nadler, F. Warner, and S. Zucker, Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion maps. PNAS 102 (2005). B. Nadler, S. Lafon, R. Coifman, and I. G. Kevrekidis, Diffusion maps, spectral clustering and reaction coordinates of dynamical systems. Appl. Comput. Harmon. Anal. 21 (2006).

Princeton University Diffusion Map (  2,  3 ) 22 33 22 ABSOLUTE Coordinates SIGNED Coordinates Report absolute distance of all uninformed individuals to informed individual to DMAP routine Report (signed) distance of all uninformed individuals to informed individual to DMAP routine STICK SLIP STICK SLIP Reaction Coordinate

Princeton University MAN MACHINE MAN MACHINE ABSOLUTE Coordinates SIGNED Coordinates

Princeton University So, again, the same simple theme If there is some reason to believe that there exist slow, effective dynamics in some smart collective variables Then this can be used to accelerate some features of the computation Tools for data-based detection of coarse variables… MAIN DIFFICULTY: finding physical initial conditions consistent with desired coarse initial conditions