Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.

Slides:



Advertisements
Similar presentations
PRAGMA – 9 V.S.S.Sastry School of Physics University of Hyderabad 22 nd October, 2005.
Advertisements

Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Multiscale Dynamics of Bio-Systems: Molecules to Continuum February 2005.
Molecular Dynamics at Constant Temperature and Pressure Section 6.7 in M.M.
Mesoscopic nonequilibrium thermoydnamics Application to interfacial phenomena Dynamics of Complex Fluid-Fluid Interfaces Leiden, 2011 Miguel Rubi.
Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
Emergence of Quantum Mechanics from Classical Statistics.
The Screened Coulomb Potential-Implicit Solvent Model (SCP-ISM) is used to study the alanine dipeptide in aqueous solution and the discrimination of native.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Lecture 3: Markov processes, master equation
DARPA Oasis PI Meeting Hilton Head, SC March 2002.
1 CE 530 Molecular Simulation Lecture 8 Markov Processes David A. Kofke Department of Chemical Engineering SUNY Buffalo
A microstate of a gas of N particles is specified by: 3N canonical coordinates q 1, q 2, …, q 3N 3N conjugate momenta p 1, p 2, …, p 3N Statistical Mechanics.
1 Markov Chains Tom Finke. 2 Overview Outline of presentation The Markov chain model –Description and solution of simplest chain –Study of steady state.
Locating structural energy minima of biological molecules in explicit solvent APS March Meeting Baltimore 2006 Eric Dykeman Otto Sankey Arizona State University.
Theoretical and computationalphysical chemistry group Theoretical characterization of  -helix and  -hairpin folding kinetics Theoretical characterization.
Protein folding kinetics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Chess Review May 10, 2004 Berkeley, CA A stability criterion for Stochastic Hybrid Systems A. Abate, L. Shi, S. Simic and S.S. Sastry University of California.
Finding the Tools--and the Questions--to Understand Dynamics in Many Dimensions R. Stephen Berry The University of Chicago TELLURIDE, APRIL 2007.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Molecular Dynamics and Normal Mode Analysis of WW domain Santanu Chatterjee 1, Christopher Sweet 1, Tao Peng 2, John Zintsmaster 2, Brian Wilson 2, Jesus.
Leipzig, 17 May Markov Models of Protein Folding - Application to Molecular Dynamics Simulations Christian Hedegaard Jensen.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
6. Markov Chain. State Space The state space is the set of values a random variable X can take. E.g.: integer 1 to 6 in a dice experiment, or the locations.
Coordinates and Pathways in MM and QM/MM modeling Haiyan Liu School of Life Sciences, University of Science and Technology of China.
Modeling molecular dynamics from simulations Nina Singhal Hinrichs Departments of Computer Science and Statistics University of Chicago January 28, 2009.
Introduction to Non-Archimedean Physics of Proteins.
Department of Mechanical Engineering
Graph mining in bioinformatics Laur Tooming. Graphs in biology Graphs are often used in bioinformatics for describing processes in the cell Vertices are.
Monte Carlo Methods: Basics
Deca-Alanine Stretching
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: WHAM and Related Methods Dr. Ronald M. Levy Statistical.
1 CE 530 Molecular Simulation Lecture 6 David A. Kofke Department of Chemical Engineering SUNY Buffalo
Quantal and classical geometric phases in molecules: Born-Oppenheimer Schrodinger equation for the electronic wavefunction Florence J. Lin University of.
Department of Mechanical Engineering
Chem. 860 Molecular Simulations with Biophysical Applications Qiang Cui Department of Chemistry and Theoretical Chemistry Institute University of Wisconsin,
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 14 Some materials adapted from Prof. Keith E. Gubbins:
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling II Prof. Corey O’Hern Department of Mechanical Engineering Department.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Markov Cluster (MCL) algorithm Stijn van Dongen.
STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis.
Exploring the connection between sampling problems in Bayesian inference and statistical mechanics Andrew Pohorille NASA-Ames Research Center.
Path integral Monte Carlo
Insight into peptide folding role of solvent and hydrophobicity dynamics of conformational transitions.
Schematic representation of ‘Microstates’. Integretion over a Markovian Web. G. C. Boulougouris, D. Frenkel, J. Chem. Theory Comput. 2005, 1,
Lecture 2 Molecular dynamics simulates a system by numerically following the path of all particles in phase space as a function of time the time T must.
Protein folding dynamics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Data Mining Using Eigenpattern Analysis in Simulations and Observed Data Woodblock Print, from “Thirty-Six Views of Mt. Fuji”, by K. Hokusai, ca
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Molecular simulations of polypeptides under confinement CHEN633: Final Project Rafael Callejas-Tovar Artie McFerrin Department of Chemical Engineering.
EPSRC Portfolio Partnership in Complex Fluids and Complex Flows Computational Rheology The Numerical Prediction of Complex Flows of Complex Fluids Gigantic.
Slow Relaxations in Complex Fluids: Origin and Nature of Dynamical Heterogeneities B. Chakraborty, Brandeis University, DMR Materials as diverse.
Network Science K. Borner A.Vespignani S. Wasserman.
Statistical Mechanics and Multi-Scale Simulation Methods ChBE
Basic Monte Carlo (chapter 3) Algorithm Detailed Balance Other points non-Boltzmann sampling.
Monte Carlo Simulation of Canonical Distribution The idea is to generate states i,j,… by a stochastic process such that the probability  (i) of state.
1 Xin Zhou Asia Pacific Center for Theoretical Physics, Dep. of Phys., POSTECH, Pohang, Korea Structuring and Sampling in Complex Conformational.
CF14 EGI-XSEDE Workshop Session Tuesday, May 20 Helsinki, Findland Usecase 2 TTU-COMPCHEM Collaboration on Direct Classical and Semiclassical Dynamics.
Modeling molecular dynamics from simulations
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 22, 2013: Topological methods for exploring low-density.
Lecture 17: Kinetics and Markov State Models
ReMoDy Reactive Molecular Dynamics for Surface Chemistry Simulations
Maria Okuniewski Nuclear Engineering Dept.
Department of physics, Fudan University, Shanghai, China
Lecture 17: Kinetics and Markov State Models
Universality and diversity of the protein folding scenarios:a comprehensive analysis with the aid of a lattice model  Leonid A Mirny, Victor Abkevich,
Deep learning enhanced Markov State Models (MSMs)
Presentation transcript:

Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department of Physics Yale University

Protein folding kinetics configuration space e N local energy minima (configurations) connected via transitions Random walk on network from initial to native state States and transition probabilities obtained from simulations basin of attraction transition initial state native state energy minimum

“Describing protein folding kinetics by Molecular Dynamics Simulations. 1. Theory” W. C. Swope, J. W. Pitera, and F. Suits, J. Phys. Chem. B 108 (2004) Markov Modeling of Proteins “Describing protein folding kinetics by Molecular Dynamics Simulations. 2. Example applications to Alanine Dipeptide and a  -hairpin peptide” W. C. Swope, J. W. Pitera, et al., J. Phys. Chem. B 108 (2004) 6582.

Additional Reading 1. “Molecular simulation of ab Initio protein folding for a Millisecond folder NTL9(1-39),” JACS 132 (2010) “Using massively parallel simulation and Markovian models to study protein folding: Examining the dynamics of the Villin headpiece,” J. Chem. Phys. 124 (2006) “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) “Using generalized ensemble simulations and Markov state models to identify conformational states,” Methods 49 (2009) “Stochastic dynamics of model proteins on a directed graph,” Phys. Rev. Lett. 79 (2009)

Markov Modeling Describes temporal evolution of state of the system No memory; transition probabilities only depend on current state; satisfied by MD trajectories Time domain (continuous or discrete); state space (continuous or discrete) Statistical description: What is probability that member of the ensemble of systems will be in a given state at time t? How does one choose set of states for Markov model of protein dynamics---continuous degrees of freedom yields infinite number of states? Number of native contacts…but not specific enough

initial state native state Lumping of States: From 11 to 3 Are transitions among aggregated states (A, B, C) Markovian? Yes, at sufficiently long time scales. How does one decide on lumping scheme? A B C

Mathematical Description T to, from i,j=1,N s Elements non-negative Columns sum to 1 Eigenvalues  i ≤ 1: T  =   (  =1) gives steady-state probability distribrution Detailed balance (no net flow) Eigenvectors form complete set N s - 1 eigenvalues determine relaxation rates

Toy Model

T ij = =1=1 Transition Matrix Lumped Transition Matrix =1=1 L

State Probabilities

Results from Toy Model 9 microstates L(T n ) (L(T)) n Larger deviations; practical

ii log  i Eigenvalue Spectra Small deviations