Structure Refinement BCHM 5984 September 7, 2009.

Slides:



Advertisements
Similar presentations
Optimization.
Advertisements

Optimization : The min and max of a function
A Digital Laboratory “In the real world, this could eventually mean that most chemical experiments are conducted inside the silicon of chips instead of.
Optimization of thermal processes
Optimization 吳育德.
Steepest Decent and Conjugate Gradients (CG). Solving of the linear equation system.
Jonathan Richard Shewchuk Reading Group Presention By David Cline
Geometry Optimization Pertemuan VI. Geometry Optimization Backgrounds Real molecules vibrate thermally about their equilibrium structures. Finding minimum.
Function Optimization Newton’s Method. Conjugate Gradients
Motion Analysis (contd.) Slides are from RPI Registration Class.
CSci 6971: Image Registration Lecture 4: First Examples January 23, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart, RPI Dr.
Optimization Methods One-Dimensional Unconstrained Optimization
Molecular Mechanics force fields minimization. Force Fields good review: MacKerell (2004) JCompChem, 25:1584 FF typically contains terms for: –bonds and.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Advanced Topics in Optimization
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Molecular Modeling: Geometry Optimization C372 Introduction to Cheminformatics II Kelsey Forsythe.

9 1 Performance Optimization. 9 2 Basic Optimization Algorithm p k - Search Direction  k - Learning Rate or.
UNCONSTRAINED MULTIVARIABLE
Geometry Optimisation Modelling OH + C 2 H 4 *CH 2 -CH 2 -OH CH 3 -CH 2 -O* 3D PES.
Molecular Modeling Part I. A Brief Introduction to Molecular Mechanics.
Molecular Modeling Ka-Lok Ng Asia University. Topics to be covered Useful concepts in molecular modelling, useful mathematics Quantum mechanics and chemistry.
84 b Unidimensional Search Methods Most algorithms for unconstrained and constrained optimisation use an efficient unidimensional optimisation technique.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Qualifier Exam in HPC February 10 th, Quasi-Newton methods Alexandru Cioaca.
Nonlinear programming Unconstrained optimization techniques.
Chem Math 252 Chapter 5 Regression. Linear & Nonlinear Regression Linear regression –Linear in the parameters –Does not have to be linear in the.
1 Optimization Multi-Dimensional Unconstrained Optimization Part II: Gradient Methods.
Computer Animation Rick Parent Computer Animation Algorithms and Techniques Optimization & Constraints Add mention of global techiques Add mention of calculus.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Multivariate Unconstrained Optimisation First we consider algorithms for functions for which derivatives are not available. Could try to extend direct.
Covalent interactions non-covalent interactions + = structural stability of (bio)polymers in the operative molecular environment 1 Energy, entropy and.
MODELING MATTER AT NANOSCALES 3. Empirical classical PES and typical procedures of optimization Classical potentials.
MODELING MATTER AT NANOSCALES 3. Empirical classical PES and typical procedures of optimization Geometries from energy derivatives.
559 Fish 559; Lecture 5 Non-linear Minimization. 559 Introduction Non-linear minimization (or optimization) is the numerical technique that is used by.
Quasi-Newton Methods of Optimization Lecture 2. General Algorithm n A Baseline Scenario Algorithm U (Model algorithm for n- dimensional unconstrained.
Simplistic Molecular Mechanics Force Field Van der WaalsCharge - Charge Bond Angle Improper Dihedral  
Molecular Mechanics a. Force fields b. Energy minimization / Geometry optimization c. Molecular mechanics examples.
Data Modeling Patrice Koehl Department of Biological Sciences National University of Singapore
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Lecture 13. Geometry Optimization References Computational chemistry: Introduction to the theory and applications of molecular and quantum mechanics, E.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Chapter 2-OPTIMIZATION G.Anuradha. Contents Derivative-based Optimization –Descent Methods –The Method of Steepest Descent –Classical Newton’s Method.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Lecture 7: Molecular Mechanics: Empirical Force Field Model Nanjie Deng Structural Bioinformatics II.
The Nudged Elastic Band Method and its Implementation in VASP
Computational Structure Prediction
Non-linear Minimization
Protein Structure Prediction and Protein Homology modeling
Chapter 14.
Energies, forces, bonds - only shapes of atomic electron orbitals give us a hint of three-dimensional shapes of molecules more quantitative treatment requires.
CS5321 Numerical Optimization
Non-linear Least-Squares
Collaborative Filtering Matrix Factorization Approach
Computational Analysis
Chapter 10. Numerical Solutions of Nonlinear Systems of Equations
Introduction to Scientific Computing II
Introduction to Scientific Computing II
Introduction to Scientific Computing II
Large Time Scale Molecular Paths Using Least Action.
Optimization Methods TexPoint fonts used in EMF.
Conformational Search
Neural Network Training
Performance Optimization
Outline Preface Fundamentals of Optimization
Section 3: Second Order Methods
Computer Animation Algorithms and Techniques
Protein structure prediction
Presentation transcript:

Structure Refinement BCHM 5984 September 7, 2009

Methods of 3-D Structure Prediction De novo structure prediction Derived completely from sequence Comparative modeling Fold recognition / Threading (when no homologs exist) Homology modeling (when clear homologs exist)

Steps in Homology Modeling Identify homologues Determine sequence identity Align sequences Identify conserved / non-conserved regions Generate model for conserved regions Generate model for non-conserved regions Build sidechains Evaluate and refine

Understanding the Problem Leach, AR (2001) “Molecular Modelling Principles and Applications”. 2nd Ed. Prentice Hall Publishers.

Proteins are Complicated Many more atoms Must consider: Ideal bond lengths, bond angles, dihedrals Electrostatic interaction (hydrogen bonds, ionic interactions) and van der Waals interactions

How to Solve the Problem Given a function f which depends on one or more independent variables x1, x2, …, xi, find the values of those variables where f has a minimum value. At a minimum point, the first derivative of the function with respect to each of the variables is zero and the second derivatives are all positive: The function f is the potential energy The variables xi are the atomic Cartesian coordinates Change the position of the coordinates (xi) until we find the position with the smallest potential energy

Examples of the Functions E = Ecovalent + Enoncovalent can be further expanded to: Ecovalent = Ebond + Eangle + Edihedral Enoncovalent = Eelectrostatic + Evdw

Examples of the Functions Enoncovalent = Eelectrostatic + Evdw Coulombic Potential Lennard-Jones Potential

Examples of the Functions Ecovalent = Ebond + Eangle + Edihedral Bond Stretching Potential Harmonic Angle Potential Dihedral Potential

What You Need to Start Cartesian coordinates of your model (x, y, z) for every atom = 3N variables where N is the number of atoms Energy minimizer program Knows potential energy functions for minimization Knows ideal bond lengths, bond angles, etc. for all atomic interactions, covalent and non-covalent (also called force fields)

Energy Minimization Methods Non-derivative methods Simplex First-order derivative methods Steepest descents Conjugate gradients Second-order derivative methods Newton-Raphson Quasi-Newton methods Davidson-Fletcher-Powell (DFP) Broyden-Fletcher-Goldfarb-Shanno (BFGS)

Energy Minimization Methods Non-derivative methods Simplex First-order derivative methods Steepest descents Conjugate gradients Second-order derivative methods Newton-Raphson Quasi-Newton methods Davidson-Fletcher-Powell (DFP) Broyden-Fletcher-Goldfarb-Shanno (BFGS)

Simplex Method Moves around like an “amoeba”

Non-Derivative Methods Advantages: Works well when starting configuration is very high in potential energy Disadvantages: Surprisingly slow (calculations are fast, but it takes many iterations) Not good for large biomolecules

Energy Minimization Methods Non-derivative methods Simplex First-order derivative methods Steepest descents Conjugate gradients Second-order derivative methods Newton-Raphson Quasi-Newton methods Davidson-Fletcher-Powell (DFP) Broyden-Fletcher-Goldfarb-Shanno (BFGS)

Steepest Descents Method 1.) Evaluate the sum of all forces on the system (first derivative of potential energy functions) 2.) Move in the direction of the force until potential energy stops decreasing 3.) Turn 90° and return to step 2 sx = -gx / |gx| s = step direction g = gradient direction x = coordinates of system The next step is orthogonal: gx  gx-1 = 0

SD: When to Turn Line Search Arbitrary Step Find three points along a line where the middle point is less than the other two points Calculates a function for the three points and determines the minimum The minimum becomes the middle point, and repeat Arbitrary Step Try a small step size to see that potential energy decreases Iteratively increase step size until potential energy is increased Multiply the final step size by 0.5

SD: When to Stop After a predefined energy minimum has been reached For example, < 1.0 kJ / mol After a predefined number of steps For example, after 1000 orthogonal steps

SD: Searching Problem Does not work well in (relatively) flat energy wells Takes too many steps / too long to finish

Conjugate Gradients Method 1.) Evaluate the sum of all forces on the system (first derivative of potential energy functions) 2.) Move in the direction of the force until potential energy stops decreasing 3.) Return to step 1 Red line = Conjugate gradients Green line = Steepest descents

Steepest Descents vs. Conjugate Gradients Stable and rigorous Generally slower and takes more steps than CG in flat wells Can take bigger steps and finish faster in steep wells Conjugate gradients: Slower in the beginning, but can be faster overall (takes fewer steps) in flat wells Less stable than SD (may need restarting) Both methods (ideally) converge to the same local energy minimum

Energy Minimization Methods Non-derivative methods Simplex First-order derivative methods Steepest descents Conjugate gradients Second-order derivative methods Newton-Raphson Quasi-Newton methods Davidson-Fletcher-Powell (DFP) Broyden-Fletcher-Goldfarb-Shanno (BFGS)

Second-Order Derivative Methods Use first derivatives to see which way the gradient flows Use second derivatives to see changes in the way the gradient flows Tries to predict the best spot to “jump” to Newton-Raphson method: xn = current position xn+1 = next position f’(xn) = first derivative of energy function f'’(xn) = second derivative of energy function

Newton-Raphson can be Slow A Hessian matrix is a matrix of second-order derivatives of a function Must be calculated in each step for Newton-Raphson method

The BFGS Assumption Calculating second-order derivatives is hard and time consuming Never actually calculates a Hessian matrix, just estimates it as it goes along Estimated by looking at successive gradients Not technically a “true” second-order derivative method

Second-Order Derivative Methods Advantages Takes the fewest steps Fastest (for small molecules) Disadvantages For big systems, can require too much memory Best suited for small molecules Red line = BFGS method Green line = Conjugate gradients

Choosing an EM Method Depends on: Storage / computational capabilities Number of atoms in the system When working with proteins, always steepest descents or conjugate gradients Initial Refinement (Av. gradient < 1 kcal Å-2) Stringent Minimization (Av. gradient < 0.1 kcal Å-2) Method CPU time (s) # of iterations Steepest descents 67 98 1405 1893 Conjugate gradients 149 213 257 367

Inherent Problem of EM Only finds local minima No method available can find the global minimum from any starting point

Performing Energy Minimization Links Dundee PRODRG2 Server (http://davapc1.bioch.dundee.ac.uk/prodrg/) Swiss-PDBViewer (http://ca.expasy.org/spdbv/text/energy.htm) GROMACS (http://www.gromacs.org/) NAMD (http://www.ks.uiuc.edu/Research/namd/) AMBER (http://ambermd.org/) CHARMM (http://www.charmm.org/) Methods of EM in GROMACS Steepest descents Conjugate gradients L-BFGS (limited-memory Broyden-Fletcher-Goldfarb-Shanno quasi-Newtonian minimizer)

Discuss Homework 2