Global Optimization: For Some Problems, There's HOPE Daniel M. Dunlavy Sandia National Laboratories, Albuquerque, NM, USA Dianne P. O’Leary Dept. of Computer.

Slides:



Advertisements
Similar presentations
Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
Advertisements

On Constrained Optimization Approach To Object Segmentation Chia Han, Xun Wang, Feng Gao, Zhigang Peng, Xiaokun Li, Lei He, William Wee Artificial Intelligence.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
With thanks to Zhijun Wu An introduction to the algorithmic problems of Distance Geometry.
Spie98-1 Evolutionary Algorithms, Simulated Annealing, and Tabu Search: A Comparative Study H. Youssef, S. M. Sait, H. Adiche
Structural bioinformatics
Global Optimization: For Some Problems, There’s HOPE Daniel M. Dunlavy University of Maryland, College Park Applied Mathematics and Scientific Computation.
Taking a Numeric Path Idan Szpektor. The Input A partial description of a molecule: The atoms The bonds The bonds lengths and angles Spatial constraints.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Protein Structures.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Department of Mechanical Engineering
LTE Review (September 2005 – January 2006) January 17, 2006 Daniel M. Dunlavy John von Neumann Fellow Optimization and Uncertainty Estimation (1411) (8962.
1 Hybrid methods for solving large-scale parameter estimation problems Carlos A. Quintero 1 Miguel Argáez 1 Hector Klie 2 Leticia Velázquez 1 Mary Wheeler.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
COMPARATIVE or HOMOLOGY MODELING
Bayesian parameter estimation in cosmology with Population Monte Carlo By Darell Moodley (UKZN) Supervisor: Prof. K Moodley (UKZN) SKA Postgraduate conference,
Stochastic sleep scheduling (SSS) for large scale wireless sensor networks Yaxiong Zhao Jie Wu Computer and Information Sciences Temple University.
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
Protein Folding in the 2D HP Model Alexandros Skaliotis – King’s College London Joint work with: Andreas Albrecht (University of Hertfordshire) Kathleen.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Statistical Physics of the Transition State Ensemble in Protein Folding Alfonso Ramon Lam Ng, Jose M. Borreguero, Feng Ding, Sergey V. Buldyrev, Eugene.
Doshisha Univ. JapanGECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi YoshidaTomoyuki.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
SAND C 1/17 Coupled Matrix Factorizations using Optimization Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar Sandia National Laboratories SIAM Conference.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
Approximation of Protein Structure for Fast Similarity Measures Fabian Schwarzer Itay Lotan Stanford University.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Conformational Space.  Conformation of a molecule: specification of the relative positions of all atoms in 3D-space,  Typical parameterizations:  List.
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
A Technical Introduction to the MD-OPEP Simulation Tools
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Danny Dunlavy, Andy Salinger Sandia National Laboratories Albuquerque, New Mexico, USA SIAM Parallel Processing February 23, 2006 SAND C Sandia.
Protein Structure Prediction
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Vaida Bartkutė, Leonidas Sakalauskas
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Sandia is a multi-program laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Multifidelity Optimization Via Pattern Search and Space Mapping Genetha Gray Computational Sciences & Mathematics Research Sandia National Labs, Livermore,
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance Andrew I. Jewett, Conrad C. Huang and Thomas.
Multifidelity Optimization Using Asynchronous Parallel Pattern Search and Space Mapping Techniques Genetha Gray*, Joe Castro i, Patty Hough*, and Tony.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Zhijun Wu Department of Mathematics Program on Bio-Informatics and Computational Biology Iowa State University Joint Work with Tauqir Bibi, Feng Cui, Qunfeng.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
Constrained Optimization by the  Constrained Differential Evolution with an Archive and Gradient-Based Mutation Tetsuyuki TAKAHAMA ( Hiroshima City University.
On the Computation of All Global Minimizers Through Particle Swarm Optimization IEEE Transactions On Evolutionary Computation, Vol. 8, No.3, June 2004.
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Protein dynamics Folding/unfolding dynamics
Computational Analysis
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Protein structure prediction.
謝孫源 (Sun-Yuan Hsieh) 成功大學 電機資訊學院 資訊工程系
Protein structure prediction
Presentation transcript:

Global Optimization: For Some Problems, There's HOPE Daniel M. Dunlavy Sandia National Laboratories, Albuquerque, NM, USA Dianne P. O’Leary Dept. of Computer Science and UMIACS University of Maryland, College Park, MD, USA Copper Mountain Conference on Iterative Methods April 3, 2006 SAND C Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000.

Outline Unconstrained Minimization Problem Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Problem Solve the unconstrained minimization problem Function Characteristics –Solution exists –Smooth ( ) –Multimodal, deep local minima –Good starting points unavailable

Some Related Methods Stochastic search methods –Random perturbations Simulated annealing –Perturbations, acceptance criterion, schedule Evolutionary algorithms –Ensembles/populations Smoothing methods –Deformation of function being minimized Homotopy/continuation methods –Nonlinear equations,

Outline Unconstrained Minimization Problem Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Homotopy Optimization Method (HOM) Goal –Minimize complicated nonlinear target function Steps to solution –Easy template function:, is a known local min. –Define a continuous homotopy function: –Produce sequence of minimizers of w.r.t. starting at and ending at

Illustration of HOM

Homotopy Optimization using Perturbations & Ensembles (HOPE) Improvements over HOM –Produce ensembles of local minimizers of by perturbing intermediate results –Increase likelihood of predicting a global minimizer Algorithmic considerations –Maximum ensemble size –Determining ensemble members

Illustration of HOPE Maximum ensemble size = 2

Considerations using HOPE Template function –Known solution, relation to target function Homotopy function –Continuous, but smoothness may help Parameters –Maximum ensemble size –Perturbation function/amount of perturbation –Number of perturbed versions produced –Number of steps taken in Metric for choosing ensemble members

Convergence of HOPE HOPE parameterized as existing methods –Probability-one homotopy methods for NLPs –Stochastic search methods Pure Adaptive Search/Improving Hit-and-Run –Simulated Annealing HOPE converges in probability –Smooth homotopy: –Closed form for probability –Conditions for convergence with probability one Extension to general homotopy maps –One spatial dimension, –Basins of attraction must be known

Outline Unconstrained Minimization Problem Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Numerical Experiments Test Problems –N-modal sine function: –Moré, Garbow, Hillstrom test functions 5 functions where local methods failed –Pintér test function Random multimodal functions, unique minimizer known Standard homotopy functions –Convex, probability-one Comparisons with local and stochastic search –More computation using HOM/HOPE –Better results and ensembles of local minimizers

Outline Unconstrained Minimization Problem Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Protein Structure Prediction AlaArgAspAsnArg Protein StructureAmino Acid Sequence CC CN R H OH Given the amino acid sequence of a protein (1D), is it possible to predict its native structure (3D)?

Protein Structure Prediction: Computational Methods Molecular dynamics –Langevin dynamics approximated using stochastic differential equation Bioinformatics –Sequence/structure matches to experimentally determined native structures Comparative modeling (threading, homology modeling) Energy Minimization –Find lowest energy conformation Native structure [Anfinsen, 1973]

Some Existing Energy Minimization Methods Local methods –Truncated Newton and quasi-Newton methods Memory efficient (second derivatives not stored) Global methods –Stochastic search, simulated annealing, evolutionary algorithms, and smoothing methods –Other methods Convex global underestimation Stochastic tunneling Packet annealing Derivative-free pattern search

Energy Minimization using HOM Goal –Minimize energy function of target protein Steps to solution –Energy of template protein: –Define a homotopy function: Deforms template protein into target protein –Produce sequence of minimizers of starting at and ending at

Energy Minimization using HOPE Extensions of HOM –Perturbations Specific to protein structure Bond length, bond angle, and particle perturbations –Ensembles Ensembles chosen using homotopy function value Benefits over existing minimization methods –Take advantage of sequence-related structural properties of template and target proteins

Outline Problem and Existing Methods Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Backbone Model: Particle Properties Backbone model – Single chain of particles with residue attributes – Particles model C  atoms in proteins Properties of particles – Hydrophobic, Hydrophilic, Neutral – Diverse hydrophobic-hydrophobic interactions [Veitshans, et al., 1996.]

Backbone Model: Energy Function

Backbone Model: Experiments 9 chains (22 particles) with known structure Loop Region HydrophobicHydrophilicNeutral ABCDEFGHIABCDEFGHI

Backbone Model: Experiments 62 template-target pairs –10 pairs had identical native structures Methods –HOM vs. Newton’s method w/trust region (N-TR) –HOPE vs. ensemble-based simulated annealing (SA) Different ensemble sizes (2,4,8,16) Averaged over 10 runs Perturbations where sequences differ Measuring success –Structural overlap function: Percentage of interparticle distances off by more than 20% of the average bond length ( ) –Root mean-squared deviation (RMSD)

Backbone Model: Results

Success of HOPE and SA with ensembles of size 16 for each template-target pair. The size of each circle represents the percentage of successful predictions over the 10 runs. SAHOPE

Outline Problem and Existing Methods Homotopy Optimization Methods Numerical Experiments Protein Structure Prediction Problem Numerical Experiments Concluding Remarks

Conclusion New homotopy optimization methods –HOM: sequence of minimizers vs. path tracing –HOPE: perturbations and ensembles –Convergence (cast as existing methods) Numerical Experiments –HOM/HOPE outperform several standard methods –Standard test functions Standard homotopy functions used –Protein structure prediction Problem-specific homotopy functions Take advantage of sequence-related protein properties

Future Directions Protein structure prediction –More realistic energy functions (AMBER, CHARMM) –Protein Data Bank (templates) –Different size chains for template/target HOPE for large-scale problems –Inherently parallelizable –Communication: initializing ensembles at each step HOPE for other optimization problems –Constrained problems HOPE for other applications

Thank You Daniel M. Dunlavy – HOPE Publications D.M. Dunlavy, D.P. O'Leary, D. Klimov, and D. Thirumalai HOPE: A Homotopy Optimization Method for Protein Structure Prediction J. Comput. Biol., 12(10): Dec HOPE: A Homotopy Optimization Method for Protein Structure Prediction D. M. Dunlavy and D.P. O'Leary Homotopy Optimization Methods for Global Optimization Sandia National Laboratories, SAND Dec Homotopy Optimization Methods for Global Optimization D.M. Dunlavy Homotopy Optimization Methods and Protein Structure Prediction Ph.D. Thesis, University of Maryland, College Park, Aug Homotopy Optimization Methods and Protein Structure Prediction D.M. Dunlavy Global Optimization of a Simplified Protein Energy Model In preparation Global Optimization of a Simplified Protein Energy Model