Global Optimization: For Some Problems, There’s HOPE Daniel M. Dunlavy University of Maryland, College Park Applied Mathematics and Scientific Computation.

Slides:



Advertisements
Similar presentations
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Advertisements

Dukka Application of Monte Carlo Simulation: Removing averaging artifacts in protein structure prediction.
Protein Structure Prediction With Evolutionary Algorithms Natalio Krasnogor, U of the West of England William Hart, Sandia National Laboratories Jim Smith,
Global Optimization: For Some Problems, There's HOPE Daniel M. Dunlavy Sandia National Laboratories, Albuquerque, NM, USA Dianne P. O’Leary Dept. of Computer.
Structural bioinformatics
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Using Pictorial Structures to Identify Proteins in X-ray Crystallographic Electron Density Maps Frank DiMaio Jude Shavlik
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Digital Filter Stepsize Control in DASPK and its Effect on Control Optimization Performance Kirsten Meeker University of California, Santa Barbara.
Protein Structures.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Protein Tertiary Structure Prediction
Department of Mechanical Engineering
LTE Review (September 2005 – January 2006) January 17, 2006 Daniel M. Dunlavy John von Neumann Fellow Optimization and Uncertainty Estimation (1411) (8962.
JM - 1 Introduction to Bioinformatics: Lecture XVI Global Optimization and Monte Carlo Jarek Meller Jarek Meller Division of Biomedical.
1 Hybrid methods for solving large-scale parameter estimation problems Carlos A. Quintero 1 Miguel Argáez 1 Hector Klie 2 Leticia Velázquez 1 Mary Wheeler.
COMPARATIVE or HOMOLOGY MODELING
Basic Computations with 3D Structures
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
A new Approach to Structural Prediction of Proteins Heiko Schröder Bertil Schmidt Jiujiang Zhu School of Computer Engineering Nanyang Technological University.
Protein Folding in the 2D HP Model Alexandros Skaliotis – King’s College London Joint work with: Andreas Albrecht (University of Hertfordshire) Kathleen.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Statistical Physics of the Transition State Ensemble in Protein Folding Alfonso Ramon Lam Ng, Jose M. Borreguero, Feng Ding, Sergey V. Buldyrev, Eugene.
Doshisha Univ. JapanGECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi YoshidaTomoyuki.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
Rotamer Packing Problem: The algorithms Hugo Willy 26 May 2010.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Approximation of Protein Structure for Fast Similarity Measures Fabian Schwarzer Itay Lotan Stanford University.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
A Technical Introduction to the MD-OPEP Simulation Tools
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Structure Prediction
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Data and Knowledge Engineering Laboratory Clustered Segment Indexing for Pattern Searching on the Secondary Structure of Protein Sequences Minkoo Seo Sanghyun.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
We propose an accurate potential which combines useful features HP, HH and PP interactions among the amino acids Sequence based accessibility obtained.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Bounded Nonlinear Optimization to Fit a Model of Acoustic Foams
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Simplified picture of the principles used for multiple copy simultaneous search (MCSS) and for computational combinatorial ligand design (CCLD). Simplified.
Protein dynamics Folding/unfolding dynamics
Protein Structures.
3-Dimensional Structure
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
Protein structure prediction.
謝孫源 (Sun-Yuan Hsieh) 成功大學 電機資訊學院 資訊工程系
Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties  Thomas Veitshans, Dmitri Klimov, Devarajan.
Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Doshisha University, Kyoto, Japan Takeshi Yoshida.
Protein structure prediction
Presentation transcript:

Global Optimization: For Some Problems, There’s HOPE Daniel M. Dunlavy University of Maryland, College Park Applied Mathematics and Scientific Computation

1/29 Outline Problem and Existing Methods Homotopy Optimization Methods Protein Structure Prediction Problem Numerical Experiments Conclusions/Future Directions

2/29 Problem Solve the unconstrained minimization problem Function Characteristics –Solution exists, smooth ( ) –Complicated (multiple minima, deep local minima) –Good starting points unknown/difficult to compute Challenges –Finding solution in reasonable amount of time –Knowing when solution has been found

3/29 Exhaustive/enumerative search Stochastic search [Spall, 2003] ; adaptive [Zabinsky, 2003] “Globalized” local search [Pinter, 1996] Branch and bound [Horst and Tuy, 1996] Genetic/evolutionary [Voss, 1999] Smoothing methods [Piela, 2002] Simulated annealing [Salamon, et al., 2002] Homotopy/continuation methods [Watson, 2000] Some Existing Methods

4/29 Outline Problem and Existing Methods Homotopy Optimization Methods Protein Structure Prediction Problem Numerical Experiments Conclusions/Future Directions

5/29 Goal –Solve complicated nonlinear target system Steps to solution –Easy template system: –Define a continuous homotopy function: Example (convex): –Trace path of from to Homotopy Methods for Solving Nonlinear Equations

6/29 Goal –Minimize complicated nonlinear target function Steps to solution –Easy template function: –Define a continuous homotopy function: Example (convex): –Produce sequence of minimizers of w.r.t. starting at and ending at Homotopy Optimization Methods (HOM)

7/29 Illustration of HOM

8/29 Homotopy Optimization using Perturbations & Ensembles (HOPE) Improvements over HOM –Produces ensemble of sequences of local minimizers of by perturbing intermediate results –Increases likelihood of predicting global minimizer Algorithmic considerations –Maximum ensemble size –Determining ensemble members

9/29 Illustration of HOPE

10/29 Convergence of HOPE path tracing random walk

11/29 Convergence of HOPE

12/29  Convergence of HOPE

13/29 Outline Problem and Existing Methods Homotopy Optimization Methods Protein Structure Prediction Problem Numerical Experiments Conclusions/Future Directions

14/29 Protein Structure Prediction AlaArgAspAsnArg Protein StructureAmino Acid Sequence CC CN R H OH Given the amino acid sequence of a protein (1D), is it possible to predict its native structure (3D)?

15/29 Protein Structure Prediction Given: –Protein model Molecular properties Potential energy function (force field) Goal: –Predict lowest energy conformation Native structure [Anfinsen, 1973] –Develop hybrid method, combining: Energy minimization [numerical optimization] Comparative modeling [bioinformatics] –Use template (known structure) to predict target structure

16/29 Backbone model –Single chain of particles with residue attributes –Particles model C  atoms in proteins Properties of particles –Hydrophobic, Hydrophilic, Neutral –Diverse hydrophobic-hydrophobic interactions Protein Model: Particle Properties [Veitshans, Klimov, and Thirumalai. Protein Folding Kinetics, 1996.]

17/29 Protein Model: Energy Function

18/29 Goal –Minimize energy function of target protein Steps to solution –Energy of template protein: –Define a homotopy function: Deforms template protein into target protein –Produce sequence of minimizers of starting at and ending at Homotopy Optimization Method for Proteins

19/29 Outline Problem and Existing Methods Homotopy Optimization Methods Protein Structure Prediction Problem Numerical Experiments Conclusions/Future Directions

20/29 Numerical Experiments 9 chains (22 particles) with known structure Loop Region HydrophobicHydrophilicNeutral ABCDEFGHIABCDEFGHI ABCDEFGHI A100 B77100 C D E F G H I Sequence Matching (%)

21/29 Numerical Experiments

22/29 Numerical Experiments 62 template-target pairs –10 pairs had identical native structures Methods –HOM vs. Newton’s method w/trust region (N-TR) –HOPE vs. simulated annealing (SA) Different ensemble sizes (2,4,8,16) Averaged over 10 runs Perturbations where sequences differ Measuring success –Structural overlap function: Percentage of interparticle distances off by more than 20% of the average bond length ( ) –Root mean-squared deviation (RMSD) Ensemble SA Basin hopping T 0 = 10 5 Cycles = 10 Berkeley schedule

23/29 Results

24/29 Results Success of HOPE and SA with ensembles of size 16 for each template-target pair. The size of each circle represents the percentage of successful predictions over the 10 runs. SAHOPE

25/29 Outline Problem and Existing Methods Homotopy Optimization Methods Protein Structure Prediction Problem Numerical Experiments Conclusions/Future Directions

26/29 Conclusions Homotopy optimization methods –More successful than standard minimizers HOPE –For problems with readily available –Solves protein structure prediction problem –Outperforms ensemble-based simulated annealing SA parameters not optimal

27/29 HOPEful Directions Protein structure prediction –Protein Data Bank (templates) –Different size chains for template/target HOPE for other minimization problems –Standard test problems –Probabilistic convergence analysis ( ) HOPE for large-scale problems –Inherently parallelizable –Communication: enforce maximum ensemble size

28/29 Other Work/Interests Optimization –Surrogate models in APPSPACK (pattern search) Linear Algebra –Structure preserving eigensolvers Quaternion-based Jacobi-like methods –RF circuit design – efficient DAE solvers Preconditioners, harmonic-balance methods Information processing/extraction –Entity recognition/disambiguation Persons, locations, organization –Querying, clustering and summarizing documents

29/29 Acknowledgements Dianne O’Leary (UM) –Advisor Dev Thirumalai (UM), Dmitri Klimov (GMU) –Model, suggestions Ron Unger (Bar-Ilan) –Problem formulation National Library of Medicine (NLM) –Grant: F37-LM008162

30/29 Thank You Daniel Dunlavy – HOPE

31/29 Homotopy Parameter Functions Split low/high frequency dihedral terms Homotopy parameter functions for each term

32/29 Different for individual energy terms Homotopy Function for Proteins TemplateTarget

33/29 HOPE Algorithm

34/29 Structural Overlap Function NativePredicted

35/29 RMSD Measures the distance between corresponding particles in the predicted and lowest energy conformations when they are optimally superimposed. where is a rotation and translation of