Taking a Numeric Path Idan Szpektor. The Input A partial description of a molecule: The atoms The bonds The bonds lengths and angles Spatial constraints.

Slides:



Advertisements
Similar presentations
Reactive and Potential Field Planners
Advertisements

Artificial Intelligence Presentation
Optimization : The min and max of a function
Manipulator’s Inverse kinematics
Optimization of thermal processes
L.M. McMillin NOAA/NESDIS/ORA Regression Retrieval Overview Larry McMillin Climate Research and Applications Division National Environmental Satellite,
CHAPTER 8 A NNEALING- T YPE A LGORITHMS Organization of chapter in ISSO –Introduction to simulated annealing –Simulated annealing algorithm Basic algorithm.
Gizem ALAGÖZ. Simulation optimization has received considerable attention from both simulation researchers and practitioners. Both continuous and discrete.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Classification and Prediction: Regression Via Gradient Descent Optimization Bamshad Mobasher DePaul University.
1 Last lecture  Configuration Space Free-Space and C-Space Obstacles Minkowski Sums.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
1Notes  Handing assignment 0 back (at the front of the room)  Read the newsgroup!  Planning to put 16mm films on the web soon (possibly tomorrow)
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Iterative closest point algorithms
Motion Analysis (contd.) Slides are from RPI Registration Class.
CSci 6971: Image Registration Lecture 4: First Examples January 23, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart, RPI Dr.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
Optimization via Search CPSC 315 – Programming Studio Spring 2009 Project 2, Lecture 4 Adapted from slides of Yoonsuck Choe.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
FLEX* - REVIEW.
Advanced Computer Graphics (Fall 2010) CS 283, Lecture 20: Inverse Kinematics Ravi Ramamoorthi Most slides courtesy.
Optimization Methods One-Dimensional Unconstrained Optimization
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
Chapter 5: Path Planning Hadi Moradi. Motivation Need to choose a path for the end effector that avoids collisions and singularities Collisions are easy.
A Kinematic View of Loop Closure EVANGELOS A. COUTSIAS, CHAOK SEOK, MATTHEW P. JACOBSON, KEN A. DILL Presented by Keren Lasker.
1 Numerical geometry of non-rigid shapes Non-Euclidean Embedding Non-Euclidean Embedding Lecture 6 © Alexander & Michael Bronstein tosca.cs.technion.ac.il/book.
RAPID: Randomized Pharmacophore Identification for Drug Design PW Finn, LE Kavraki, JC Latombe, R Motwani, C Shelton, S Venkatasubramanian, A Yao Presented.
Optimization Methods One-Dimensional Unconstrained Optimization
Monte Carlo Methods in Partial Differential Equations.
UNCONSTRAINED MULTIVARIABLE
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Dr. Hala Moushir Ebied Faculty of Computers & Information Sciences
Geometry Optimisation Modelling OH + C 2 H 4 *CH 2 -CH 2 -OH CH 3 -CH 2 -O* 3D PES.
Conformational Sampling
© Manfred Huber Autonomous Robots Robot Path Planning.
1 CE 530 Molecular Simulation Lecture 17 Beyond Atoms: Simulating Molecules David A. Kofke Department of Chemical Engineering SUNY Buffalo
Simulation and Animation
Nonlinear programming Unconstrained optimization techniques.
CS B659: Principles of Intelligent Robot Motion Configuration Space.
Computing Missing Loops in Automatically Resolved X-Ray Structures Itay Lotan Henry van den Bedem (SSRL)
Computer Animation Rick Parent Computer Animation Algorithms and Techniques Optimization & Constraints Add mention of global techiques Add mention of calculus.
Introduction The berthing assignment problem requires that a detailed time-and-space-schedule be planned for incoming ships, with the goal of minimizing.
2005MEE Software Engineering Lecture 11 – Optimisation Techniques.
Thursday, May 9 Heuristic Search: methods for solving difficult optimization problems Handouts: Lecture Notes See the introduction to the paper.
Iterative Improvement Algorithm 2012/03/20. Outline Local Search Algorithms Hill-Climbing Search Simulated Annealing Search Local Beam Search Genetic.
FORS 8450 Advanced Forest Planning Lecture 5 Relatively Straightforward Stochastic Approach.
1cs426-winter-2008 Notes  Will add references to splines on web page.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Chapter 10 Minimization or Maximization of Functions.
Modeling Protein Flexibility with Spatial and Energetic Constraints Yi-Chieh Wu 1, Amarda Shehu 2, Lydia Kavraki 2,3  Provided an approach to generating.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility Applications to molecular simulation Work on this paper.
Searching a Linear Subspace Lecture VI. Deriving Subspaces There are several ways to derive the nullspace matrix (or kernel matrix). ◦ The methodology.
Ch. Eick: Randomized Hill Climbing Techniques Randomized Hill Climbing Neighborhood Hill Climbing: Sample p points randomly in the neighborhood of the.
1 Minimum Interference Algorithm for Integrated Topology Control and Routing in Wireless Optical Backbone Networks Fangting Sun Mark Shayman University.
Numerical Methods for Inverse Kinematics Kris Hauser ECE 383 / ME 442.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
Fundamentals of Computer Animation
Elastic models of conformational transitions in macromolecules
Character Animation Forward and Inverse Kinematics
Clustering (3) Center-based algorithms Fuzzy k-means
Dr. Arslan Ornek IMPROVING SEARCH
Constraints: Connectivity and Mobility, Lecture 2
Instructor: Shengyu Zhang
Randomized Hill Climbing
Collaborative Filtering Matrix Factorization Approach
Conformational Search
Chapter 4 . Trajectory planning and Inverse kinematics
Presentation transcript:

Taking a Numeric Path Idan Szpektor

The Input A partial description of a molecule: The atoms The bonds The bonds lengths and angles Spatial constraints on some of the atoms of a molecule.

The Questions Is there a conformation of the molecule that answers the spatial constraints? Provide explicit description of such conformations. How many “really” different conformations are there? Which one is the best?

Hard life… Torsion angles are (usually) unconstrained  a large number of degrees of freedom  The answers to the questions are not easy to find analytically. They are also resource consuming (running time).

A Numeric Path Two algorithms take different approaches to limit the space of possible conformations to search in: Numeric randomized algorithm Semi numeric algorithm on a simpler problem

A Randomized Kinematics-Based Approach to Pharmacophore- Constrained Conformational Search and Database Screening S. M. Lavalle, P. W. Finn, L. E. Kavraki, J. C. Latombe (2000)

The Pharmacophore Problem The problem: to find a molecule conformation that satisfies a set of constraints out of a database of flexible molecules (e.g. for ligand docking). Constraints: Different atoms and their features in the molecule (types, charges etc.) The bond length and angles between bonds Static bond torsion angles Relative locations of some atoms from an anchor atom

The Molecule Model – no rings

The Molecule Model (Cont…) An atom a i carries standard information A bond b i carries the following information: l i – the bond length α i – the angle from the previous bond The set of possible torsion angles θ i  [0, 2π) All info besides the torsion angles is fixed. Θ is the m dimensional vector of variables that defines the conformation

The Pharmacophore Model A finite set of features corresponding to a subset of atoms Constraints on the relative positions between features (the atoms), when one of the features is designated as a anch, the origin of a global xyz coordinate

The Kinematic Model The bond length, angles and the torsion angles Θ can be used to give the positions of all of the atoms, relative to a anch We look at each atom center as a local coordinate frame. We would like to use the transformation from one coordinate frame to the another.

The Kinematic Model (Cont…)

The homogeneous transformation is:

The Kinematic Model (Cont…) The xyz position of atom a i is given by:

The Kinematic Model (Cont…) The coordinate frame of the molecule could be rotated with respect to the global coordinate frame of the pharmacophore feature positions  Another global coordinate frame transformation is needed

The Kinematic Model (Cont…) Global rotation transformation based on Euler angles γ,φ,ψ:

The Kinematic Model (Cont…) The complete xyz position of atom a i is given by:

The Kinematic Error Function Given Θ, γ,φ and ψ, the total amount of error between the requested feature positions G and the actual feature positions g can be measured as:

The Energy Function In a sense, the energy function measures the likelihood that the molecule will achieve a conformation in nature. An example for an energy function:

Two Questions For a given molecule from a database and a pharmacophore: Can the molecule achieve a low-energy conformation that satisfies the given pharmacophore? What are the “distinct” low-energy conformations that satisfy the pharmacophore?

Randomized Conformation Search with Constraints – The Motivation For the problem, the system of equations is generally under constrained, which leads to a complicated multidimensional solution set. This consideration, and the need for efficiency, led to the choice of a numerical randomized technique.

Randomized Conformation Search with Constraints – The Approach

The Search – Gradient Descent Randomly sample the neighborhood of P i Search for a point P i+1 such that f(P i+1 ) < f(P i ) If such P i+1 is found, move to P i+1 and repeat the search

Distance Minimization

Energy Minimization

Integrating into a Database Search A small set (~several hundreds) of candidate molecules (configurations) are chosen from a database (using 2D information) Kinematics-based conformational search is performed to further reduce the set of candidates

Search in a Set of Candidates Each time a sample conformation fails to match, the likelihood that the molecule will ever succeed decrease On the other hand, after any number of ‘fail’ iterations, it is impossible to conclude that the molecule will never succeed

Search in a Set of Candidates (cont…) Perform one attempt (random sample + match search) per molecule Go through all the molecules in the set and repeat Stop when a requested number of matches was found or a maximum number of iterations was reached

Conformation Clustering In general, having alternative low-energy conformations is useful because in many cases it is not the lowest-energy conformation that results in docking.

Conformation Clustering (cont...) Use a metric m(θ 1, θ 2 ), such as RMS of the displacements of the atoms between two conformations A threshold m max is the maximum distance that still regards two conformations as identical (in the same cluster)

Clustering Algorithm – Idea Always keep one representative for each cluster The representative is the conformation with the lowest energy

Clustering Algorithm – Detailed for a new conformation θ i : if exists another conformation θ k such that m(θ k, θ i ) ≤ m max and e(θ k ) ≤ e(θ i ), discard θ i otherwise: add θ i as a new cluster remove all θ k such that m(θ k, θ i ) ≤ m max

Experiments 2 different pharmacophores, for ACE and Thermolysin inhibitors 6 different molecules for each pharmacophore, kept in the database in randomly picked conformations The docked conformations are known

Experiments (Cont…) Cluster distance m max =1.5 Ǻ 20 iterations for the different candidates

Experiments (Cont…) In general, conformations’ energies are within 2–7 kcal/mol of the energy of the known docked conformation. The RMS distance of the conformations from the known docked conformations: Thermolysin inhibitors – 0.50, 2.96, 0.59, 0.81, 2.40, and 2.56 Å. ACE inhibitors – 1.26, 1.79, 0.94, 2.03, 1.87, and 1.98 Å.

Experiments (Cont…) A sufficient clustering record for a single molecule required about 5–20 min. A claim: “Our previous work with randomized techniques has shown that if we continue iterating our algorithm we increase our chances of covering the conformational space of the molecule, and hence, our chances of providing exhaustive information about the constrained conformations of the molecule.”

A Cluster record

Cyclic coordinate descent: A robotics algorithm for protein loop closure A. A. Canutescu and R. L. Dunbrack Jr. (2003)

The Loop Closure Problem The problem: matching a given loop to a given backbone (e.g. for Homology modeling). Constraint: connecting the two protein segments on either end of the loop, termed N and C-terminal anchors.

Loop Closure Problem (Cont…)

Previous Solutions Number of available conformation is enormous. Analytical: for 6 degrees of freedom. Numerical: changing all torsion angles at once to the next “best” position. Numerical methods are computationally expensive and sometimes unstable.

Cyclic Coordinate Descent (CCD) Originally developed for robotics. An iterative relaxation algorithm. Adjust only one degree of freedom at a time.

CCD Algorithm Proceed in an iterative fashion along the chain of degrees of freedom. Modify each torsion angle so that the end of the loop gets as close as possible to the desired position.

CCD Simplicity One equation in one unknown for each degree of freedom. The equation provides: Optimum setting for the variable First and second derivatives

The Main Equation

The Main Equation (Cont...)

Multiplying the last two terms by: Defining: We get:

The Main Equation (Cont...)

CCD Benefits Computationally Fast. Analytically simple - no singularities. Constraints can be placed on any degree of freedom. Using derivatives, small increments in change can be done in preference to of large changes.

Test 1 – Success Percentage 2752 different loops. 100 randomly different starting conformations for each loop. A match (closed loop) is when distance from the terminals is less than 0.08 Ǻ. Maximum 5000 iterative cycles (through all torsion angles) per search.

Test 1 (cont…) Two test frameworks: No constraints Using a Ramachandran Map When using the map: for a new proposed angle θ new, find (ψ new,θ new ) in the map and compare to (ψ old,θ old ) Change to θ new with probability:

Results for Test 1

Analysis of Test 1 Loops that fail to converge arrive to a local minimum (usage of Monte Carlo method to escape local minimum). CCD favors large changes in the first residues. Loops that failed to converge are usually extended loops.

Test 2 – Minimum RMS to Original CCD is meant to be a component of a matching algorithm that should also include loop generation and energy function. How well CCD alone converge to the original loop conformation. 30 original loops. For each loop, 5000 random conformations were generated and closed.

Results for Test 2

Results for Test 2 (Cont...)

Test 3 – Structure Convergence Using Ramachandran map and starting with the same loop conformation. Each time using different random generated numbers. 500 tests. do all final conformations are the same? Compared to 500 closures of the same loop with different starting conformations.

Test 3 – Results

Comparison to Random TWEAK 5000 trails (10 loops, 500 initial configurations each). Closure when RMS < 0.08 Ǻ. CCD – closed ~7 min running time TWEAK – closed ~40 min running time