Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

Taking Geometry to its Edge: Fast Rigid (and Hinge-Bent) Docking Algorithms. Haim Wolfson 1, Dina Duhovny 1, Yuval Inbar 1, Vladimir Polak 1, Ruth Nussinov.
Bio-CAD M. Ramanathan Bio-CAD. Molecular surfaces Bio-CAD.
Proximity graphs: reconstruction of curves and surfaces
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Seminar in structural bioinformatics Multiple structural alignment of proteins By Elad Kaspani.
Protein Structure Alignment Human Myoglobin pdb:2mm1 Human Hemoglobin alpha-chain pdb:1jebA Sequence id: 27% Structural id: 90% Another example: G-Proteins:
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Structural Bioinformatics Workshop Max Shatsky Workshop home page:
Docking Algorithm Scheme Part 1: Molecular shape representation Part 2: Matching of critical features Part 3: Filtering and scoring of candidate transformations.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Agenda A brief introduction The MASS algorithm The pairwise case Extension to the multiple case Experimental results.
Seminar in BioInformatics A Method for Biomolecular Structural Recognition and Docking Allowing Conformational Flexibility (1998) Bilha Sandak, Ruth Nussinov.
Docking of Protein Molecules
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
FLEX* - REVIEW.
Structural Bioinformatics Workshop Max Shatsky Workshop home page:
SiteEngine: Functional Sites Structural Search Engine
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
QSD – Quadratic Shape Descriptors Surface Matching and Molecular Docking Using Quadratic Shape Descriptors Goldman BB, Wipke WT. Quadratic Shape Descriptors.
Stochastic roadmap simulation for the study of ligand-protein interactions Mehmet Serkan Apaydin, Carlos E. Guestrin, Chris Varma, Douglas L. Brutlag and.
Dali: A Protein Structural Comparison Algorithm Using 2D Distance Matrices.
Object Recognition. Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding.
A unified statistical framework for sequence comparison and structure comparison Michael Levitt Mark Gerstein.
1 Alignment of Flexible Protein Structures Based on: FlexProt: Alignment of Flexible Protein Structures Without a Pre-definition of Hinge Regions / M.
Geometric molecular surface modeling using mathematical morphology operators. Journal of Molecular Graphics Volume 13, Issue 6, Pages (December.
Unbound Docking of Rigid Molecules. Problem Definition Given two molecules find their correct association: +=
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Calmodulin Alignment of Flexible Protein Structures.
Model Database. Scene Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.
Identifying similar surface patches on proteins using a spin-image surface representation M. E. Bock Purdue University, USA G. M. Cortelazzo, C. Ferrari,
Geometric molecular surface modeling using mathematical morphology operators. Journal of Molecular Graphics Volume 13, Issue 6, Pages (December.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
The Geometry of Biomolecular Solvation 1. Hydrophobicity Patrice Koehl Computer Science and Genome Center
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Structural alignments of Proteins using by TOPOFIT method Vitkup D., Melamud E., Moult J., Sander C. Completeness in structural genomics. Nature Struct.
A computational study of protein folding pathways Reducing the computational complexity of the folding process using the building block folding model.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Docking III: Matching via Critical Points Yusu Wang Joint Work with P. K. Agarwal, H. Edelsbrunner, J. Harer Duke University.
MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance Andrew I. Jewett, Conrad C. Huang and Thomas.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Topics in bioinformatics CS697 Spring 2011 Class 12 – Mar Molecular distance measurements Molecular transformations.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Local Flexibility Aids Protein Multiple Structure Alignment Matt Menke Bonnie Berger Lenore Cowen.
An Improved Search Algorithm for Optimal Multiple-Sequence Alignment Paper by: Stefan Schroedl Presentation by: Bryan Franklin.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
1 of 21 SDA development -Description of sda Description of sda-5a - Sda for docking.
Bayesian Refinement of Protein Functional Site Matching
Volume 95, Issue 8, Pages (October 2008)
Volume 13, Issue 4, Pages (February 2004)
G. Fiorin, A. Pastore, P. Carloni, M. Parrinello  Biophysical Journal 
Binding Dynamics of Isolated Nucleoporin Repeat Regions to Importin-β
Volume 90, Issue 1, Pages (July 1997)
Antonina Roll-Mecak, Chune Cao, Thomas E. Dever, Stephen K. Burley 
Replica Exchange Molecular Dynamics Simulations Provide Insight into Substrate Recognition by Small Heat Shock Proteins  Sunita Patel, Elizabeth Vierling,
Structural Flexibility of CaV1. 2 and CaV2
Protein Structure Alignment
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Alignment of Flexible Molecular Structures

Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear protein domain motions (Gerstein, Lesk, Chotia). Conformational flexibility in drugs.

Problem definition

Flexible Geometric Hashing Exploit the fact that neighboring parts share the joint - accumulate mutual information at the joint. Achieve complexity of the same order of magnitude as in rigid alignment.

Flexible protein alignment without prior hinge knowledge FlexProt - algorithm detects automatically flexibility regions, exploits amino acid sequence order.

Motivation

Geometric Representation 3-D Curve {v i }, i=1…n

Experimental Results

FlexProt Algorithm two protein molecules A and B, each being represented by the sequence of the 3-D coordinates of its C  atoms. Input: two protein molecules A and B, each being represented by the sequence of the 3-D coordinates of its C  atoms. largest flexible alignment by decomposing the two molecules into a minimal number of rigid fragment pairs having similar 3-D structure. Task: largest flexible alignment by decomposing the two molecules into a minimal number of rigid fragment pairs having similar 3-D structure.

Detection of Congruent Rigid Fragment Pairs Joining Rigid Fragment Pairs Rigid Structural Comparison Clustering (removing ins/dels) FlexProt Main Steps

Structural Similarity Matrix Congruent Rigid Fragment Pair

j i+1 j+1 i j-1 i-1 v i-1 v i v i+1 w j-1 w j w j+1 Frag kt (l) = v k … v i... v k+l-1 w t … w j … w t+l-1 RMSD (Frag kt (l) ) <  Detection of Congruent Rigid Fragment Pairs k t k+l-1 t+l-1

RMSD Computation V i …... V i+l W j...… W j+l W j...… W j+l V k …... V k+m W t...… W t+m W t...… W t+m P=P=P=P= Q= P U Q RMSD( P U Q ) in O(1) time NOT O( |P|+|Q| ) RMSD( P ) RMSD( Q )

Detection of Congruent Rigid Fragment Pairs Joining Rigid Fragment Pairs Rigid Structural Comparison Clustering (removing ins/dels) FlexProt Main Steps

How to Join Rigid Fragment Pairs ?

Graph Representation Graph Node Graph Edge

Graph Representation The fragments are in ascending order.The fragments are in ascending order. The gaps (ins/dels) are limited.The gaps (ins/dels) are limited. Allow some overlapping.Allow some overlapping. W + Size of the rigid fragment pair (node b) - Gaps (ins/dels) - Overlapping Penalties a b

Graph Representation W _i W _k W _t W _m W _n DAG (directed acyclic graph) DAG (directed acyclic graph)

Optimal Solution ? “All Shortest Paths”“All Shortest Paths” O(|E| * |V|+|V| 2 ) (for DAG) O(|E| * |V|+|V| 2 ) (for DAG) W _i W _k W _t W _m W _n “Single-source shortest paths”“Single-source shortest paths” O(|E|+|V|) O(|E|+|V|)

Detection of Congruent Rigid Fragment Pairs Joining Rigid Fragment Pairs Rigid Structural Comparison Clustering (removing ins/dels) FlexProt Main Steps

Clustering (removing ins/dels) T1T1 T2T2 If joining two fragment pairs gives small RMSD (T 1 ~ T 2 ) then put them into one cluster.

Detection of Congruent Rigid Fragment Pairs Joining Rigid Fragment Pairs Rigid Structural Comparison Clustering (removing ins/dels) FlexProt Main Steps

Correspondence Problem

Molecular Surface Representation Applications to docking

Motivation Prediction of biomolecular recognition. Detection of drug binding ‘cavities’. Molecular Graphics.

1. Solvent Accessible Surface – SAS 2. Connolly Surface

Connolly’s MS algorithm A ‘water’ probe ball ( A diameter) is rolled over the van der Waals surface. Smoothes the surface and bridges narrow ‘inaccessible’ crevices.

Connolly’s MS algorithm - cont. Convex, concave and saddle patches according to the no. of contact points between the surface atoms and the probe ball. Outputs points+normals according to the required sampling density (e.g. 10 pts/A 2 ).

Example - the surface of crambin

Critical points based on Connolly rep. (Lin, Wolfson, Nussinov) Define a single point+normal for each patch. Convex-caps, concave-pits, saddle - belt.

Critical point definition

Connolly => Shou Lin

Solid Angle local extrema knob hole

Chymotrypsin surface colored by solid angle (yellow-convex, blue-concave)