Development of SCWRL4 for improved prediction of protein side-chain conformations In collaboration with Moscow Engineering & Physics Institute © George.

Slides:



Advertisements
Similar presentations
François Fages MPRI Bio-info 2007 Formal Biology of the Cell Protein structure prediction with constraint logic programming François Fages, Constraint.
Advertisements

CSE 681 Bounding Volumes. CSE 681 Bounding Volumes Use simple volume enclose object(s) tradeoff for rays where there is extra intersection test for object.
Collision Detection CSCE /60 What is Collision Detection?  Given two geometric objects, determine if they overlap.  Typically, at least one of.
Computer graphics & visualization Collisions. computer graphics & visualization Simulation and Animation – SS07 Jens Krüger – Computer Graphics and Visualization.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Crystallography -- lecture 21 Sidechain chi angles Rotamers Dead End Elimination Theorem Sidechain chi angles Rotamers Dead End Elimination Theorem.
Lactate dehydrogenase + 38 ATP + 2 ATP. How does lactate dehydrogenase perform its catalytic function ?
4/15/2017 Using Gaussian Process Regression for Efficient Motion Planning in Environments with Deformable Objects Barbara Frank, Cyrill Stachniss, Nichola.
Short fast history of protein design Site-directed mutagenesis -- protein engineering (J. Wells, 1980's) Coiled coils, helix bundles (W. DeGrado, 1980's-90's)
Basis Expansion and Regularization Presenter: Hongliang Fei Brian Quanz Brian Quanz Date: July 03, 2008.
Week 14 - Monday.  What did we talk about last time?  Bounding volume/bounding volume intersections.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Rapid Protein Side-Chain Packing via Tree Decomposition Jinbo Xu Department of Mathematics Computer Science and AI Lab MIT.
Chapter 6: Vertices to Fragments Part 2 E. Angel and D. Shreiner: Interactive Computer Graphics 6E © Addison-Wesley Mohan Sridharan Based on Slides.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Implicit Hitting Set Problems Richard M. Karp Harvard University August 29, 2011.
Ensemble Results of PIM1 PIM PIM Ensemble Results of GSK3 GSK GSK GSK
A Combinatorial Maximum Cover Approach to 2D Translational Geometric Covering Karen Daniels, Arti Mathur, Roger Grinde University of Massachusetts Lowell.
Thomas Blicher Center for Biological Sequence Analysis
Using Pictorial Structures to Identify Proteins in X-ray Crystallographic Electron Density Maps Frank DiMaio Jude Shavlik
FLEX* - REVIEW.
Solving the Protein Threading Problem in Parallel Nocola Yanev, Rumen Andonov Indrajit Bhattacharya CMSC 838T Presentation.
OBBTree: A Hierarchical Structure for Rapid Interference Detection Gottschalk, M. C. Lin and D. ManochaM. C. LinD. Manocha Department of Computer Science,
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Apex Point Map for Constant-Time Bounding Plane Approximation Samuli Laine Tero Karras NVIDIA.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Protein Side Chain Packing Problem: A Maximum Edge-Weight Clique Algorithmic Approach Dukka Bahadur K.C, Tatsuya Akutsu and Tomokazu Seki Proceedings of.
12/4/2001CS 638, Fall 2001 Today Using separating planes/axes for collision testing Collision detection packages.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University COT 5410 – Spring 2004.
Chapter 9 Superposition and Dynamic Programming 1 Chapter 9 Superposition and dynamic programming Most methods for comparing structures use some sorts.
COMPARATIVE or HOMOLOGY MODELING
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Rotamer Packing Problem: The algorithms Hugo Willy 26 May 2010.
Department of Mechanical Engineering
Secondary structure prediction
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
CIS 350 – I Game Programming Instructor: Rolf Lakaemper.
Altman et al. JACS 2008, Presented By Swati Jain.
1Computer Graphics Implementation II Lecture 16 John Shearer Culture Lab – space 2
Implementation II Ed Angel Professor of Computer Science, Electrical and Computer Engineering, and Media Arts University of New Mexico.
Implementation II.
Spectral Sequencing Based on Graph Distance Rong Liu, Hao Zhang, Oliver van Kaick {lrong, haoz, cs.sfu.ca {lrong, haoz, cs.sfu.ca.
Linear Models for Classification
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
Fixed parameter algorithms for protein similarity search under mRNA structure constrains A joint work by: G. Blin, G. Fertin, D. Hermelin, and S. Vialette.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University.
Presented by Paul Phipps
Computer Graphics I, Fall 2010 Implementation II.
Bioinformatics 2 -- lecture 9
Solving and Analyzing Side-Chain Positioning Problems Using Linear and Integer Programming Carleton L. Kingsford, Bernard Chazelle and Mona Singh Bioinformatics.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Protein Structure Visualisation
Computer Graphics Implementation II
Core-Sets and Geometric Optimization problems.
Collision Detection Spring 2004.
Parts of these slides are based on
Algorithm design techniques Dr. M. Gavrilova
Reduce the need for human intervention in protein model building
A Smoothed Backbone-Dependent Rotamer Library for Proteins Derived from Adaptive Kernel Density Estimates and Regressions  Maxim V. Shapovalov, Roland L.
Implementation II Ed Angel Professor Emeritus of Computer Science
Lecture 4: Econometric Foundations
Protein structure prediction.
Computer Animation Algorithms and Techniques
Introduction to Computer Graphics with WebGL
Collision Detection.
Implementation II Ed Angel Professor Emeritus of Computer Science
Algorithm Course Algorithms Lecture 3 Sorting Algorithm-1
Presentation transcript:

Development of SCWRL4 for improved prediction of protein side-chain conformations In collaboration with Moscow Engineering & Physics Institute © George Krivov Georgii Krivov, Maxim Shapovalov and Roland L. Dunbrack Jr.

SCWRL ® program SCWRL Version 3 was written by Adrian A. Canutescu and Dr. Roland L. Dunbrack Jr. Main assumption: backbone  There is a finite set of possible conformations (called rotamers) for each amino acid residue. pair-wise interactions

1.Obtain spatial backbone structure and aminoacid sequence 2.For each residue build possible side-chain conformations (rotamers) using rotamer library 3.Build interaction graph –each vertex denotes a certain residue –an edge between vertices indicates that there is an interaction between some rotamers of the corresponding residues 4.Find optimal assignment of side-chain conformations by graph decomposition and dynamic programming 5.Save resolved structure into file SCWRL’s sidechains packing algorithm PDB Res 2 Res 1 Res 3 Res 4

Inproved dynamic programming A tree-decomposition of a graph is a pair, where  based on a tree-decomposition of the interaction graph – is a tree with a set of vertices I and a set of edges F – is a family of subsets of the set V, associated with the vertices of T, such that which satisfies the conditions: h kh k h lh l b c a c d b e g f d c e d e h i g e e g h a c b d e i f g h k l Res 2 Res 1 Res 3 Res 4 Hans L. Bodlaender. 1992

combinatorial complexity blows-up hardly feasible even with new algorithm  Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable Described combinatorial algorithm… resolves global optimum (avoids stochastics) accuracy of prediction entirely depends on rotamer library and energy potentials is capable of larger and denser graphs than one based on biconnected components SCWRL4 is capable of significantly larger proteins than SCWRL3 typically finishes pretty quicklyno coffee-breaks… However… However… a better accuracy is desired quick collision detection algorithm thermodynamic fluctuations via Flexible Rotamers Model

combinatorial complexity blows-up hardly feasible even with new algorithm  Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable Hierarchies of bounding boxes  enable efficient search for intersections between two groups of geometric figures 2. Check each combination for overlapping 4. Continue recursively on each clashing pair 1. If overlap then split each 3. Disregard boxes that don’t clash Given two groups of figures enclosed into k-dops… quick collision detection algorithm James T. Klosowski, et.al. 1998

Cubic (k = 3) Tetrahedral (k = 4) k = 2 k = 3 k = 4 examples: combinatorial complexity blows-up hardly feasible even with new algorithm  quick collision detection algorithm … works best in conjunction with k-Discrete Oriented Polytopes – a class of convex polytopes with 2k planes any plane is orthogonal to one of k basic axes which remain fixed – easy to enclose a ball – easy to merge – easy clash check – almost rotatable Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable

Cubic (k = 3) Tetrahedral (k = 4) k = 2 k = 3 k = 4 examples: combinatorial complexity blows-up hardly feasible even with new algorithm  basic axis x i min i max i i = 1..k simple projection onto all basic axes  easy to enclose a ball easy to merge easy clash check almost rotatable quick collision detection algorithm … works best in conjunction with k-Discrete Oriented Polytopes – a class of convex polytopes with 2k planes any plane is orthogonal to one of k basic axes which remain fixed Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable

Cubic (k = 3) Tetrahedral (k = 4) k = 2 k = 3 k = 4 examples: combinatorial complexity blows-up hardly feasible even with new algorithm  easy to enclose a ball  easy to merge easy clash check almost rotatable quick collision detection algorithm … works best in conjunction with k-Discrete Oriented Polytopes – a class of convex polytopes with 2k planes any plane is orthogonal to one of k basic axes which remain fixed Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable

Cubic (k = 3) Tetrahedral (k = 4) k = 2 k = 3 k = 4 examples: combinatorial complexity blows-up hardly feasible even with new algorithm  A doesn’t clash B if exists axis x i (1≤i ≤ k) such that easy to enclose a ball easy to merge  easy clash check almost rotatable k-DOP A k-DOP B ? quick collision detection algorithm … works best in conjunction with k-Discrete Oriented Polytopes – a class of convex polytopes with 2k planes any plane is orthogonal to one of k basic axes which remain fixed Involve more rotamers  more interaction to evaluate  more combinations to enumerate More realistic potentials  longer interaction range  more edges in the graph  less decomposable

More realistic potentials  longer interaction range  more edges in the graph  less decomposable Fast anisotropic hydrogen bond potential Fast anisotropic hydrogen bond potential

For more relevant comparison it make sense to predict a crystal not the ASU Amount of sidechains relative surface accessibility (%)

Extra percent in average accuracy due to crystal awareness Knowledge of crystal symmetry enables higher accuracy …

Tuning parameters of the Flexible Rotamer Model due to backbone and frame sample around rotamer library’s conformation due to sidechains’ interaction may be setup independently for each type of amino acid from rotamer library search for optimal values in high- dimensional space optimize one amino acid type in a time (and loop for all)

Optimizing expensive function in multidimensional space 1.Generate sample of arguments and evaluate function at these points 2.Assume that second order approximation works well 3.From the linear regression resolve coefficients and their covariance 4.Maximization of quadratic form is relatively simple, provided that we can resolve eigenvalues and eigenvectors 5.Hence, generate sample of quadratic forms, maximize each of them and aggregate robust for non-convex functions!

Traces through the optimization of the FRM parameters Training on 40 proteins ( ~ residues ) + 24 proteins more and continue ( ~ residues ) Testing on 130 proteins ( ~ residues )

Conditional accuracy (%) Confidence of side-chain placement (derived from experimental EDS maps) CYSASNASP ARG ILE GLN GLU HIS PHE LEU LYS METPRO SER THR TRP ALL TYRVAL sliding frame - 20% Measurement: Side chains with better electron density are easier to predict Shapovalov et.al. 2007

backbone PDB file SCWRL 3.exe rotamer library output PDB file functionality of SCWRL4 is available as library  enables direct manipulation of the model via C++ API class SCWRL { … }; SCWRL4.DLL all this good with Improved usability (coming soon)

Acknowledgements Dr. Roland Dunbrack Prof. Nickolai Kudryashov Colleagues: Adrian Canutescu Guoli Wang Maxim Shapovalov Qiang Wang Qifang Xu Questions, Comments, Suggestions ? Thanks for Your Attention! Have a Nice Day and welcome to our poster!