Computing Missing Loops in Automatically Resolved X-Ray Structures Itay Lotan Henry van den Bedem (SSRL)

Slides:



Advertisements
Similar presentations
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Advertisements

Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
3D-STAF: Scalable Temperature and Leakage Aware Floorplanning for Three-Dimensional Integrated Circuits Pingqiang Zhou, Yuchun Ma, Zhouyuan Li, Robert.
Crystallography -- lecture 21 Sidechain chi angles Rotamers Dead End Elimination Theorem Sidechain chi angles Rotamers Dead End Elimination Theorem.
NUS CS5247 Motion Planning for Camera Movements in Virtual Environments By Dennis Nieuwenhuisen and Mark H. Overmars In Proc. IEEE Int. Conf. on Robotics.
ICT 1 A heuristic for maritime inventory routing Oddvar Kloster, Truls Flatberg Molde,
A Combined Optimization Method for Solving the Inverse Kinematics Problem of Mechanical Manipulators Roland Mai B659, Spring 2010 Indiana University.
Reference Assisted Nucleic Acid Sequence Reconstruction from Mass Spectrometry Data Gabriel Ilie 1, Alex Zelikovsky 2 and Ion Măndoiu 1 1 CSE Department,
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
MAE 552 Heuristic Optimization Instructor: John Eddy Lecture #35 4/26/02 Multi-Objective Optimization.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
The TEXTAL System: Automated Model-Building Using Pattern Recognition Techniques Dr. Thomas R. Ioerger Department of Computer Science Texas A&M University.
MAE 552 – Heuristic Optimization Lecture 8 February 8, 2002.
Motion Analysis (contd.) Slides are from RPI Registration Class.
CSci 6971: Image Registration Lecture 4: First Examples January 23, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart, RPI Dr.
Taking a Numeric Path Idan Szpektor. The Input A partial description of a molecule: The atoms The bonds The bonds lengths and angles Spatial constraints.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Expected accuracy sequence alignment
Thomas Blicher Center for Biological Sequence Analysis
Inverse Kinematics How do I put my hand here? IK: Choose these angles!
Algorithm for Fast MC Simulation of Proteins Itay Lotan Fabian Schwarzer Dan Halperin Jean-Claude Latombe.
Robotics Algorithms for the Study of Protein Structure and Motion Based on Itay Lotan’s PhD Jean-Claude Latombe Computer Science Department Stanford University.
Simulated Annealing Van Laarhoven, Aarts Version 1, October 2000.
Bioinformatics Challenge  Learning in very high dimensions with very few samples  Acute leukemia dataset: 7129 # of gene vs. 72 samples  Colon cancer.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
The Crystallographic Refinement of TM1389- A methyl-transferase from Thermotoga maritima Rosanne Joseph SLAC Summer Intern Joint Center for Structural.
Identification of Domains using Structural Data Niranjan Nagarajan Department of Computer Science Cornell University.
Serial and Parallel Manipulators
A Kinematic View of Loop Closure EVANGELOS A. COUTSIAS, CHAOK SEOK, MATTHEW P. JACOBSON, KEN A. DILL Presented by Keren Lasker.
Computing Protein Structures from Electron Density Maps: The Missing Fragment Problem Itay Lotan † Henry van den Bedem* Ashley M. Deacon* Jean-Claude Latombe.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Protein Structure Alignment by Incremental Combinatorial Extension (CE) of the Optimal Path Ilya N. Shindyalov, Philip E. Bourne.
COMPARATIVE or HOMOLOGY MODELING
Inverse Kinematics. Inverse Kinematics (IK) T q1q1 q2q2 q3q3 q4q4 q5q5 Given a kinematic chain (serial linkage), the position/orientation of one end relative.
Rotamer Packing Problem: The algorithms Hugo Willy 26 May 2010.
Altman et al. JACS 2008, Presented By Swati Jain.
Protein Structure Prediction
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Active Vision Sensor Planning of CardEye Platform Sherif Rashad, Emir Dizdarevic, Ahmed Eid, Chuck Sites and Aly Farag ResearchersSponsor.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
1cs426-winter-2008 Notes  Will add references to splines on web page.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
By: David Gelbendorf, Hila Ben-Moshe Supervisor : Alon Zvirin
Modeling Protein Flexibility with Spatial and Energetic Constraints Yi-Chieh Wu 1, Amarda Shehu 2, Lydia Kavraki 2,3  Provided an approach to generating.
Conformational Space of a Flexible Protein Loop Jean-Claude Latombe Computer Science Department Stanford University (Joint work with Ankur Dhanik 1, Guanfeng.
Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.
Expected accuracy sequence alignment Usman Roshan.
Forward and inverse kinematics in RNA backbone conformations By Xueyi Wang and Jack Snoeyink Department of Computer Science UNC-Chapel Hill.
RAPPER Nick Furnham Blundell Group – Department of Biochemistry Cambridge University UK
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
The Annealing Algorithm Revisited L.P.P.P. van Ginneken DigiPen Institute of Technology.
Computational Aspects of the Protein Target Selection, Protein Production Management and Structure Analysis Pipeline.
Line Fitting James Hayes.
Reduce the need for human intervention in protein model building
Itay Lotan† Henry van den Bedem* Ashley M. Deacon*
Itay Lotan† Henry van den Bedem* Ashley M. Deacon*
Lecture #7: FASTA & LFASTA
Homology Modeling.
Protein structure prediction.
CRISP: Consensus Regularized Selection based Prediction
Volume 15, Issue 9, Pages (September 2007)
Robust and automated crystal screening
Presentation transcript:

Computing Missing Loops in Automatically Resolved X-Ray Structures Itay Lotan Henry van den Bedem (SSRL)

Bioinformatics core UCSD, SDSC Crystallomics core TSRI, GNF Structure Determination Core SSRL Crystal screening / X-ray data collection Structure determination Structure refinement Funding from NIH Protein Structure Initiative 10 centers Funding for five years from July 2000 Ongoing projects at SDC: Beam line automation: Sample mounting robotics, automated diffraction quality assessment Automated structure determination: Structure Solution Pipeline Joint Center for Structural Genomics : Create new technologies to drive high throughput structure determination

From Model Building to Refinement Structure Solution Pipeline Initial Model(s) Diffraction Images Final Model Mostly Automated Manual Finalizing model: Labor intensive, time consuming. Existing tools to assist in model building unsatisfactory: 1.Produce incorrect configurations 2.Lack meaningful scoring algorithm to rank configurations 3.Remain highly interactive – difficult to integrate in Structure Solution Pipeline Initial models (RESOLVE, ARP/WARP): Several chains and gaps

The Problem We are given: –A density map –A solved structure with a gap (5 – 15 res.) Goal: –Automatically compute backbone conformation for the gap region

Gaps The structure is solved automatically Gaps appear in areas of “poor” density –Signal is indistinguishable from noise –Disconnected iso-surfaces –Automatic solver bails out

Things we can use The loop-closure constraint What density there is The solved structure The sequence is known (C β atoms) Preferred backbone angles (Ramchandran plots)

Loop Closure: CCD algorithm Robot Inverse Kinematics ( Wang & Chen ’91 ) Protein loops ( Canutescu & Dunbrack ’03 ) Algorithm: 1.Fix loop at one end 2.Repeat until closure For each DOF of loop Minimize closure score for DOF

CCD for Proteins Closure score: Sum of squared distances of N, C α and C atoms of final residue from their target positions

Our Approach 1. Generate closed loops using density, Ramachandran plot bias and solved structure 2. Optimize highest scoring loops using density and solved structure

Stage 1: Generate Closed Loops Perform one big CCD run For residue i : –Compute closure moves of ( φ,ψ ) angles –Compute max density of residue i+1 –Combine and bias toward peaks in Ramachandran plot Weight of closure move is increased gradually

Stage 2: Loop Optimization Choose residue i and φ or ψ DOF at random –Apply random change –Use DOFs of residues [ i-1, i+2 ] to close loop using CCD –Compute new score Accept change using Metropolis-like criterion Slowly decrease temperature and reduce StDev of random changes

Score Density: Weighted sum of density at atom centers and points away from center along coordinate axes. Collision: Penalize overlap of loop atoms with solved structure atoms as function penetration depth. Self Collision: Penalize overlap of atoms in loop

Local Loop Changes My CCD method: –Choose DOF at random (from ALL DOFs) with biases –Compute Direction of change –Move only a little –Allowed change in N-Cα and Cα-C bond lengths, N-Cα-C angle and Ω angle decreases with distance from optimal value Repeat until closed or maximum iterations

3.7Å 0.35Å 8 Residue Loop: Example 1

8 Residue Loop: Example 2 0.3Å 2.79Å

12 Residue Loop: 1.29Å0.28Å

9 Residue Loop: 3Å0.32Å

Open Issues Many parameters that are determined arbitrarily –Annealing regimen –Weight of collision penalty –Acceptance criterion Have one set of parameters that works for all loops lengths and density qualities