Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.

Slides:



Advertisements
Similar presentations
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Advertisements

Contextual Classification by Melanie Ganz Lecture 6, Medical Image Analysis 2008.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
With thanks to Zhijun Wu An introduction to the algorithmic problems of Distance Geometry.
Structural bioinformatics
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
EMBIO – Cambridge Particle Swarm Optimization applied to Automated Docking Automated docking of a ligand to a macromolecule Particle Swarm Optimization.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Docking of Protein Molecules
FLEX* - REVIEW.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Development and Validation of a Genetic Algorithm for Flexible Docking Gareth Jones, Peter Willet, Robert C. Glen, Andrew R. Leach and Robin Taylor J.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
An Integrated Approach to Protein-Protein Docking
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Physics of Protein Folding. Why is the protein folding problem important? Understanding the function Drug design Types of experiments: X-ray crystallography.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Comparative Evaluation of 11 Scoring Functions for Molekular Docking Authors: Renxiao Wang, Yipin Lu and Shaomeng Wang Presented by Florian Lenz.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.
Chapter 9 Superposition and Dynamic Programming 1 Chapter 9 Superposition and dynamic programming Most methods for comparing structures use some sorts.
Conformational Sampling
Prediction of avian Influenza A binding preference to human receptor using conformational analysis of receptor bound to hemagglutinin Wanwimon Mokmak,
Confab – Systematic generation of diverse low-energy conformers Noel M O’Boyle, 1 Tim Vandermeersch, 2 and Geoffrey R Hutchison. 3 1 Analytical and Biological.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
A Technical Introduction to the MD-OPEP Simulation Tools
SimBioSys Inc.© 2004http:// Conformational sampling in protein-ligand complex environment Zsolt Zsoldos SimBioSys Inc., © 2004 Contents:
Altman et al. JACS 2008, Presented By Swati Jain.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Protein Structure Prediction
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility Applications to molecular simulation Work on this paper.
Ab-initio protein structure prediction ? Chen Keasar BGU Any educational usage of these slides is welcomed. Please acknowledge.
Zhijun Wu Department of Mathematics Program on Bio-Informatics and Computational Biology Iowa State University Joint Work with Tauqir Bibi, Feng Cui, Qunfeng.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
Computational Structure Prediction
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Simplified picture of the principles used for multiple copy simultaneous search (MCSS) and for computational combinatorial ligand design (CCLD). Simplified.
Computational design of protein function
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Virtual Screening.
Volume 84, Issue 5, Pages (May 2003)
Ligand Docking to MHC Class I Molecules
Identification of Structural Mechanisms of HIV-1 Protease Specificity Using Computational Peptide Docking: Implications for Drug Resistance  Sidhartha.
An Integrated Approach to Protein-Protein Docking
3-Dimensional Structure
Protein structure prediction.
Low-Resolution Structures of Proteins in Solution Retrieved from X-Ray Scattering with a Genetic Algorithm  P. Chacón, F. Morán, J.F. Díaz, E. Pantos,
Ligand Binding to the Voltage-Gated Kv1
Combining Efficient Conformational Sampling with a Deformable Elastic Network Model Facilitates Structure Refinement at Low Resolution  Gunnar F. Schröder,
Building and using sequence‐expression‐activity maps (SEAMAPs) AThe RBS Library Calculator designs a synthetic RBS library to efficiently search a multi‐dimensional.
Conformational Search
Protein-Protein Docking: From Interaction to Interactome
Protein structure prediction
Insights from Free-Energy Calculations: Protein Conformational Equilibrium, Driving Forces, and Ligand-Binding Modes  Yu-ming M. Huang, Wei Chen, Michael J.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University of Madras, Guindy Campus Chennai

In this talk Introduction Mean Field Theory Mutually Orthogonal Latin Squares Applications to – Peptide structure Energy landscape Protein structure Ligand Docking

The Problem: Given the sequence of a protein, can we use information from Physics, Chemistry, …… to calculate its three dimensional structure ? Introduction

‘Native Structure’ is one with minimum potential energy Choose a model + force field (potential function) Optimize structure with respect to potential function

Introduction Optimisation techniques Monte Carlo Molecular Dynamics – simulated annealing Genetic algorithms …… Mean Field Technique

Mean field technique has been applied, e.g. to predict protein side chain structure Φ is the conformational search space This is divided into a number of subspaces φ i Each such subspace has a number of states φ ij each with a probability of occurrence ρ ij

First ρ ij = 1/n i n i is number of states of subspace i Next, the effective potential due to a state φ rs of a subspace φ r is evaluated as V eff (φ rs ) = ∑ i,j ρ ij V(φ rs, φ ij ) Next, ρ rs re-evaluated as ρ rs = exp{-V eff (φ rs )/RT} / ∑ q exp{-V eff (φ rq )/RT} Mean Field Technique

Application to peptide/protein (backbone) structure - Torsion angles as subspaces? Extension to torsion angle space is not straight- forward The interaction between a pair of subspaces (when the subspaces are the torsion angles) does not depend only on their respective states, but is a function of the states of all other subspaces. i.e. V(φ rs, φ ij ) is not meaningful - we need V(φ rs, φ ij, ……) Mean Field Technique

To avoid combinatorial explosion we use a small sample (~ n 2 ) of the possible (m n ) combinations We use mutually orthogonal Latin squares (MOLS) to identify the sample e.g. 3 torsion angles, 5 values each - Mutually Orthogonal Latin Squares φ ij, i = 1, 3; j = 1, 5 33 11 22

Mutually Orthogonal Latin Squares φ 11, φ 21 φ 31 φ 15, φ 24 φ 33 φ 14, φ 22 φ 35 φ 13, φ 25 φ 32 φ 12, φ 23 φ 34 φ 12, φ 22 φ 32 φ 11, φ 25 φ 34 φ 15, φ 23 φ 31 φ 14, φ 21 φ 33 φ 13, φ 24 φ 35 φ 13, φ 23 φ 33 φ 12, φ 21 φ 35 φ 11, φ 24 φ 32 φ 15, φ 22 φ 34 φ 14, φ 25 φ 31 φ 14, φ 24 φ 34 φ 13, φ 22 φ 31 φ 12, φ 25 φ 33 φ 11, φ 23 φ 35 φ 15, φ 21 φ 32 φ 15, φ 25 φ 35 φ 14, φ 23 φ 32 φ 13, φ 21 φ 34 φ 12, φ 24 φ 31 φ 11, φ 22 φ 33 3 MOLS of order 5

Mutually Orthogonal Latin Squares The effective energy is now V eff (φ rs ) = ∑ q w q V q (φ rs …) w q = exp{-V q (φ rs …)/RT} / ∑ q exp{-V q (φ rs …)/RT} The summation is over all the points in the MOLS grid in which φ rs occurs Note: w q is calculated from V q, not from V eff

φ 11, φ 21 φ 31 φ 15, φ 24 φ 33 φ 14, φ 22 φ 35 φ 13, φ 25 φ 32 φ 12, φ 23 φ 34 φ 12, φ 22 φ 32 φ 11, φ 25 φ 34 φ 15, φ 23 φ 31 φ 14, φ 21 φ 33 φ 13, φ 24 φ 35 φ 13, φ 23 φ 33 φ 12, φ 21 φ 35 φ 11, φ 24 φ 32 φ 15, φ 22 φ 34 φ 14, φ 25 φ 31 φ 14, φ 24 φ 34 φ 13, φ 22 φ 31 φ 12, φ 25 φ 33 φ 11, φ 23 φ 35 φ 15, φ 21 φ 32 φ 15, φ 25 φ 35 φ 14, φ 23 φ 32 φ 13, φ 21 φ 34 φ 12, φ 24 φ 31 φ 11, φ 22 φ 33 Mutually Orthogonal Latin Squares e.g. for φ 11

Mutually Orthogonal Latin Squares Since w q is calculated from V q, not from V eff Therefore w q is not ρ ij and the procedure is not iterative Parameterize the search space Use these to build a set of MOLS (chosen at random) to globally sample the space Analyze the samples to obtain a low energy conformation (This is followed by gradient minimization) Another low energy conformation?

Mutually Orthogonal Latin Squares We obtain ~1500 low energy structures By clustering, we show these may be reduced to ~ 50 mutually dissimilar structures e.g. 23 structures for Met-enkephalin

Mutually Orthogonal Latin Squares The search is exhaustive Plot of ‘new’ structure versus structure number Sample Overlap First sample Second sample

Applications – Peptide Structure Peptide Conformation search – the procedure identifies all computed and experimental structures e.g. Met-enkephalin, ECEPP/3 force field MOLS GEM of Scheraga, et al. MOLS Crystal Structure We use the MOLS procedure to create structure libraries

Applications – Peptide Structure Energy landscape, ECEPP/3 force field Minimal energy envelope for Met-enkephalin

Applications – Protein Structure MOLS libraries + genetic algorithms (ECEPP/3) (AMBER + ‘hydrophobic’) Selection Initialization Mutation Variation Diversity Fragment Libraries Fragments MOLS Minimization & Clustering Phase 1 Phase 2 Cross over

Applications – Protein Structure Avian Pancreatic Polypeptide 4.0 A Villin Head Piece 5.2 A Mellitin 4.3 c-MYB 6.1 A Tryptophan zipper 1.8 A Left  Predicted; Right  Experimental

Applications – Protein loops Loops in protein crystal structures classified by size. Structure predicted using ECEPP/3 …DSPEF… …VNRKSD… …INMTSQQ… …TNASTLDT… …KAPGGGCND… …RNLTKDRCKP… Blue – Predicted structure Orange – Crystal structure

Applications – Docking Ligand (drug) docking to proteins Redefine the search space as the conformational space of peptide ligand (i.e. n torsion angles φ 1 to φ n ), plus the ‘docking’ space (i.e. the rotation and translation parameters of the peptide in receptor site, r 1 to r 6 ). Composite scoring function is now f 1 {φ 1 to φ n } + f 2 {r 1 …r 6 } conformational energy + ‘docking’ energy

PDB ID: 1sua, RMSD = 1.50Å Sequence: ALAL PDB ID: 1a30, RMSD = 0.66Å Sequence: EDL PDB ID: 8gch, RMSD = 1.04Å Sequence: GAW PDB ID: 1b32, RMSD = 0.51Å Sequence: KMK PDB ID: 1dkx, RMSD = 1.35Å Sequence: NRLLLTG PDB ID: 1awq, RMSD = 1.50Å Sequence: HAGPIA Blue – predicted structure Red – Crystal structure

Acknowledgements Z A Rafi, K Vengadesan, J Arunachalam, V Kanakasabai, P Arun Prasad CSIR DST-FIST UGC-SAP Thank You