Molecular Mechanics and docking Lecture 22 Introduction to Bioinformatics 2007 C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.

Slides:



Advertisements
Similar presentations
Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Advertisements

Crystallography, Birkbeck MOLECULAR SIMULATIONS ALL YOU (N)EVER WANTED TO KNOW Julia M. Goodfellow Dynamic Processes: Lecture 1 Lecture Notes.
Protein Docking Rong Chen Boston University. BU Bioinformatics The Lowest Binding Free Energy  G water R L R L L R L R L R.
Molecular Dynamics: Review. Molecular Simulations NMR or X-ray structure refinements Protein structure prediction Protein folding kinetics and mechanics.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Questions 1) Are the values of r0/theta0 approximately what is listed in the book (in table 3.1 and 3.2)? -> for those atom pairs/triplets yes; 2) In the.
How to approximate complex physical and thermodynamic interactions? Employ rigid or flexible structures for ligand and receptor (Side-chains or Back-bone.
Molecular Dynamics, Monte Carlo and Docking Lecture 21 Introduction to Bioinformatics MNW2.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Molecular Modeling of Crystal Structures molecules surfaces crystals.
Molecular Simulation. Molecular Simluation Introduction: Introduction: Prerequisition: Prerequisition: A powerful computer, fast graphics card, A powerful.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
An Integrated Approach to Protein-Protein Docking
Structure-Function Analysis 117 Jan 2006 DNA/Protein structure-function analysis and prediction Protein-protein Interaction (PPI) and Docking: Protein-protein.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Experimentally solving protein structures, protein-protein interactions and simulating protein dynamics Lecture 15 Introduction to Bioinformatics 2007.
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
The Geometry of Biomolecular Solvation 1. Hydrophobicity Patrice Koehl Computer Science and Genome Center
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Molecular Modeling Part I Molecular Mechanics and Conformational Analysis ORG I Lab William Kelly.
8/30/2015 5:26 PM Homology modeling Dinesh Gupta ICGEB, New Delhi.
Molecular Modeling Part I. A Brief Introduction to Molecular Mechanics.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Structure-Function Analysis 117 Jan 2006 DNA/Protein structure-function analysis and prediction Protein-protein Interaction (PPI): Protein-protein Interaction.
Conformational Sampling
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Department of Mechanical Engineering
8. Selected Applications. Applications of Monte Carlo Method Structural and thermodynamic properties of matter [gas, liquid, solid, polymers, (bio)-macro-
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
Potential energy surface, Force field & Molecular Mechanics 3N (or 3N-6 or 3N-5) Dimension PES for N-atom system x E’ =  k i (l i  l 0,i ) +  k i ’
A Technical Introduction to the MD-OPEP Simulation Tools
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Molecular Mechanics Studies involving covalent interactions (enzyme reaction): quantum mechanics; extremely slow Studies involving noncovalent interactions.
Altman et al. JACS 2008, Presented By Swati Jain.
Solving protein structures,molecular mechanics, and docking Lecture 18 Introduction to Bioinformatics 2006.
Homework 2 (due We, Feb. 1): Reading: Van Holde, Chapter 1 Van Holde Chapter 3.1 to 3.3 Van Holde Chapter 2 (we’ll go through Chapters 1 and 3 first. 1.Van.
Lecture 5 Barometric formula and the Boltzmann equation (continued) Notions on Entropy and Free Energy Intermolecular interactions: Electrostatics.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Simplistic Molecular Mechanics Force Field Van der WaalsCharge - Charge Bond Angle Improper Dihedral  
LSM3241: Bioinformatics and Biocomputing Lecture 6: Fundamentals of Molecular Modeling Prof. Chen Yu Zong Tel:
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Molecular Dynamics, Monte Carlo and Docking Lecture 21 Introduction to Bioinformatics MNW2.
Molecular Mechanics (Molecular Force Fields). Each atom moves by Newton’s 2 nd Law: F = ma E = … x Y Principles of M olecular Dynamics (MD): F =
1 Molecular Simulations Macroscopic EOS (vdW, PR) Little molecular detail Empirical parameters (  ) Seeking understanding of complex systems Surfactants.
Structure/function studies of HIV proteins HIV gp120 V3 loop modelling using de novo approaches HIV protease-inhibitor binding energy prediction.
Molecular dynamics (MD) simulations  A deterministic method based on the solution of Newton’s equation of motion F i = m i a i for the ith particle; the.
Lecture 7: Molecular Mechanics: Empirical Force Field Model Nanjie Deng Structural Bioinformatics II.
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
1 of 21 SDA development -Description of sda Description of sda-5a - Sda for docking.
Computational Structure Prediction
Rong Chen Boston University
Computational Analysis
Driven Adiabatic Dynamics Approach to the Generation of Multidimensional Free-Energy Surfaces. Mark E. Tuckerman, Dept. of Chemistry, New York University,
An Integrated Approach to Protein-Protein Docking
CZ5225 Methods in Computational Biology Lecture 7: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Conformational Search
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Molecular Mechanics and docking Lecture 22 Introduction to Bioinformatics 2007 C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E

Today’s lecture 1.Protein interaction and docking a)Zdock method 2.Molecular motion simulated by molecular mechanics

Docking - ZDOCK Protein-protein docking –3-dimensional (3D) structure of protein complex –starting from 3D structures of receptor and ligand Rigid-body docking algorithm (ZDOCK) –pairwise shape complementarity function –all possible binding modes –using Fast Fourier Transform algorithm Refinement algorithm (RDOCK) –Take top 2000 predicted structures from ZDOCK (RDOCK is too computer intensive to refine very many possible dockings) –three-stage energy minimization –electrostatic and desolvation energies molecular mechanical software (CHARMM) statistical energy method (Atomic Contact Energy) 49 non-redundant unbound test cases: –near-native structure (<2.5Å) on top for 37% test cases for 49% within top 4

Protein-protein docking Finding correct surface match Systematic search: –2 times 3D space! Define functions: –‘1’ on surface –‘  ’ or ‘  ’ inside –‘0’ outside  

Protein-protein docking Correlation function: C  = 1/N 3  o  p  q exp[2  i(o  + p  + q  )/N] C o,p,q

Docking Programs ZDOCK, RDOCK AutoDock Bielefeld Protein Docking DOCK DOT FTDock, RPScore and MultiDock GRAMM Hex 3.0 ICM Protein-Protein docking (Abagyan group, currently the best) KORDO MolFit MPI Protein Docking Nussinov-Wolfson Structural Bioinformatics Group …

Docking Programs Issues: Rigid structures or made flexible? –Side-chains –Main-chains Full atomic detail or simplified models? Docking energy functions (purpose built force fields)

Molecular motions Proteins are very dynamic systems Protein folding Protein structure Protein function (e.g. opening and closing of oxygen binding site in hemoglobin)

Protein motion Principles Simulation –MD –MC

The Ramachandran plot Allowed phi-psi angles Red areas are preferred, yellow areas are allowed, and white is avoided

Molecular mechanics techniques Two basic techniques: Molecular Dynamics (MD) simulations Monte Carlo (MC) techniques

Molecular Dynamics (MD) simulation MD simulation can be used to study protein motions. It is often used to refine experimentally determined protein structures. It is generally not used to predict structure from sequence or to model the protein folding pathway. MD simulation can fold extended sequences to `global' potential energy minima for very small systems (peptides of length ten, or so, in vacuum), but it is most commonly used to simulate the dynamics of known structures. Principle: an initial velocity is assigned to each atom, and Newton's laws are applied at the atomic level to propagate the system's motion through time MD simulation incorporates a notion of time

q = coordinates p = momentum K = kinetic energy V = potential energy

Molecular Dynamics Knowledge of the atomic forces and masses can be used to solve the position of each atom along a series of extremely small time steps (on the order of femtoseconds = seconds). The resulting series of snapshots of structural changes over time is called a trajectory. The use of this method to compute trajectories can be more easily seen when Newton's equation is expressed in the following form: The "leapfrog" method is a common numerical approach to calculating trajectories based on Newton's equation. This method gets its name from the way in which positions (r) and velocities (v) are calculated in an alternating sequence, `leaping' past each other in time The steps can be summarized as follows: v = dr i /dt a = d 2 r i /d 2 t

Force field The potential energy of a system can be expressed as a sum of valence (or bond), crossterm, and nonbond interactions: The energy of valence interactions comprises bond stretching (E bond ), valence angle bending (E angle ), dihedral angle torsion (E torsion ), and inversion (also called out-of- plane interactions) (E inversion or E oop ) terms, which are part of nearly all force fields for covalent systems. A Urey-Bradley term (E UB ) may be used to account for interactions between atom pairs involved in 1-3 configurations (i.e., atoms bound to a common atom): E valence = E bond + E angle + E torsion + E oop + E UB Modern (second-generation) forcefields include cross terms to account for such factors as bond or angle distortions caused by nearby atoms. Cross terms can include the following terms: stretch-stretch, stretch-bend-stretch, bend-bend, torsion-stretch, torsion-bend-bend, bend-torsion-bend, stretch-torsion-stretch. The energy of interactions between nonbonded atoms is accounted for by van der Waals (E vdW ), electrostatic (E Coulomb ), and (in some older forcefields) hydrogen bond (E hbond ) terms: E nonbond = E vdW + E Coulomb + E hbond

Force field

f = a/r 12 - b/r 6 Van der Waals forces distance energy The Lennard-Jones potential is mildly attractive as two uncharged molecules or atoms approach one another from a distance, but strongly repulsive when they approach too close. The resulting potential is shown (in pink). At equilibrium, the pair of atoms or molecules tend to go toward a separation corresponding to the minimum of the Lennard--Jones potential (a separation of 0.38 nanometers for the case shown in the Figure)

Thermal bath

Figure: Snapshots of ubiquitin pulling with constant velocity at three different time steps.

Docking example: antibody HyHEL-63 (cyan) complexed with Hen Egg White Lysozyme (yellow) The X-ray structure of the antibody HyHEL-63 (cyan) uncomplexed and complexed with Hen Egg White Lysozyme (yellow) has shown that there are small but significant, local conformational changes in the antibody paratope on binding. The structure also reveals that most of the charged epitope residues face the antibody. Details are in Li YL, Li HM, Smith-Gill SJ and Mariuzza RA (2000) The conformations of the X-ray structure Three-dimensional structures of the free and antigen-bound Fab from monoclonal antilysozyme antibody HyHEL-63. Biochemistry 39: Salt links and electrostatic interactions provide much of the free energy of binding. Most of the charged residues face in interface in the X-ray structure. The importance of the salt link between Lys97 of HEL and Asp27 of the antibody heavy chain is revealed by molecular dynamics simulations. After 1NSec of MD simulation at 100°C the overall conformation of the complex has changed, but the salt link persists. Details are described in Sinha N and Smith-Gill SJ (2002) Electrostatics in protein binding and function. Current Protein & Peptide Science 3: Important for binding is a salt bridge (i.e. charge complementary interaction) between Lys97 of HEL and Asp27 of the antibody heavy chain, as demonstrated by Molecular Dynamics (MD)

Monte Carlo (MC) simulation "Monte Carlo Simulation" is a term for a general class of optimization methods that use randomization. The general idea is, given the current configuration and some figure of merit, e.g., the energy of the folded configuration, to generate a new configuration at random (or semi-random):  If the energy of the new configuration is smaller than the old configuration, always accept it as the next configuration;  if it is worse than the current configuration, accept or reject it it with some probability dependent on how much larger the new energy is than the old energy.  E = E(new)-E(old) If  E<0 then accept else if random[0, 1] < e -  E /kT then accept else reject Boltzmann -- probability of conformation c: P(c) = e -E(c)/kT E P

Monte Carlo (MC) simulation The idea is that by always accepting a better configuration, on the average the system will tend to move toward a (local) energy minimum, while conversely, by sometimes accepting worse configurations, the system will be able to "climb" out of a sub-optimal local minima, and perhaps fall into the basin of attraction of the global minimum. The specific algorithms for probabilistically generating and accepting new configurations define the type of "Monte Carlo" algorithm; some common methods are "Metropolis," "Gibbs Sampler," "Heat Bath," "Simulated Annealing," "Great Deluge," etc. MC techniques are computationally more efficient than MD MC simulations do not incorporate a notion of time! E Configuration space (models) Local minimum Global minimum

#! /usr/bin/perl #=============================================================================== # # $Id: mcdemo.pl,v /03/12 16:13:28 jkleinj Exp $ # # mcdemo: Demo program for MC simulation of the number pi # # (C) 2003 Jens Kleinjung # # Dr Jens Kleinjung, Room P440 | # Bioinformatics Unit, Faculty of Sciences | Tel # Free University Amsterdam | Fax # De Boelelaan 1081A, 1081 HV Amsterdam | # #=============================================================================== # preset parameters $hits = 1; $miss = 1; for ($i=0; $i<100000; $i++) { # assign random x,y coordinates $x = rand; $y = rand; # calculate radius $r = sqrt(($x*$x)+($y*$y)); # sum up hits and misses if ($r <= 1) { $hits++; } else { $miss++; } # calculate pi $pi = (4*$hits)/($hits +$miss); # print pi if ($i%100 == 0) { print("$i $pi\n"); } } #===============================================================================

In many conformational search methods based on Monte Carlo (MC), after a MC move, the system is energy minimised, i.e. put in the lowest local energy conformation, for example by gradient descent (steepest descent).

What can be done with MD and MC Dynamics of proteins Protein folding – very difficult Protein unfolding – done with MD Structure refinement – most frequent application –After experimental structure elucidation –After some model building operation PPI – Interaction dynamics, Docking Hydrophobic patch dynamics

Take home messages Experimentally determining protein structures –X-ray diffraction From crystallised protein sample to electron density map –Structure descriptors: resolution, R-factor –Nuclear magnetic resonance (NMR) Based on atomic nuclear spin Produces set of distances between residues (distance restraints) Distances are used to build protein model using Distance Geometry Protein dynamics simulation –Molecular dynamics Follows Newton’s equations of motion Simulates molecular movements through time Very small time steps (typically 2 femtoseconds = 2* seconds) Protein conformational search –Monte Carlo Conformations are randomly changed Uses Mitropolis criterion to decide between conformation i and i+1 based on conformational internal energy and the Boltzmann equation Has no notion of time, is a conformational search protocol –Normally faster than MD so more conformations can be generated