Molecular dynamics and applications to amyloidogenic sequences Nurit Haspel, David Zanuy, Ruth Nussinov (in cooperation with Ehud Gazit’s group)

Slides:



Advertisements
Similar presentations
Time averages and ensemble averages
Advertisements

Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Statistical mechanics
A Digital Laboratory “In the real world, this could eventually mean that most chemical experiments are conducted inside the silicon of chips instead of.
Survey of Molecular Dynamics Simulations By Will Welch For Jan Kubelka CHEM 4560/5560 Fall, 2014 University of Wyoming.
LINCS: A Linear Constraint Solver for Molecular Simulations Berk Hess, Hemk Bekker, Herman J.C.Berendsen, Johannes G.E.M.Fraaije Journal of Computational.
Solvation Models. Many reactions take place in solution Short-range effects Typically concentrated in the first solvation sphere Examples: H-bonds,
Introduction to Molecular Orbitals
Questions 1) Are the values of r0/theta0 approximately what is listed in the book (in table 3.1 and 3.2)? -> for those atom pairs/triplets yes; 2) In the.
Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
Computational Chemistry
Ion Solvation Thermodynamics from Simulation with a Polarizable Force Field Gaurav Chopra 07 February 2005 CS 379 A Alan GrossfeildPengyu Ren Jay W. Ponder.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
2. Modeling of small systems Building the model What is the optimal conformation of a molecule? What is the relative energy of a given conformation? What.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Molecular Dynamics Classical trajectories and exact solutions
Joo Chul Yoon with Prof. Scott T. Dunham Electrical Engineering University of Washington Molecular Dynamics Simulations.
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Coarse grained simulations of p7 folding

Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Molecular Modeling Part I Molecular Mechanics and Conformational Analysis ORG I Lab William Kelly.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Geometry Optimisation Modelling OH + C 2 H 4 *CH 2 -CH 2 -OH CH 3 -CH 2 -O* 3D PES.
Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.
Molecular Modeling Part I. A Brief Introduction to Molecular Mechanics.
Molecular Dynamics Simulations
Javier Junquera Molecular dynamics in the microcanonical (NVE) ensemble: the Verlet algorithm.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Molecular Dynamics Simulation Solid-Liquid Phase Diagram of Argon ZCE 111 Computational Physics Semester Project by Gan Sik Hong (105513) Hwang Hsien Shiung.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Minimization v.s. Dyanmics A dynamics calculation alters the atomic positions in a step-wise fashion, analogous to energy minimization. However, the steps.
Basics of molecular dynamics. Equations of motion for MD simulations The classical MD simulations boil down to numerically integrating Newton’s equations.
1 CE 530 Molecular Simulation Lecture 6 David A. Kofke Department of Chemical Engineering SUNY Buffalo
Rates of Reactions Why study rates?
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
Chemical Reactions in Ideal Gases. Non-reacting ideal gas mixture Consider a binary mixture of molecules of types A and B. The canonical partition function.
A Technical Introduction to the MD-OPEP Simulation Tools
Molecular Dynamics simulations
Molecular Mechanics Studies involving covalent interactions (enzyme reaction): quantum mechanics; extremely slow Studies involving noncovalent interactions.
 We just discussed statistical mechanical principles which allow us to calculate the properties of a complex macroscopic system from its microscopic characteristics.
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Ch 24 pages Lecture 11 – Equilibrium centrifugation.
Simplistic Molecular Mechanics Force Field Van der WaalsCharge - Charge Bond Angle Improper Dihedral  
LSM3241: Bioinformatics and Biocomputing Lecture 6: Fundamentals of Molecular Modeling Prof. Chen Yu Zong Tel:
Quantum Mechanics/ Molecular Mechanics (QM/MM) Todd J. Martinez.
Interacting Molecules in a Dense Fluid
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Molecular Mechanics (Molecular Force Fields). Each atom moves by Newton’s 2 nd Law: F = ma E = … x Y Principles of M olecular Dynamics (MD): F =
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
--Experimental determinations of radial distribution functions --Potential of Mean Force 1.
Statistical Mechanics and Multi-Scale Simulation Methods ChBE
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Molecular dynamics (MD) simulations  A deterministic method based on the solution of Newton’s equation of motion F i = m i a i for the ith particle; the.
Lecture 7: Molecular Mechanics: Empirical Force Field Model Nanjie Deng Structural Bioinformatics II.
Overview of Molecular Dynamics Simulation Theory
Molecular dynamics (MD) simulations
Molecular Modelling - Lecture 3
Computational Analysis
Molecular dynamics (MD) simulations
Protein Structure Prediction
Large Time Scale Molecular Paths Using Least Action.
Intro to Molecular Dynamics (MD) Simulation using CHARMM
CZ5225 Methods in Computational Biology Lecture 7: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Presentation transcript:

Molecular dynamics and applications to amyloidogenic sequences Nurit Haspel, David Zanuy, Ruth Nussinov (in cooperation with Ehud Gazit’s group)

Contents Molecular dynamics: goals, applications and basic principles: Molecular dynamics: goals, applications and basic principles: Newton’s equations of motion. Newton’s equations of motion. Energy conservation and equations. Energy conservation and equations. The force field. The force field. Solvation models. Solvation models. Periodic boundary conditions. Periodic boundary conditions. A molecular dynamic protocol A molecular dynamic protocol Energy minimization. Energy minimization.

Contents (cont.) MD protocol (cont) MD protocol (cont) Assignment of initial velocities. Assignment of initial velocities. Equilibration. Equilibration. Methods of integration. Methods of integration. Case study: Human calcitonin hormone Case study: Human calcitonin hormone Basic background. Basic background. Simulated models. Simulated models. Initial results and future work plans. Initial results and future work plans.

The goal of MD Predicting the structure and energy of molecular systems (in our case – short peptide structures). Predicting the structure and energy of molecular systems (in our case – short peptide structures). Simulating the behavior of the molecules in the solution (by solving the energy equations at every time interval). Simulating the behavior of the molecules in the solution (by solving the energy equations at every time interval). Trying to find a model that explains the behavior of the system. Trying to find a model that explains the behavior of the system.

Applications of MD Sampling the conformational space over time. Important for ligand docking, for example. Sampling the conformational space over time. Important for ligand docking, for example. Determine equilibrium averages, structural and motional properties of the system. Determine equilibrium averages, structural and motional properties of the system. Study the time development of the system. Study the time development of the system. Today, most (if not all) biomolecular structures obtained by X-ray crystallography or NMR are MD refined. Today, most (if not all) biomolecular structures obtained by X-ray crystallography or NMR are MD refined.

Types of simulated systems Peptidic systems Peptidic systems Micelle formation Micelle formation Nucleotides Nucleotides Small molecules Small molecules Ligand docking Ligand docking … Note: Each type of system has its own unique parameters and equations. Note: Each type of system has its own unique parameters and equations.

The basic principle Solving the classical mechanics equations (Newton’s equations) over the pairs of atom distances, angles, dihedrals, VdW interactions and electrostatics in small time intervals (other parameters can be added). classical equations are usually sufficient for large scale systems. Quantum mechanical modifications are extremely costly and are used only on small scale system or where more accuracy is needed.

Newton’s mechanical equation (based on Newton’s second law) F V1 V2 Or, with a small enough time interval Δt: F = Ma = M*(dv/dt) = M*(d 2 r/dt 2 ) ΔV = (F/M)* Δt → V2 = V1+(F/M)* Δt

Newton equations (cont.) The new position, r 2 is determined by the old position, r 1 and the velocity v 2 over time Δt (which should be very small!). The above equation describes the changes in the positions of the atoms over time.

The process of MD The simulation is the numerical integration of the Newton equations over time. Positions and velocities at time t Positions and velocities at time t+dt. Positions + velocities = trajectory.

The connection between force and energy U = the energy (scalar). r = the position vector. F=-dU/dr →U=-∫Fdr=-1/2*Mv 2

Conservation of energy The potential energy is taken from the force field parameters. ½*ΣM i V i 2 + ΣE pot,i =const

The potential energy equations – bonded interactions U(R) = bond bond angle angle dihedral dihedral

The potential energy equations (cont., non-bonded interactions) Van der Waals Van der Waals electrostatic electrostaticEtc… The energy parameters are defined in the force field

The force field definition All the equations and the adjusted parameters that allow to describe quantitatively the energy of the chemical system. Note, that mixing equations and parameters from different systems always results in errors! Force field examples: FF2, FF3, Sybyl, charmm etc.

Solvation models No solvent – constant dielectric. No solvent – constant dielectric. Continuum – referring to the solvent as a bulk. No explicit representation of atoms (saving time). Continuum – referring to the solvent as a bulk. No explicit representation of atoms (saving time). Explicit – representing each water molecule explicitly (accurate, but expensive). Explicit – representing each water molecule explicitly (accurate, but expensive). Mixed – mixing two models (for example: explicit + continuum. To save time). Mixed – mixing two models (for example: explicit + continuum. To save time).

Periodic boundary conditions Problem: Only a small number of molecules can be simulated and the molecules at the surface experience different forces than those at the inner side. The simulation box is replicated infinitely in three dimensions (to integrate the boundaries of the box). The simulation box is replicated infinitely in three dimensions (to integrate the boundaries of the box). When the molecule moves, the images move in the same fashion. When the molecule moves, the images move in the same fashion. The assumption is that the behavior of the infinitely replicated box is the same as a macroscopic system. The assumption is that the behavior of the infinitely replicated box is the same as a macroscopic system.

Periodic boundary conditions

A sample MD protocol Read the force fields data and parameters. Read the force fields data and parameters. Read the coordinates and the solvent molecules. Read the coordinates and the solvent molecules. Slightly minimize the coordinates (the created model may contain collisions), a few SD steps followed by some ABNR steps. Slightly minimize the coordinates (the created model may contain collisions), a few SD steps followed by some ABNR steps. Warm to the desired temperature (assign initial velocities). Warm to the desired temperature (assign initial velocities). Equilibrate the system. Equilibrate the system. Start the dynamics and save the trajectories every 1ps (trajectory=the collection of structures at any given time step). Start the dynamics and save the trajectories every 1ps (trajectory=the collection of structures at any given time step).

Why is minimization required? Most of the coordinates are obtained using X- ray diffraction or NMR. Most of the coordinates are obtained using X- ray diffraction or NMR. Those methods do not map the hydrogen atoms of the system. Those methods do not map the hydrogen atoms of the system. Those are added later using modeling programs (such as insight), which are not 100% accurate. Those are added later using modeling programs (such as insight), which are not 100% accurate. Minimization is therefore required to resolve the clashes that may “blow up” the energy function. Minimization is therefore required to resolve the clashes that may “blow up” the energy function.

Common minimization protocols First order algorithms: First order algorithms: Steepest descent Steepest descent Conjugated gradient Conjugated gradient Second order algorithms: Second order algorithms: Newton-Raphson Newton-Raphson Adopted basis Newton Raphson (ABNR) Adopted basis Newton Raphson (ABNR)

Steepest descent This is the simplest minimization method: The first directional derivative (gradient) of the potential is calculated and displacement is added to every coordinate in the opposite direction (the direction of the force). The first directional derivative (gradient) of the potential is calculated and displacement is added to every coordinate in the opposite direction (the direction of the force). The step is increased if the new conformation has a lower energy. The step is increased if the new conformation has a lower energy. Advantages: Simple and fast. Advantages: Simple and fast. Disadvantages: Inaccurate, usually does not converge. Disadvantages: Inaccurate, usually does not converge.

Conjugated gradient Uses first derivative information + information from previous steps – the weighted average of the current gradient and the previous step direction. Uses first derivative information + information from previous steps – the weighted average of the current gradient and the previous step direction. The weight factor is calculated from the ratio of the previous and current steps. The weight factor is calculated from the ratio of the previous and current steps. This method converges much better than SD. This method converges much better than SD.

Newton-Raphson algorithm Uses both first derivative (slope) and second (curvature) information. Uses both first derivative (slope) and second (curvature) information. In the one-dimensional case: In the one-dimensional case: In the multi-dimensional case – much more complicated (calculates the inverse of a hessian [curvature] matrix at each step) In the multi-dimensional case – much more complicated (calculates the inverse of a hessian [curvature] matrix at each step) Advantage: Accurate and converges well. Advantage: Accurate and converges well. Disadvantage: Computationally expensive, for convergence, should start near a minimum. Disadvantage: Computationally expensive, for convergence, should start near a minimum.

Adopted basis Newton Raphson (ABNR) An adaptation of the NR method that is especially suitable for large systems. An adaptation of the NR method that is especially suitable for large systems. Instead of using a full matrix, it uses a basis that represents the subspace in which the system made the most progress in the past. Instead of using a full matrix, it uses a basis that represents the subspace in which the system made the most progress in the past. Advantage: Second derivative information, convergence, faster than the regular NR method. Advantage: Second derivative information, convergence, faster than the regular NR method. Disadvantages: Still quite expensive, less accurate than NR. Disadvantages: Still quite expensive, less accurate than NR.

Assignment of initial velocities At the beginning the only information available is the desired temperature. Initial velocities are assigned randomly according to the Maxwell-Bolzmann distribution: At the beginning the only information available is the desired temperature. Initial velocities are assigned randomly according to the Maxwell-Bolzmann distribution: P v - the probability of finding a molecule with velocity between v and dv. Note that: 1. the velocity has x,y,z components. 2. The velocities exhibit a gaussian distribution

Bond and angle constraints (SHAKE algorithm) Constrain some bond lengths and/or angles to fixed values using a restraining force G i. Solve the equations once with no constraint force. Solve the equations once with no constraint force. Determine the magnitude of the force (using lagrange multipliers) and correct the positions accordingly. Determine the magnitude of the force (using lagrange multipliers) and correct the positions accordingly. Iteratively adjust the positions of the atoms until the constraints are satisfied. Iteratively adjust the positions of the atoms until the constraints are satisfied.

Equilibrating the system Velocity distribution may change during simulation, especially if the system is far from equilibrium. Perform a simulation, scaling the velocities occasionally to reach the desired temperature. Perform a simulation, scaling the velocities occasionally to reach the desired temperature. The system is at equilibrium if: The system is at equilibrium if: The quantities fluctuate around an average value. The quantities fluctuate around an average value. The average remains constant over time. The average remains constant over time.

The verlet integration method Taylor expansion about r(t): Combining the equation results in: Which is velocity independent. The error is of order δt 4 (the next expression of the series)

The verlet method (cont.) The velocities can be calculated using the derivation formula: Here the error is of order δt 2 Note – the time interval is in the order of 1fs. ( s)

The verlet algorithm Start with r(t) and r(t-δt) Start with r(t) and r(t-δt) Calculate a(t) from the Newton equation: Calculate a(t) from the Newton equation: a(t) = f i (t)/m i. a(t) = f i (t)/m i. Calculate r(t+δt) according to the aforementioned equation. Calculate r(t+δt) according to the aforementioned equation. Calculate v(t). Calculate v(t). Replace r(t-δt) with r and r with r(t+δt). Replace r(t-δt) with r and r with r(t+δt). Repeat as desired. Repeat as desired.

Amyloid fibril formation Associated with a large number of degenerative diseases such as Alzheimer’s, Parkinson’s etc. Associated with a large number of degenerative diseases such as Alzheimer’s, Parkinson’s etc. Associated with a structural change in the protein structure, resulting in the formation of stable fibrils. Associated with a structural change in the protein structure, resulting in the formation of stable fibrils. The fibrils are richer in β-sheets (although their tertiary arrangements are usually undetermined). The fibrils are richer in β-sheets (although their tertiary arrangements are usually undetermined). Amyloid forming proteins do not share sequence homology, but the fibrillar structures exhibit similar physicochemical and structural characteristics. Amyloid forming proteins do not share sequence homology, but the fibrillar structures exhibit similar physicochemical and structural characteristics.

The human calcitonin (hCT) A 32 amino acid polypeptide hormone, produced by the C-cells of the thyroid and involved in calcium homeostasis. A 32 amino acid polypeptide hormone, produced by the C-cells of the thyroid and involved in calcium homeostasis. Fibrillation of hCT was found to be associated with carcinoma of the thyroid. Fibrillation of hCT was found to be associated with carcinoma of the thyroid. Synthetic hCT can form amyloid fibrils in vitro with similar morphology to the deposits found in the thyroid. Synthetic hCT can form amyloid fibrils in vitro with similar morphology to the deposits found in the thyroid. The in vitro process is affected by the pH of the system. The in vitro process is affected by the pH of the system.

The structure of hCT In monomeric state, hCT has little ordered secondary structure in room temperature. In monomeric state, hCT has little ordered secondary structure in room temperature. Fibrillated hCT have both helical and sheet components. Fibrillated hCT have both helical and sheet components. In DMSO/H 2 O a short double stranded anti- parallel β-sheet is formed in the region of residues In DMSO/H 2 O a short double stranded anti- parallel β-sheet is formed in the region of residues Previous research indicated a critical role to residues Previous research indicated a critical role to residues

The sequence of hCT NH 2 -CGNLSTCMLGTYQDFNKFHTFPQTAIGVGAP-COOH

Experimental data regarding the fibril forming region The DFNKF area was found to form fibrils rich in anti-parallel β-sheets. The DFNKF area was found to form fibrils rich in anti-parallel β-sheets. The spectrum observed with the DFNK tetrapeptide is less typical of β-sheets, but may be interpreted as such. The spectrum observed with the DFNK tetrapeptide is less typical of β-sheets, but may be interpreted as such. The FNKF tetrapeptide exhibits a spectrum that is typical of a non-ordered structure. The FNKF tetrapeptide exhibits a spectrum that is typical of a non-ordered structure. The DFN tripeptide seems to be a mixture of β-sheet and non-ordered structure. The DFN tripeptide seems to be a mixture of β-sheet and non-ordered structure.

The effect of F→A mutation The DANKA mutation does not exhibit a typical spectrum of the β-sheet structure, although they exhibit a certain degree of order. This implies on the effect of the Phe aromatic residues in the fibrillation process.

Tested models Combinations of parallel/anti parallel within sheet and between sheets. So far – about 20 models. Combinations of parallel/anti parallel within sheet and between sheets. So far – about 20 models. Each model is simulated for 4ns. (each such simulation takes about 5 days on a powerful cluster…). Each model is simulated for 4ns. (each such simulation takes about 5 days on a powerful cluster…). The tested parameters for model stability: distance within/between sheets, aromatic interactions, HB contact conservation etc. The tested parameters for model stability: distance within/between sheets, aromatic interactions, HB contact conservation etc.

Topologically different models

An example of a model

Initial results (trajectory analysis) A model that’s totally unstable: Before: After:

Average intra-sheet distance analysis

Percentage of conserved H-bonds over time

Future work plans Test mutations once we focus on the correct model. Test mutations once we focus on the correct model. Make more analyses and find out what causes the fibril formation (suspicion: The aromatic ring π-stacking, salt bridges between the oppositely charged residues D and K) Make more analyses and find out what causes the fibril formation (suspicion: The aromatic ring π-stacking, salt bridges between the oppositely charged residues D and K) …??? …???