ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation.

Slides:



Advertisements
Similar presentations
Protein NMR terminology COSY-Correlation spectroscopy Gives experimental details of interaction between hydrogens connected via a covalent bond NOESY-Nuclear.
Advertisements

Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Applications and integration with experimental data Checking your results Validating your results Structure determination from powder data calculations.
CING Validation Tools for Biomolecular NMR Structures Jurgen F. Doreleijers & Geerten W. Vuister Protein Biophysics, IMM & CMBI Radboud University Nijmegen.
Power and weakness of data Power: data + software + bioinformatician = answer. Weakness: Data errors. Data poorly understood. Poor software. Never enough.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structure validation Everything that can go wrong, will go wrong, especially with things as complicated as protein structures. Everything that can go wrong,
Force Fields G Vriend It is all about time versus accuracy Quantum chemistry Approximations Force Fields Hybrid methods Self consistent fields.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Incorporating additional types of information in structure calculation: recent advances chemical shift potentials residual dipolar couplings.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Seminar series 2 Protein structure validation. In 't verleden ligt het heden; in 't nu, wat worden zal. The past: Linus Pauling ‘Inventor’ of helix and.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Thomas Blicher Center for Biological Sequence Analysis
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
IV. Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Structure validation Everything that can go wrong, will go wrong. Everything that could go wrong has gone wrong. Especially with something as complicated.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
A Probabilistic Approach to Protein Backbone Tracing in Electron Density Maps Frank DiMaio, Jude Shavlik Computer Sciences Department George Phillips Biochemistry.
Structure Representation and Coordinates Format Lecture 3 Structural Bioinformatics Dr. Avraham Samson
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Comparing Data from MD simulations and X-ray Crystallography What can we compare? 3D shapes (Scalar coupling constants, a.k.a. J-values, nuclear Overhauser.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
Secondary structure prediction
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Copyright © 2007, BioXGEM Lab., Institute of Bioinformatics, NCTU All rights reserved. ARPPI: A knowledge-based model for protein-protein interactions.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Anatomy ± Physiology Minnesota Science Olympiad States 2011.
Doug Raiford Lesson 17.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
SimBioSys Inc.© 2004http:// Conformational sampling in protein-ligand complex environment Zsolt Zsoldos SimBioSys Inc., © 2004 Contents:
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Seminar series 2 Protein structure validation. Structure validation Everything that can go wrong, will go wrong, especially with things as complicated.
Atomic structure model
X-ray detection xray/facilities.html.
Force Fields Summary. Force Fields 2 What is a Force Field ? A force field is a set of equations and parameters which when evaluated for a (molecular)
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
©CMBI 2002 Structure validation Murphy’s law: Everything that can go wrong, will go wrong, especially with things as complicated as protein structures.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Stony Brook Integrative Structural Biology Organization
PDBe Protein Interfaces, Surfaces and Assemblies
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
1. Pure Protein (0.3 mL, mM; ~ 10 mg)
Molecular Modeling By Rashmi Shrivastava Lecturer
Protein structure prediction.
Programme Last week’s quiz results + Summary
Structure validation Everything that can go wrong, will go wrong, especially with things as complicated as protein structures.
Christian X. Weichenberger, Manfred J. Sippl  Structure 
Presentation transcript:

ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

The plan for today: Gert Vriend: Introduction to validation Robbie Joosten: X-ray structure validation and improvement Jurgen Doreleijers:NMR structure validation (and improvement ) Bas Vroling (and you):YASARA All (and you):General validation practicals Split-up in groups:General validation issues X-ray specific issues NMR specific issues Continuation of validation practicals At the end:Overview of validation and related facilities And in-between we have coffee, lunch, tea, and whatever else they throw at us at any moment that anybody feels like it. Protein structure validation

Structure validation Everything that can go wrong, will go wrong, especially with things as complicated as protein structures.

What is real?

ATOM 1 N LEU ATOM 2 CA LEU ATOM 3 C LEU ATOM 4 O LEU ATOM 5 CB LEU ATOM 6 CG LEU ATOM 7 CD1 LEU ATOM 8 CD2 LEU

X-ray

‘FFT-inv’ FFT-inv And now move the atoms around till the calculated reflections best match the observed ones.

Multiple minima X-ray refinement / multiple minima

X-ray R-factor Error = Σ w.(obs-calc) 2 R-factor = Σ w.|obs-calc|

X-ray resolution

NMR data collection

NMR data NMR data consists of short inter-atomic distances between atoms. We call these NOEs. Most NOEs are between close neighbours in the sequence. Those hold little information. The ‘good’ NOEs are between atoms far away in the sequence. There are few of those, normally. NOEs are known with low precision. E.g. NOEs are binned , , and NMR can also measure some angles, and relative orientations. The latter, called RDCs are powerful.

NMR Q-factor Error = Σ NOE/RDC-violations + Energy term 2

NMR versus X-ray With X-ray you measure reflections. Each reflection holds information about each atom. With NMR you measure pair-wise distances, angles, and orientations. These all hold local information. X-ray requires crystals, and crystals cause/are artefacts. NMR is in solution, but provides much less precision.

NMR versus X-ray NMRX-ray ‘Error’ 1-2 Å Å Mobilityyesnot really Crystal artefactsnoyes Material needed20 mg1 mg Cost of hardware4 M Euronear infinite (share) Drug designnoalmost Better combine and use the best of both worlds.

Why validation ? Why does a sane (?) human being spend twenty years to search for millions of errors in the PDB?

Validation because: Everything we know about proteins comes from PDB files. Errors become less dangerous when you know about them. And, going back to the red thread through this series, if a template is wrong the model will be wrong.

What kind of errors can the software find? Administrative errors. Crystal-specific errors. NMR-specific errors. Really wrong things. Improbable things. Things worth looking at. Ad hoc things.

Smile or cry? A 5RXN 1.2 B 7GPB 2.9 C 1DLP 3.3 D 1BIW 2.5

Why? Simple, proteins are very complex.

X-ray specific

Little things hurt big

How bad is bad? X-ray

Check with force fields A force field is a set of parameters together with a set of rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future. In structure validation we often look at structures and count ‘things’ For example, we count that the number of buried hydrogen bond donors that do not make a hydrogen bond is 4.6+/-1.2 per 100 amino acids in well-solved proteins. So we call that normal, and now, using ΔG=-RTln(K), we can calculate the energy penalty for proteins with more than 4.6 unsatisfied buried unsatisfied hydrogen bonds.

Contact Probability

Contact probability box

One slide about homology modelling

His, Asn, Gln ‘flips’

Hydrogen bond network

Your best check:

How difficult can it be? 1CBQ 2.2 A

How difficult can it be?

Errors or discoveries? Buried histidine. Warning for buried histidine triggered biochemical follow -up and new mechanism for KH-module of Vigilin. (A. Pastore, 1VIG).

Acknowledgements: Elmar Krieger Sander Nabuurs Chris Spronk Maarten Hekkelman Rob Hooft Robbie Joosten