EBI is an Outstation of the European Molecular Biology Laboratory. Assessment of macromolecular interactions and identification of macromolecular assemblies.

Slides:



Advertisements
Similar presentations
Macromolecular Complexes in Crystals and Solutions CCP4 Study Weekend, Nottingham, UK, 7-8 Jan Research Complex at Harwell Eugene Krissinel CCP4,
Advertisements

Ligands and reversible binding. Ligands Kinetic experiments study the rate at which reactions happen.- how conc of reactant and product change as funct.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
EMBL-EBI MSDpisa a web service for studying Protein Interfaces, Surfaces and Assemblies Eugene Krissinel
Lecture 15: Regulation of Proteins 2: Allosteric Control of Hemoglobin Hemoglobin and Myoglobin Allosteric Transition in Hemoglobin Physiological Role.
Biochemistry 300 Introduction to Structural Biology Walter Chazin 5140 BIOSCI/MRBIII
Biochemistry Atoms, Elements, and Compounds Chemical Reactions
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Review of “Stability of Macromolecular Complexes” Dan Kulp Brooijmans, Sharp, Kuntz.
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Protein Interfaces, Surfaces and Assemblies
Protein Tertiary Structure Prediction
Molecular Modeling: The Computer is the Lab
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Coordinate handling and exploitation An overview of coordinate functionality in CCP4 suite Coordinate functionality in REFMAC group of programs (A. Vaguine)
Evaluation of Structure Quality Using RCSB PDB Tools Kyle Burkhardt, Lead Data Annotator The RCSB PDB at Rutgers University.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
6 Energy and Energy Conversions Cells must acquire energy from their environment. Cells cannot make energy; energy is neither created nor destroyed, but.
CHEMISTRY. Composition of Matter Matter - Everything in universe is composed of matter Matter is anything that occupies space or has mass Mass – quantity.
Atoms, Elements, and Compounds- Chapter 6
6 Energy, Enzymes, and Metabolism. 6 Energy and Energy Conversions To physicists, energy represents the capacity to do work. To biochemists, energy represents.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
Biochemical Reactions Chemistry in Biology Chapter 6.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes MSD Protein.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Question 1: Name, PDB codes Eric Martz (no research lab) Macromolecular visualization I chose*: 3onz HUMAN TETRAMERIC HEMOGLOBIN:
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Energy and Metabolism Chapter 8. Energy Metabolism All the chemical reactions carried out by the cell.
EMBL-EBI MSDpisa a web service for studying Protein Interfaces, Surfaces and Assemblies Eugene Krissinel
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Study of Pentacene clustering MAE 715 Project Report By: Krishna Iyengar.
STRUCTURAL BIOLOGY Martina Mijušković ETH Zürich, Switzerland.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Biochemistry - as science; biomolecules; metabolic ways. Structure of proteins, methods of its determination.
Atoms  Chemistry is the study of matter. 6.1 Atoms, Elements, and Compounds  Atoms are the building blocks of matter. Chapter 6 Chemistry in Biology.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Question 1: Name, PDB codes Eric Martz Keiichi Namba Macromolecular visualization I chose: 3onz HUMAN TETRAMERIC HEMOGLOBIN: PROXIMAL.
Enzyme mechanisms B. X-ray methods 1 Small angle X-ray scattering (SAX) Kinetic crystallography – slow reactions Kinetic crystallography – Laue method.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
Introduction. Zn 2+ homeostasis is regulated at the transcriptional level by the DNA-binding protein SmtB. Manipulation of Zn 2+ homeostasis could act.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
EM Maps and Models in EMDB/PDB. Growth of EM entries
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-PISA a web based service for understanding Protein Interfaces, Surfaces and Assemblies.
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Atomic structure model
Energy and Metabolism Chapter 8. Energy Metabolism All the chemical reactions carried out by the cell.
Using Technology to Study Cellular and Molecular Biology.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
400x 1000x 400x Problem of the day: Are these prokaryotic or eukaryotic cells?
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
Chapter 6 Chemistry of Life.
PDBe Protein Interfaces, Surfaces and Assemblies
Question 1: Name, PDB codes
AO1- development of science knowledge- KS3
BIO201 SPRING 2018 Introduction to Biochemistry & Biotechnology
Chapter 2 CHEMISTRY OF LIFE.
Enzyme Kinetics & Protein Folding 9/7/2004
Enzyme-Substrate Complex
List a Carbohydrate Monomer
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. Assessment of macromolecular interactions and identification of macromolecular assemblies in crystalline state (PISA software) Eugene Krissinel Macromolecular Structure Database

Macromolecular Assemblies Complexes of protein, DNA/RNA chains and ligands, stable in native environment The way the chains assemble represents the [Protein] Quaternary Structure (PQS) Macromolecular assemblies are often the Biological Units, performing a certain biochemical function Biological and biochemical significance of macromolecular assemblies is truly immense

Biological Unit Example: Hemoglobin Iron-containing oxygen- transport tetrameric protein complex Oxygen gets bound to Fe +2 in heme ligand Binding and release of ligands induces structural changes PDB code: 1a00

Biological Units are Hard to Study Light / Neutron / X-ray / Small angle scatterings: mainly composition and multimeric state may be found. 3D shape may be guessed from mobility measurements. Very few quaternary structures have been identified experimentally. Electron microscopy: not a fantastic resolution and not applicable to all objects NMR is not good for big chains, even less so for protein assemblies.

Biological Units in the PDB But PQS are assigned to many entries in PDB! Most of those are PROBABLE Quaternary Structures. 1.Depositor’s say prevails. 2.Accept everything which passes formal validation checks. 3.No experimental evidence for PQS is required. 4.If a depositor does not know or does not care (60-80% of instances for PQS), the curator is to decide. 5.The curator may (EBI-MSD) or may not (Rutgers-RCSB, PDBj) use computing/modeling tools to assist the PQS annotation. The PDB “rules” are:

Asymmetric Unit vs. Biological Unit Crystal = translated Unit Cells More than 80% of macromolecular structures are solved by means of X-ray diffraction on crystals. It is reasonable to expect that Biological Units make construction blocks for the crystal. An X-ray diffraction experiment produces atomic coordinates of the Asymmetric Unit, which is stored as a PDB file. In general, neither Asymmetric Unit nor Unit Cell has any direct relation to Biological Unit. Biological Unit may be made of Unit Cell = all space symmetry group mates of ASU PDB file (ASU) a single ASU part of ASU several ASU several ASU parts Biological Unit

PDB entries and Biological Units PDB entry 1P30 A monomer? Biological unit 1P30 Homotrimer!

PDB entries and Biological Units PDB entry 2TBV A trimer? Biological Unit 2TBV 180-mer!

PDB entries and Biological Units PDB entry 1E94 ???????? 2 Biological Units in 1E94: A dodecamer and a hexamer!

Protein Crystallography in Simple Words ? no image or bad image 2 crystallisation 3 in crystal ?? good image but no associations in vivo 1

Significant interfaces? PQS EBI (Kim Henrick) Trends in Biochem. Sci. (1998) 23, 358 PITA EBI (Hannes Ponstingl) J. Appl. Cryst. (2003) 36, 1116

Significant interfaces or artefacts? “ No single parameter absolutely differentiates the interfaces from all other surface patches ” Jones, S. & Thornton, J.M. (1996) Principles of protein-protein interactions, Proc. Natl. Acad. Sci. USA, 93, Formation of N>2 -meric complexes is most probably a corporate process involving a set of interfaces. Therefore significance of an interface should not be detached from the context of protein complex “… the type of complexes need to be taken into account when characterizing interfaces between them. ” Jones, S. & Thornton, J.M., ibid.

Is there a Measure of Significance?

Chemistry’s say It is not so much properties of individual interfaces but rather chemical stability of protein complex in general that really matters Protein chains will most likely associate into largest complexes that are still stable A protein complex is stable if its free Gibbs energy of dissociation is positive:

Detection of Biological Units in Crystals 1.Enumerate all possible assemblies in crystal, subject to crystal structure: space symmetry group, geometry and composition of the Asymmetric Unit (Graph Theory used) 2.Evaluate assemblies for chemical stability 3.Leave only stable assemblies in the list and range them by chances to be a Biological Unit : Larger assemblies take preference Single-assembly solutions take preference Otherwise, assemblies with higher  G diss take preference Method Summary

Classification of protein complexes Assembly classification on the benchmark set of 218 structures published in Ponstingl, H., Kabir, T. and Thornton, J. (2003) Automatic inference of protein quaternary structures from crystals. J. Appl. Cryst. 36, homomers and 20 heteromers Fitted parameters: 1.Free energy of a H-bond : 2.Free energy of a salt bridge : 3.Constant entropy term : 4.Surface entropy factor : = 0.51 kcal/mol = 0.21 kcal/mol = 11.7 kcal/mol = 0.57·10 -3 kcal/(mol*Å 2 ) Classification error in  G diss : ± 5 kcal/mol

Classification of protein-DNA complexes Assembly classification on the benchmark set of 212 protein – DNA complexes published in Luscombe, N.M., Austin, S.E., Berman H.M. and Thornton, J.M. (2000) An overview of the structures of protein-DNA complexes. Genome Biol. 1, Classification error in : ± 5 kcal/mol 2mer3mer4mer5mer6mer10merOtherSumCorrect 2mer % 3mer % 4mer % 5mer % 6mer % 10mer % Total:21293%

Free energy of misclassifications

Misclassification of 1QEX Predicted: homohexamer Dissociates into 2 trimers  106 kcal/mol Biolgical unit: homotrimer Dissociates into 3 monomers  90 kcal/mol BACTERIOPHAGE T4 GENE PRODUCT 9 (GP9), THE TRIGGER OF TAIL CONTRACTION AND THE LONG TAIL FIBERS CONNECTOR

What is 1QEX? Rossmann M.G., Mesyanzhinov V.V., Arisaka F and Leiman P.G. (2004) The bacteriophage T4 DNA injection machine. Curr. Opinion Struct. Biol. 14: BACTERIOPHAGE T4 GENE PRODUCT 9 (GP9), THE TRIGGER OF TAIL CONTRACTION AND THE LONG TAIL FIBERS CONNECTOR

1QEX: nobody is perfect 1QEX hexamer Identified wrongly 1QEX trimer 1S2E trimer Correct mainchain tracing Identified correctly Wrong mainchain tracing! BACTERIOPHAGE T4 GENE PRODUCT 9 (GP9), THE TRIGGER OF TAIL CONTRACTION AND THE LONG TAIL FIBERS CONNECTOR

Misclassification of 1D3U TATA-BINDING PROTEIN / TRANSCRIPTION FACTOR Predicted: octamer Dissociates into 2 tetramers  20 kcal/mol Functional unit: tetramer

Misclassification of 1CRX CRE RECOMBINASE / DNA COMPLEX REACTION INTERMEDIATE Predicted: dodecamer Dissociates into 2 hexamers  28 kcal/mol Functional unit: trimer

Synaptic complex 1CRX Guo F., Gopaul D.N. and van Duyne G.D. (1997) Structure of Cre recombinase complexed with DNA in a site- specific recombination synapse. Nature 389: CRE RECOMBINASE / DNA COMPLEX REACTION INTERMEDIATE

Misclassification of 1TON TONIN Predicted: dimer Dissociates at  37 kcal/mol Biological unit: monomer Apparent dimerization is an artefact due to the presence of Zn +2 ions added to the buffer to aid crystallization. Removal Zn from the file results in  3 kcal/mol Fujinaga M., James M.N.G. (1997) Rat submaxillary gland serine protease, tonin structure solution and refinement at 1.8 Å resolution. J.Mol.Biol. 195:

Reasons to be imperfect 50% Crystal packing may introduce interactions that are not engaged in native environment Definition of Biological Unit is based on physiological function and may be to a certain degree subjective Interaction artifacts are often due to the addition of binding agents in order to aid crystallization Theoretical models of macromolecular complexation are simplified Experimental data are of a limited accuracy No explicit account for concentrations, temperature, pH and ionic strength

Web-server A new MSD-EBI tool for working with Protein Interfaces, Surfaces and Assemblies

Web-server

Conclusions Chemical-thermodynamical models for protein complex stability allow one to recover biological units from protein crystallography data at 80-90% success rate Considerable part of misclassifications is due to the difference of experimental and native environments and artificial interactions induced by crystal packing Functional significance of protein interfaces cannot be reliably inferred only from their properties. Due to entropy contribution and entangled interactions, interface function is also subject to protein complex composition and geometry Protein interface and assembly analysis software (PISA) is available for public use as a web-service from Acknowledgement. This work has been supported by research grant No. 721/B19544 from the Biotechnology and Biological Sciences Research Council (BBSRC) UK.