Toward high-resolution prediction and design of transmembrane helical protein structures P. Barth, J. Schonbrun and D. Baker PNAS Sep 28 2007 Tim Nugent.

Slides:



Advertisements
Similar presentations
Transmembrane Protein Topology Prediction Using Support Vector Machines Tim Nugent and David Jones Bioinformatics Group, Department of Computer Science,
Advertisements

Structural Classification and Prediction of Reentrant Regions in Alpha-Helical Transmembrane Proteins: Application to Complete Genomes Håkan Viklunda,
Protein Structure C483 Spring 2013.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Thermodynamics of Protein Folding
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
Protein Secondary Structure II Lecture 2/24/2003.
The Screened Coulomb Potential-Implicit Solvent Model (SCP-ISM) is used to study the alanine dipeptide in aqueous solution and the discrimination of native.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Thomas Blicher Center for Biological Sequence Analysis
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Department of Mechanical Engineering
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
A two-state homology model of the hERG K + channel: application to ligand binding Ramkumar Rajamani, Brett Tongue, Jian Li, Charles H. Reynolds J & J PRD.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Altman et al. JACS 2008, Presented By Swati Jain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Protein Structure BL
Computational Structure Prediction
Protein Structure and Properties
Volume 18, Issue 2, Pages (February 2010)
Structure and Dynamics of the Membrane-Bound Form of Pf1 Coat Protein: Implications of Structural Rearrangement for Virus Assembly  Sang Ho Park, Francesca.
Jing Han, Kristyna Pluhackova, Tsjerk A. Wassenaar, Rainer A. Böckmann 
Volume 23, Issue 12, Pages (December 2015)
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
R. Elliot Murphy, Alexandra B. Samal, Jiri Vlach, Jamil S. Saad 
Rosetta: De Novo determination of protein structure
Giovanni Settanni, Antonino Cattaneo, Paolo Carloni 
Volume 18, Issue 11, Pages (November 2010)
AnchorDock: Blind and Flexible Anchor-Driven Peptide Docking
Homology Modeling.
Programme Last week’s quiz results + Summary
Liqun Zhang, Susmita Borthakur, Matthias Buck  Biophysical Journal 
Coarse-Grained Peptide Modeling Using a Systematic Multiscale Approach
Enhancing Structure Prediction and Design of Soluble and Membrane Proteins with Explicit Solvent-Protein Interactions  Jason K. Lai, Joaquin Ambia, Yumeng.
Volume 98, Issue 8, Pages (April 2010)
Monica Berrondo, Marc Ostermeier, Jeffrey J. Gray  Structure 
Volume 20, Issue 3, Pages (March 2012)
Ligand Binding to the Voltage-Gated Kv1
Collagen Stabilization at Atomic Level
Sundeep S. Deol, Peter J. Bond, Carmen Domene, Mark S.P. Sansom 
Cholesterol Modulates the Dimer Interface of the β2-Adrenergic Receptor via Cholesterol Occupancy Sites  Xavier Prasanna, Amitabha Chattopadhyay, Durba.
Kristen E. Norman, Hugh Nymeyer  Biophysical Journal 
Robust Driving Forces for Transmembrane Helix Packing
Feng Ding, Douglas Tsao, Huifen Nie, Nikolay V. Dokholyan  Structure 
Volume 95, Issue 7, Pages (October 2008)
The Three-Dimensional Structure of Proteins
Presentation transcript:

Toward high-resolution prediction and design of transmembrane helical protein structures P. Barth, J. Schonbrun and D. Baker PNAS Sep Tim Nugent – BUGF - 18/10/07

Introduction While membrane proteins comprise ~30% of a proteome, they are severely under- represented in structural databases (e.g. ~1% PDB) due to the problems associated with crystallisation. Given the biological significance of TM proteins, an understanding of their structures is an important target for theoretical prediction methods. This paper presents an all-atom physical model that describes intra-protein and protein-solvent interaction in the membrane environment. The model is evaluated by: Side chain packing Energy gaps between native and non-native structures Distortions in TM helices (helical kinks) Test set of 18 high-resolution crystal structures.

Overview of the Physical Model Van der Waals packing, hydrogen bonding, solvation and electrostatics interactions are thought to be the main forces that stabilise TM proteins. However, the magnitude and relative importance of these forces has never been clearly established and no current method has been shown to quantitatively capture the energetics involved. This model attempts to address this problem: it describes interactions between protein residues at atomic detail, whereas the water, hydrophobic core and lipid head groups are treated using continuum solvents models. Membrane described by 3 continuum phases: 2 isotropic (water and hydrophobic core of lipid bilayer) and 1 anisotropic (lipid head group region of the membrane). Hydrogen bonds are treated explicitly, including weak CH-O h-bonds and bifurcated h-bonds. The energy function is an extension of the ROSETTA full-atom potential developed for water soluble proteins structure prediction.

Force Field Lennard-Jones potential for modelling VDW forces. Backbone torsional term for local structural propensities. Knowledge-based pair interaction term for side chain interactions. 20 reference energies that control overall amino acid composition. Implicit atomic solvation term based on Lazaridis and Karplus model. Orientation-dependent h-bond term. H-bond and solvation energies modified for the anisotropic membrane environment. CH-O and bifurcated H-bonds modelled explicitly Thought to play an important role in helical distortions and stabilising polar residues in membrane proteins.

Side-chain Conformation Recovery Test The ability of the model to predict side chain conformations was assessed by simultaneous re-packing side chains on fixed backbones derived from 18 high- resolution crystal structures. Ch1 and Ch2 dihedral angles correct for 73% of buried positions. Recovery is higher in regions embedded in the hydrophobic core, consistent with a higher degree of packing in those regions compared to the lipid head group.

Amino Acid Recovery Test The energy function was tested in redesign experiments in which sequence space is searched for the combination of amino acids that minimises the structure's free energy. The method selected the native amino acid at >45% of buried and ~35% of all positions. These values compare well with those obtained for water soluble proteins. For both lipid and water soluble proteins, recovery is higher for non-polar than polar residues, consistent with the latter being often selected by nature for function rather than stability. The gradient in amino acid polarity between core and head group regions agrees with those observed in native proteins.

Native TMH Docking Test Monte Carlo simulations using fixed backbones but flexible side chains were used to dock single TMHs on to their templates to generate diverse sets of near-native and non-native conformations. Native and near-native were significantly lower in energy than non-native (Z score). Validates the energy function. Effect of the membrane hydrogen bond potential on the discrimination between native and nonnative decoys in native helix random docking calculations of Glycophorin A. Decoys scored with the ROSETTA hydrogen bond potential with (Left) or without (Right) the membrane-specific bifurcated/weak hydrogen bond potential are represented. The lowest-energy decoys are encircled in black.

Analysis of the Contribution of Individual Terms in the Model Solvation Solvation term dominated by free-energy cost of transferring polar groups and free energy gain of transferring non-polar groups from water into the bilayer. Removal of the solvation potential from the function led to a 25% decrease in total sequence recovery, and also loss of the polarity gradient. VDW and H-bonding Short-range VDW and h-bond interactions contribute to the discrimination between TMH interfaces that have smaller structural discrepancies to the native structures Addition of CH-O and bifurcated h-bond terms increased the energy gaps between native and non-native docked conformations by 31% on average. This highlights their role in stabilising tightly packed interfaces, e.g. Involving glycine zippers in glycerol channels and glycophorin A, or interfaces accompanying small polar residues. Incorporating CH-O and bifurcated h-bond terms improved recovery of polar residues (especially Thr and Ser) by 13%. Overall sequence recovery improved by 2%.

VDW and H-bonding

Sequence-Based Modelling of Distorted TMHs Most TMH distortions occur at Pro residues or at positions where kinks are initiated by Pro residues and later stabilised by tertiary interactions during evolution. The kink usually propagates 4 residues N-terminal to the residue responsible for the bend. They hypothesised that the native conformation of the distorted helix could be identified by modelling the chain away from the bend-induced hinges by ensembles of pairs of ideal helix fragments with a range of orientations. Ideal helix fragments docked to native protein templates - select pairs with the lowest non-local interaction energy with other TMHs. Control experiments where ideal helices for the entire TMH length were docked on protein templates confirmed that native tertiary interactions could not be recapitulated with ideal backbone geometries. Native conformations could also not be identified by individual helix fragments away from the bend.

Sequence-Based Modelling of Distorted TMHs

De novo prediction of interface-bound peptide structures Domains lying parallel to the bilayer are recurrent in membrane-embedded polypeptides. Structure prediction performed on the fd-coat protein – NMR structure consisting of 1 TMH and one interfacial helix. Course-grained ROSETTA structure was refined and relaxed with the energy function. Lowest energy structure had an rmsd of 2.4 Å. Model has characteristic hydrophobic and polar residues snorkelling in and out of the membrane. (A) fd-coat protein. (Left) Backbone superposition of the experimental structure determined by solid-state NMR (blue) and the lowest-energy decoy generated by ROSETTA (pink) starting from an extended chain. The rmsd over 30 C atoms is 2.4 Å. (Right) All-atom representation of the lowest-energy decoy generated by ROSETTA. The boundaries predicted by ROSETTA between the hydrophobic core and the interface regions of the membrane are represented with a black solid line.

Prediction of TMH oligomeric interfaces by docking Many functions of TMP are driven by oligomerisation. They docked randomly orientated glycophorin A molecules. Symmetry of the native homodimer wasn't enforced Docked structure has rmsd of 0.65 Å compared to the native dimer. (B) Glycophorin A. Isolated monomers were docked with the ROSETTA protein–protein docking protocol and the all-atom membrane force field. The superposition between the native (blue) and the lowest-energy predicted structure (pink) is represented. The rmsd over 45 C atoms is 0.65 Å.

De novo prediction of polytopic membrane structures To be observed experimentally, native structures must be significantly lower in free energy than non-native conformations. Course-grained ROSETTA structures were refined and relaxed with the energy function. The energies of the ab initio models were compared to native structures relaxed by the same protocols. Lowest energy models have near atomic resolution - rmsd of 2.1 Å over 111 atoms and 2.4 Å over 139 atoms for the 4-helix bundles BRD4 and VATP respectively. (C and D) Ab initio structure prediction of polytopic membrane proteins. Native polytopic membrane protein conformations define a narrow energy basin in the all-atom conformational energy landscape. When near-native topologies are generated at the coarse-grained level, all-atom relaxed decoys define a funnel toward the native basin and the lowest-energy predicted structures have near-atomic resolution structures. In the energy versus rmsd plots, nonnative (red points) (generated from sequence by the ROSETTA coarse-grained structure prediction mode) and native conformations (green points) were relaxed by sampling the conformational degrees of freedom of all backbone and side-chain atoms. Cartoons show superposition between the native (blue) and the lowest-energy predicted structure (pink). Boxed areas show regions where close to native side-chain packing arrangements were obtained. The rmsd values are 2.1 Å over 111 C atoms for BRD4 (C) and 2.4 Å over 139 C atoms for VATP (D).

Conclusions Model validated by a variety of in silico tests. 73% of side-chain rotamers and 35% of all native residues recovered in a diverse range of TMPs. Other important properties of TMP examined, e.g. Helix kinks and interfacial helices. Good recovery suggests that the solvent model captures the main solvation properties of the membrane. VDW and h-bond interactions are essential for stability and structural specificity of TMH bundles. Weak and bifurcated h-bonds important – demonstrated by increase in Z scores (measure of structural determination in structure prediction tests). Modelling TMH kinks using ideal helix fragments reduces the complexity of the search in conformational space. Near-atomic resolution ab initio structures produced for 3 TMPs ( residues).

Conclusions Model captures essential physical properties that govern solvation and stability of membrane proteins. More sophisticated model needed to predict the effects of membrane deformations and specific amino acid/lipid interactions that are involved in the regulation of membrane protein structures and functions. A more accurate treatment of electrostatics accounting for induced polarisation effects may be necessary to model functional properties involving networks of buried residues or water/ion-solvated regions in channels and transporters.