Computational Design of Protein Structures and Interfaces Brian Kuhlman University of North Carolina, Chapel Hill.

Slides:



Advertisements
Similar presentations
Toward high-resolution prediction and design of transmembrane helical protein structures P. Barth, J. Schonbrun and D. Baker PNAS Sep Tim Nugent.
Advertisements

Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Thermodynamics of Protein Folding
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
Thomas Blicher Center for Biological Sequence Analysis
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Folding Protein Structure Prediction Protein Design
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
COMPARATIVE or HOMOLOGY MODELING
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
Structural biology should be computable! Protein structures determined by amino acid sequences Protein structures and complexes correspond to global free.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Department of Mechanical Engineering
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Classwork II: NJ tree using MEGA. 1.Go to CDD webpage and retrieve alignment of cd00157 in FASTA format. 2.Import this alignment into MEGA and convert.
Altman et al. JACS 2008, Presented By Swati Jain.
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Chemistry XXI Unit 3 How do we predict properties? M1. Analyzing Molecular Structure Predicting properties based on molecular structure. M4. Exploring.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
In silico Protein Design: Implementing Dead-End Elimination algorithm
7. Protein Design 1. Protein design 2 Structure prediction and design are inverse problems.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Enzyme Design on the OSG Florian Richter Baker Group, Dept. of Biochemistry University of Washington, Seattle.
Computational Structure Prediction
Protein Structure and Properties
Mark A Willis, Barney Bishop, Lynne Regan, Axel T Brunger  Structure 
X-Ray Structures of Myc-Max and Mad-Max Recognizing DNA
Rosetta: De Novo determination of protein structure
Volume 42, Issue 2, Pages (April 2011)
Protein structure prediction.
Scot A Wolfe, Elizabeth I Ramm, Carl O Pabo  Structure 
Volume 108, Issue 1, Pages (January 2002)
Fernando Corrêa, Jason Key, Brian Kuhlman, Kevin H. Gardner  Structure 
Volume 90, Issue 1, Pages (July 1997)
Volume 9, Issue 8, Pages (August 2001)
Crystal Structure of the p53 Core Domain Bound to a Full Consensus Site as a Self- Assembled Tetramer  Yongheng Chen, Raja Dey, Lin Chen  Structure  Volume.
Volume 20, Issue 6, Pages (June 2012)
David Jeruzalmi, Mike O'Donnell, John Kuriyan  Cell 
A Putative Mechanism for Downregulation of the Catalytic Activity of the EGF Receptor via Direct Contact between Its Kinase and C-Terminal Domains  Meytal.
Structural Basis of Caspase-7 Inhibition by XIAP
Volume 11, Issue 2, Pages (February 2003)
David Jeruzalmi, Mike O'Donnell, John Kuriyan  Cell 
Volume 87, Issue 7, Pages (December 1996)
Structure of CD94 Reveals a Novel C-Type Lectin Fold
Protein structure prediction
The Three-Dimensional Structure of Proteins
Jianchao Li, Haiyang Liu, Manmeet H. Raval, Jun Wan, Christopher M
Presentation transcript:

Computational Design of Protein Structures and Interfaces Brian Kuhlman University of North Carolina, Chapel Hill

Outline: Three Protein Design Stories Using Flexible Backbone Design for the Complete Redesign of a Protein Core Designing the Structure and Sequence of a Protein-Binding Peptide Design of Metal-Mediated Protein-Protein Interactions

Central problem of protein design: identifying amino acid sequences that will stabilize a target structure or interaction

Rosetta’s Full Atom Energy Function 1) Lennard-Jones potential (favors atom close, but not too close) 2) Lazaridis-Karplus implicit solvation model (penalizes buried polar atoms) 3) orientation dependent hydrogen bonding (allows buried polar atoms) 4) knowledge-based pair potential between charged amino acids 5) knowledge-based torsional preferences 6) amino acid references energies (unfolded state) (1) (3) (2) (4) (5)

Sequence Optimization Simulated Annealing start with a random sequence make a single amino acid replacement or rotamer substitution accept or reject move based on the Metropolis Criterion repeat many times decreasing the temperature as you go Results from 10 independent runs on a small glubular protein

The usefulness of backbone sampling when performing design Initial target structure is often not designable Backbone sampling

Backbone Sampling in Rosetta Monte Carlo Sampling of Internal Degrees of Freedom (phi,psi) - Fragment insertions (aggressive sampling) - Small random changes to phi/psi (refinement) Gradient-based minimization of backbone (and side chain) torsion angles Loop closure algorithms - cyclic coordinate descent - kinematic loop closure Docking - Monte Carlo sampling - Gradient-based minimization

Our Typical Strategy For Designing Novel Structures or Interactions Create Starting Model of Target Backbone Conformation Perform Sequence Optimization Backbone Optimization Evaluate Models with Rosetta Score and other Structure Quality Metrics Rosetta Energy Per Residue Trajectory Number RedDesign Relax Round 1 Green Design Relax Round 2 BlueDesign Relax Round 3 Average Rosetta Energy per Residue of Relaxed Crystal Structures = -2.5

Outline: Three Protein Design Stories Using Flexible Backbone Design for the Complete Redesign of a Protein Core Designing the Structure and Sequence of a Protein-Binding Peptide Design of Metal-Mediated Protein-Protein Interactions

Background: Protein Redesign with a Naturally Occurring Backbone Generally Recovers Sequences with High Identity to the Wild Type Sequence In the core it is typical to see greater than 50% sequence identity with the WT protein. Conclusions: Simulations are not sampling large regions of sequence space compatible with a given fold. ‘Memory’ of the native sequence makes the test less rigorous. Cyan: native tenascin Magenta: design model

A More Rigorous Test: The Complete Redesign of a Protein Core Model System: Four Helix Bundle, CheA phosphotransferase domain (pdb code: 1tqg). 37 core positions selected for flexible backbone redesign. The native amino acid was not allowed during the simulation.

Design Protocol: Flexible Backbone Redesign Iterative cycles (5) of sequence design and backbone refinement 10,000 independent trajectories performed 50 best scoring sequences were evaluated with a non- pairwise additive packing term and a secondary structure prediction server (jpred3)

Design Model Compared to the WT Structure Green: Design Model Salmon: WT crystal structure WT - L F V T Y L L T L L L I E A F A L L M A M L C L E I L A R L I G V I M V I Des - I V T L L I V D I V Y W K I Y L V M I T V V L I M L V M L I V K L V E L K Redesigned Positions

The CheA Redesign is Well-Folded and is Hyperthermophilic Temperature ( o C) GuHCl(M) Mean residue ellipticity Circular Dichroism Unfolding Experiments T m = ( o C) (extrapolated)  G f (20°C) = -19 kcal / mol 1 H- 15 N HSQC HN N15

Crystal Structure of CheA Redesign Resolution: 1.8 Å Close up: Helix 2 and 3

Crystal Structure Compared with the Design Model Green: Design Model, Cyan: Crystal Structure

Comparison: WT, X-Ray of Redesign and Redesign Model Salmon: WT Green: Redesign Model Cyan: X-Ray Redesign

Conclusions and Future Directions for CheA Redesign Demonstrates that sequence design can be combined with backbone sampling to more aggressively redesign proteins. Extreme thermostability can be achieved by remodeling a protein’s core. Why is the redesign stabilized? Possibilities: tighter packing, more favorable rotamers, stronger helical propensities, burial of more hydrophobic surface area, more dynamic, …

Outline: Three Protein Design Stories Using Flexible Backbone Design for the Complete Redesign of a Protein Core Designing the Structure and Sequence of a Protein-Binding Peptide Design of Metal-Mediated Protein-Protein Interactions

Designing a New Docked Conformation for a Protein- Binding Peptide WT GoLoco motif (blue) with WT G  i1 (green) Design goal: Change the sequence of GoLoco so the C-terminal residues of GoLoco adopt a helix when bound to G  i1. Deanne Sammond, Glenn Butterfoss

Designing Sequence and Structure at an Interface 1.Remove the portion of the structure to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Designing Sequence and Structure at an Interface 1.Remove the portion of the backbone to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Designing Sequence and Structure at an Interface 1.Remove the portion of the backbone to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Representative Starting Structures

Designing Sequence and Structure at an Interface 1.Remove the portion of the backbone to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Designing Sequence and Structure at an Interface 1.Remove the portion of the backbone to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Designing Sequence and Structure at an Interface 1.Remove the portion of the backbone to be remodeled 2.Build in a new backbone with the target conformation (fragment assembly) 3.Design a sequence for the new backbone 4.Refine the conformation of the designed residues 5.Iterate steps 3 and 4

Designing Sequence and Structure at an Interface From two thousand design trajectories, four designs were selected for experimental characterization. One bound with an affinity tighter than the truncated GoLoco peptide. Normalized Fluorescence Polarization G  i1 (  M) Binding curves for GoLoco Redesigns GL helix -4, K d = 810 nM Design: GL helix -4

Crystal Structure of the GoLoco Redesign Purple: design model, Salmon: crystal structure Dustin Bosch, Mischa Machius, David Siderovski

Outline: Three Protein Design Stories Using Flexible Backbone Design for the Complete Redesign of a Protein Core Designing the Structure and Sequence of a Protein-Binding Peptide Design of Metal-Mediated Protein-Protein Interactions

Pitfall #1: No binding! Pitfall #2: Incorrect binding orientation Metal coordination bonds are: enthalpically strong and geometrically constrained Metal binding can potentially addresse two major pitfalls of protein-protein interface design

33 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol

34 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol RosettaMatch – Geometric Hashing Algorithm Clarke and Yuan, 1995 Zanghellini et al., 2006

35 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol

36 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol

37 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol

38 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol

39 Step 0: Choose scaffold proteins. Step 1: Design half zinc sites 1 and 2. Step 2: Generate symmetric complex, 2 flips. Step 3: Search rigid body alignments. filter 1 = zinc geometry filter 2 = backbone clashes Step 4: Symmetric design of interface sidechains, symmetric backbone minimization. Step 5: Score. Step 6: Visual inspection. Symmetric Metal Interface Design Protocol dG bind dSASA dG bind /dSASA uns_hbond o

Representative Design Models

1YZMsym Forms a Dimer Analytical S75 gel filtration Analytical ultracentrifugation and multi-angle light scattering also confirm dimer formation. Model of 1YZMsym

Ellipticity (220 nm) T m ( o C) 1YZMsym 57 1YZMsym + cobalt (equamolar) 69 1YZMsym + zinc (equamolar) ~90 T m ( o C) 1YZM_wtHis 46 1YZM_wtHis + zinc 51 Zinc Binding stabilizes 1YZMsym Circular Dichroism (CD) thermal denaturation

[Titrant] (uM) 1YZMsym, 12 uM ZnSO 4, K d < 30 nM 1YZMsym, no ZnSO 4, K d = 3 uM Normalized Fluorescence Polarization Assay: Titration of 1YZMsym into a small amount of 1YZMsym labeled with a polarizable dye. Zinc Promotes Homodimer Formation Fluorescence Polarization Binding Assay +

Crystal Structure of 1YZMsym without Metal Green: 1YZMsym design model with zinc Cyan: 1YZMsym no metal crystal structure (1.2 Å resolution)

Crystal Structure of 1YZMsym with Cobalt Cyan: Crystal Structure with Cobalt Green: Design Model with Zinc

1YZMsym Cobalt: Octahedral Coordination

2A9Osym: monomer-dimer equilibrium when dilute MBP fusion  zinc promotes dimer 2Q0Vsym: dimer without zinc, high-order oligomer with zinc, poor expression 2D4Xsym: monomer 1RZ4sym: poor expression1G2Rsym: high-order oligomer2IL5sym: high-order oligomer Multiple ways to miss the design goal

Summary and Future Directions: Metal- Mediated Interface Design Metal binding can promote tight binding and allow specification of binding orientation Demonstrated creation of a symmetric interaction Next step – apply strategy to heterodimers

Acknowledgements Core Redesign Grant Murphy GoLoco Peptide Redesign Deanne Sammond Glenn Butterfoss Dustin Bosch (UNC Pharmacology) David Siderovski (UNC Pharmacology) Metal-Mediated Interface Design Bryan Der Ramesh Jha Steven Lewis Andrew Leaver-Fay (RosettaMatch) Mike Miley (UNC Center for Struct. Biol) Mischa Machius (UNC Center for Struct. Biol.) Ash Tripathy (UNC Mac-In-Fac)

The Challenge of Designing Hbond Networks WT: K d = 100 nM Triple mutant: K d > 20  M T519 S75 Q111 S78

1 3 2 Clarke and Yuan, 1995 Zanghellini et al., Å free Define zinc coordination geometry RosettaMatch algorithm 51 RosettaMatch: Designing a zinc binding site 109 o

1 3 Clarke and Yuan, 1995 Zanghellini et al., Å free 109 o Define zinc coordination geometry RosettaMatch algorithm o RosettaMatch: Designing a zinc binding site

Example of Failed Design: No Binding