Computational Structure-Based Redesign of Enzyme Activity Cheng-Yu Chen, Ivelin Georgiev, Amy C.Anderson, Bruce R.Donald A Different computational redesign.

Slides:



Advertisements
Similar presentations
Antibody Structure Prediction and the Use of Mutagenesis in Docking Arvind Sivasubramanian, Aroop Sircar, Eric Kim & Jeff Gray Johns Hopkins University,
Advertisements

ROTAMER OPTIMIZATION FOR PROTEIN DESIGN THROUGH MAP ESTIMATION AND PROBLEM-SIZE REDUCTION Hong, Lippow, Tidor, Lozano-Perez. JCC Presented by Kyle.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Protein Structure Prediction using ROSETTA
Crystallography -- lecture 21 Sidechain chi angles Rotamers Dead End Elimination Theorem Sidechain chi angles Rotamers Dead End Elimination Theorem.
Loop Refinement by Localized Sampling and Minimization (SLAM) Vageli Coutsias, Matt Jacobson, Michael Wester, Lan Hua.
Short fast history of protein design Site-directed mutagenesis -- protein engineering (J. Wells, 1980's) Coiled coils, helix bundles (W. DeGrado, 1980's-90's)
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Embedding Computation in Studies of Protein Structure and Function Chris Bailey-Kellogg & lab & collaborators Shobha Xiaoduan Wei John Chris Fei.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Thomas Blicher Center for Biological Sequence Analysis
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
In double vision when drunk By Thomas Huber 23 November 2001 Alexandra Headland.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Protein sequencing and Mass Spectrometry. Sample Preparation Enzymatic Digestion (Trypsin) + Fractionation.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Comparative Protein Modeling Jason Wiscarson ( Lloyd Spaine ( Comparative or homology modeling, is a computational.
An Extended Dead-End Elimination Algorithm to Determine Gap-Free Lists of Low Energy States EDDA KLOPPMANN, G. MATTHIAS ULLMANN, TORSTEN BECKER.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
COMPARATIVE or HOMOLOGY MODELING
Computational protein design. Reasons to pursue the goal of protein design In medicine and industry, the ability to precisely engineer protein hormones.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
In molecular switching, the recognition of an external signal such as ligand binding by one protein is coupled to the catalytic activity of a second protein.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Optimal resource assignment to maximize multistate network reliability for a computer network Yi-Kuei Lin, Cheng-Ta Yeh Advisor : Professor Frank Y. S.
Protein Design CS273: Final Project Charles Kou Crystal structure of top7 – A novel protein structure created with RosettaDesign.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study George Chikenji*, Yoshimi Fujitsuka, and Shoji Takada*
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Doug Raiford Lesson 19.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
Automating Steps in Protein Structure Determination by NMR CS April 13, 2009.
Rational Drug Design : HIV Integrase. A process for drug design which bases the design of the drug upon the structure of its protein target. 1.Structural.
Structure prediction: Homology modeling
Computational engineering of bionanostructures Ram Samudrala University of Washington How can we analyse, design, & engineer peptides capable of specific.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
03/27/09 NIH Roadmap Structure Determination of Membrane Proteins from Orientational Constraints Homayoun Valafar Roadmap Initiative Meeting 3/26/09-3/27/09.
MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance Andrew I. Jewett, Conrad C. Huang and Thomas.
Motif Search and RNA Structure Prediction Lesson 9.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Modelling proteomes Ram Samudrala University of Washington How does the genome of an organism specify its behaviour and characteristics?
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Protein Structures from A Statistical Perspective Jinfeng Zhang Department of Statistics Florida State University.
7. Protein Design 1. Protein design 2 Structure prediction and design are inverse problems.
Bioinformatics 2 -- lecture 20 Protein design -- the state of the art.
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Computational Protein Redesign and the Non-Ribosomal Code
Ivelin Georgiev Bruce Donald Donald Lab Duke University
Dead-End Elimination for Protein Design with Flexible Rotamers
Volume 90, Issue 11, Pages (June 2006)
Rosetta: De Novo determination of protein structure
Volume 22, Issue 2, Pages (February 2014)
Protein structure prediction.
Volume 22, Issue 2, Pages (February 2014)
Presentation transcript:

Computational Structure-Based Redesign of Enzyme Activity Cheng-Yu Chen, Ivelin Georgiev, Amy C.Anderson, Bruce R.Donald A Different computational redesign strategy Yizhou Yin Mar06, 2009

- Protein design: straightforward design vs. Directed mutation De Novo vs. redesign - Computational structure-based redesign GMEC (global minimum energy confirmation) - ROSETTA (RosettaDesign, …) 1) Energy Function 2) Conformational Sampling

Simplified protocol of redesign using GMEC Generate sequence space: select residue position for mutation; define types of AA that are allowed in mutation Constraint: volume, steric filter, etc Backbone dependent library, side-chain conformation library, rotamer library, fragment library… Searching for global minimum energy conformation throughout the whole sequence and conformation space (multistep) Starting Structure Screen/filterRank Select Further refinement? Another iterative cycle? Other procedure? Experimental test

Ensemble-based protein redesign Backbone dependent library, side-chain conformation library, rotamer library, fragment library… Generate sequence space: select residue position for mutation (steric shell); define types of AA that are allowed in mutation Starting Structure, targeted substrate, cofactor Active site mutation Filters: sequence-space filter, k-point, volume filter K* algorithm: search and score Rank + Select Bolstering Mutation Self-Consistent Mean Field entropy-based method Experimental verification MinDEE/A* algorithm: search and score Experimental verification Multiple pruning methods

K* algorithm -For a given protein-substrate complex, K* computes a provably-accurate ε-approximation to the binding constant K A -K*= [Σexp(-E b /RT)] / [Σexp(-E l /RT)·Σexp(-E f /RT)] b ∈ B l ∈ L f ∈ F B, L, F are rotamer-based ensembles; E is the conformation energy -Several algorithms are used to prune the candidate sequences at different steps so that the searching in the sequence space will be more efficient.

For each allowable mutated sequence: Step1 Molecular ensemble is generated, then pruned by steric, volume filters. Step2 After constrained energy minimization, the conformation is enumerated by A*. Step3 The scores from step2 are used to compute there separate partition functions, which is then combined to compute K* score.

SCMF entropy-based method S i = - ∑p(a ︱ i) ln p(a ︱ i) a ∈ A i p(a ︱ i) = ∑ p(r ︱ i) r ∈ R a -Ai is the set of AA types allowed at position i; p(a) is the probability of having AA type a at i. Ra is the set of rotamers for AA type a and p(r) is the probability of having rotamer r for AA type a at i. -Higher entropy implies higher probability of multiple AA types, hence higher tolerance to mutation at position i.

Example of GrsA-PheA’s specificity switched from Phe to Leu -GrsA-PheA is the phenylalanine adenylation domain of the nonribosomal peptide synthetase (NRPS) enzyme gramicidin S synthetase A, whose cognate substrate is Phe.

-7 residues at the active site are allowed to mutate to (G, A, V, L, I, W, F, Y, M) -only sequences with up to two mutations were considered, give the number candidates: 1450 (6.44 x 10 ) -After pruning, the number of sequences evaluated by K*: 505 (1.12 x 10 ) -Top ten sequences were experimentally verified. -7 residues were selected by SCMF and were allowed to mutate to different subset of AA. -Up to 3-point mutations were considered.

Example of T278L/A301G

T278/A301G ≈512 fold switch in specificity from Phe to Leu V187L/T278L/A301G ≈2168 fold switch in specificity from Phe to Leu, 1/6 of the WTenzyem:WTsubstrate activity

ensemble based vs. non-ensemble based 1)searching for best conformation 2)Searching for best mutation with best conformation 3)Other redesign 4)Other than redesign structure-based design vs. other computational design/ evolution Comparison in efficiency, accuracy

-Will there be any better “hybrid” methods? -How to appropriately decide the sampling size based on the redesign methods? -Any other new strategy?