Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.

Slides:



Advertisements
Similar presentations
Toward high-resolution prediction and design of transmembrane helical protein structures P. Barth, J. Schonbrun and D. Baker PNAS Sep Tim Nugent.
Advertisements

3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
Protein Structure Prediction using ROSETTA
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
Hydrogen bonds in Rosetta: a phenomonological study Jack Snoeyink Dept. of Computer Science UNC Chapel Hill.
How to approximate complex physical and thermodynamic interactions? Employ rigid or flexible structures for ligand and receptor (Side-chains or Back-bone.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Solving NMR structures II: Calculation and evaluation The NMR ensemble Methods for calculating structures distance geometry, restrained molecular dynamics,
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Forces inter-atomic interactions hydrophobic effect – driving force
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Computational Design of Protein Structures and Interfaces Brian Kuhlman University of North Carolina, Chapel Hill.
Protein Structural Prediction. Protein Structure is Hierarchical.
The Geometry of Biomolecular Solvation 1. Hydrophobicity Patrice Koehl Computer Science and Genome Center
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Macromolecular structure
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Marcin Pacholczyk, Silesian University of Technology.
COMPARATIVE or HOMOLOGY MODELING
Structure and stability of globular proteins.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
4. Modeling of side chains 1. Protein Structure Prediction: – given: sequence of protein – predict: structure of protein Challenges: – conformation space.
2. Introduction to Rosetta and structural modeling Approaches for structural modeling of proteins The Rosetta framework and its prediction modes Cartesian.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials.
Rotamer Packing Problem: The algorithms Hugo Willy 26 May 2010.
Ab Initio Methods for Protein Structure Prediction CS882 Presentation, by Shuai C., Li.
Department of Mechanical Engineering
2. Introduction to Rosetta and structural modeling (From Ora Schueler-Furman) Approaches for structural modeling of proteins The Rosetta framework and.
Altman et al. JACS 2008, Presented By Swati Jain.
Covalent interactions non-covalent interactions + = structural stability of (bio)polymers in the operative molecular environment 1 Energy, entropy and.
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
. Protein Structure Prediction. Protein Structure u Amino-acid chains can fold to form 3-dimensional structures u Proteins are sequences that have (more.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Chapter 4.1: Overview of Protein Structure CHEM 7784 Biochemistry Professor Bensley.
Rosetta Steven Bitner. Objectives Introduction How Rosetta works How to get it How to install/use it.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
ILO 1-Explain the chemical structure,classification, and properties of amino acids and how peptides are formed. 2-Describe the order of protein organization.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
In silico Protein Design: Implementing Dead-End Elimination algorithm
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
2. Introduction to Rosetta and structural modeling
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
Computational Structure Prediction
Rong Chen Boston University
Protein Structure Prediction and Protein Homology modeling
Conformation Dependence of Backbone Geometry in Proteins
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
Rosetta: De Novo determination of protein structure
Protein structure prediction.
Protein Homology Modelling
Accounting for side-chain flexibility in protein-ligand docking: 3D Interaction Homology as an approach of quantifying side-chain flexibility of Tyrosine.
Gydo C.P. van Zundert, Adrien S.J. Melquiond, Alexandre M.J.J. Bonvin 
Protein structure prediction
Presentation transcript:

Rosetta Energy Function Glenn Butterfoss

Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function Aggressively search conformational space 2. High resolution: Full atom More sophisticated energy function “Local” search of conformational (and sequence) space

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions)

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) In general … Weighted linear combination Energy = w 1 *term 1 + w 2 *term 2 + … Pair-wise decomposable Heavily trained on PDB statistics Discriminate “near native” vs “non native” No single low resolution score Several functions with different weights

Rosetta Energy Function 11 22 Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions)

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) d E CLASH BAD!! Evaluate between Centoids and Backbone Atoms

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) Pair-wise probability based on PDB statistics (electrostatics) aa = residue type d = centroid distance (binned, interpolated) s = sequence seperation (must be > 8 res )

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) neighbors within 10 Å of C  binned by : 0-3, 4,5, …, >30 also interpolated Probability of burial /exposure (solvation)

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) Optimize 2º orientation

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) N R1R1 R2R2 C O Represent protein as vectors of 2 residue “strands” sheet vector helix vector

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) Coordinate system v1v1 v2v2    r hb Scores selected to discriminate “near native structures for “non native”: Relative direction (  ) Relative H-bond orientation (hb) Distance (r, r  Number of sheets given number of strands Helix-Strand Packing

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) Used in earlier stages and for filtering Promote a compact fold

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) LTSDELKAQWNTSTLVRHQEAGASLTSDELKAQWNTSTLVRHQEAGAS set of non-redundant protein structures......

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions) NC + N C Fragment insertion Extended protein chain N C + Select a site Fragment insertion

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies In general … Weighted linear combination Energy = w 1 *term 1 + w 2 *term 2 + … Pair-wise decomposable Pre- tabulate energies Hybrid Statistical / MM-like score Weights trained for different applications

Rosetta Energy Function 11 22 High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies

experimental conformation rotamer Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies rotamers

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies 11 22 Dunbrack and Cohen library Based on PDB statistics Backbone dependent Additional rotamers from standard deviations of distributions

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies 11 22 ss = secondary structure Local backbone energy Also used in some centroid refinement

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies Fast pair-wise additive Penalize burial of polar residues Simple solvation model Lazaridius Karplus (standard)

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies Protein-DNA interactions: Generalized Born Protein-Ligand: Coulomb Simple solvation model (Special cases)

Rosetta Energy Function H O O High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies Geometric H-bond potential 2 angles, 1 distance Based on PDB statistics r H-bonding

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies r CHARMM radii Standard attractive potential Repulsive term linearized Note: command line options allow the repulsive term to be softened (radii reduced) VDW interactions

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies Probability of finding residue types at give in distance Defined by C  coordinates Electrostatics

Rosetta Energy Function High resolution: Atom Model full atom representation Energy function terms Rotamer (Dunbrack) Ramachandran Solvation (Lazaridius Karplus) Hydrogen bonding Lennard-Jones Pair (electrostatic) Reference energies Unique “cost” for designing in each residue type  G for bringing residue type into folded protein Optimized with sequence recovery trials of folded protein structures Correction for “folding”

Rosetta Energy Function Xavier Rosetta Community Thanks

Rosetta in Systems Biology Structure Prediction: Monte Carlo + Minimization search p(ΔE)? Energy

Rosetta in Systems Biology Protein Design: Protocol: Packing: Pre-tabulate table of all pair-wise rotamer energies Monte Carlo search through rotamer / sequence space With docking and backbone movement: Iterate packing with (as above) with backbone / rigid body movements Possibly apply restraints docking, rmsd, disulfide, …

Rosetta in Systems Biology Protein Design: Protocol: Filtering: Total energy Packing quality Avoid buried unsatisfied H-bonds (problem at interfaces)

Rosetta Energy Function Low resolution: Atom Model centroid reduction of side chains Energy function terms van der Waals repulsion “pair” terms (electrostatics) residue environment (prob of burial) 2º structure pairing terms (H-bonds) radius of gyration packing density Implicit terms fragments (local interactions)