Structure prediction: Homology modeling

Slides:



Advertisements
Similar presentations
Applications of Homology Modeling
Advertisements

PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Crystallography -- lecture 21 Sidechain chi angles Rotamers Dead End Elimination Theorem Sidechain chi angles Rotamers Dead End Elimination Theorem.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Protein Fold recognition Morten Nielsen, Thomas Nordahl CBS, BioCentrum, DTU.
The 7 steps of Homology modeling. 1: Template recognition and initial alignment.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
Protein Fold recognition
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
Homology Modeling comparative modeling vs. ab initio folding alignment (check gaps) threading loop building re-packing side-chains in core, DEE, SCWRL.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Applications of Homology Modeling Hanka Venselaar.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Homology Modeling Seminar produced by Hanka Venselaar.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Computational Structure-Based Redesign of Enzyme Activity Cheng-Yu Chen, Ivelin Georgiev, Amy C.Anderson, Bruce R.Donald A Different computational redesign.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Comparative Protein Modeling Jason Wiscarson ( Lloyd Spaine ( Comparative or homology modeling, is a computational.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
CSCE555 Bioinformatics Lecture 18 Protein Tertiary Structure Prediction Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
JM - 1 Introduction to Bioinformatics: Lecture XI Computational Protein Structure Prediction Jarek Meller Jarek Meller Division.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Homologue Clustering and Molecular Modeling L. Wang.
Homology Modeling 原理、流程,還有如何用該工具去預測三級結構 Lu Chih-Hao 1 1.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Modelling genome structure and function Ram Samudrala University of Washington.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Molecular modelling José R. Valverde CNB/CSIC © José R. Valverde, 2014 CC-BY-NC-SA.
PROTEIN MODELLING Presented by Sadhana S.
Modelling the rice proteome
Protein dynamics Folding/unfolding dynamics
Protein Structure Prediction
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Rosetta: De Novo determination of protein structure
Homology Modeling.
Protein structure prediction.
Homology modeling in short…
Presentation transcript:

Structure prediction: Homology modeling Lecture 8 Structural Bioinformatics Dr. Avraham Samson 81-871

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Why do we need homology modeling ? To be compared with:

Structural Genomics project Aim to solve the structure of all proteins: this is too much work experimentally! Solve enough structures so that the remaining structures can be inferred from those experimental structures The number of experimental structures needed depend on our abilities to generate a model.

Structural Genomics Proteins with known structures Unknown proteins

Homology Modeling: why it works High sequence identity High structure similarity

Homology Modeling: How it works Find template Align target sequence with template Generate model: - add loops - add sidechains Refine model

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Fold Recognition Homology modeling refers to the easy case when the template structure can be identified using BLAST alone. What to do when BLAST fails to identify a template? Use more sophisticated sequence methods Profile-based BLAST: PSIBLAST Hidden Markov Models (HMM) Use secondary structure prediction to guide the selection of a template, or to validate a template Use threading programs: sequence-structure alignments Use all of these methods! Meta-servers: http://bioinfo.pl/Meta

Fold Recognition Blast for PDB search Full homology modeling packages Profile based approach HMM Structure-derived profiles Fold recognition and Secondary structure prediction

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Very short loops: Analytic Approach Wedemeyer, Scheraga J. Comput. Chem. 20, 819-844 (1999)

Medium loops: A database approach Scan database and search protein fragments with correct number of residues and correct end-to-end distances

Medium loops: A database approach cRMS (Ǻ) Method breaks down for loops larger than 9 Loop length

data clustered library Long loops: A fragment-based approach 1) Clustering Protein Fragments to Extract a Small Set of Representatives (a Library) data clustered library

Generating Loops Fragment library

Generating Loops Fragment library

Generating Loops Fragment library

Generating Loops Fragment library

Generating Loops Fragment library Threading

Long loops: A fragment-based approach Test cases: 20 loops for each loop length Methods: database search, and fragment building, with fragment libraries of size L <cRMS (Ǻ) Loop length

Loop building: Other methods Heuristic sampling (Monte Carlo, simulated annealing) Inverse kinematics Relaxation techniques Systematic sampling http://www.cs.ucdavis.edu/~koehl/BioEbook/loop_building.html

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Self-Consistent Mean-Field Sampling P(J,2) P(J,1) P(J,3)

Self-Consistent Mean-Field Sampling P(i,2) P(i,1)+P(i,2)+P(i,3)=1 P(i,1) P(i,3)

Self-Consistent Mean-Field Sampling Multicopy Protein i,1 i,2 i,3 j,1 j,3 j,2

Self-Consistent Mean-Field Sampling Multicopy Protein Mean-Field Energy E(i,k) = U(i,k) + U(i,k,Backbone) i,1 i,2 i,3 j,1 j,3 j,2

Self-Consistent Mean-Field Sampling Multicopy Protein Mean-Field Energy E(i,k) = U(i,k) + U(i,k,Backbone) i,1 i,2 i,3 j,1 j,3 j,2 Update Cycle (Koehl and Delarue, J. Mol. Biol., 239:249-275 (1994))

Self-Consistent Mean-Field Sampling Another way is simply aligning the side chains as well

Dead End Elimination (DEE) Theorem There is a global minimum energy conformation (GMEC) for which there is a unique rotamer for each residue The energy of the system must be pairwise. Each residue i has a set of possible rotamers. The notation ir means residue i has the conformation described by rotamer r. The energy of any conformation C of the protein is given by: Note that:

Dead End Elimination (DEE) Theorem Consider two rotamers, ir and it, at residue i and the set of all other rotamer conformations {S} at all residues excluding i. If the pairwise energy between ir and js is higher than the pairwise energy between it and js, for all js in {S}, then ir cannot exist in the GMEC and is eliminated. Mathematically: If then ir does not belong to the GMEC

Dead End Elimination (DEE) Theorem This is impractical as it requires {S}. It can be simplified to: If then ir does not belong to the GMEC Iteratively eliminate high energy rotamers: proved to converge to GMEC Desmet, J, De Maeyer, M, Hazes, B, Lasters, I. Nature, 356:539-542 (1992)

Other methods for side-chain modeling Heuristics (Monte Carlo, Simulated Annealing) SCWRL (Dunbrack) Pruning techniques Mean field methods

Loop building + Sidechain Modeling: generalized SCMF Template Add multi-copies of candidate “loops” Add multi-copies of candidate side-chains Final model Koehl and Delarue. Nature Struct. Bio. 2, 163-170 (1995)

Homology Modeling Presentation Fold recognition Model building Loop building Sidechain modeling Refinement Testing methods: the CASP experiment

Refinement ? CASP6 assessors, homology modeling category: “We are forced to draw the disappointing conclusion that, similarly to what observed in previous editions of the experiment, no model resulted to be closer to the target structure than the template to any significant extent.” The consensus is not to refine the model, as refinement usually pulls the model away from the native structure!!

Homology Modeling: Practical guide Approach 2: Submit target sequence to automatic servers - Fully automatic: - 3D-Jigsaw : http://www.bmm.icnet.uk/servers/3djigsaw/ - EsyPred3D: http://www.fundp.ac.be/urbm/bioinfo/esypred/ - SwissModel: http://swissmodel.expasy.org//SWISS-MODEL.html - Fold recognition: - 3D-PSSM: http://www.sbg.bio.ic.ac.uk/~3dpssm/ - Useful sites: - Meta server: http://bioinfo.pl/Meta - PredictProtein: http://cubic.bioc.columbia.edu/predictprotein/

Comparative Modeling Server & Program COMPOSER http://www.tripos.com/sciTech/inSilicoDisc/bioInformatics/matchmaker.html MODELER http://salilab.org/modeler InsightII http://www.msi.com/ SYBYL http://www.tripos.com/