Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.

Slides:



Advertisements
Similar presentations
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
Advertisements

Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
Space/Time Tradeoff and Heuristic Approaches in Pairwise Alignment.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Protein structure (Part 2 of 2).
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
The 7 steps of Homology modeling. 1: Template recognition and initial alignment.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Similar Sequence Similar Function Charles Yan Spring 2006.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Homology Modeling Seminar produced by Hanka Venselaar.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
COMPARATIVE or HOMOLOGY MODELING
Protein Sequence Alignment and Database Searching.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Structure prediction: Homology modeling
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Homology modeling with SWISS-MODEL
Pairwise Local Alignment and Database Search Csc 487/687 Computing for Bioinformatics.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Protein Structure Prediction Graham Wood Charlotte Deane.
Protein Homologue Clustering and Molecular Modeling L. Wang.
Homology Modeling 原理、流程,還有如何用該工具去預測三級結構 Lu Chih-Hao 1 1.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
Comparative methods Basic logics: The 3D structure of the protein is deduced from: 1.Similarities between the protein and other proteins 2.Statistical.
Modelling genome structure and function Ram Samudrala University of Washington.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Homology modeling in short…
Presentation transcript:

Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002

Protein Structure DNA Sequence  Protein Sequence  Structure  (Mis)function DNA Sequence  Protein Sequence  Structure  (Mis)function It is believed that all the information necessary to determine the structure of a protein is present in its primary sequence. It is believed that all the information necessary to determine the structure of a protein is present in its primary sequence.

Protein Folding Programs Protein folding is one of the biggest computational challenges Protein folding is one of the biggest computational challenges Different types of folding and structure predictions programs Different types of folding and structure predictions programs Simulations Simulations Homology Modeling Approaches Homology Modeling Approaches

Simulations Simulate the real behavior of proteins Simulate the real behavior of proteins High detail, short time scales High detail, short time scales 2 main simulation types 2 main simulation types Molecular Dynamics Molecular Dynamics Monte Carlo Monte Carlo

The Energy Function Calculate energies for each particle Calculate energies for each particle Since long range interactions important for each pair of particles the pair-wise interactions should be calculated Since long range interactions important for each pair of particles the pair-wise interactions should be calculated

Homology Modeling Template Selection and Fold Assignment Template Selection and Fold Assignment Target – Template Alignment Target – Template Alignment Model Building Model Building Loop Modeling Loop Modeling Sidechain Modeling Sidechain Modeling Model Evaluation Model Evaluation

Fold Assignment and Template Selection Identify all protein structures with sequences related to the target, then select templates Identify all protein structures with sequences related to the target, then select templates 3 main classes of comparison methods 3 main classes of comparison methods Compare the target sequence with each database sequence independently, pair-wise sequence – sequence comparison, BLAST and FASTA Compare the target sequence with each database sequence independently, pair-wise sequence – sequence comparison, BLAST and FASTA Multiple sequence comparisons to improve sensitivity, PSI-BLAST Multiple sequence comparisons to improve sensitivity, PSI-BLAST Threading or 3-D template matching methods Threading or 3-D template matching methods

Target – Template Alignment Most important step in Homology Modeling Most important step in Homology Modeling A specialized method should be used for alignment A specialized method should be used for alignment Over 40% identity the alignment is likely to be correct. Over 40% identity the alignment is likely to be correct. Regions of low local sequence similarity become common when overall sequence identity is under 40%. (Saqi et al., Protein Eng. 1999) Regions of low local sequence similarity become common when overall sequence identity is under 40%. (Saqi et al., Protein Eng. 1999) The alignment becomes difficult below 30% sequence identity. (Rost, Protein Eng. 1999) The alignment becomes difficult below 30% sequence identity. (Rost, Protein Eng. 1999)

Model Building Construct a 3-D model of the target sequence based on its alignment on template structures Construct a 3-D model of the target sequence based on its alignment on template structures Three different model building approaches Three different model building approaches Modeling by rigid body assembly Modeling by rigid body assembly Modeling by segment matching Modeling by segment matching Modeling by satisfaction of spatial restraints Modeling by satisfaction of spatial restraints Accuracies of these models are similar Accuracies of these models are similar Template selection and alignment have larger impact on the model Template selection and alignment have larger impact on the model

Swiss-MOD Web Server Screenshots from the Homology Modeling Server Swiss-Model Construct a framework using known protein structures Generate the location of the target amino acids on the framework If loop regions not determined, additional database search or short simulations

Procedure of the MODELLER program After obtaining restraints run a geometry optimization or real- space optimization to satisfy them

Errors in Homology Models a.Errors in sidechain packing b.Distortions and shifts in correctly aligned regions c.Errors in regions without a template

d. Errors due to misalignment e. Incorrect templates

Model Building Programs COMPOSERPwww-cryst.bioc.cam.ac.uk CONGENPwww.congenomics.com/congen/congen.html CPH modelsSwww.cbs.dtu.dk/services/CPHmodels/ DRAGONPwww.nimr.mrc.ac.uk/~mathbio/a-aszodi/dragon.html ICMPwww.molsoft.com InsightIIPwww.msi.com MODELLERPguitar.rockefeller.edu/modeller/modeller.html LOOKPwww.mag.com QUANTAPwww.msi.com SYBYLPwww.tripos.com SCWRLPwww.cmpharm.ucsf.edu/~bower/scrwl/scrwl.html SWISS-MODSwww.expasy.ch/swissmod WHAT IFPwww.sander.embl-heidelberg.de/whatif/

Applications

Critical Assessment of protein Structure Prediction (CASP) Venclovas et al. Proteins, 2001

Critical Assessment of protein Structure Prediction (CASP) Venclovas et al. Proteins, 2001

Conclusions Computer Simulations are powerful to show detailed motions but they cannot cover long enough time spans to simulate folding for large systems Computer Simulations are powerful to show detailed motions but they cannot cover long enough time spans to simulate folding for large systems Homology Modeling techniques can be successful if the target protein has a known fold Homology Modeling techniques can be successful if the target protein has a known fold The higher the sequence similarity the more likely the model will be successful The higher the sequence similarity the more likely the model will be successful With the implementation of better techniques the errors in fold assignment, alignment, and sidechain and loop modeling are decreasing With the implementation of better techniques the errors in fold assignment, alignment, and sidechain and loop modeling are decreasing Theoretically, if at least one member of every possible fold is known, it is possible to predict the structure of every coding sequence to within a certain accuracy Theoretically, if at least one member of every possible fold is known, it is possible to predict the structure of every coding sequence to within a certain accuracy