3-Dimensional Structure

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Protein Structure, Databases and Structural Alignment
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Identification of Domains using Structural Data Niranjan Nagarajan Department of Computer Science Cornell University.
CISC667, F05, Lec16, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Phylogenetic Trees (III) Probabilistic methods.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
CISC667, F05, Lec8, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Multiple Sequence Alignment Scoring Dynamic Programming algorithms Heuristic algorithms.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Protein Structural Prediction. Protein Structure is Hierarchical.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Representations of Molecular Structure: Bonds Only.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Doug Raiford Lesson 19.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Approximation Algorithms For Protein Folding Prediction Giancarlo MAURI,Antonio PICCOLBONI and Giulio PAVESI Symposium on Discrete Algorithms, pp ,
Protein structure prediction Anttu Kurttio Ville Pietiläinen.
Protein Structure Prediction
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Ab-initio protein structure prediction ? Chen Keasar BGU Any educational usage of these slides is welcomed. Please acknowledge.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
CISC667, S07, Lec25, Liao1 CISC 467/667 Intro to Bioinformatics (Spring 2007) Review Session.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Chapter 14 Protein Structure Classification
Protein Structure Prediction and Protein Homology modeling
Protein Structure Prediction
Protein Structures.
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
Rosetta: De Novo determination of protein structure
The Levinthal paradox: yesterday and today
Homology Modeling.
Protein structure prediction.
謝孫源 (Sun-Yuan Hsieh) 成功大學 電機資訊學院 資訊工程系
Protein structure prediction
Homology modeling in short…
Presentation transcript:

3-Dimensional Structure CISC 467/667 Intro to Bioinformatics (Spring 2007) Protein Structure Prediction 3-Dimensional Structure CISC667, S07, Lec21, Liao

Computational Methods for 3-D structures Foundation Anfinsen Hypothesis: A native state of protein corresponds to a free energy minimum. (This hypothesis was by Anfinsen based on some experimental findings for smaller molecules, 1973) Levinthal paradox: how can a protein fold to a native conformation so rapidly when the number of possible conformations is so huge? Experimental observation: proteins fold into native conformations in a few seconds If a local move by diffusion takes 10-11 seconds, it would take 1057 seconds to try ~ 5100 possible conformations for a protein of 100 AAs. NP-completeness Computational Methods for 3-D structures Comparative (find homologous proteins) Threading (recognize folds) Ab initio (Molecular dynamics) CISC667, S07, Lec21, Liao

Root Mean Square Deviation (RMSD) Credit: Chris Bystroff @ RPI CISC667, S07, Lec21, Liao

Comparative (homology based) modeling Identification of structurally conserved regions (using multiple alignment) Backbone construction Loop construction Side-chain restoration Structure verification and evaluation Structure refinement (energy minimization) CISC667, S07, Lec21, Liao

Credit: Chris Bystroff @ RPI CISC667, S07, Lec21, Liao

Credit: Chris Bystroff @ RPI CISC667, S07, Lec21, Liao

Credit: Chris Bystroff @ RPI CISC667, S07, Lec21, Liao

When two homologous proteins do not have high sequence similarity Protein Threading When two homologous proteins do not have high sequence similarity Structure of one protein, P1, is known The goal is to predict the structure for the other protein P2 Identify core regions for P1 Scoring a given alignment with a known structure (Contact potentials) CISC667, S07, Lec21, Liao

Threading: A schematic view CISC667, S07, Lec21, Liao

Lattice Models Simplifications HP model All residues have the same size Bond length is uniform Positions of residues are restricted to positions in a regular lattice (or grid). HP model Energy function Bi,j for a pair of residues wi and wj = 0 when the two residues do not have contact (i.e. not topological neighbors), or one residue is polar/hydrophilic and the other is hydrophobic, or both are polar/hydrophilic = -1 when two residues are topological neighbors, and both are hydrophobic. Note: Despite these simplifications, it is still NP complete to find an optimal conformation CISC667, S07, Lec21, Liao

E =  1 i+1 < j  n Bi,j (wi, wj ) 2-dimensional and 3-dimensional lattice Two residues wi and wj are connected neighbors when j = i+1 or i-1. Two residues wi and wj are topological neighbors when j  i+1 or i-1, and || wi - wj || = 1 A native state is a conformation that has the minimum contact energy E =  1 i+1 < j  n Bi,j (wi, wj ) where (wi, wj ) = 1 when wi and wj are topological neighbor, and =0 otherwise. The HP model approximates the hydrophobic force, which is not really a force, but rather an aggregate tendency for nonpolar residues to minimize their contact with the solvent. CISC667, S07, Lec21, Liao

Example HP model P H Peptide bond Topological contact 1 7 2 3 6 5 4 CISC667, S07, Lec21, Liao

The HP model by SSK (Sali, Shakhnovich, and Karplus, 1994) A 27-bead heteropolymer in a 3-d lattice. Contact energy normally distributed Possible conformation: 526 ~ 1018 Compact conformation: in a 3x3x3 cube there are 103346 distinct conformations Folding time t0 is short if energy gap is large Solved Levinthal paradox CISC667, S07, Lec21, Liao

CISC667, S07, Lec21, Liao

Local moves Corner move Impossible move Crankshaft move End move Corner move Impossible move Crankshaft move CISC667, S07, Lec21, Liao

Crossover to create new conformations 10 00 01 01 11 10 00 00 00 10 Credit: Iosif Vaisman @ GAU CISC667, S07, Lec21, Liao

Genetic algorithm Input Initialize population (randomly) P, the population, r: the fraction of population to be replaced, f, a fitness, ft, the fitness_threshold, m: the rate for mutation. Initialize population (randomly) Evaluate: for each h in P, compute Fitness(h) While [Maxh f(h)] < ft do Select Crossover Mutate Update P with the new generation Ps Evaluate: f(h) for all h  P Return the h in P that has the best fitness CISC667, S07, Lec21, Liao

Unger-Moult hybrid genetic algorithm initialize population P(t) of random coils best = argmax {F(x) | x in P(t)} repeat { pointwise mutation n = 0 while (n < P) { // genetic algorithm part select 2 chromosomes m, f produce child c by crossover of m, f ave = average( F(m), F(f)) if(F(c) >= ave) { place c in next generation; n++ } else { // Metropolis part z = random(0,1) if (z < e -(ave –F(c)/T ) { } }// end while update best check if stopping criterion is met CISC667, S07, Lec21, Liao

Ab initio approaches: CISC667, S07, Lec21, Liao Credit: Iosif Vaisman @ GMU CISC667, S07, Lec21, Liao

Credit: Iosif Vaisman @ GMU CISC667, S07, Lec21, Liao

Credit: Iosif Vaisman @ GMU CISC667, S07, Lec21, Liao

Credit: Iosif Vaisman @ GMU CISC667, S07, Lec21, Liao

Homology modeling programs http://www.expasy.ch/swissmod Resources Homology modeling programs http://www.expasy.ch/swissmod Threading: http://www.ncbi.nlm.nih.gov/Structure/RESEARCH/threading.shtml CISC667, S07, Lec21, Liao