Programme 8.00-8.20Last week’s quiz results + Summary 8.20-9.00Fold recognition 9.00-9.15Break 9.15-11.20Exercise: Modelling remote homologues 11.20-11.40Summary.

Slides:



Advertisements
Similar presentations
Review.
Advertisements

François Fages MPRI Bio-info 2007 Formal Biology of the Cell Protein structure prediction with constraint logic programming François Fages, Constraint.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Protein Structure Prediction using ROSETTA
Proteins Function and Structure.
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
FUNDAMENTALS OF MOLECULAR BIOLOGY Introduction -Molecular Biology, Cell, Molecule, Chemical Bonding Macromolecule -Class -Chemical structure -Forms Important.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Fold Recognition Ole Lund, Assistant professor, CBS.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
Protein Fold recognition Morten Nielsen, Thomas Nordahl CBS, BioCentrum, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
Fold Recognition Ole Lund, Associate professor, CBS.
Protein Fold recognition
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Protein Structural Prediction. Protein Structure is Hierarchical.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Proteins account for more than 50% of the dry mass of most cells
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Department of Mechanical Engineering
Secondary structure prediction
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Doug Raiford Lesson 19.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
©CMBI 2009 Alignment & Secondary Structure You have learned about: Data & databases Tools Amino Acids Protein Structure Today we will discuss: Aligning.
Structure prediction: Homology modeling
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Proteins.
Chapter 3 Proteins.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Ab-initio protein structure prediction ? Chen Keasar BGU Any educational usage of these slides is welcomed. Please acknowledge.
Modelling genome structure and function Ram Samudrala University of Washington.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Prepared By: Syed Khaleelulla Hussaini. Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity.
Protein Structure Visualisation
Computational Structure Prediction
Protein Structure and Properties
Protein Structure Prediction and Protein Homology modeling
Fig. 5-UN1  carbon Amino group Carboxyl group.
Rosetta: De Novo determination of protein structure
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Homology Modeling.
Protein structure prediction.
Programme Last week’s quiz results + Summary
Protein Homology Modelling
Homology modeling in short…
Presentation transcript:

Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary & discussion Quiz 1

Feedback Persons 2

Homology Modelling Revisited 3

Why Do We Need Homology Modelling? Ab Initio protein folding (random sampling): –100 aa, 3 conf./residue gives approximately different overall conformations! Random sampling is NOT feasible, even if conformations can be sampled at picosecond ( sec) rates. –Levinthal’s paradox Do homology modelling instead. 4

How Is It Possible? The structure of a protein is uniquely determined by its amino acid sequence (but sequence is sometimes not enough): –prions –pH, ions, cofactors, chaperones Structure is conserved much longer than sequence in evolution. –Structure > Function > Sequence 5

How Is It Done? Identify template(s) –Initial alignment Improve alignment Backbone generation Loop modelling Side chains Refinement Validation  6

PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO GLU ALA ILE CYS PHE ASN VAL CYS ARG THR PRO GLU ALA ILE CYS From ”Professional Gambling” by Gert Vriend Improving the Alignment 7

Template Quality Selecting the best template is crucial! The best template may not be the one with the highest % id (best p-value…) –Template 1: 93% id, 3.5 Å resolution  –Template 2: 90% id, 1.5 Å resolution 8

Error Recovery Errors in the model can NOT be recovered at a later step –The alignment can not make up for a bad choice of template. –Loop modeling can not make up for a poor alignment. The step where the errors were introduced should be redone. 9

Validation Most programs will get the bond lengths and angles right. Model Rama. plot ~ template Rama. plot. –select a high quality template! Inside/outside distributions of polar and apolar residues. ✓ 10

Summary Successful homology modelling depends on the following: –Template quality –Alignment (add biological information) –Modelling program/procedure (use more than one) Always validate your final model! 11

Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary & discussion Quiz 12

Fold recognition and ab initio protein structure prediction by Pernille Andersen 13

Outline Threading and pair potentials Ab initio structure prediction methods Human intervention (what kind of knowledge can be used for alignment and selection of templates?) Meta-servers (the principle, 3d jury) Summary of take-home messages 14

Threading and pair potentials Compares a given sequence against known structures (folds) Potentials that describe tendencies observed in known protein structures Example: Pair potentials How normal is it to observe a pair of an alanine and a valine separated by 20 residues in the sequence and 3Å in space? (X) How normal is it to observe any pair of residues separated by 20 residues and 3Å in space? (Y) Potential: E= -log (X/Y) 15

Alignment score from structural fitness (pair potential) How well does K fit environment at P6? If P8 is acidic then fine, if P8 is basic then poor Potentials of mean force A T N L Y K E T L.. Deletions 16

Threading methods today Problem: No protein is average Interactions in proteins cannot only be described by pairs of amino acids The information in the potentials is partly captured with sequence profiles or HMMs Today mostly used in HYBRID approaches in combination with profile-profile based methods Potentials can be used to score models based on different templates or alignments HMM alignment, hhpred 17

Fold recognition models in CASP6 Two-high-scoring predictions by the top groups in FR/H (top) and FR/A (bottom). The assigned z-scores are given for the top predictions (center) as well as for two average predictions (right). G. Wang Assessment of fold recognition predictions in CASP6, Proteins 61, S7, Pages

Ab initio/ free modeling methods Aim is to find the fold of native protein by simulating the biological process of protein folding. A VERY DIFFICULT task because a protein chain can fold into millions of different conformations. Use it only when no detectable homologues can be found. Methods can also be useful for fold recognition in cases of extremely low homology (e.g. convergent evolution). 19

Fragment-based ab initio modelling Rosetta method of the Baker group: –Secondary structure prediction –Fragments library of 3 and 9 residues from known structures –Link fragments together, use only backbone and CB atoms –Contact/pair potential –Energy minimization techniques (Monte Carlo optimization) to calculate tertiary structure –Refine structure including side chains Das R, Baker D, Annu. Rev. Biochem :363–

Energy minimization The energy of the whole protein model is minimized to obtain the final model 21

Potentials for finding good models Potentials should make models more “native-like” van der Waal’s attractive/repulsive forces Pair potentials Contact number potentials Back bone torsion angle potential Solvation potentials Hydrogen bond potentials Side chain rotamer potentials Uroplatus Fimbriatus (gecko) 22

Problems with empirical potentials Fragments with correct local structure Nature’s potential Empirical potential 23

CASP6 & Ab Initio (new folds category) Excellent modelling Hardest target The Baker group ( #100) was among the top scoring 24

Human intervention The best groups in CASP use maximum knowledge of query proteins Specialists can help to find a correct template and correct alignments Knowledge of function Cysteines forming disulfide bridges or binding e.g. zinc molecules Proteolytic cleavage sites Other metal binding residues Antibody epitopes or escape mutants Ligand binding Results from CD or fluorescence experiments 25

Fold It: The Protein Folding Game Rosetta Energy Potentials Uses the HUMAN brain’s pattern recognition resources for finding the lowest energy fold Human intervention II 26

Meta-servers Democratic modeling –The highest scoring hit is often wrong –Many prediction methods have the correct fold among the top hits –If many different prediction methods all have the same fold among the top hits, this fold is probably correct Server 1 Template 1 -> Model 1 Template 2 -> Model 2 Template 3 -> Model 3 Server 2 Template 1 -> Model 1 Template 2 -> Model 2 Template 3 -> Model 3 Server 3 Template 2 -> Model 1 Template 2 -> Model 2 Template 3 -> Model 3 27

Example of a meta-server 3DJury –Inspired by Ab initio modeling methods Average of frequently obtained low energy structures is often closer to the native structure than the lowest energy structure –Find most abundant high scoring model in a list of prediction from several predictors 1.Use output from a set of servers 2.Superimpose all pairs of structures 3.Similarity score based on # of Cα pairs within 3.5Å –Similar methods developed by A. Elofsson (Pcons and D. Fischer (3D shotgun) 28

3DJury Because it is a meta- server it can be slow If queue is too long some servers are skipped Alternative conformations for a sequence are easily obtained 29

Take home messages Hybrid methods using both threading methods and profile-profile alignments are the best Use only Ab initio methods if necessary and know that the quality is really low! Try to use as much knowledge as possible for alignment and template selections in difficult cases Use meta-servers when you can TRY FOLDIT! 30

Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary & discussion Quiz 31