A new Approach to Structural Prediction of Proteins Heiko Schröder Bertil Schmidt Jiujiang Zhu School of Computer Engineering Nanyang Technological University.

Slides:



Advertisements
Similar presentations
Carbohydrates, Lipids, Proteins, and Nucleic Acids
Advertisements

PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Protein Structure Prediction
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
1 September, 2004 Chapter 5 Macromolecular Structure.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Mossbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois. Editors: J. T. P. DeBrunner and E.
Massively Parallel Solutions for Molecular Sequence Analysis Bertil Schmidt School of Computer Engineering, Nanyang Technological University, Singapore.
The Protein Data Bank (PDB)
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1. Primary Structure: Polypeptide chain Polypeptide chain Amino acid monomers Peptide linkages Figure 3.6 The Four Levels of Protein Structure.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Protein Structures.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Evolving Models of Biological Sequence Similarity Daniel P. Miranker The University of Texas at Austin [Chenetal98]
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Representations of Molecular Structure: Bonds Only.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Doshisha Univ. JapanGECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi YoshidaTomoyuki.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
10/3/2003 Molecular and Cellular Modeling 10/3/2003 Introduction Objective: to construct a comprehensive simulation software system for the computational.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Structure Prediction
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Protein Structure  The structure of proteins can be described at 4 levels – primary, secondary, tertiary and quaternary.  Primary structure  The sequence.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
NUS CS 5247 David Hsu Protein Motion. NUS CS 5247 David Hsu2 What is a protein?  Primary level - a sequence of alphabets (amino acid molecules)  Amino.
Structural organization of proteins
Four Levels of Protein Structure
Protein Structure BL
Computational Structure Prediction
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
3.11 Proteins are essential to the structures and activities of life
Protein Structure Prediction and Protein Homology modeling
Protein Synthesis.
Proteins Basic structure of an amino acid
. Nonpolar (hydrophobic) Nonpolar (hydrophobic) Amino Acid Side Chains
Introduction to Bioinformatics II
Study Question: What are enzymes?
Protein Structure Prediction
Protein Structures.
Prediction of protein structure
Protein Structure Chapter 14.
Protein structure prediction.
Proteins.
Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Doshisha University, Kyoto, Japan Takeshi Yoshida.
Protein structure prediction
Four Levels of Protein Structure
Presentation transcript:

A new Approach to Structural Prediction of Proteins Heiko Schröder Bertil Schmidt Jiujiang Zhu School of Computer Engineering Nanyang Technological University Singapore

Contents zProtein Structure zProtein Structure Prediction zApproach based on Local Protein Structure zRefinements zConclusions and Future Work

Protein Structure zProteins are large molecules composed of smaller molecules called amino acids zThere are 20 kinds of amino acids found in natural proteins zAll share a common structure R side chain carboxyl groupamine group alpha carbon (with attached hydrogen)

Protein Structure

From Primary to Tertiary Structure zA protein’s 3D shape is determined by its primary amino acid sequence (Anfinsen, 1963) zPredicting tertiary structure from amino acid sequence is an unsolved problem yDifficult to model the energies that stabilize a protein molecule yConformational search space is enormous

Prediction Methods zGiven an amino acid sequence: ysearch a set of known folds by aligning sequence and a template fold representative ypredict the fold that gets the best scoring alignment Target amino acid sequence Template Fold library YLAADTYK Template amino acid sequence FISSETCNMEPSSYVTGLIRKN Target/template Score: 7212

Prediction Methods zThis method is very effective when target and template have >30% sequence identity zApproximately 1/3 of protein sequences can be assigned folds and modeled this way zOur aim is to contribute to determine tertiary structures in case matching sequences cannot be found

Local structure and prediction zWhat is Local structure ? ydescribes environment of an amino acid yan amino acid’s relationship to neighbors zwe use this information to predict structure from primary sequence

Dihedral Angles zThe 6 atoms in each peptide unit lie in the same plane   and  free to rotate  The structure of a protein is almost totally determined, if all angles  and  are known

Idea of our Approach zStiff  free zlocal predictability  database of sub-chain structures zreduction of the number of degrees of freedom by 10, reduces the computation time significantly in combination with a global optimization algorithm (e.g. GA or SA) Side chains Back bone CC  and  C N

Classification of Dihedral Angles Selected PDB structures Dihedral angle extraction Histogram for each amino acids pair stiff multiple flexible

Classification of Dihedral Angles  ALA-ALA Frequency   LEU-ARG Frequency   GLY-ILE Stiff multiple flexible

Classification of Dihedral Angles Selected PDB structures Dihedral angle extraction Histogram for each amino acids pair stiff multiple flexible zStiff angles: determine mean value zMultiple angles: determine sequence of mean values, one for each peak in decreasing order of these peaks zFlexible angles: determine mean value and mark as flexible

Prediction based on Classification zGiven a sequence of amino acids, find the subsequence in which all angles are of type stiff zpredict structure of these subsequences, using the mean values of the corresponding histograms

Prediction based on Classification zPart of a protein predicted with this method (backbone of a helix, original structure on the left, predicted structure on the right) zSuccessfully predicted certain stiff structures of subsequences up to the length of 15

Refinement of the method zFor multiple angles: yconsider sequences of length 3 or 4:  extract sequences (C,A,B,D) and determine the histogram of angles  and  related to the peptide chain between A and B  if histogram for  for amino acids (A,B) is multiple, check if angle for (A,B,C,D) is stiff xwith longer subsequences the occurrences of these sequences drops dramatically

Refinement of the method zFor multiple angles: yif an amino acid sequence has only a small number of multiple edges, it is possible to try all combinations of possible peaks ymany combinations lead to collisions in part of the protein, and thus can be eliminated

Conclusion and Future Work zPresented a method to predict stiff structures of subsequences up to the certain length zPresented a refinement of the method to handle multiple angles zhow to handle flexible angles ? zUsing the local prediction as an input for a global optimization method, e.g. based on Simulated Annealing