1 Chapter 7 Protein and RNA Structure Prediction 暨南大學資訊工程學系 黃光璿 2004/05/24.

Slides:



Advertisements
Similar presentations
1 Amino acid and proteins Ghollam-Reza Moshtaghi-Kashanian Biochemistry Department Medical School Kerman University of Medical sciences.
Advertisements

1 Chapter 1 Molecular Biology and Biological Chemistry 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/02/23.
Protein Structure C483 Spring 2013.
Pages 42 to 46.  Chemical composition  Carbon  Hydrogen  Oxygen  Nitrogen  Sulfur (sometimes)  Monomer/Building Block  Amino Acids (20 different.
1 Chapter 2 Data Searches and Pairwise Alignments 暨南大學資訊工程學系 黃光璿 2004/03/08.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
1 Chapter 8Proteomics 暨南大學資訊工程學系 黃光璿 2004/06/07 2 proteome  the sum total of an organism’s proteins genome  the sum total of an organism’s genetic.
1 Chapter 3 Substitution Patterns 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/03/22.
1 蛋白質簡介 暨南大學資訊工程系 2003/05/06. 2 蛋白質是由 20 種胺基酸所組成.
Polypeptides – a quick review A protein is a polymer consisting of several amino acids (a polypeptide) Each protein has a unique 3-D shape or Conformation.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
1. Primary Structure: Polypeptide chain Polypeptide chain Amino acid monomers Peptide linkages Figure 3.6 The Four Levels of Protein Structure.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Proteins Structures Primary Structure.
AMINO ACIDS AND PROTEINS
Protein Structures.
Proteins: Levels of Protein Structure Conformation of Peptide Group
Protein Structural Prediction. Protein Structure is Hierarchical.
1 Multiple Sequence Alignment 暨南大學資訊工程學系 黃光璿 2004/05/31.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
CAP5510 – Bioinformatics Protein Structures
Proteins: Secondary Structure Alpha Helix
Proteins: Amino Acid Chains DNA Polymerase from E. coli Standard amino acid backbone: Carboxylic acid group, amino group, the alpha hydrogen and an R group.
3.2 Proteins Mini Lecture Radjewski. Major functions of proteins: Enzymes—catalytic proteins Defensive proteins (e.g., antibodies) Hormonal and regulatory.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Amino acids and proteins … for AS Biology. Amino acids Proteins are macromolecules consisting of long unbranched chains of amino acids. All amino acids.
BIOL 200 (Section 921) Lecture # 2, June 20, 2006 Reading for lecture 2: Essential Cell Biology (ECB) 2nd edition. Chap 2 pp 55-56, 58-64, 74-75; Chap.
Department of Mechanical Engineering
Doug Raiford Lesson 19.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Mrs. Einstein Research in Molecular Biology. Importance of proteins for cell function: Proteins are the end product of the central dogma YOU are your.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Protein structure and function Part - I
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
1 / 45 Chapter 1 Amino Acids to Proteins 1.1 Protein Composition 1.2 Protein Conformations 1.3 Protein Structure and Function: A Few Examples 1.4 The Dynamics.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Chapter 3. Protein structure and function. Proteins are the most versatile macromolecules in living systems. serve crucial functions in essentially all.
Biological-Engineering for Beginners Biochemistry II: Proteins Leigh Casadaban and Alina Gatowski July 26, 2009.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit Department of Pathology Tel
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Proteins Biochemistry Unit 1. What You Need to Know! How to recognize protein by its structural formula The cellular function of proteins The four structural.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Proteins Structure Predictions Structural Bioinformatics.
Tymoczko • Berg • Stryer © 2015 W. H. Freeman and Company
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Structure and Function
Structural organization of proteins
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Four Levels of Protein Structure
Nucleic Acids & Proteins
Protein Structure BL
Principles of protein structure and stability.
Lecture 5 Protein Structure.
Conformationally changed Stability
. Nonpolar (hydrophobic) Nonpolar (hydrophobic) Amino Acid Side Chains
Introduction to Bioinformatics II
The Chemistry of Life Proteins
Protein Structures.
Conformationally changed Stability
Fig 3.13 Reproduced from: Biochemistry by T.A. Brown, ISBN: © Scion Publishing Ltd, 2017.
Protein structure prediction
The Three-Dimensional Structure of Proteins
Four Levels of Protein Structure
Presentation transcript:

1 Chapter 7 Protein and RNA Structure Prediction 暨南大學資訊工程學系 黃光璿 2004/05/24

2 Proteins Built from a repertoire of 20 amino acids

3

4 7.1 Amino Acids

5 胺基酸 中心碳 胺基( NH 2 ) COOH 氫( H ) 側鏈( side chain, R )

6 同分異構物

7

8

9 Fig. 7.2

10

11

12

13 pH, pK a, and pI pH  -log [H + ] pK a  = pH ~ half of the amino acid residues will dissociate ( 釋放出 H + ). pI  = pH, isoelectric point for protein

Polypeptide Composition

15

Secondary Structure

Backbone Flexibility

18 Conformation of Polypeptide Chain

19 Ramachandran Plot N: 藍 C: 黑 O: 紅 H: 白

20 二級結構( Secondary Structure ) Alpha helix

21 Beta sheet

22

23 Beta turn

24 Loop

Accuracy of Prediction Computational methods  neural network  discrete-state models  hidden Markov models  nearest neighbor classification  evolutionary computation

26 PHD, Predator  structure prediction algorithms  accuracies in the range 70% ~ 75%

Chou-Fasman Method

28 Identifying Alpha Helices 1. Find all regions where four out of six have P(a)> Extend the regions until four with P(a) < 100 in both directions. 3. If ΣP(a) > ΣP(b) and the stretch >5, then it is identified as a helix.

29 Identifying Beta Sheets 1. Find all regions where four out of six have P(b)> Extend the regions until four with P(b) < 100 in both directions. 3. If ΣP(b) > ΣP(a) and the average value of P(b) over the stretch >100, then it is identified as a helix.

30 Resolving Overlapping Regions 1. Identified as helix if ΣP(a) > ΣP(b), as sheet if ΣP(b) > ΣP(a) over the overlapping regions.

31 Identifying Turns 1. Let P(t) = f(i)xf(i+1)xf(i+2)xf(i+3) for each position i. 2. Identify as a turn if 1. P(t) > ; 2. The average of P(turn) over the four residues > 100; 3. ΣP(a) ΣP(b) over the four residues.

GOR Method on a window of 17 residues

Tertiary and Quaternary Structure

34 三級結構( Tertiary Structure ) 折疊成立體的形狀

35 四級結構( Quaternary Structure ) 數個三級結構結合成具 有功能的大分子 人類的血球蛋白

36 Driving Forces for Folding electrostatic forces hydrogen bonds van der Waals forces disulfide bonds solvent interactions

Hydrophobicity ( 疏水性 ) hydrophobic collapse  Tend to keep polar, charged residues on the surface.  The class of membrane-integral proteins is an exception.

38 sickle-cell anemia ( 鐮狀細胞性貧血 )  human hemoglobin: 2 alpha & 2 beta globins  charged glutamic acid residue  hydrophobic valine residues

Disulfide Bonds

40

41

Active Structures vs Most Stable Structures Natural selection favors proteins that are both active and robust.

43 Levinthal Paradox in residues, each assume 3 different conformations  ~ 5x10 47 possibilities  Suppose it takes s for one trial. Proteins fold by progressive stabilization of intermediates rather than by random search.

Algorithms for Modeling Protein Folding Lattice Models Off-Lattice Models

Lattice Models Reduce the search space and make computing tractable.  Minimize free energy conformation

46 HP-model hydrophobic-polar model  Scoring is based on hydrophobic contacts.  Maximize the H-to-H contacts. Fig. 7.8

47

Off-Lattice Models Use RMSD (root mean square deviation) to measure the accuracy. Determine Φ and Ψin the allowable region of the Ramachandran plot.

Energy Functions and Optimization Problems  The exact forces that drive the folding process are not well understood.  It is too computationally expensive.

50 Summary model representation scoring function search (optimization)  (V. Pande, Stanford)

Structure Prediction very high accuracy  < 3.0 Å

Comparative Modeling Also called homology modeling Rely on the robustness of the folding code

53 1. Identify a set of protein structures related to the target protein. 2. Align the sequence of the target with the sequence of the template. 3. Construct the model. 4. Model the loop. 5. Model the side chains. 6. Evaluate the model.

Threading Given  a conformation and  a protein sequence, measure its favorability.

Predicting RNA Secondary Structures

56 Nearest Neighbor Energy Rules Zuker’s Mfold program

57 Why study RNA secondary structures? For understanding of  gene regulation  expression of protein products

58 參考資料及圖片出處 1. Fundamental Concepts of Bioinformatics Dan E. Krane and Michael L. Raymer, Benjamin/Cummings, Fundamental Concepts of Bioinformatics 2. Biochemistry, by J. M. Berg, J. L. Tymoczko, and L. Stryer, Fith Edition, Biochemistry