Presentation is loading. Please wait.

Presentation is loading. Please wait.

Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland.

Similar presentations


Presentation on theme: "Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland."— Presentation transcript:

1 Thomas Huber huber@maths.uq.edu.au Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland huber@maths.uq.edu.au Protein Scoring Functions: Essential Tools or Fancy Fad?

2 Why do we (still) care about Protein Structures/Prediction? Academic curiosity? –Understanding how nature works Urgency of prediction –  10 4 structures are determined insignificant compared to all proteins –sequencing = fast & cheap –structure determination = hard & expensive Transistors in Intel processors TrEMBL sequences (computer annotated) SwissProt sequences (annotated) structures in PDB

3 What would we like to be able to predict? What is a protein’s structure? –Does a sequence adopt a known fold? Fold recognition –Does a sequence adopt a new fold? New fold prediction (dream of structural genomics) How stable is a protein –Thermodynamic stability What is a protein’s function? –Functional annotation

4 Three basic choices in molecular modelling Representation –Which degrees of freedom are treated explicitly Scoring –Which scoring function (force field) Searching –Which method to search or sample conformational space Two Linages of Protein Structure Prediction The physicist’s approach –Thermodynamics: Structures with low energy are more likely The biologist’s approach –Similar sequences  similar structures

5 Fragment Scoring Proteins are decomposed into overlapping fragments of 7 residues Each fragment is described by Amino acid specific local structure Non-specific environment Fragments are clustered and a statistical model for each cluster is built Total score =  fragment scores

6 Finding Remote Homologues with sausage 572 sequence-structure pairs Structures are similar (FSSP) > 70% structurally aligned < 20% sequence identity

7 RNA-dependent RNA Polymerases

8 A Real Case Example RNA-dependent RNA polymerases Dengue virus Bacteriophage  6

9 Testing/Breaking the Scoring Designed  -sheet (Serrano) –12 residues –Forms stable  -sheet at room temperature

10 Another Uniquely Folded Mini-Protein Villin head-piece (36 residues) –High thermodynamic stability (T m >70º) –Folds autonmously

11 A Uniquely Folded Mini- Protein Zinc finger analoge (Mayo) –28 residues –thermodynamic stable (T m  25º)

12 Trimer Stability Nitrogen regulation proteins –2 protein (PII (GlnB) and GlnK) –112 residues –sequence: 67% identities, 82% positives –structure: 0.7Å RMSD –trimeric –Dr S. Vasudevan: hetero-trimers

13 Hetero-trimer Stability What is the most/least stable trimer Why use a low resolution force field? –Structures differ (0.7Å RMSD) –Side chains are hard to optimise Calculation: –GlnB 3 > GlnB 2 -GlnK > GlnB-GlnK 2 > GlnK 3 Experiment: –GlnB 3 > GlnB 2 -GlnK > GlnB-GlnK 2 > GlnK 3 GlnK GlnB

14 People sausage –Andrew Torda (RSC) –Oliver Martin (RSC) GlnB/GlnK, RdR polymerases –Subhash Vasudevan (JCU) Sausage and Cassandra freely available http://rsc.anu.edu.au/~torda huber@maths.uq.edu.au Increasing urgency for in-silico proteomics Good force fields = essential for success –Different tasks (may) require different scoring schemes Summary


Download ppt "Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland."

Similar presentations


Ads by Google