Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland.

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Protein Structure Prediction using ROSETTA
Protein Structure Prediction: On the Cusp between Futility and Necessity? Thomas Huber Supercomputer Facility Australian National University Canberra
Chemotaxis Pathway How can physics help? Davi Ortega.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
A Grid implementation of the sliding window algorithm for protein similarity searches facilitates whole proteome analysis on continuously updated databases.
Structural bioinformatics
Intro to Bioinformatics Summary. What did we learn Pairwise alignment – Local and Global Alignments When? How ? Tools : for local blast2seq, for global.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein Structure, Databases and Structural Alignment
Bayesian Classification of Protein Data Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics.
Sequence order independent structural alignment Joe Dundas, Andrew Binkowski, Bhaskar DasGupta, Jie Liang Department of Bioengineering/Bioinformatics,
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
In double vision when drunk By Thomas Huber 23 November 2001 Alexandra Headland.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Bioinformatics (3 lectures) Why bother about proteins/prediction What is bioinformatics Protein databases Making use of database information –Predictions.
Queensland Parallel Supercomputing Foundation 1. Professor Mark Ragan (Institute for Molecular Bioscience) 2. Dr Thomas Huber (Department of Mathematics)
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
COMPARATIVE or HOMOLOGY MODELING
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
Representations of Molecular Structure: Bonds Only.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
Computer Matchmaking in the Protein Sequence/Structure Universe Thomas Huber Supercomputer Facility Australian National University Canberra
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
Department of Mechanical Engineering
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Structure prediction: Homology modeling
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Central dogma: the story of life RNA DNA Protein.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Bioinformatics Project BB201 Metabolism A.Nasser
MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance Andrew I. Jewett, Conrad C. Huang and Thomas.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
Modelling proteins and proteomes using Linux clusters Ram Samudrala University of Washington.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Modelling genome structure and function Ram Samudrala University of Washington.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Computational Structure Prediction
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Protein Structure Prediction and Protein Homology modeling
Determine protein structure from amino acid sequence

Protein dynamics Folding/unfolding dynamics
Ligand Docking to MHC Class I Molecules
Molecular Modeling By Rashmi Shrivastava Lecturer
Large Time Scale Molecular Paths Using Least Action.
Homology Modeling.
Presentation transcript:

Thomas Huber Computational Biology and Bioinformatics Environment ComBinE Department of Mathematics The University of Queensland Protein Scoring Functions: Essential Tools or Fancy Fad?

Why do we (still) care about Protein Structures/Prediction? Academic curiosity? –Understanding how nature works Urgency of prediction –  10 4 structures are determined insignificant compared to all proteins –sequencing = fast & cheap –structure determination = hard & expensive Transistors in Intel processors TrEMBL sequences (computer annotated) SwissProt sequences (annotated) structures in PDB

What would we like to be able to predict? What is a protein’s structure? –Does a sequence adopt a known fold? Fold recognition –Does a sequence adopt a new fold? New fold prediction (dream of structural genomics) How stable is a protein –Thermodynamic stability What is a protein’s function? –Functional annotation

Three basic choices in molecular modelling Representation –Which degrees of freedom are treated explicitly Scoring –Which scoring function (force field) Searching –Which method to search or sample conformational space Two Linages of Protein Structure Prediction The physicist’s approach –Thermodynamics: Structures with low energy are more likely The biologist’s approach –Similar sequences  similar structures

Fragment Scoring Proteins are decomposed into overlapping fragments of 7 residues Each fragment is described by Amino acid specific local structure Non-specific environment Fragments are clustered and a statistical model for each cluster is built Total score =  fragment scores

Finding Remote Homologues with sausage 572 sequence-structure pairs Structures are similar (FSSP) > 70% structurally aligned < 20% sequence identity

RNA-dependent RNA Polymerases

A Real Case Example RNA-dependent RNA polymerases Dengue virus Bacteriophage  6

Testing/Breaking the Scoring Designed  -sheet (Serrano) –12 residues –Forms stable  -sheet at room temperature

Another Uniquely Folded Mini-Protein Villin head-piece (36 residues) –High thermodynamic stability (T m >70º) –Folds autonmously

A Uniquely Folded Mini- Protein Zinc finger analoge (Mayo) –28 residues –thermodynamic stable (T m  25º)

Trimer Stability Nitrogen regulation proteins –2 protein (PII (GlnB) and GlnK) –112 residues –sequence: 67% identities, 82% positives –structure: 0.7Å RMSD –trimeric –Dr S. Vasudevan: hetero-trimers

Hetero-trimer Stability What is the most/least stable trimer Why use a low resolution force field? –Structures differ (0.7Å RMSD) –Side chains are hard to optimise Calculation: –GlnB 3 > GlnB 2 -GlnK > GlnB-GlnK 2 > GlnK 3 Experiment: –GlnB 3 > GlnB 2 -GlnK > GlnB-GlnK 2 > GlnK 3 GlnK GlnB

People sausage –Andrew Torda (RSC) –Oliver Martin (RSC) GlnB/GlnK, RdR polymerases –Subhash Vasudevan (JCU) Sausage and Cassandra freely available Increasing urgency for in-silico proteomics Good force fields = essential for success –Different tasks (may) require different scoring schemes Summary