MODELLING PROTEOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?

Slides:



Advertisements
Similar presentations
Protein Structure Prediction using ROSETTA
Advertisements

Structural bioinformatics
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Protein Interactions and Disease Audry Kang 7/15/2013.
Proteomics Understanding Proteins in the Postgenomic Era.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Elements of Molecular Biology All living things are made of cells All living things are made of cells Prokaryote, Eukaryote Prokaryote, Eukaryote.
Protein Tertiary Structure Prediction
NOVEL PARADIGMS FOR DRUG DISCOVERY SHOTGUN COMPUTATIONAL MULTITARGET SCREENING RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON PIONEER AWARD.
Modelling, comparison, and analysis of proteomes Ram Samudrala University of Washington.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
Does gene order matter? Cis-regulatory elements, proteins, and messengers are integrated into biological circuits. Does gene location in the genome affect.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Consensus RAPDF rTAD Refinement Successes & Failures Jeremy Horst Ram Samudrala’s CompBio Group University of Washington.
Lecture 10 – protein structure prediction. A protein sequence.
Modelling proteomes An integrated computational framework for systems biology research Ram Samudrala University of Washington How does the genome of an.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
-A cell is an organization of millions of molecules -Proper communication between these molecules is essential to the normal functioning of the cell -To.
COMPUTATIONAL VACCINE DESIGN RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we design vaccines based on conformational epitopes and.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Samudrala group - overall research areas CASP6 prediction for T Å C α RMSD for all 70 residues CASP6 prediction for T Å C α RMSD for all.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
An Integrated Computational Framework for Systems Biology Ram Samudrala University of Washington How does the genome of an organism specify its behaviour.
SHOTGUN STRUCTURAL PROTEOMICS RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON Given a heterogeneous mixture of proteins, how can we determine.
INTERACTOMICS RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON NIH DIRECTOR’S PIONEER AWARD 2010 How does the genome of an organism specify its.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
NOVEL PARADIGMS FOR DRUG DISCOVERY SHOTGUN COMPUTATIONAL MULTITARGET SCREENING RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON NIH DIRECTOR’S.
Computational engineering of bionanostructures Ram Samudrala University of Washington How can we analyse, design, & engineer peptides capable of specific.
Central dogma: the story of life RNA DNA Protein.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Modelling proteins and proteomes using Linux clusters Ram Samudrala University of Washington.
THERAPUETIC DISCOVERY BY MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its.
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Bioinformatics and Computational Biology
COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES
MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Modelling proteomes Ram Samudrala Department of Microbiology How does the genome of an organism specify its behaviour and characteristics?
COMPUTATIONAL BIOLOGY IN DRUG DISCOVERY RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we computationally screen compounds against.
NOVEL PARADIGMS FOR DRUG DISCOVERY
Modelling proteomes Ram Samudrala University of Washington How does the genome of an organism specify its behaviour and characteristics?
MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Modelling proteins and proteomes using Linux clusters Ram Samudrala University of Washington.
SHOTGUN STRUCTURAL PROTEOMICS RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON Given a heterogeneous mixture of proteins, how can we determine.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Discovery of Therapeutics to Improve Quality of Life Ram Samudrala University of Washington.
Modelling proteomes Ram Samudrala University of Washington.
Modelling proteomes: Application to understanding HIV disease progression Ram Samudrala Department of Microbiology University of Washington How does the.
COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we design peptides and proteins capable.
Structure/function studies of HIV proteins HIV gp120 V3 loop modelling using de novo approaches HIV protease-inhibitor binding energy prediction.
Modelling genome structure and function Ram Samudrala University of Washington.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Modelling proteomes Ram Samudrala University of Washington How does the genome of an organism specify its behaviour and characteristics?
MODELLING PROTEOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Bioinformatics Overview
How does the genome of an organism
University of Washington
MODELLING INTERACTOMES
Modelling the rice proteome
MODELLING INTERACTOMES
University of Washington
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
How does the genome of an organism
University of Washington
NOVEL PARADIGMS FOR DRUG DISCOVERY
Presentation transcript:

MODELLING PROTEOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?

PROTEOME ~60,000 in human ~60,000 in rice ~4500 in bacteria Several thousand distinct sequence families

STRUCTURE A few thousand distinct structural folds

FUNCTION Tens of thousands of functions

EXPRESSION Different expression patterns based on time and location

INTERACTION Interaction and expression are interdependent with structure and function

PROTEIN FOLDING …-CTA-AAA-GAA-GGT-GTT-AGC-AAG-GTT-… Gene …-L-K-E-G-V-S-K-D-… One amino acid Protein sequence Unfolded protein Native biologically relevant state Spontaneous self-organisation (~1 second) Not unique Mobile Inactive Expanded Irregular

PROTEIN FOLDING …-L-K-E-G-V-S-K-D-… …-CTA-AAA-GAA-GGT-GTT-AGC-AAG-GTT-… One amino acid Gene Protein sequence Unfolded protein Native biologically relevant state Spontaneous self-organisation (~1 second) Unique shape Precisely ordered Stable/functional Globular/compact Helices and sheets Not unique Mobile Inactive Expanded Irregular

STRUCTURE 0246 ACCURACY Experiment (X-ray, NMR) Computation (de novo) Computation (template-based) Hybrid (Iterative Bayesian interpretation of noisy NMR data with structure simulations) One distance constraint for every six residues One distance constraint for every ten residues C α RMSD

DE NOVO MODELLING Astronomically large number of conformations 5 states/100 residues = = SELECT Hard to design functions that are not fooled by non-native conformations (“decoys”) Sample conformational space such that native-like conformations are found

DE NOVO MODELLING GENERATE Make random moves to optimise what is observed in known structures …… Find the most protein-like structures MINIMISE …… FILTER Filter based on all-atom pairwise interactions, bad contacts compactness, secondary structure, consensus of generated conformations

TEMPLATE-BASED MODELLING KDHPFGFAVPTKNPDGTMNLMNWECAIP KDPPAGIGAPQDN----QNIMLWNAVIP ** * * * * * * * ** …… SCAN ALIGN Refine using constraints derived from multiple templates Build initial models from multiple templates using minimum perturbation Construct nonconserved side and main chains using graph theory and semfold

STRUCTURE 0.5 Å C α RMSD for 173 residues (60% identity) T0290 – peptidyl-prolyl isomerase from H. sapiens T0364 – hypothetical from P. putida 5.3 Å C α RMSD for 153 residues (11% identity) T0332 – methyltransferase from H. sapiens 2.0 Å C α RMSD for 159 residues (23% identity) T0288 – PRKCA-binding from H. sapiens 2.2 Å C α RMSD for 93 residues (25% identity) Liu/Hong-Hung/Ngan

HYBRID MODELLING Hong-Hung

FUNCTION Wang/Cheng Ion binding energy prediction with a correlation of 0.7 Calcium ions predicted to < 0.05 Å RMSD in 130 cases Meta-functional signature for DXS model from M. tuberculosis Meta-functional signature accuracy

INTERACTION McDermott/Wichadakul/Staley/Horst/Manocheewa/Jenwitheesuk/Bernard BtubA/BtubB interolog model from P. dejongeii (35% identity to eukaryotic tubulins) Transcription factor bound to DNA promoter regulog model from S. cerevisiae Prediction of binding energies of HIV protease mutants and inhibitors using docking with dynamics

SYSTEMS McDermott/Wichadakul Example predicted protein interaction network from M. tuberculosis (107 proteins with 762 unique interactions) In sum, we can predict functions for more than 50% of a proteome, approximately ten million protein-protein and protein-DNA interactions with an expected accuracy of 50%. Utility in identifying function, essential proteins, and host pathogen interactions Proteins PPIs TRIs H. sapiens 26,741 17, ,807 1,045,622 S. cerevisiae 5,801 5, ,505 2,456 O.sativa (6) 125,568 19, , ,990 E. coli 4, ,980 54,619

SYSTEMS McDermott/Wichadakul Combining protein-protein and protein-DNA interaction networks to determine regulatory circuits

INFRASTRUCTURE Guerquin/Frazier ~500,000 molecules over 50+proteomes served using a 1.2 TB PostgreSQL database and a sophisticated AJAX webapplication and XML-RPC API

INFRASTRUCTURE Guerquin/Frazier

INFRASTRUCTURE Chang/Rashid

APPLICATION: DRUG DISCOVERY HSV KHSVCMV Jenwitheesuk

APPLICATION: DRUG DISCOVERY HSV KHSV CMV Computionally predicted broad spectrum human herpesvirus protease inhibitors is effective in vitro against members from all three classes and is comparable or better than anti-herpes drugs HSV Our protease inhibitor acts synergistically with acylovir (a nucleoside analogue that inhibits replication) and it is less likely to lead to resistant strains compared to acylovir Lagunoff

APPLICATION: NANOTECHNOLOGY Oren/Sarikaya/Tamerler

FUTURE Structural genomics Functional genomics + Computational biology + MODELLING PROTEIN AND PROTEOME STRUCTURE FUNCTION AT THE ATOMIC LEVEL IS NECESSARY TO UNDERSTAND THE RELATIONSHIPS BETWEEN SINGLE MOLECULES, SYSTEMS, PATHWAYS, CELLS, AND ORGANISMS

ACKNOWLEDGEMENTS Baishali Chanda Brady Bernard Chuck Mader David Nickle Ersin Emre Oren Ekachai Jenwitheesuk Gong Cheng Imran Rashid Jeremy Horst Ling-Hong Hung Michal Guerquin Rob Brasier Rosalia Tungaraza Shing-Chung Ngan Siriphan Manocheewa Somsak Phattarasukol Stewart Moughon Tianyun Liu Vania Wang Weerayuth Kittichotirat Zach Frazier Kristina Montgomery, Program Manager Current group members: Aaron Chang Duncan Milburn Jason McDermott Kai Wang Marissa LaMadrid Past group members: Funding agencies: National Institutes of Health National Science Foundation Searle Scholars Program Puget Sound Partners in Global Health UW Advanced Technology Initiative Washington Research Foundation UW TGIF James Staley Mehmet Sarikaya/Candan Tamerler Michael Lagunoff Roger Bumgarner Wesley Van Voorhis Collaborators:

MOTIVATION FOR DETERMINING PROTEIN STRUCTURE The functions necessary for life are undertaken by proteins. Protein function is mediated by protein three-dimensional structure. Knowing protein structure at high resolution will enable us to: Determine and understand molecular function. Understand substrate and ligand binding. Devise intelligent mutagenesis and biochemical experiments to understand biological function. Design therapeutics rationally. Design novel proteins. Knowing the structures of all proteins encoded by an organism’s genome will enable us to understand complex pathways and systems, and ultimately organismal behaviour and evolution. Applications in the area of medicine, nanotechnology, and biological computing.

s(d ab ) for contacts AO AN AC... YOH AO AN AC … YOH 167 X167 contacts distance bins known structures atom-atom contacts AO AN AC... YOH AO AN AC … YOH candidate structure atom-atom contacts AO AN AC... YOH AO AN AC … YOH NxN contacts ALL-ATOM SCORING FUNCTION

CRITICAL ASSESSMENT OF STRUCTURE PREDICTION Bias towards known structures Pre-CASPCASP Blind prediction

Oxidoreductase TransferaseHydrolaseLigaseLyase STRUCTURE TO FUNCTION? TIM barrel proteins experimental structures

INTEROLOG MODELLING Interacting protein database Protein a Protein b Experimentally determined interaction Target proteome Protein A Predicted interaction Protein B 85% 90% Assign confidence based on similarity and strength of interaction Paradigm is the use of homology to transfer information across organisms; not limited to yeast, fly, and worm Consensus of interactions helps with confidence assignments

E. coli INTERACTIONS McDermott

M. tuberculosis INTERACTIONS McDermott

C. elegans INTERACTIONS McDermott

H. sapiens INTERACTIONS McDermott

Network-based annotation for C. elegans McDermott

Articulation points KEY PROTEINS IN ANTHRAX

HOST PATHOGEN INTERACTIONS McDermott