Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.

Slides:



Advertisements
Similar presentations
Ideal Parent Structure Learning School of Engineering & Computer Science The Hebrew University, Jerusalem, Israel Gal Elidan with Iftach Nachman and Nir.
Advertisements

Modelling and Identification of dynamical gene interactions Ronald Westra, Ralf Peeters Systems Theory Group Department of Mathematics Maastricht University.
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Simulation of Prokaryotic Genetic Circuits Jonny Wells and Jimmy Bai.
. Context-Specific Bayesian Clustering for Gene Expression Data Yoseph Barash Nir Friedman School of Computer Science & Engineering Hebrew University.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
From Sequence to Expression: A Probabilistic Framework Eran Segal (Stanford) Joint work with: Yoseph Barash (Hebrew U.) Itamar Simon (Whitehead Inst.)
Learning Module Networks Eran Segal Stanford University Joint work with: Dana Pe’er (Hebrew U.) Daphne Koller (Stanford) Aviv Regev (Harvard) Nir Friedman.
Open Day 2006 From Expression, Through Annotation, to Function Ohad Manor & Tali Goren.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Finding Transcription Modules from large gene-expression data sets Ned Wingreen – Molecular Biology Morten Kloster, Chao Tang – NEC Laboratories America.
6. Gene Regulatory Networks
Ab initio motif finding
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.
Assigning Numbers to the Arrows Parameterizing a Gene Regulation Network by using Accurate Expression Kinetics.
Epistasis Analysis Using Microarrays Chris Workman.
Structure Learning for Inferring a Biological Pathway Charles Vaske Stuart Lab.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
What Is a Gene Network?. Gene Regulatory Systems “Programs built into the DNA of every animal.” Eric H. Davidson.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Using Bayesian Networks to Analyze Expression Data By Friedman Nir, Linial Michal, Nachman Iftach, Pe'er Dana (2000) Presented by Nikolaos Aravanis Lysimachos.
Metabolic pathway alteration, regulation and control (5) -- Simulation of metabolic network Xi Wang 02/07/2013 Spring 2013 BsysE 595 Biosystems Engineering.
Modeling and identification of biological networks Esa Pitkänen Seminar on Computational Systems Biology Department of Computer Science University.
Uri Alon’s lab 10/02. Network of transcriptional interactions in E. coli Thieffry, Collado-Vides, 1998 Shen-Orr, Alon, Nature Genetics 2002.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Reconstruction of Transcriptional Regulatory Networks
Summary Various mathematical models describing gene regulatory networks as well as algorithms for network reconstruction from experimental data have been.
Combinatorial State Equations and Gene Regulation Jay Raol and Steven J. Cox Computational and Applied Mathematics Rice University.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Analysis of the yeast transcriptional regulatory network.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Steady-state Analysis of Gene Regulatory Networks via G-networks Intelligent Systems & Networks Group Dept. Electrical and Electronic Engineering Haseong.
Introduction to biological molecular networks
A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches Cantone, I., Marucci, L., Iorio, F., Ricci, M., Belcastro,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Gaussian Process Networks Nir Friedman and Iftach Nachman UAI-2K.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Constrained graph structure learning by integrating diverse data types
EQTLs.
Learning gene regulatory networks in Arabidopsis thaliana
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Ingenuity Knowledge Base
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
1 Department of Engineering, 2 Department of Mathematics,
Schedule for the Afternoon
From Steady State to Kinetics
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Dynamic regulatory map and static network for yeast response to AA starvation. Dynamic regulatory map and static network for yeast response to AA starvation.
Presentation transcript:

Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University

Goal: Reconstruct Cellular Networks Biocarta. u Structure u Function u Dynamics Conditions Genes Common approach: Interaction Networks Different semantics for networks u Boolean, probabilistic, differential equations, …

A Major Assumption… mRNA tr. rate protein active protein mRNA mRNA degradation TF G G G Activation signal Hidden mRNA Observed

Realistic Regulation Modeling u Model the closest connection u Active protein levels are not measured u Transcript rates are computed from expression data and mRNA decay rates u Realistic biochemical model of transcription rates TF G G Hidden Observed proteinmRNA mRNA tr. rate active protein Activation signal mRNA degradation Hidden Observed

OnOff Modeling Transcription Rate Simplest case: one activator G TF mRNA transcripts G TF On [McAdams & Arkin, 1997; Ronen et al, 2002] P( ) Avg rate = + P( )

Modeling Transcription Rate Steady state equations: G TF Concentration of free promoters Concentration of bound promoters Concentration of TF dd bb

Modeling Transcription Rate G TF dd bb  = 1  = 4  = 20  = 250 TF activity Transcription rate TF activity Time  = 1  = 4  = 20  = 250 Trans rate Time

[Buchler et al, 2003; Setty et al, 2002] General Two Regulator Function TF 2 TF 1 G P(State) a b c d 11 X 22 G TF Similar models for other modes of binding: u Competitive binding u Cooperative binding

P(State) General Two Regulator Function TF 2 TF 1 G  b = 0  a = 0  c = 0  d =1  b = 1  c = 1 bb aa cc dd X X X X = Average Rate Rate “AND” gate “OR” gate a b c d [Buchler et al, 2003; Setty et al, 2002] Avg rate = function of TF concentrations Few parameters:  Affinity parameters  Rate parameters

Models of Regulatory Networks Regulators (activity) Target Genes (trans. rate) G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF 3 G5G5 G6G6 G7G7 Noise Observed rates ? Predicted rates TF activity Time Trans rate Time

Learning Learning From Data Transcription rates Expression data mRNA decay rates Kinetic parameters G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF 2 + Gradient ascent

TF 1 TF 2 G4G4 TF 1 G3G3 G2G2 G1G1 Learning Cell Cycle Experiment Transcription rates Expression data mRNA decay rates Kinetic parameters + Biological Databases [YPD] ChIP location [Lee et. al] 7 regulators & 141 target genes Cell cycle gene expression [Spellman et. al] + mRNA decay rates [Wang et al] Transcription rates

M/G1 G1 S S/G2 G2/M predictionsinput parameters Cell Cycle Experiment 17x141 = 2397 Data points 466 parameters 17x7 = 119 Regulator activity values

G1G2G1G2 FKH1 FKH2 G1G2G1G2 SWI5 ACE2 Regulator Activity Profiles u When are they active? Known biology: u SWI4 & MBP1: mid-late G1 u FHK1: S/G2 u FKH2: G2/M u SWI5: M/G1 G1G2G1G2 MBP1 SWI4 Reconstructed activity profiles match direct experimental knowledge

Regulator Activity Profiles u When are they active? u Could we reconstruct these from mRNA profiles? Known biology: u SWI5 is transcriptionally regulated u MCM1 is not Regulator’s own mRNA is not sufficient to reconstruct activity levels mRNA profile SWI5 Activity mRNA MCM1 Activity mRNA

Regulator Activity Profiles u When are they active? u Could we reconstruct these from mRNA profiles? u Could we reconstruct these from target’s transcription rate? Avg target rate

M/G1 G1 S S/G2 G2/M input predictions Cell Cycle Experiment How well are we doing? residue

Model Learning ab initio Learning Transcription rates Learning Expression data mRNA decay rates Kinetic parameters G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF 2 + Big assumption: u Network topology is given u Unrealistic, even for well understood systems + Challenge: Reconstruct network topology?  Number of regulators  Their joint effect on target genes

How Do We Learn Structure? Standard approach: hill climbing search G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 G4G4 TF 2 TF 1 G3G3 G2G2 G1G G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF 3 Problem: Scoring structures is costly  Requires non-linear parameter optimization  Impractical on real data

Pred(G|TF,Y) Ideal regulator Time Pred(G|TF) TF G Y Step 1: Compute optimal hypothetical regulator Time regulators Step 2: Search for “similar” regulator TF 1 TF 2 TF 3 TF 4 Activity level Target Profile Ideal Regulator Method Goal: Consider adding edges Idea: Score only promising candidates

Parent(s) activity Predicted(G|TF,TF 2 ) Time regulators TF 1 TF 2 TF 3 TF 4 Step 3: Add new parent and optimize parameters Time Step 1: Compute optimal hypothetical regulator Step 2: Search for “similar” regulator Pred(G|TF,Y) Ideal regulator Y Target Profile TF G TF 2 Crucial point: Choice of similarity measure u Principled approach see [Nachman et al UAI04] u Provides approximation to Δlikelihood Ideal Regulator Method Goal: Consider adding edges Idea: Score only promising candidates

New regulator: “centroid” of selected ideal regulators Adding New Regulator Ideal regulators Idea: Introduce hidden regulator for genes with similar ideal regulator TF new G1G1 G2G2 G4G4 G1G1 G2G2 G3G3 G4G4 G5G5 Y1Y1 Y2Y2 Y3Y3 Y4Y4 Y5Y5 Time

M/G1 G1 S S/G2 G2/M Input rates Curated prior knowledge 466 params ab initio from scratch 461 params Ab initio Structure Learning

Input rates Curated prior knowledge 466 params ab initio from scratch 461 params M/G1 G1 S S/G2 G2/M Ab initio Structure Learning

H2 SWI5 H4 SWI4 Significant target overlap & correlated activity Significant target overlap & weak correlation H1 MBP1 H3 FKH2 curated ab initio target genes regulators Regulators: ab initio vs. curated H1 H2H4H3H5H6 H7 SWI4MBP1ACE2FKH1SWI5MCM1FKH2

curated ab initio target genes regulators u Significant agreement with “known” topology  Both in structure & dynamics u Improved predictions Regulators: ab initio vs. curated SWI4MBP1ACE2FKH1SWI5MCM1FKH2 H1 H2H4H3H5H6 H7

Model Learning Conclusions Kinetic parameters G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF Transcription rates Network (prior knowledge) G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 u Realistic model, based on first principles u Learning procedure  Reconstruct unobserved activity profiles  Reconstruct network topology u Insights into  Structure & Dynamics  Function

Future Directions u Prior knowledge u ChIP location u Cis-regulatory elements External perturbations Internal feedback G4G4 TF 2 TF 1 G3G3 G2G2 G1G1 TF 3 G5G5 G6G6 G7G7