Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.

Slides:



Advertisements
Similar presentations
Bayesian inference Lee Harrison York Neuroimaging Centre 01 / 05 / 2009.
Advertisements

Yinyin Yuan and Chang-Tsun Li Computer Science Department
Modelling and Identification of dynamical gene interactions Ronald Westra, Ralf Peeters Systems Theory Group Department of Mathematics Maastricht University.
Microarray Data Analysis (Lecture for CS397-CXZ Algorithms in Bioinformatics) March 19, 2004 ChengXiang Zhai Department of Computer Science University.
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Computer vision: models, learning and inference Chapter 18 Models for style and identity.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Model-based clustering of gene expression data Ka Yee Yeung 1,Chris Fraley 2, Alejandro Murua 3, Adrian E. Raftery 2, and Walter L. Ruzzo 1 1 Department.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
27/06/2005ISMB 2005 GenXHC: A Probabilistic Generative Model for Cross- hybridization Compensation in High-density Genome-wide Microarray Data Joint work.
From Sequence to Expression: A Probabilistic Framework Eran Segal (Stanford) Joint work with: Yoseph Barash (Hebrew U.) Itamar Simon (Whitehead Inst.)
04/02/2006RECOMB 2006 Detecting MicroRNA Targets by Linking Sequence, MicroRNA and Gene Expression Data Joint work with Quaid Morris (2) and Brendan Frey.
1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
Visual Recognition Tutorial
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Yeast Dataset Analysis Hongli Li Final Project Computer Science Department UMASS Lowell.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
6. Gene Regulatory Networks
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.
Combining the strengths of UMIST and The Victoria University of Manchester Propagating Measurement Uncertainty in Microarray Data Analysis Magnus Rattray.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
TF Infer A Tool for Probabilistic Inference of Transcription Factor Activities H.M. Shahzad Asif Institute of Adaptive and Neural Computation School of.
TF Infer A Tool for Probabilistic Inference of Transcription Factor Activities H.M. Shahzad Asif Machine Learning Group Department of Computer Science.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning regulatory networks from postgenomic data and prior knowledge Dirk Husmeier 1) Biomathematics & Statistics Scotland 2) Centre for Systems Biology.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Summary Various mathematical models describing gene regulatory networks as well as algorithms for network reconstruction from experimental data have been.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Analysis of the yeast transcriptional regulatory network.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Inferring gene regulatory networks with non-stationary dynamic Bayesian networks Dirk Husmeier Frank Dondelinger Sophie Lebre Biomathematics & Statistics.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
Introduction to biological molecular networks
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Bayesian Speech Synthesis Framework Integrating Training and Synthesis Processes Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda Nagoya Institute.
Definitions Transcriptional Modules (TM) are groups of co-regulated genes and transcription factors regulating their expression –Basic building blocks.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Gaussian Process Networks Nir Friedman and Iftach Nachman UAI-2K.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation Yee W. Teh, David Newman and Max Welling Published on NIPS 2006 Discussion.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Incorporating graph priors in Bayesian networks
Learning gene regulatory networks in Arabidopsis thaliana
Probabilistic Robotics
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Christopher A. Penfold Vicky Buchanan-Wollaston Katherine J. Denby And
1 Department of Engineering, 2 Department of Mathematics,
Predictive distributions
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Analyzing Time Series Gene Expression Data
FUNCTIONAL ANNOTATION OF REGULATORY PATHWAYS
Single-Cell Analysis of Growth in Budding Yeast and Bacteria Reveals a Common Size Regulation Strategy  Ilya Soifer, Lydia Robert, Ariel Amir  Current.
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Topic Models in Text Processing
Inferring Cellular Processes from Coexpressing Genes
Presentation transcript:

Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray

Talk plan Biological problem Genome-wide inference of regulatory activity Reducing the network

Biological networks Living cells contain thousands of genes These interact in complex ways forming dynamical networks (pathways) We would like to model different processes as (approximately) independent pathways Can we infer pathways from gene-expression data?

Transcriptional networks Some attempts at inferring regulatory networks (as opposed to pathways) from expression Hierarchical clustering (Eisen et al ‘95, Spellman et al ‘98) Dynamic Bayesian Networks (Friedman et al ’00) Need very large amounts of data to achieve good results (Husmeier 2003)

Adding location data Recent experimental techniques (ChIP) allow to measure binding of transcription factors (TFs) to target genes in vivo This can be viewed as a direct measurement of the wiring of the regulatory network The data is very noisy, and many binding events do not result in regulation

Regulatory network of S. Cerevisiae Data from Lee et al, Science (2002)

Inferring regulatory activity We wish to integrate location data and expression data This will allow us to infer which regulatory relations are confirmed by the expression data We use probabilistic models as they can handle more naturally the noise inherent in the system

Genome-wide inference Assume we have gene-expression measurements for N genes and we know the wiring of the regulatory network TF1 g1 g2gN gk TFd S 11 S 21 S k1 S 2d S Nd Placing suitable prior assumptions on the TF concentrations and the regulatory strengths, we can obtain posterior estimates from the data

Probabilistic model We use a (log) linear model of regulation We place a state-space-model (SSM) prior distribution on the TF concentrations We place a normal prior on the regulatory strengths G.Sanguinetti, N.D. Lawrence and M. Rattray, Bioinformatics, in press.

Estimating the parameters Exact inference in this model is intractable Approximate inference is performed using a variational EM algorithm This exploits Jensen’s inequality to get a bound on the log likelihood Under a factorization assumption on the approximating distribution q, the E-step becomes exactly solvable via fixed point equations.

Results The left hand picture shows the expression level of ACE2 in the yeast cell cycle, the middle shows the inferred protein concentration and right shows the significance of the activities. Location data from Lee et al (2002), microarray data from Spellman et al (1998)

Finding submodules Knowledge of the posterior distribution of the regulatory intensities allows to determine which regulations are significant We can use this information to identify approximately independent submodules of the regulatory network These can be used a starting point for fine- grained modelling of small gene networks.

Flood-fill One way to reduce the density of the regulatory network is to threshold based on regulatory strength or significance level One can then recover submodules by exhaustive search (similar to the flood- fill algorithm in physics) + Conceptually easy, exhaustive - Arbitrariness in setting threshold

Spectral clustering Based on the observation that presence of clusters implies approximately block diagonal structure for affinity matrix Cluster identification is reduced to an eigenvalue problem

Spectral submodules Use linear affinity matrix Spectral decomposition yields very coherent submodules Not exhaustive

Future directions Extend the modelling of transcriptional regulation to incorporate nonlinear and combinatorial effects Model submodules in a detailed way (cf. Lawrence, Sanguinetti and Rattray, NIPS06) Identify submodules in a semi-supervised fashion