Recovering Temporally Rewiring Networks: A Model-based Approach

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

CS188: Computational Models of Human Behavior
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
1 Some Comments on Sebastiani et al Nature Genetics 37(4)2005.
Le Song Joint work with Mladen Kolar and Eric Xing KELLER: Estimating Time Evolving Interactions Between Genes.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Modeling Malware Spreading Dynamics Michele Garetto (Politecnico di Torino – Italy) Weibo Gong (University of Massachusetts – Amherst – MA) Don Towsley.
Undirected Probabilistic Graphical Models (Markov Nets) (Slides from Sam Roweis)
An Introduction to Variational Methods for Graphical Models.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases Fan Guo, Lei Li, Eric Xing, Christos Faloutsos Carnegie Mellon University {fanguo, leili,
6. Gene Regulatory Networks
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
1 Uniform Sampling from the Web via Random Walks Ziv Bar-Yossef Alexander Berg Steve Chien Jittat Fakcharoenphol Dror Weitz University of California at.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Ahsanul Haque *, Swarup Chandra *, Latifur Khan * and Michael Baron + * Department of Computer Science, University of Texas at Dallas + Department of Mathematical.
Learning regulatory networks from postgenomic data and prior knowledge Dirk Husmeier 1) Biomathematics & Statistics Scotland 2) Centre for Systems Biology.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Markov Random Fields Probabilistic Models for Images
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Inferring gene regulatory networks with non-stationary dynamic Bayesian networks Dirk Husmeier Frank Dondelinger Sophie Lebre Biomathematics & Statistics.
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Lecture 2: Statistical learning primer for biologists
Expectation-Maximization (EM) Algorithm & Monte Carlo Sampling for Inference and Approximation.
1 Param. Learning (MLE) Structure Learning The Good Graphical Models – Carlos Guestrin Carnegie Mellon University October 1 st, 2008 Readings: K&F:
Dynamic Networks: How Networks Change with Time? Vahid Mirjalili CSE 891.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.
Graphical Models for Segmenting and Labeling Sequence Data Manoj Kumar Chinnakotla NLP-AI Seminar.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Incorporating graph priors in Bayesian networks
Multi-task learning approaches to modeling context-specific networks
Dynamics and context-specificity in biological networks
Dynamical Statistical Shape Priors for Level Set Based Tracking
Bucket Renormalization for Approximate Inference
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
1 Department of Engineering, 2 Department of Mathematics,
Haim Kaplan and Uri Zwick
Synthesis of MCMC and Belief Propagation
Learning Markov Networks
1 Department of Engineering, 2 Department of Mathematics,
Estimating Networks With Jumps
1 Department of Engineering, 2 Department of Mathematics,
Instructors: Fei Fang (This Lecture) and Dave Touretzky
An Introduction to Variational Methods for Graphical Models
Dynamics and context-specificity in biological networks
General Gibbs Distribution
Markov Random Fields Presented by: Vladan Radosavljevic.
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Expectation-Maximization & Belief Propagation
Parameter Learning 2 Structure Learning 1: The good
Readings: K&F: 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7 Markov networks, Factor graphs, and an unified view Start approximate inference If we are lucky… Graphical.
Approximate Inference by Sampling
Readings: K&F: 11.3, 11.5 Yedidia et al. paper from the class website
Markov Networks.
Jana Diesner, PhD Associate Professor, UIUC
Sequential Learning with Dependency Nets
CSC 578 Neural Networks and Deep Learning
Presentation transcript:

Recovering Temporally Rewiring Networks: A Model-based Approach Fan Guo, Steve Hanneke, Wenjie Fu, Eric P. Xing School of Computer Science, Carnegie Mellon University 11/10/2018 ICML 2007 Presentation

Social Networks Physicist Collaborations High School Dating The Internet 11/10/2018 All the images are from http://www-personal.umich.edu/~mejn/networks/. That page includes original citations.

Biological Networks Model for the Yeast cell cycle transcriptional regulatory network Fig. 4 from (T.I. Lee et al., Science 298, 799-804, 25 Oct 2002) Protein-Protein Interaction Network in S. cerevisiae Fig. 1 from (H. Jeong et al., Nature 411, 41-42, 3 May 2001) 11/10/2018 The small image is from http://www.raiks.de/img/dyna_title_zoom.jpg

When interactions are hidden… Infer the hidden network topology from node attribute observations. Methods: Optimizing a score function; Information-theoretic approaches; Model-based approach … Most of them pool the data together to infer a static network topology. 11/10/2018 ICML 2007 Presentation

And changing over time Network topologies and functions are not static: Social networks can grow as we know more friends Biological networks rewire under different conditions Fig. 1b from Genomic analysis of regulatory network dynamics reveals large topological changes N. M. Luscombe, et al. Nature 431, 308-312, 16 September 2004 11/10/2018 ICML 2007 Presentation

Overview Network topologies and functions are not always static. We propose probabilistic models and algorithms for recovering latent network topologies that are changing over time from node attribute observations. 11/10/2018 ICML 2007 Presentation

Rewiring Networks of Genes Networks rewire over discrete timesteps 11/10/2018 Part of the image is modified from Fig. 3b (E. Segal et al., Nature Genetics 34, 166-176, June 2003).

The Graphical Model Transition Model Emission Model 11/10/2018 ICML 2007 Presentation

Technical Challenges Latent network structures are of higher dimensions than observed node attributes How to place constraints on the latent space? Limited evidence per timestep How to share the information across time? 11/10/2018 ICML 2007 Presentation

Energy Based Conditional Probablities Energy-based conditional probability model (recall Markov random fields…) Energy-based model is easier to analysis, but even the design of approximate inference algorithm can be hard. 11/10/2018 11/10/2018 ICML 2007 Presentation ICML 2007 Presentation 10

Transition Model Based on our previous work on discrete temporal network models in the ICML’06 SNA-Workshop. Model network rewiring as a Markov process. An expressive framework using energy-based local probabilities (based on ERGM): Features of choice: (Density) (Edge Stability) (Transitivity) 11/10/2018 11/10/2018 ICML 2007 Presentation ICML 2007 Presentation 11

Emission Model in General Given the network topology, how to generate the binary node attributes? Another energy-based conditional model: All features are pairwise which induces an undirected graph corresponding to the time-specific network topology; Additional information shared over time is represented by a matrix of parameters Λ; The design of feature function Φ is application-specific. 11/10/2018 11/10/2018 ICML 2007 Presentation ICML 2007 Presentation 12

Design of Features for Gene Expression The feature function If no edge between i and j, Φ equals 0; Otherwise the sign of Φ depends on Λij and the empirical correlation of xi, xj at time t. 11/10/2018 ICML 2007 Presentation

Graphical Structure Revisit Hidden rewiring networks Initial network to define the prior on A1 Time-invariant parameters dictating the direction of pairwise correlation in the example 11/10/2018 ICML 2007 Presentation

Inference A natural approach to infer the hidden networks A1:T is Gibbs sampling: To evaluate the log-odds Conditional probabilities in a Markov blanket Tractable transition model; the partition function is the product of per edge terms Computation is straightforward Given the graphical structure, run variable elimination algorithms, works well for small graphs 11/10/2018 ICML 2007 Presentation

Parameter Estimation Grid search is very helpful, although Monte Carlo EM can be implemented. Trade-off between the transition model and emission model: Larger θ : better fit of the rewiring processes; Larger η : better fit of the observations. 11/10/2018 ICML 2007 Presentation

Results from Simulation Data generated from the proposed model. Starting from a network (A0) of 10 nodes and 14 edges. The length of the time series T = 50. Compare three approaches using F1 score: avg: averaged network from “ground truth” (approx. upper bounds the performance of any static network inference algorithm) htERG: infer timestep-specific networks sERG: the static counterpart of the proposed algorithm Study the “edge-switching events” 11/10/2018 ICML 2007 Presentation

Varying Parameter Values F1 scores on different parameter settings (varying ) 11/10/2018 ICML 2007 Presentation

Varying the Amount of Data F1 scores on different number of examples 11/10/2018 ICML 2007 Presentation

Capturing Edge Switching Summary on capturing edge switching in networks Three cases studied: offset, false positive, missing (false negative) mean and rms of offset timesteps 11/10/2018 ICML 2007 Presentation

Results on Drosophila Data The proposed model was applied to infer the muscle development sub-network (Zhao et al., 2006) on Drosophila lifecycle gene expression data (Arbeitman et al., 2002). 11 genes, 66 timesteps over 4 development stages Further biological experiments are necessary for verification. Network in (Zhao et al. 2006) Embryonic Larval Pupal & Adult 11/10/2018 ICML 2007 Presentation

Summary A new class of probabilistic models to address the problem of recoving hidden, time-dependent network topologies and an example in a biological context. An example of employing energy-based model to define meaningful features and simplify parameterization. Future work Larger-scale network analysis (100+?) Developing emission models for richer context 11/10/2018 ICML 2007 Presentation

Acknowledgement Yanxin Shi CMU Wentao Zhao Texas A&M University Hetunandan Kamisetty CMU 11/10/2018 ICML 2007 Presentation

Thank You! 11/10/2018 ICML 2007 Presentation