Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis Dynamic Structural Equation Models for Tracking Cascades over Social Networks Acknowledgments:

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Autonomic Scaling of Cloud Computing Resources
An Interactive-Voting Based Map Matching Algorithm
Distributed Nuclear Norm Minimization for Matrix Completion
Maximizing the Spread of Influence through a Social Network
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
Modeling and Analysis of Random Walk Search Algorithms in P2P Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE, Rensselaer Polytechnic Institute.
Bayesian Nonparametric Matrix Factorization for Recorded Music Reading Group Presenter: Shujie Hou Cognitive Radio Institute Friday, October 15, 2010 Authors:
1 Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota Acknowledgment: AFOSR MURI grant no. FA
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Copyright 2006, Data Mining Research Laboratory An Event-based Framework for Characterizing the Evolutionary Behavior of Interaction Graphs Sitaram Asur,
Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation James Foulds 1, Levi Boyles 1, Christopher DuBois 2 Padhraic Smyth.
Statistical Estimation of High Dimensional Covariance Matrices – a sampling from Prof. Hero’s research group Ted Tsiligkaridis SPEECS Friday, Sept. 9,
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Blogosphere  What is blogosphere?  Why do we need to study Blog-space or Blogosphere?
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Randomized Cuts for 3D Mesh Analysis
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Models of Influence in Online Social Networks
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Social Network Analysis via Factor Graph Model
Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis A Proximal Gradient Algorithm for Tracking Cascades over Networks Acknowledgments: NSF ECCS Grant.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
1 Unveiling Anomalies in Large-scale Networks via Sparsity and Low Rank Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University.
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 David Balduzzi 1 Bernhard Schölkopf 1 UNCOVERING THE TEMPORAL DYNAMICS.
1 Exact Recovery of Low-Rank Plus Compressed Sparse Matrices Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota.
Contagion in Networks Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Information Spread and Information Maximization in Social Networks Xie Yiran 5.28.
1 Sparsity Control for Robust Principal Component Analysis Gonzalo Mateos and Georgios B. Giannakis ECE Department, University of Minnesota Acknowledgments:
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
ACM International Conference on Information and Knowledge Management (CIKM) Analysis of Physical Activity Propagation in a Health Social Network.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Manuel Gomez Rodriguez Structure and Dynamics of Information Pathways in On-line Media W ORKSHOP M ENORCA, MPI FOR I NTELLIGENT S YSTEMS.
Multi-area Nonlinear State Estimation using Distributed Semidefinite Programming Hao Zhu October 15, 2012 Acknowledgements: Prof. G.
B. Baingana, E. Dall’Anese, G. Mateos and G. B. Giannakis Acknowledgments: NSF Grants , , , , ARO W911NF
Network Lasso: Clustering and Optimization in Large Graphs
Rank Minimization for Subspace Tracking from Incomplete Data
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
DM-MEETING Bijaya Adhikari OUTLINE From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics 
1 Consensus-Based Distributed Least-Mean Square Algorithm Using Wireless Ad Hoc Networks Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE.
A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen.
Inferring gene regulatory networks from multiple microarray datasets (Wang 2006) Tiffany Ko ELE571 Spring 2009.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 Bernhard Schölkopf 1 S UBMODULAR I NFERENCE OF D IFFUSION NETWORKS FROM.
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
1 Patterns of Cascading Behavior in Large Blog Graphs Jure Leskoves, Mary McGlohon, Christos Faloutsos, Natalie Glance, Matthew Hurst SDM 2007 Date:2008/8/21.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Paper Presentation Social influence based clustering of heterogeneous information networks Qiwei Bao & Siqi Huang.
Introduction to several works and Some Ideas Songcan Chen
Inferring Networks of Diffusion and Influence
Multi-task learning approaches to modeling context-specific networks
DM-Group Meeting Liangzhe Chen, Nov
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Bucket Renormalization for Approximate Inference
Effective Social Network Quarantine with Minimal Isolation Costs
Mixture of Mutually Exciting Processes for Viral Diffusion
Estimating Networks With Jumps
Recursively Adapted Radial Basis Function Networks and its Relationship to Resource Allocating Networks and Online Kernel Learning Weifeng Liu, Puskal.
Signal Processing on Graphs: Performance of Graph Structure Estimation
Malik Magdon-Ismail, Konstantin Mertsalov, Mark Goldberg
--WWW 2010, Hongji Bao, Edward Y. Chang
Presentation transcript:

Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis Dynamic Structural Equation Models for Tracking Cascades over Social Networks Acknowledgments: NSF ECCS Grant No and NSF AST Grant No December 17, 2013

Context and motivation 2 Popular news stories Infectious diseases Buying patterns Propagate in cascades over social networks Network topologies: Unobservable, dynamic, sparse Topology inference vital: Viral advertising, healthcare policy B. Baingana, G. Mateos, and G. B. Giannakis, ``Dynamic structural equation models for social network topology inference,'' IEEE J. of Selected Topics in Signal Processing, 2013 (arXiv: [cs.SI]) Goal: track unobservable time-varying network topology from cascade traces Contagions

Contributions in context 3  Contributions  Dynamic SEM for tracking slowly-varying sparse networks  Accounting for external influences – Identifiability [Bazerque-Baingana-GG’13]  ADMM-based topology inference algorithm  Related work  Static, undirected networks e.g., [Meinshausen-Buhlmann’06], [Friedman et al’07]  MLE-based dynamic network inference [Rodriguez-Leskovec’13]  Time-invariant sparse SEM for gene network inference [Cai-Bazerque-GG’13]  Structural equation models (SEM): [Goldberger’72]  Statistical framework for modeling causal interactions (endo/exogenous effects)  Used in economics, psychometrics, social sciences, genetics… [Pearl’09] J. Pearl, Causality: Models, Reasoning, and Inference, 2 nd Ed., Cambridge Univ. Press, 2009

Cascades over dynamic networks 4  Example: N = 16 websites, C = 2 news event, T = 2 days  Unknown (asymmetric) adjacency matrices  N-node directed, dynamic network, C cascades observed over Event #1 Event #2  Cascade infection times depend on:  Causal interactions among nodes (topological influences)  Susceptibility to infection (non-topological influences)

Model and problem statement 5  Captures (directed) topological and external influences Problem statement:  Data: Infection time of node i by contagion c during interval t : external influence un-modeled dynamics Dynamic SEM

Exponentially-weighted LS criterion 6  Structural spatio-temporal properties  Slowly time-varying topology  Sparse edge connectivity,  Sparsity-promoting exponentially-weighted least-squares (LS) estimator (P1)  Edge sparsity encouraged by -norm regularization with  Tracking dynamic topologies possible if

Topology-tracking algorithm 7  Alternating-direction method of multipliers (ADMM), e.g., [Bertsekas-Tsitsiklis’89]  Each time interval (P2) Acquire new data Recursively update data sample (cross-)correlations Solve (P2) using ADMM  Attractive features  Provably convergent, close-form updates (unconstrained LS and soft-thresholding)  Fixed computational cost and memory storage requirement per

ADMM iterations 8  Sequential data terms:,, can be updated recursively: denotes row i of

Simulation setup  Kronecker graph [Leskovec et al’10]: N = 64, seed graph  cascades,,  Non-zero edge weights varied for   Uniform random selection from  Non-smooth edge weight variation 9

Simulation results  Algorithm parameters   Initialization    Error performance 10

The rise of Kim Jong-un t = 10 weeks t = 40 weeks  Web mentions of “Kim Jong-un” tracked from March’11 to Feb.’12  N = 360 websites, C = 466 cascades, T = 45 weeks 11 Data: SNAP’s “Web and blog datasets” Kim Jong-un – Supreme leader of N. Korea Increased media frenzy following Kim Jong-un’s ascent to power in 2011

LinkedIn goes public  Tracking phrase “Reid Hoffman” between March’11 and Feb.’12  N = 125 websites, C = 85 cascades, T = 41 weeks t = 5 weeks t = 30 weeks 12 Data: SNAP’s “Web and blog datasets” US sites  Datasets include other interesting “memes”: “Amy Winehouse”, “Syria”, “Wikileaks”,….

Conclusions 13  Dynamic SEM for modeling node infection times due to cascades  Topological influences and external sources of information diffusion  Accounts for edge sparsity typical of social networks  ADMM algorithm for tracking slowly-varying network topologies  Corroborating tests with synthetic and real cascades of online social media  Key events manifested as network connectivity changes Thank You!  Ongoing and future research  Identifiabiality of sparse and dynamic SEMs  Statistical model consistency tied to  Large-scale MapReduce/GraphLab implementations  Kernel extensions for network topology forecasting

ADMM closed-form updates 14  Update with equality constraints:,  :  Update by soft-thresholding operator