Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis A Proximal Gradient Algorithm for Tracking Cascades over Networks Acknowledgments: NSF ECCS Grant.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

An Interactive-Voting Based Map Matching Algorithm
Accelerated, Parallel and PROXimal coordinate descent IPAM February 2014 APPROX Peter Richtárik (Joint work with Olivier Fercoq - arXiv: )
1 Closed-Form MSE Performance of the Distributed LMS Algorithm Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE Department, University of.
Distributed Nuclear Norm Minimization for Matrix Completion
CHAPTER 3 CHAPTER 3 R ECURSIVE E STIMATION FOR L INEAR M ODELS Organization of chapter in ISSO –Linear models Relationship between least-squares and mean-square.
Link Analysis: PageRank
Carnegie Mellon Distributed Inference: High Dimensional Consensus TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis Dynamic Structural Equation Models for Tracking Cascades over Social Networks Acknowledgments:
1 Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota Acknowledgment: AFOSR MURI grant no. FA
Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation James Foulds 1, Levi Boyles 1, Christopher DuBois 2 Padhraic Smyth.
Cascading Behavior in Large Blog Graphs Patterns and a Model Leskovec et al. (SDM 2007)
Statistical Estimation of High Dimensional Covariance Matrices – a sampling from Prof. Hero’s research group Ted Tsiligkaridis SPEECS Friday, Sept. 9,
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Randomized Cuts for 3D Mesh Analysis
Research at Intel Distributed Localization of Modular Robot Ensembles Robotics: Science and Systems 25 June 2008 Stanislav Funiak, Michael Ashley-Rollman.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Sparsity-Aware Adaptive Algorithms Based on Alternating Optimization and Shrinkage Rodrigo C. de Lamare* + and Raimundo Sampaio-Neto * + Communications.
Models of Influence in Online Social Networks
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Sebastian Schelter, Venu Satuluri, Reza Zadeh
Social Network Analysis via Factor Graph Model
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
1 Unveiling Anomalies in Large-scale Networks via Sparsity and Low Rank Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University.
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 David Balduzzi 1 Bernhard Schölkopf 1 UNCOVERING THE TEMPORAL DYNAMICS.
1 Exact Recovery of Low-Rank Plus Compressed Sparse Matrices Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota.
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 Sparsity Control for Robust Principal Component Analysis Gonzalo Mateos and Georgios B. Giannakis ECE Department, University of Minnesota Acknowledgments:
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
A New Hybrid Wireless Sensor Network Localization System Ahmed A. Ahmed, Hongchi Shi, and Yi Shang Department of Computer Science University of Missouri-Columbia.
Manuel Gomez Rodriguez Structure and Dynamics of Information Pathways in On-line Media W ORKSHOP M ENORCA, MPI FOR I NTELLIGENT S YSTEMS.
High-dimensional Error Analysis of Regularized M-Estimators Ehsan AbbasiChristos ThrampoulidisBabak Hassibi Allerton Conference Wednesday September 30,
Multi-area Nonlinear State Estimation using Distributed Semidefinite Programming Hao Zhu October 15, 2012 Acknowledgements: Prof. G.
B. Baingana, E. Dall’Anese, G. Mateos and G. B. Giannakis Acknowledgments: NSF Grants , , , , ARO W911NF
Network Lasso: Clustering and Optimization in Large Graphs
Rank Minimization for Subspace Tracking from Incomplete Data
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
1 Consensus-Based Distributed Least-Mean Square Algorithm Using Wireless Ad Hoc Networks Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE.
A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen.
Inferring gene regulatory networks from multiple microarray datasets (Wang 2006) Tiffany Ko ELE571 Spring 2009.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Unsupervised Streaming Feature Selection in Social Media
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 Bernhard Schölkopf 1 S UBMODULAR I NFERENCE OF D IFFUSION NETWORKS FROM.
1 Patterns of Cascading Behavior in Large Blog Graphs Jure Leskoves, Mary McGlohon, Christos Faloutsos, Natalie Glance, Matthew Hurst SDM 2007 Date:2008/8/21.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Perfect recall: Every decision node observes all earlier decision nodes and their parents (along a “temporal” order) Sum-max-sum rule (dynamical programming):
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Introduction to several works and Some Ideas Songcan Chen
Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project A Distributed Newton Method for Network Optimization Ali Jadbabaie and Asu Ozdaglar.
Inferring Networks of Diffusion and Influence
Multi-task learning approaches to modeling context-specific networks
Jinbo Bi Joint work with Jiangwen Sun, Jin Lu, and Tingyang Xu
DM-Group Meeting Liangzhe Chen, Nov
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Bucket Renormalization for Approximate Inference
Latent Space Model for Road Networks to Predict Time-Varying Traffic
Effective Social Network Quarantine with Minimal Isolation Costs
Mixture of Mutually Exciting Processes for Viral Diffusion
Estimating Networks With Jumps
Recursively Adapted Radial Basis Function Networks and its Relationship to Resource Allocating Networks and Online Kernel Learning Weifeng Liu, Puskal.
A Dynamic System Analysis of Simultaneous Recurrent Neural Network
Signal Processing on Graphs: Performance of Graph Structure Estimation
Malik Magdon-Ismail, Konstantin Mertsalov, Mark Goldberg
Presentation transcript:

Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis A Proximal Gradient Algorithm for Tracking Cascades over Networks Acknowledgments: NSF ECCS Grant No and NSF AST Grant No May 8, 2014 Florence, Italy

Context and motivation 2 Popular news stories Infectious diseases Buying patterns Propagate in cascades over social networks Network topologies: Unobservable, dynamic, sparse Topology inference vital: Viral advertising, healthcare policy B. Baingana, G. Mateos, and G. B. Giannakis, ``A proximal-gradient algorithm for tracking cascades over social networks,'' IEEE J. of Selected Topics in Signal Processing, Aug (arXiv: [cs.SI]). Goal: track unobservable time-varying network topology from cascade traces Contagions

Contributions in context 3  Contributions  Dynamic SEM for tracking slowly-varying sparse networks  Accounting for external influences – Identifiability [Bazerque-Baingana-GG’13]  First-order topology inference algorithm  Related work  Static, undirected networks e.g., [Meinshausen-Buhlmann’06], [Friedman et al’07]  MLE-based dynamic network inference [Rodriguez-Leskovec’13]  Time-invariant sparse SEM for gene network inference [Cai-Bazerque-GG’13]  Structural equation models (SEM) [Goldberger’72]  Statistical framework for modeling relational interactions (endo/exogenous effects)  Used in economics, psychometrics, social sciences, genetics… [Pearl’09] D. Kaplan, Structural Equation Modeling: Foundations and Extensions, 2 nd Ed., Sage, 2009.

Cascades over dynamic networks 4  Example: N = 16 websites, C = 2 news events, T = 2 days  Unknown (asymmetric) adjacency matrices  N-node directed, dynamic network, C cascades observed over Event #1 Event #2  Node infection times depend on:  Causal interactions among nodes (topological influences)  Susceptibility to infection (non-topological influences)

Model and problem statement 5  Captures (directed) topological and external influences Problem statement:  Data: Infection time of node i by contagion c during interval t : external influence un-modeled dynamics Dynamic SEM

Exponentially-weighted LS criterion 6  Structural spatio-temporal properties  Slowly time-varying topology  Sparse edge connectivity,  Sparsity-promoting exponentially-weighted least-squares (LS) estimator (P1)  Edge sparsity encouraged by -norm regularization with  Tracking dynamic topologies possible if

7  Attractive features  Provably convergent, closed-form updates (unconstrained LS and soft-thresholding)  Fixed computational cost and memory storage requirement per  Scales to large datasets  Let (P2) gradient descent Solvable by soft-thresholding operator [cf. Lasso]  Iterative shrinkage-thresholding algorithm (ISTA) [Parikh-Boyd’13]  Ideal for composite convex + non-smooth cost Topology-tracking algorithm γ -γ

8  Sequential data terms in : row i of recursive updates  Each time interval Recursively update Acquire new data Solve (P2) using (F)ISTA Recursive updates

Simulation setup  Kronecker graph [Leskovec et al’10]: N = 64, seed graph  cascades,,  Non-zero edge weights varied for   Uniform random selection from  Non-smooth edge weight variation 9

Simulation results  Error performance 10  Algorithm parameters 

The rise of Kim Jong-un t = 10 weeks t = 40 weeks  Web mentions of “Kim Jong-un” tracked from March’11 to Feb.’12  N = 360 websites, C = 466 cascades, T = 45 weeks 11 Data: SNAP’s “Web and blog datasets” Kim Jong-un – Supreme leader of N. Korea Increased media frenzy following Kim Jong-un’s ascent to power in 2011

LinkedIn goes public  Tracking phrase “Reid Hoffman” between March’11 and Feb.’12  N = 125 websites, C = 85 cascades, T = 41 weeks t = 5 weeks t = 30 weeks 12 Data: SNAP’s “Web and blog datasets” US sites  Datasets include other interesting “memes”: “Amy Winehouse”, “Syria”, “Wikileaks”,….

Conclusions 13  Dynamic SEM for modeling node infection times due to cascades  Topological influences and external sources of information diffusion  Accounts for edge sparsity typical of social networks  Proximal gradient algorithm for tracking slowly-varying network topologies  Corroborating tests with synthetic and real cascades of online social media  Key events manifested as network connectivity changes Thank You!  Ongoing and future research  Dynamical models with memory  Identifiabiality of sparse and dynamic SEMs  Statistical model consistency tied to  Large-scale MapReduce/GraphLab implementations  Kernel extensions for network topology forecasting

14 Recursive Updates Parallelizable ISTA iterations

ADMM iterations 15  Sequential data terms:,, can be updated recursively: denotes row i of

ADMM closed-form updates 16  Update with equality constraints:,  :  Update by soft-thresholding operator

17 a1) edge sparsity: a2) sparse changes: a3) error-free DSEM: Goal: under a1)-a3), establish conditions on to uniquely identify  Preliminary result (static SEM) If, with and diagonal matrix and i), ii) non-zero entries of are drawn from a continuous distribution, and iii) Kruskal rank, then and can be uniquely determined. J. A. Bazerque, B. Baingana, and G. B. Giannakis, "Identifiability of sparse structural equation models for directed, cyclic, and time-varying networks," Proc. of Global Conf. on Signal and Info. Processing, Austin, TX, December 3-5, Outlook: Indentifiability of DSEMs