Spotting Culprits in Epidemics: How many and Which ones? B. Aditya Prakash Virginia Tech Jilles Vreeken University of Antwerp Christos Faloutsos Carnegie.

Slides:



Advertisements
Similar presentations
A Moving Strategy for Mobile Sink in Secure Data Collection Zhou Sha.
Advertisements

R 0 and other reproduction numbers for households models MRC Centre for Outbreak analysis and modelling, Department of Infectious Disease Epidemiology.
On the Vulnerability of Large Graphs
CMU SCS I2.2 Large Scale Information Network Processing INARC 1 Overview Goal: scalable algorithms to find patterns and anomalies on graphs 1. Mining Large.
FUNNEL: Automatic Mining of Spatially Coevolving Epidemics Yasuko Matsubara, Yasushi Sakurai (Kumamoto University) Willem G. van Panhuis (University of.
School of Computer Science Carnegie Mellon University Duke University DeltaCon: A Principled Massive- Graph Similarity Function Danai Koutra Joshua T.
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
Understanding Cancer-based Networks in Twitter using Social Network Analysis Dhiraj Murthy Daniela Oliveira Alexander Gross Social Network Innovation Lab.
Interacting Viruses: Can Both Survive? Alex Beutel, B. Aditya Prakash, Roni Rosenfeld, Christos Faloutsos Carnegie Mellon University, USA KDD 2012, Beijing.
Mauro Sozio and Aristides Gionis Presented By:
DAVA: Distributing Vaccines over Networks under Prior Information
In Search of Influential Event Organizers in Online Social Networks
© 2012 IBM Corporation IBM Research Gelling, and Melting, Large Graphs by Edge Manipulation Joint Work by Hanghang Tong (IBM) B. Aditya Prakash (Virginia.
Graph Laplacian Regularization for Large-Scale Semidefinite Programming Kilian Weinberger et al. NIPS 2006 presented by Aggeliki Tsoli.
KAIST Adaptive Triangular Deployment Algorithm for Unattended Mobile Sensor Networks Suho Yang (September 4, 2008) Ming Ma, Yuanyuan Yang IEEE Transactions.
Endend endend Carnegie Mellon University Korea Advanced Institute of Science and Technology VoG: Summarizing and Understanding Large Graphs Danai Koutra.
Scalable Vaccine Distribution in Large Graphs given Uncertain Data Yao Zhang, B. Aditya Prakash Department of Computer Science Virginia Tech CIKM, Shanghai,
Lecture 21: Spectral Clustering
CMU SCS Mining Billion-node Graphs Christos Faloutsos CMU.
1 Epidemic Spreading in Real Networks: an Eigenvalue Viewpoint Yang Wang Deepayan Chakrabarti Chenxi Wang Christos Faloutsos.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
3D Position Determination Hasti AhleHagh Professor. W.R. Michalson.
Fully Automatic Cross-Associations Deepayan Chakrabarti (CMU) Spiros Papadimitriou (CMU) Dharmendra Modha (IBM) Christos Faloutsos (CMU and IBM)
Virus Propagation Modeling Chenxi Wang, Christos Faloutsos Yang Wang, Deepayan Chakrabarti Carnegie Mellon University Center for Computer.
Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
U. Michigan participation in EDIN Lada Adamic, PI E 2.1 fractional immunization of networks E 2.1 time series analysis approach to correlating structure.
1 Fully Automatic Cross-Associations Deepayan Chakrabarti (CMU) Spiros Papadimitriou (CMU) Dharmendra Modha (IBM) Christos Faloutsos (CMU and IBM)
Elliot Anshelevich Department of Computer Science Interests: Design and analysis of algorithms, especially for large decentralized networks. Strategic.
DIDS part II The Return of dIDS 2/12 CIS GrIDS Graph based intrusion detection system for large networks. Analyzes network activity on networks.
Structure based Data De-anonymization of Social Networks and Mobility Traces Shouling Ji, Weiqing Li, and Raheem Beyah Georgia Institute of Technology.
Models of Influence in Online Social Networks
CMU SCS Big (graph) data analytics Christos Faloutsos CMU.
Online Social Networks and Media Epidemics and Influence.
1 A(n) (extremely) brief/crude introduction to minimum description length principle jdu
Structure Preserving Embedding Blake Shaw, Tony Jebara ICML 2009 (Best Student Paper nominee) Presented by Feng Chen.
Understanding and Managing Cascades on Large Graphs B. Aditya Prakash Computer Science Virginia Tech. CS Seminar 11/30/2012.
V5 Epidemics on networks
Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.
Directed-Graph Epidemiological Models of Computer Viruses Presented by: (Kelvin) Weiguo Jin “… (we) adapt the techniques of mathematical epidemiology to.
Interacting Viruses in Networks: Can Both Survive? Authors: Alex Beutel, B. Aditya Prakash, Roni Rosenfeld, and Christos Faloutsos Presented by: Zachary.
Winner-takes-all: Competing Viruses or Ideas on fair-play Networks B. Aditya Prakash, Alex Beutel, Roni Rosenfeld, Christos Faloutsos Carnegie Mellon University,
On Graphs Supporting Greedy Forwarding for Directional Wireless Networks W. Si, B. Scholz, G. Mao, R. Boreli, et al. University of Western Sydney National.
EVENT DETECTION IN TIME SERIES OF MOBILE COMMUNICATION GRAPHS
Hidden Hazards: Finding Missing Nodes in Large Graph Epidemics Shashidhar Sundareisan Virginia Tech Jilles Vreeken Max Planck Institute B. Aditya Prakash.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Online Social Networks and Media
ECML-PKDD 2010, Barcelona, Spain B. Aditya Prakash*, Hanghang Tong* ^, Nicholas Valler+, Michalis Faloutsos+, Christos Faloutsos* * Carnegie Mellon University,
Measuring Behavioral Trust in Social Networks
 Probability in Propagation. Transmission Rates  Models discussed so far assume a 100% transmission rate to susceptible individuals (e.g. Firefighter.
Du, Faloutsos, Wang, Akoglu Large Human Communication Networks Patterns and a Utility-Driven Generator Nan Du 1,2, Christos Faloutsos 2, Bai Wang 1, Leman.
CS 1944: Sophomore Seminar Big Data and Machine Learning B. Aditya Prakash Assistant Professor Nov 3, 2015.
Center-Piece Subgraphs: Problem definition and Fast Solutions Hanghang Tong Christos Faloutsos Carnegie Mellon University.
Community Detection based on Distance Dynamics Reporter: Yi Liu Student ID: Department of Computer Science and Engineering Shanghai Jiao Tong.
Leveraging Information Theory for Mining Graphs and Sequences: From Propagation to Segmentation B. Aditya Prakash Computer Science Virginia Tech. ITA Workshop,
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
Arizona State University Fast Eigen-Functions Tracking on Dynamic Graphs Chen Chen and Hanghang Tong - 1 -
Observability-Based Approach to Optimal Design of Networks Atiye Alaeddini Spring 2016.
Wenyu Zhang From Social Network Group
MEIKE: Influence-based Communities in Networks
Passing Networks in Hockey RIT Analytics Conference 2015 S Burtch
Edge Weight Prediction in Weighted Signed Networks
Introduction Secondary Users (SUs) Primary Users (PUs)
Approximating the Community Structure of the Long Tail
Cost-effective Outbreak Detection in Networks
Automatic Segmentation of Data Sequences
Properties and applications of spectra for networks
Viral Marketing over Social Networks
Navigation and Propagation in Networks
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presentation transcript:

Spotting Culprits in Epidemics: How many and Which ones? B. Aditya Prakash Virginia Tech Jilles Vreeken University of Antwerp Christos Faloutsos Carnegie Mellon University IEEE ICDM Brussels December 11, 2012

Contagions Social collaboration Information Diffusion Viral Marketing Epidemiology and Public Health Cyber Security Human mobility Games and Virtual Worlds Ecology Localized effects: riots…

Virus Propagation Susceptible-Infected (SI) Model [AJPH 2007] CDC data: Visualization of the first 35 tuberculosis (TB) patients and their 1039 contacts Diseases over contact networks Prakash, Vreeken, Faloutsos 2012 β

Outline Motivation---Introduction Problem Definition Intuition MDL Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

Culprits: Problem definition 2-d grid Q: Who started it? Prakash, Vreeken, Faloutsos 2012

Culprits: Problem definition Prakash, Vreeken, Faloutsos 2012 Prior work: [Lappas et al. 2010, Shah et al. 2011] 2-d grid Q: Who started it?

Outline Motivation---Introduction Problem Definition Intuition MDL Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

Culprits: Exoneration Prakash, Vreeken, Faloutsos 2012

Culprits: Exoneration Prakash, Vreeken, Faloutsos 2012

Who are the culprits Two-part solution – use MDL for number of seeds – for a given number: exoneration = centrality + penalty Running time = – linear! (in edges and nodes) Prakash, Vreeken, Faloutsos 2012 NetSleuth

Outline Motivation---Introduction Problem Definition Intuition MDL – Construction – Opitimization Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

Modeling using MDL Minimum Description Length Principle == Induction by compression Related to Bayesian approaches MDL = Model + Data Model – Scoring the seed-set Number of possible |S|- sized sets En-coding integer |S| Prakash, Vreeken, Faloutsos 2012

Modeling using MDL Data: Propagation Ripples Original Graph Infected Snapshot Ripple R2 Ripple R1 Prakash, Vreeken, Faloutsos 2012

Modeling using MDL Ripple cost Total MDL cost How the ‘frontier’ advances How long is the ripple Ripple R Prakash, Vreeken, Faloutsos 2012

Outline Motivation---Introduction Problem Definition Intuition MDL – Construction – Opitimization Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

How to optimize the score? Two-step process – Given k, quickly identify high-quality set – Given these nodes, optimize the ripple R Prakash, Vreeken, Faloutsos 2012

Optimizing the score High-quality k-seed-set – Exoneration Best single seed: – Smallest eigenvector of Laplacian sub-matrix – Analyze a Constrained SI epidemic Exonerate neighbors Repeat Prakash, Vreeken, Faloutsos 2012

Optimizing the score Optimizing R – Get the MLE ripple! Finally use MDL score to tell us the best set NetSleuth: Linear running time in nodes and edges Prakash, Vreeken, Faloutsos 2012 Ripple R

Outline Motivation---Introduction Problem Definition Intuition MDL Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

Experiments Evaluation functions: – MDL based – Overlap based (JD == Jaccard distance) Closer to 1 the better Prakash, Vreeken, Faloutsos 2012 How far are they?

Experiments: # of Seeds One SeedTwo Seeds Three Seeds

Experiments: Quality (MDL and JD) Prakash, Vreeken, Faloutsos 2012 Ideal = 1 One SeedTwo Seeds Three Seeds

Experiments: Quality (Jaccard Scores) Prakash, Vreeken, Faloutsos 2012 Closer to diagonal, the better True NetSleuth One SeedTwo Seeds Three Seeds

Experiments: Scalability Prakash, Vreeken, Faloutsos 2012

Outline Motivation---Introduction Problem Definition Intuition MDL Experiments Conclusion Prakash, Vreeken, Faloutsos 2012

Conclusion Given: Graph and Infections Find: Best ‘Culprits’ Two-part solution – use MDL for number of seeds – for a given number: exoneration = centrality + penalty NetSleuth: – Linear running time in nodes and edges Prakash, Vreeken, Faloutsos 2012

B. Aditya Prakash Any Questions? Prakash, Vreeken, Faloutsos 2012