DM-MEETING Bijaya Adhikari 11.11.2015. OUTLINE From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics 

Slides:

Advertisements

Similar presentations

Chapter 4 Partition I. Covering and Dominating.

Advertisements

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.

1 Some Comments on Sebastiani et al Nature Genetics 37(4)2005.

Fast Algorithms For Hierarchical Range Histogram Constructions

GOLOMB RULERS AND GRACEFUL GRAPHS

Lab 2 Lab 3 Homework Labs 4-6 Final Project Late No Videos Write up

Image Denoising using Locally Learned Dictionaries Priyam Chatterjee Peyman Milanfar Dept. of Electrical Engineering University of California, Santa Cruz.

Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.

Estimating Surface Normals in Noisy Point Cloud Data Niloy J. Mitra, An Nguyen Stanford University.

Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.

Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.

Bootstrapping in regular graphs

Prediction and model selection

All-Pairs Shortest Paths

Geometric Approaches to Reconstructing Times Series Project Outline 15 February 2007 CSC/Math 870 Computational Discrete Geometry Connie Phong.

Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Quality Assurance in the clinical laboratory

Improved results for a memory allocation problem Rob van Stee University of Karlsruhe Germany Leah Epstein University of Haifa Israel WADS 2007 WAOA 2007.

Models of Influence in Online Social Networks

Incomplete Graphical Models Nan Hu. Outline Motivation K-means clustering Coordinate Descending algorithm Density estimation EM on unconditional mixture.

Anomaly detection Problem motivation Machine Learning.

Outlier Detection Using k-Nearest Neighbour Graph Ville Hautamäki, Ismo Kärkkäinen and Pasi Fränti Department of Computer Science University of Joensuu,

WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation Chao Chen ⨳ , Dongsheng Li

COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.

Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.

Machine Learning Seminar: Support Vector Regression Presented by: Heng Ji 10/08/03.

Chapter Twenty-ThreeModern Programming Languages1 Formal Semantics.

Formal Semantics Chapter Twenty-ThreeModern Programming Languages, 2nd ed.1.

Spatial Dynamic Factor Analysis Hedibert Freitas Lopes, Esther Salazar, Dani Gamerman Presented by Zhengming Xing Jan 29,2010 * tables and figures are.

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Instance Construction via Likelihood- Based Data Squashing Madigan D., Madigan D., et. al. (Ch 12, Instance selection and Construction for Data Mining.

Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.

Example: Bioassay experiment Problem statement –Observations: At each level of dose, 5 animals are tested, and number of death are observed.

Presented by Jian-Shiun Tzeng 5/7/2009 Conditional Random Fields: An Introduction Hanna M. Wallach University of Pennsylvania CIS Technical Report MS-CIS

Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.

Lei Li Computer Science Department Carnegie Mellon University Pre Proposal Time Series Learning completed work 11/27/2015.

MURI Annual Review, Vanderbilt, Sep 8 th, 2009 Heterogeneous Sensor Webs for Automated Target Recognition and Tracking in Urban Terrain (W911NF )

AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.

Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.

Lecture 2: Statistical learning primer for biologists

CpSc 881: Machine Learning

June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.

1 Parameter Learning 2 Structure Learning 1: The good Graphical Models – Carlos Guestrin Carnegie Mellon University September 27 th, 2006 Readings:

Panther: Fast Top-k Similarity Search in Large Networks JING ZHANG, JIE TANG, CONG MA, HANGHANG TONG, YU JING, AND JUANZI LI Presented by Moumita Chanda.

Chapter Graphs and Graph Models

Unsupervised Streaming Feature Selection in Social Media

CASA 2006 CASA 2006 A Skinning Approach for Dynamic Mesh Compression Khaled Mamou Titus Zaharia Françoise Prêteux.

F EATURE -E NHANCED P ROBABILISTIC M ODELS FOR D IFFUSION N ETWORK I NFERENCE Stefano Ermon ECML-PKDD September 26, 2012 Joint work with Liaoruo Wang and.

Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.

Zhengli Huang and Wenliang (Kevin) Du

1 Relational Factor Graphs Lin Liao Joint work with Dieter Fox.

NN k Networks for browsing and clustering image collections Daniel Heesch Communications and Signal Processing Group Electrical and Electronic Engineering.

Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.

Overfitting, Bias/Variance tradeoff. 2 Content of the presentation Bias and variance definitions Parameters that influence bias and variance Bias and.

Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.

T H E O H I O S T A T E U N I V E R S I T Y Computer Science and Engineering 1 1 Sriram Chellappan, Xiaole Bai, Bin Ma ‡ and Dong Xuan Presented by Sriram.

Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.

Auburn University

Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani

Recovering Temporally Rewiring Networks: A Model-based Approach

CS200: Algorithm Analysis

CSc4730/6730 Scientific Visualization

Estimating Networks With Jumps

Overfitting and Underfitting

Asymmetric Transitivity Preserving Graph Embedding

Lecture 10: Graphs Graph Terminology Special Types of Graphs

Graph Attention Networks

Human-centered Machine Learning

Presentation transcript:

DM-MEETING Bijaya Adhikari

OUTLINE From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics  Yu et al. Graph Summarization with Quality Guarantees  Riondato et al.

FROM MICRO TO MACRO: UNCOVERING AND PREDICTING INFORMATION CASCADING PROCESS WITH BEHAVIORAL DYNAMICS

MOTIVATION Can we predict cascades in a network ? Are they predictable ? If yes, given an early stage of information cascade, can we predict its cumulative cascade size for any later time ?

KEY IDEA When a node is involved in cascades, so are some of its offspring. If the dynamic process of these node level sub-cascades can be accurately modelled, then the whole cascade process can be predicted by an additive function of these local sub-cascades. Look into micro mechanism of cascades by decomposing it into multiple local (one-hop) sub-cascades and predict cascading processes.

ILLUSTRATION

EXAMPLE Comparison of Prediction for observations at various times against the true cascade(red)

BEHAVIORAL DYNAMICS Behavioral dynamics of a node captures cumulative number of its infected descendants once it gets infected Cumulative size varies from cascade to cascade, use survival rate

PARAMETERIZING BEHAVIORAL DYNAMICS KS-Statistic shows that Weibull distribution is most adequate for parameterizing behavioral dynamics PDF Survival Hazard Source:

COVARIATES OF BEHAVIORAL FEATURES Some nodes have no or very little sub-cascades and the parameters learned form data are difficult to interpret (twitter like data)

WHY CAN WE INFER CASCADES FROM EARLY STAGES ? Minor Dominance and Early Stage Dominance

FORMAL STATEMENT

SURVIVAL ANALYSIS

NETWORKED WEIBULL REGRESSION (NEWER) MODEL Fit Weibull distribution on survival time of node i

REGULARIZED NLL FOR NEWER Optimize F by coordinate descent

EFFICIENT CASCADE PREDICTION

SAMPLING MODEL Estimate Cascade dynamically so that the changes are monitored  Sub-cascade generated by a node is zero if no other node is involved  Temporal size counter and final death rate do not change but death rate increases over time  Causes relative error rate of Therefore cascade size can be dynamically estimated within some error bound

EXPERIMENTS : CASCADE SIZE PREDICTION

EXPERIMENTS: OUTBREAK TIME PREDICTION

GRAPH SUMMARIZATION WITH QUALITY GUARANTEES

MOTIVATION As the graph sizes grow, analysis, visualizing, and mining graphs become computationally challenging. As large networks do not fit in memory, accessing disk makes computation even slower. Can we find lossy concise representation of large graph that fits into main memory ?

DEFINITION Given a graph G =(V, E) and an integer k, k summary S of G is a complete weighted undirected graph The vertices of S are called supernodes and they have superedges between them Each superedge is weighted by density of edges between V i and V J Where, A G is the Adjacency matrix of original graph

DEFINITION Density matrix The density matrix can be lifter to n*n matrix, Where s(v) of a vertex in a original graph is a supernode in S

EXAMPLE

PROBLEM DEFINITION

L P RECONSTRUCTION ERROR

THE BEST MATRIX FOR A GIVEN PARTITION Given a k partition we say that n*n matrix M is P- constatnt if S i * S J submatrix of M is constant for all i and j between 1 an k It is shown that finding a P-constant matrix to represent the graph with some guaranteed quality reduces to k-means problem with l 2 metric (k-meadian with l 1 metric)

EXPERIMENTS: RECONSTRUCTION ERROR

EXPERIMENTS: SUMMARIZATION