Human-centered Machine Learning

Slides:



Advertisements
Similar presentations
Topologies of Complex Networks Functions vs. Structures Lun Li Advisor: John C. Doyle Co-advisor: Steven H. Low Collaborators: David Alderson (NPS) Walter.
Advertisements

Network analysis Sushmita Roy BMI/CS 576
1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Copula Regression By Rahul A. Parsa Drake University &
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
Sampling distributions of alleles under models of neutral evolution.
Advanced Topics in Data Mining Special focus: Social Networks.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Cascading Behavior in Large Blog Graphs Patterns and a Model Leskovec et al. (SDM 2007)
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,
Models of Influence in Online Social Networks
Diffusion in Social and Information Networks Part II W ORLD W IDE W EB 2015, F LORENCE MPI for Software SystemsGeorgia Institute of Technology Le Song.
Theory of Decision Time Dynamics, with Applications to Memory.
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 David Balduzzi 1 Bernhard Schölkopf 1 UNCOVERING THE TEMPORAL DYNAMICS.
Weighted Graphs and Disconnected Components Patterns and a Generator IDB Lab 현근수 In KDD 08. Mary McGlohon, Leman Akoglu, Christos Faloutsos.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Annealing Paths for the Evaluation of Topic Models James Foulds Padhraic Smyth Department of Computer Science University of California, Irvine* *James.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Manuel Gomez Rodriguez Structure and Dynamics of Information Pathways in On-line Media W ORKSHOP M ENORCA, MPI FOR I NTELLIGENT S YSTEMS.
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
5. Maximum Likelihood –II Prof. Yuille. Stat 231. Fall 2004.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
DM-MEETING Bijaya Adhikari OUTLINE From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics 
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 Bernhard Schölkopf 1 S UBMODULAR I NFERENCE OF D IFFUSION NETWORKS FROM.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
 DM-Group Meeting Liangzhe Chen, Oct Papers to be present  RSC: Mining and Modeling Temporal Activity in Social Media  KDD’15  A. F. Costa,
Machine learning for Dynamic Social Network Analysis
Lecture 1.31 Criteria for optimal reception of radio signals.
Inferring Networks of Diffusion and Influence
Machine learning for Dynamic Social Network Analysis
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Manuel Gomez Rodriguez
Machine learning for Dynamic Social Network Analysis
Probability Theory and Parameter Estimation I
HyperNetworks Engın denız usta
Topics In Social Computing (67810)
A Study of Group-Tree Matching in Large Scale Group Communications
Biological networks CS 5263 Bioinformatics.
Manuel Gomez Rodriguez
Link Prediction & Content
Special Topics In Scientific Computing
A Latent Space Approach to Dynamic Embedding of Co-occurrence Data
Lecture 13 Network evolution
10701 / Machine Learning Today: - Cross validation,
Pattern Recognition and Machine Learning
Boltzmann Machine (BM) (§6.4)
CS246: Latent Dirichlet Analysis
Topic models for corpora and for graphs
Topic Models in Text Processing
Parametric Methods Berlin Chen, 2005 References:
Lecture 21 Network evolution
复杂网络可控性 研究进展 汪秉宏 2014 北京 网络科学论坛.
Modelling and Searching Networks Lecture 6 – PA models
Human-centered Machine Learning
Human-centered Machine Learning
Advanced Topics in Data Mining Special focus: Social Networks
Human-centered Machine Learning
Human-centered Machine Learning
Discrete Mathematics and its Applications Lecture 6 – PA models
Human-centered Machine Learning
Advanced Topics in Data Mining Special focus: Social Networks
GhostLink: Latent Network Inference for Influence-aware Recommendation
Human-centered Machine Learning
Presentation transcript:

Human-centered Machine Learning Information propagation with Temporal Point Processes Human-centered Machine Learning http://courses.mpi-sws.org/hcml-ws18/

Information cascades: Terminating point process models Outline of tutorial

An example: information cascade Christine means D follows S Bob 3.00pm 3.25pm Beth 3.27pm Joe David 4.15pm Friggeri et al., 2014 They can have an impact in the off-line world

Information cascade representation We represent an information cascade using terminating temporal point processes: N1(t) Sharing event: N2(t) N3(t) User Time Cascade N4(t) N5(t)

Information cascade intensity Source (given, not modeled) N1(t) N2(t) N3(t) Shares / follow-ups (modeled) N4(t) N5(t) Memory Share only once Previous message by user v Influence from user v on user u 5 [Gomez-Rodriguez et al., ICML 2011]

Model inference from multiple cascades Conditional intensities Cascade log-likelihood Maximum likelihood approach to find model parameters! Sum up log-likelihoods of multiple cascades! Theorem. For any choice of parametric memory, the maximum likelihood problem is convex in B. [Gomez-Rodriguez et al., ICML 2011]

Topic-sensitive intensities Topic-modulated influence: LDA weight for topic l [Du et al., AISTATS 2013]

[Gomez-Rodriguez et al., WSDM 2013] Dynamic influence In some cases, influence change over time: #greece retweets Propagation over networks with variable influence T [Gomez-Rodriguez et al., WSDM 2013]

Memetracker [Leskovec et al., KDD ’09]

Insights I: real world events Youtube video: http://youtu.be/hBeaSfRCU4c

Insights II: dynamic clusters Youtube video: http://youtu.be/hBeaSfRCU4c

Beyond information cascades: Nonterminating point process models Outline of tutorial

Recurrent events: beyond cascades Up to this point, we have assumed we can map each event to a cascade In general, especially in social networks: Difficult to distinguish cascades in event data Most cascades are single nodes (or forests) no likes no comments no shares

Recurrent events representation We represent shares using nonterminating temporal point processes: N1(t) Recurrent event: N2(t) N3(t) User Time N4(t) N5(t) [Farajtabar et al., NIPS 2014]

Recurrent events intensity Memory Cascade sources! Hawkes process User’s intensity Messages on her own initiative Previous messages by user v Influence from user v on user u [De et al., NIPS 2016]

Information cascades and network evolution: Nonterminating point process models Outline of tutorial

Beyond information cascades (II) Recent empirical studies [Antoniades and Dovrolis, Myers & Leskovec] show that information cascades also change the structure of social networks: Information propagation triggers new links Cascade 1 Source 1 hour later

Co-evolution as interwoven point processes (I) We model user’s retweet and link events as nonterminating and terminating counting processes: Alice Bob retweets (is exposed to) Alice NBob, Alice (t) Eva t1 t2 t3 t4 Charly ABob, Alice(t) Bob follows Alice Bob t'1 Key idea Both counting processes have memory and depend on each other 30 min later

Co-evolution as interwoven point processes (II) We characterize retweet and link counting processes using their respective conditional intensities: Changes on retweets in [t, t+dt] They are coupled through the histories History of retweets and links up to t Instantaneous rates or intensities Changes on links in [t, t+dt]

Intensity for information propagation Tweets on her own initiative Time-varying network topology Node u does not need to follow s! Propagation of peer influence over the network NEva,Alice(t) Alice David NCharly,Alice(t) Eva Charly γBob, Alice (t) Bob

Intensity for network evolution Ensures a link is created only once Influence of retweet intensity on the link creation Links on her own initiative Alice Bob retweets (is exposed to) Alice NBob, Alice (t) Eva Charly Bob’s risk of following Alice λBob, Alice(t) Bob

Model inference from historical data Find optimal parameters using maximum likelihood estimation (MLE): For the choice of information propagation and link intensities, the MLE problem above is parallelizable & convex.

Retweet and link coevolution Cross-covariance for two users Average cross-covariance vs model parameters The fitted model generate link and information diffusion events that coevolve similarly (in terms of cross-covariance) as real events.

Model checking https://en.wikipedia.org/wiki/Q-Q_plot Link events Retweet events The quantiles of the intensity integrals computed using the fitted intensities match the quantiles of the unit-rate exponential distribution

Information diffusion prediction Average rank Success probability The model beats the predictions given by a standard Hawkes process

Network properties & cascade patterns Can the model generate realistic macroscopic static and temporal network patterns and information cascades? Network Cascades Degree distributions Cascade size distribution Network diameter Cascade depth distribution Level of triadic closure Cascade structure

Degree distributions Poisson, α = 0 Power-law, α = 0.2 The higher the parameter α (or β), the closer the degree distribution is to a power-law

Small (shrinking) diameters Small connected components merge Diameter shrinks Our model generate networks with small shrinking (or flattening) diameter over time, as observed empirically.

Clustering coefficient We can generate networks with different levels of triadic closure, as observed empirically

Different type of networks Erdos-Renyi, β = 0 Scale-free network, β = 0.8 Our model allows us to generate networks with very different structure

Cascade patterns: size As α (or β) increases, longer cascades become more seldom.

Cascade patterns: depth As α (or β) increases, deeper cascades are more seldom, as observed in real cascade data.

Cascade patterns: structure The structure of the generated cascades becomes more realistic as α (or β) increases.