Machine learning for Dynamic Social Network Analysis

Slides:



Advertisements
Similar presentations
Learning more about Facebook and Twitter. Introduction  What we’ve covered in the Social Media webinar series so far  Agenda for this call Facebook.
Advertisements

Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
Preference Elicitation Partial-revelation VCG mechanism for Combinatorial Auctions and Eliciting Non-price Preferences in Combinatorial Auctions.
Goal Programming How do you find optimal solutions to the following?
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
280 SYSTEM IDENTIFICATION The System Identification Problem is to estimate a model of a system based on input-output data. Basic Configuration continuous.
Dynamic Spectrum Management: Optimization, game and equilibrium Tom Luo (Yinyu Ye) December 18, WINE 2008.
Network Bandwidth Allocation (and Stability) In Three Acts.
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
Collaborative Filtering Matrix Factorization Approach
Diffusion in Social and Information Networks Part II W ORLD W IDE W EB 2015, F LORENCE MPI for Software SystemsGeorgia Institute of Technology Le Song.
MAKING COMPLEX DEClSlONS
Introduction to Adaptive Digital Filters Algorithms
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Generalized Semi- Markov Processes (GSMP). Summary Some Definitions The Poisson Process Properties of the Poisson Process  Interarrival times  Memoryless.
ECE 466/658: Performance Evaluation and Simulation Introduction Instructor: Christos Panayiotou.
Investment Performance Measurement, Risk Tolerance and Optimal Portfolio Choice Marek Musiela, BNP Paribas, London.
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
Unsupervised Streaming Feature Selection in Social Media
Control of Dynamic Discrete-Event Systems Lenko Grigorov Master’s Thesis, QU supervisor: Dr. Karen Rudie.
Machine learning for Dynamic Social Network Analysis
Manuel Gomez Rodriguez
Learning Deep Generative Models by Ruslan Salakhutdinov
Modeling and Simulation Dr. Mohammad Kilani
Part 7 - Chapter 25.
Machine learning for Dynamic Social Network Analysis
Variational filtering in generated coordinates of motion
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
OVERVIEW Impact of Modelling and simulation in Mechatronics system
Making complex decisions
Manuel Gomez Rodriguez
DM-Group Meeting Liangzhe Chen, Nov
Center for Complexity in Business, R. Smith School of Business
Intelligent Information System Lab
Lecture 4 Exposition of the Lagrange Method
Mathematical Descriptions of Systems
Analysis of Link Reversal Routing Algorithms
Morgan Bruns1, Chris Paredis1, and Scott Ferson2
"Playing Atari with deep reinforcement learning."
Queueing Theory Carey Williamson Department of Computer Science
The Importance of Integration
Announcements Homework 3 due today (grace period through Friday)
Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.
Collaborative Filtering Matrix Factorization Approach
Throughput-Optimal Broadcast in Dynamic Wireless Networks
Master Thesis Presentation
Part 7 - Chapter 25.
Summarizing Data by Statistics
Emre O. Neftci  iScience  Volume 5, Pages (July 2018) DOI: /j.isci
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Optimal Control and Reachability with Competing Inputs
Goal Programming How do you find optimal solutions to the following?
Social Media Day of Thanks
Javad Ghaderi, Tianxiong Ji and R. Srikant
Linköping University, IDA, ESLAB
CMSC 471 – Fall 2011 Class #25 – Tuesday, November 29
Maximizing Your Social Media
Dr. Arslan Ornek DETERMINISTIC OPTIMIZATION MODELS
Chapter 4 . Trajectory planning and Inverse kinematics
Human-centered Machine Learning
Human-centered Machine Learning
Human-centered Machine Learning
Human-centered Machine Learning
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Area Coverage Problem Optimization by (local) Search
Human-centered Machine Learning
Markov Decision Processes
Markov Decision Processes
Reinforcement Learning
Human-centered Machine Learning
Presentation transcript:

Machine learning for Dynamic Social Network Analysis Applications: Control Manuel Gomez Rodriguez Max Planck Institute for Software Systems IJCAI Tutorial, August 2017

Outline of the Seminar Next Representation: Temporal Point processes 1. Intensity function 2. Basic building blocks 3. Superposition 4. Marks and SDEs with jumps Applications: Models 1. Information propagation 2. Information reliability 3. Knowledge acquisition Applications: Control Next 1. Activity shaping 2. When-to-post Slides/references: learning.mpi-sws.org/ijcai-2017-tutorial

Applications: Control 1. Activity shaping 2. When-to-post Outline of tutorial

Can we steer users’ activity in a social network in general? Activity shaping Can we steer users’ activity in a social network in general? Why this goal?

Activity shaping vs influence maximization Related to Influence Maximization Problem Activity shaping is a generalization of influence maximization One time the same piece of information It is only about maximizing adoption Influence Maximization Fixed incentive Activity Shaping Variable incentive Multiple times multiple pieces, recurrent! Many different activity shaping tasks

[Farajtabar et al., NIPS 2014] Event representation We represent messages using nonterminating temporal point processes: N1(t) Recurrent event: N2(t) N3(t) User Time N4(t) N5(t) [Farajtabar et al., NIPS 2014]

Messages on her own initiative Influence from user ui on user u Events intensity N1(t) N2(t) N3(t) N4(t) N5(t) Exogenous activity Memory Hawkes process User’s intensity Messages on her own initiative Influence from user ui on user u 7 [Farajtabar et al., NIPS 2014]

a few users to produce a given level of overall users’ activity Activity shaping… how? Incentivize a few users to produce a given level of overall users’ activity Exogenous activity Endogenous activity

Activity shaping… what is it? Activity Shaping: Find exogenous activity that results in a desired average overall activity at a given time: Average with respect to the history of events up to t! [Farajtabar et al., NIPS 2014]

Exogenous intensity & average overall intensity How do they relate? Convolution Surprisingly… linearly: matrix that depends on and non negative kernel influence matrix [Farajtabar et al., NIPS 2014]

Exact Relation If the memory g(t) is exponential: Matrix exponentials Corollary exogenous intensity is constant [Farajtabar et al., NIPS 2014]

Does it really work in practice? [Farajtabar et al., NIPS 2014]

Activity shaping optimization framework Once we know that we can find to satisfy many different goals: Activity Shaping Problem We can solve this problem efficiently for a large family of utilities! Utility (Goal) Budget Cost for incentivizing [Farajtabar et al., NIPS 2014]

Capped activity maximization (CAM) If our goal is maximizing the overall number of events across a social network: Max feasible activity per user [Farajtabar et al., NIPS 2014]

Minimax activity shaping (MMASH) If our goal is make the user with the minimum activity as active as possible: [Farajtabar et al., NIPS 2014]

Least-squares activity shaping (LSASH) If our goal is to achieve a pre-specified level of activity for each user or group of users: [Farajtabar et al., NIPS 2014]

Capped activity maximization: results +34,000 more events per month than best heuristic for 2,000 Twitter users +10% more events than best heuristic [Farajtabar et al., NIPS 2014]

Applications: Control 1. Activity shaping 2. When-to-post Outline of tutorial

Social media as a broadcasting platform Everybody can build, reach and broadcast information to their own audience Broadcasted content Audience reaction

Attention is scarce Social media users follow many broadcasters Twitter feed Instagram feed Older posts Older posts

What are the best times to post? Can we design an algorithm that tell us when to post to achieve high visibility?

Representation of broadcasters and feeds Broadcasters’ posts as a counting process N(t) Users’ feeds as sum of counting processes M(t) N1(t) M1(t) t M(t) = AT N(t) N2(t) t … Mn(t) t … Nn(t) t

Broadcasting and feeds intensities M(t) t t Broadcaster intensity function (tweets / hour) Feed intensity function (tweets / hour) Given a broadcaster i and her followers Feed due to other broadcasters

Definition of visibility function Visibility of broadcaster i at follower j Position of the highest ranked tweet by broadcaster i in follower j’s wall M(t) rij(t) = 0 rij(t’) = 4 rij(t’’) = 0 In general, the visibility depends on the feed ranking mechanism! t Feed ranking …. . Older tweets Ranked stories Post by broadcaster u Post by other broadcasters

Optimal control of temporal point processes Formulate the when-to-post problem as a novel stochastic optimal control problem (of independent interest) Visibility and feed dynamics Optimizing visibility Experiments System of stochastic equations with jumps Optimal control of jumps Twitter

Visibility dynamics in a FIFO feed (I) New tweets Reverse chronological order M(t) Older tweets Rank at t+dt Other broadcasters post a story and broadcaster i does not post Broadcaster i posts a story and other broadcasters do not post Nobody posts a story rij(t)=2 rij(t+dt) = 3 rij(t)=2 rij(t+dt) =0 rij(t)=2 rij(t+dt)=2 Follower’s wall … … … … … … [Zarezade et al., WSDM 2017]

Visibility dynamics in a FIFO feed (II) Zero-one law Stochastic differential equation (SDE) with jumps Broadcaster i posts a story Other broadcasters posts a story Our Goal: Optimize rij(t) over time, so that it is small, by controlling dNi(t) through the intensity μi(t) [Zarezade et al., WSDM 2017]

Feed dynamics We consider a general intensity: Deterministic arbitrary intensity Stochastic self-excitation (e.g. Hawkes, inhomogeneous Poisson) Jump stochastic differential equation (SDE) [Zarezade et al., WSDM 2017]

The when-to-post problem … Terminal penalty Nondecreasing loss Optimization problem Dynamics defined by Jump SDEs [Zarezade et al., WSDM 2017]

Bellman’s Principle of Optimality Lemma. The optimal cost-to-go satisfies Bellman’s Principle of Optimality t Hamilton-Jacobi-Bellman (HJB) equation Partial differential equation in J (with respect to r, λ and t) [Zarezade et al., WSDM 2017]

Solving the HJB equation Consider a quadratic loss Favors some periods of times (e.g., times in which the follower is online) Trade-offs visibility and number of broadcasted posts We propose and then show that the optimal intensity is: It only depends on the current visibility! [Zarezade et al., WSDM 2017]

The RedQueen algorithm Consider s(t) = s u*(t) = (s/q)1/2 r(t) How do we sample the next time? r(t) Superposition principle t1 t2 t3 t4 t Δi exp( (s/q)1/2 ) t1 + Δ1 t2 + Δ2 t3 + Δ3 t4 + Δ4 mini ti + Δi It only requires sampling M(tf) times! [Zarezade et al., WSDM 2017]

The RedQueen algorithm RedQueen can be implemented in a few lines of code! [Zarezade et al., WSDM 2017]

When-to-post for multiple followers Consider n followers and a quadratic loss: We can easily adapt the efficient sampling algorithm to multiple followers! Favors some periods of times (e.g., times in which the follower is online) Trade-offs visibility and number of broadcasted posts Then, we can show that the optimal intensity is: It only depends on the current visibilities! [Zarezade et al., WSDM 2017]

Novelty in the problem formulation The problem formulation is unique in two key technical aspects: I. The control signal is a conditional intensity Previous work: time-varying real vector II. The jumps are doubly stochastic Previous work: memory-less jumps [Zarezade et al., WSDM 2017]

Case study: one broadcaster Significance: followers’ retweets per weekday Average position over time Broadcaster’s posts RedQueen True posts 40% lower! [Zarezade et al., WSDM 2017]

Post by other broadcasters Evaluation metrics Position over time Time at the top Post by broadcaster Post by other broadcasters r(t1) = 0 r(t2) = 1 r(t3) = 0 r(t4) = 1 r(t5) = 2 r(t6) = 0 Follower’s wall … … … … … … Position over time = 0x(t2 – t1) + 1x(t3 – t2) + 0x(t4 – t3) + 1x(t5 – t4) + 2x(t6 – t5) Time at the top = (t2 – t1) + 0 + (t4 – t3) + 0 + 0 [Zarezade et al., WSDM 2017]

broadcasters’ true posts Position over time broadcasters’ true posts average across users Better It achieves (i) 0.28x lower average position, in average, than the broadcasters’ true posts and (ii) lower average position for 100% of the users. [Zarezade et al., WSDM 2017]

broadcasters’ true posts Time at the top average across users Better broadcasters’ true posts It achieves (i) 3.5x higher time at the top, in average, than the broadcasters’ true posts and (ii) higher time at the top for 99.1% of the users. [Zarezade et al., WSDM 2017]

Representation: Temporal Point processes 1. Intensity function 2. Basic building blocks 3. Superposition 4. Marks and SDEs with jumps Applications: Models 1. Information propagation 2. Information reliability 3. Knowledge acquisition Applications: Control 1. Activity shaping 2. When-to-post Slides/references: learning.mpi-sws.org/ijcai-2017-tutorial