CS 599: Social Media Analysis University of Southern California1 Information Diffusion Kristina Lerman University of Southern California.

Slides:



Advertisements
Similar presentations
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Advertisements

Spread of Influence through a Social Network Adapted from :
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Strong and Weak Ties Chapter 3, from D. Easley and J. Kleinberg book.
Modeling Malware Spreading Dynamics Michele Garetto (Politecnico di Torino – Italy) Weibo Gong (University of Massachusetts – Amherst – MA) Don Towsley.
Patterns of Influence in a Recommendation Network Jure Leskovec, CMU Ajit Singh, CMU Jon Kleinberg, Cornell School of Computer Science Carnegie Mellon.
Efficient Control of Epidemics over Random Networks Marc Lelarge (INRIA-ENS) SIGMETRICS 09.
Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis Dynamic Structural Equation Models for Tracking Cascades over Social Networks Acknowledgments:
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Flickr Information propagation in the Flickr social network Meeyoung Cha Max Planck Institute for Software Systems With Alan Mislove.
By: Roma Mohibullah Shahrukh Qureshi
Masoud Valafar †, Reza Rejaie †, Walter Willinger ‡ † University of Oregon ‡ AT&T Labs-Research WOSN’09 Barcelona, Spain Beyond Friendship Graphs: A Study.
1 Epidemic Spreading in Real Networks: an Eigenvalue Viewpoint Yang Wang Deepayan Chakrabarti Chenxi Wang Christos Faloutsos.
Cascading Behavior in Large Blog Graphs Patterns and a Model Leskovec et al. (SDM 2007)
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Social Networks: Advertising, Pricing and All That Zvi Topol & Itai Yarom.
Information Diffusion in Social Media
Computer Science 1 Web as a graph Anna Karpovsky.
THREE ESSENTIAL FOCUSES IN MOBILE MARKETING By Eric Koeck Center website:
Universal Behavior in a Generalized Model of Contagion Peter S. Dodds Duncan J. Watts Columbia University.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
How to Expand Your School’s Online Reach using Facebook, Blogs and Twitter.
Models of Influence in Online Social Networks
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Epidemic spreading in complex networks: from populations to the Internet Maziar Nekovee, BT Research Y. Moreno, A. Paceco (U. Zaragoza) A. Vespignani (LPT-
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
1 Worm Modeling and Defense Cliff C. Zou, Don Towsley, Weibo Gong Univ. Massachusetts, Amherst.
Online Social Networks and Media Epidemics and Influence.
Web 2.0 for Businesses How You Can Use Social Media to Bring in Money & Promote Your Brand Kimberly L. Sanberg Director of Online Strategy, Ignitus presentation.
University of California at Santa Barbara Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, and Ben Zhao.
Authors: Xu Cheng, Haitao Li, Jiangchuan Liu School of Computing Science, Simon Fraser University, British Columbia, Canada. Speaker : 童耀民 MA1G0222.
+ Offline Optimal Ads Allocation in SNS Advertising Hui Miao, Peixin Gao.
Knowing Your Facebook From Your Flickr Dan O’ Neill – -
V5 Epidemics on networks
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network author: Meeyoung Cha Alan Mislove Krishna P. Gummadi From Saarbrucken,
School of Computer Science Carnegie Mellon University 1 The dynamics of viral marketing Jure Leskovec, Carnegie Mellon University Lada Adamic, University.
Winner-takes-all: Competing Viruses or Ideas on fair-play Networks B. Aditya Prakash, Alex Beutel, Roni Rosenfeld, Christos Faloutsos Carnegie Mellon University,
Jing (Selena) He and Hisham M. Haddad Department of Computer Science, Kennesaw State University Shouling Ji, Xiaojing Liao, and Raheem Beyah School of.
Online Social Networks and Media
Once posted, other YouTube users can post comments about the video, post their YouTube video responses, or rate the video. Videos that are uploaded to.
Online Social Networks and Media
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Tracking Critical-Mass Outbreaks in Social Contagions (FA DEF)
Fostering Faith in a Digital Age: challenge and opportunity Terri Martinson Elton.
Individual Project by Nora-Marie Myers May 3, 2011 Social Media Communities in the Media King 5 Seattle The Huffington Post.
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
CS 590 Term Project Epidemic model on Facebook
A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Network Theory and Dynamic Systems Cascading.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Social Media & Social Networking 101 Canadian Society of Safety Engineering (CSSE)
How Social Media Changed The World Of Event Planning By Olivia Burke.
Contagion in Networks Networked Life NETS 112 Fall 2015 Prof. Michael Kearns.
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
What Stops Social Epidemics?
Social Media Marketing: A Strategic Approach, 2e
How do I influence people on Facebook and Twitter?
Networked Life NETS 112 Fall 2018 Prof. Michael Kearns
Effective Social Network Quarantine with Minimal Isolation Costs
Networked Life NETS 112 Fall 2017 Prof. Michael Kearns
Networked Life NETS 112 Fall 2016 Prof. Michael Kearns
Diffusion in Networks Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale 1/17/2019.
SLIDE DECK 5: Informed Citizenship.
Networked Life NETS 112 Fall 2019 Prof. Michael Kearns
Presentation transcript:

CS 599: Social Media Analysis University of Southern California1 Information Diffusion Kristina Lerman University of Southern California

Information diffusion on Twitter follower graph

Diffusion on networks The spread of disease, ideas, behaviors, … on a network can be described as a contagion process where an active node (infected/informed/adopted) activates its non-active neighbors with some probability –… creates a cascade on a network How large do cascades become? What determines their growth?

Gangnam style "Gangnam Style" became the first YouTube video to reach one billion views As of May 31, 2014, the music video has been viewed over two billion times  almost 13,000 man-years!

Ebola outbreak

Studying diffusion: data Large-scale data about contagion processes is now available –YouTube views [Crane & Sornette 2008] –Flickr favorites [Cha, Mislove & Gummadi, 2009] –Twitter retweets [Ghosh & Lerman, 2011] –Facebook likes [Dow, Adamic & Frigerri, 2014] Challenges –Volume of the data Storing and processing data –Complexity How does the “whole” depend on its “parts” Networks add to complexity

Studying diffusion: methods Analytic models –Model cascading behavior, e.g., differential equation Solve model under different conditions Simulations –Implement a model to synthetically recreate the process Empirical studies –Does observations of real-world data agree with model and simulations results?

Cascading Behavior in Complex Socio-technical Networks (Borge-Holthoefer et al.) Research questions –How can global cascades occur on sparse networks? –What affects cascade growth? Network topology How node is activated by an active neighbor Properties of the diffusing item? –How can cascades be characterized? Models of diffusion on networks –Threshold model –Epidemic models –Complex contagion Empirical data allow testing of models

Threshold model (Watts 2002) Each node has some “infection threshold”  i –Node becomes infected if fraction of infected neighbors is more than threshold infected exposed 11 22 33 44 Exposure response function infection prob. number infected neighbors 1 ikiiki 

Threshold model (Watts 2002) Under some conditions, global cascades can start from a few “infected” seeds –Network topology and individual thresholds interact in cascading behavior

Epidemic models Infected nodes propagate contagion to susceptible neighbors with probability  (transmissibility or virality of contagion) infected exposed Exposure response function infection prob. number infected neighbors 1

Epidemic models Epidemic threshold  : –For  < , localized cascades (epidemic dies out) –For  > , global cascades Epidemic threshold depends on topology only: largest eigenvalue of adjacency matrix of the network –True for any network Num. infected nodes N Epidemic threshold 0 Transmissibility, 

Complex contagion Virus can propagate with a single exposure. Spread of behaviors requires multiple exposures. Non-monotonic exposure response Exposure response function infection prob. number infected neighbors 1 infected exposed

Characterizing cascades Connected tree-like subgraph. Typically star-like Size related to centrality

Seeding large outbreaks How to select seeds that will initiate large outbreaks? –Influence maximization Are some network positions better at triggering large outbreaks? –Being a hub is sufficient but not necessary “Million follower fallacy” (Cha et al) –“hub fire wall” – epidemics die out when reaching a hub

Epidemic Spreading on Real Networks: An Eigenvalue Viewpoint [Wang et al, 2003] Research questions –How do epidemic cascades on a real network? –Does an epidemic threshold exist for a given network? Contributions –Model how epidemics propagate on a network –Propagation depends on network topology epidemic threshold is related to the largest eigenvalue of the adjacency graph describing the network

Homogeneous mixing model Homogeneous mixing –Each node interacts with every other node Infection rate  : a node infects neighbor with probability  Curing rate  : infected node is cured with probability  infected exposed cured

Homogeneous mixing model Homogeneous mixing –Each node interacts with every other node Infection rate  : a node infects neighbor with probability  Curing rate  : infected node is cured with probability  infected exposed cured

Homogeneous mixing model Homogeneous mixing –Each node interacts with every other node Infection rate  : a node infects neighbor with probability  Curing rate  : infected node is cured with probability  infected exposed cured

Homogeneous mixing model Homogeneous mixing –Each node interacts with every other node Infection rate  : a node infects neighbor with probability  Curing rate  : infected node is cured with probability  infected exposed cured

Homogeneous mixing: epidemic threshold Infection rate  : node infects neighbor with probability  Curing rate  : node is cured with probability  –Number of infected nodes: N inf = (1-  /  )N –Epidemic threshold: critical value of  /  =1/ beyond which N inf  N, but below N inf  0 infected exposed cured

Epidemics on networks Homogeneous mixing model is a good approximation of virus propagation in a population where contact among individuals is homogeneous, i.e., each individual is equally likely to encounter another –Public spaces: airports, shopping centers, … –Schools –Public transportation But, social interactions are usually structured – what role does network structure play in epidemic spread? –How does the size of cascades depend on network properties?

Model of epidemic cascades on a network

Simulations on real and synthetic graphs Simulate epidemics on –Real-world networks –Scale-free graphs (power law degree distribution) –Random graphs (Poisson degree distribution) Results are the same as homogeneous mixing model Simulations steps –Start with a set of randomly chosen infected nodes –At each time step Infected node attempts to infect each neighbor (probability  ) An infected node is cured (probability  ) –Continue until number of infected nodes no longer changes

Simulation results on real-world network Simulations on 10,900 node Oregon network graph, with =5.72,  =0.14  /  =1.75  /  =0.58 Cascade size vs time

Epidemic threshold

Epidemic threshold and cascade growth  /  =0.4  /  =0.2  /  =0.13  /  =0.1  /  =0.06

Epidemic threshold and cascade size Num. infected nodes N Epidemic threshold 0 Effective Transmissibility, 

Summary A variety of models proposed to explain cascading behavior on networks –Some models explain the relationship between properties of the network and properties of cascades, e.g., epidemic threshold depends on the eigenvalue of the adjacency matrix of the graph –Some models can produce global cascades What does data say?

The Structure of Online Diffusion Networks SHARAD GOEL, Yahoo! Research DUNCAN J. WATTS, Yahoo! Research DANIEL G. GOLDSTEIN, Yahoo! Research

“A relatively small number of seeds can trigger a relatively large number of adoptions via some, usually multistep, diffusion process”

How often How much Is it worth it

Findings Most cascades small and shallow Most adoptions lie in such cascades. Rare for adoptions to result from chains of referrals

Yahoo! Kindness one month period in 2010, Yahoo!’s philanthropic arm launched a website (kindness.yahoo.com) 59,000 users adopted the campaign 7 Different Sources

Zync a plug-in for Yahoo! Messenger, an instant messaging (IM) application, that allows pairs of users to watch videos synchronously while sending instant messages to one another. 7 Different Sources

The Secretary Game Players are encouraged to share the game’s URL with at least three other people with an explanation that the game designers are seeking the world’s best players. 7 Different Sources

Twitter News Stories. 80,000 news stories posted on the Twitter during November 2011, where the original article was distributed by one of five popular news sites: The New York Times, CNN, MSNBC, Yahoo! News, and The Huffington Post. Tweeted  Adopted 7 Different Sources

Twitter Videos 540,000 YouTube videos posted on Twitter during November 2011 Tweeted  Adopted 7 Different Sources

Friend Sense third-party Facebook application that queried respondents about their political views as well as their beliefs about their friends’ political views 7 Different Sources

Yahoo! Voice paid service launched in 2004 that allows users to make voice- over-IP calls to phones through Yahoo! Messenger. 1.8 million users purchased voice credits, who are defined as adopters 7 Different Sources

Data Sources Varied Cost Nature of the network Incentive Timescale

d

“ The usual intuition regarding heavy-tailed distributions, however, is that large events, although rare, are sufficiently large to dominate certain key properties of the corresponding system.”

Authors point of view Diffusion on online social networks does not really follow epidemic models. Researchers should focus on sub-critical process.

Authors point of view What accounts for sudden popularity of some YouTube videos or products like Gmail and Facebook? Mass Media and traditional advertisement.

MapReduce parallel computation framework Tree Canonicalization Implementation Details (Time Permitting)

Questions?! Comments! Thanks For Listening