A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen.

Slides:



Advertisements
Similar presentations
Google News Personalization: Scalable Online Collaborative Filtering
Advertisements

Google News Personalization Scalable Online Collaborative Filtering
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.
ICDM, Shenzhen, 2014 Flu Gone Viral: Syndromic Surveillance of Flu on Twitter using Temporal Topic Models Liangzhe Chen, K. S. M. Tozammel Hossain, Patrick.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Enabling the Social Web Krishna P. Gummadi Networked Systems Group Max Planck Institute for Software Systems.
TDTS21: Advanced Networking Lecture 8: Online Social Networks Based on slides from P. Gill Revised 2015 by N. Carlsson.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Discovering Overlapping Groups in Social Media Xufei Wang, Lei Tang, Huiji Gao, and Huan Liu Arizona State University.
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Hardware-based Load Generation for Testing Servers Lorenzo Orecchia Madhur Tulsiani CS 252 Spring 2006 Final Project Presentation May 1, 2006.
Influence and Correlation in Social Networks Aris Anagnostopoulos Ravi Kumar Mohammad Mahdian.
A Hierarchical Characterization of a Live Streaming Media Workload IEEE/ACM Trans. Networking, Feb Eveline Veloso, Virg í lio Almeida, Wagner Meira,
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Social Networks: Advertising, Pricing and All That Zvi Topol & Itai Yarom.
1 Measurement-based Characterization of a Collection of On-line Games Chris Chambers Wu-chang Feng Portland State University Sambit Sahu Debanjan Saha.
Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.
Song Recommendation for Social Singing Community Kuang Mao, Ju Fan, Lidan Shou, Gang Chen, Mohan Kankanhalli Zhejiang University, National University of.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Models of Influence in Online Social Networks
Social Network Analysis via Factor Graph Model
Signatures As Threats to Privacy Brian Neil Levine Assistant Professor Dept. of Computer Science UMass Amherst.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Towards Highly Reliable Enterprise Network Services via Inference of Multi-level Dependencies Paramvir Bahl, Ranveer Chandra, Albert Greenberg, Srikanth.
Authors: Xu Cheng, Haitao Li, Jiangchuan Liu School of Computing Science, Simon Fraser University, British Columbia, Canada. Speaker : 童耀民 MA1G0222.
+ Offline Optimal Ads Allocation in SNS Advertising Hui Miao, Peixin Gao.
Influence and Correlation in Social Networks Priyanka Garg.
Free Powerpoint Templates Page 1 Free Powerpoint Templates Influence and Correlation in Social Networks Azad University KurdistanSocial Network.
How do I decide whom to follow on Twitter ? IARank: Ranking Users on Twitter in Near Real-time, Based on their Information Amplification Potential.
V5 Epidemics on networks
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Implementing Query Classification HYP: End of Semester Update prepared Minh.
Google News Personalization: Scalable Online Collaborative Filtering
--He Xiangnan PhD student Importance Estimation of User-generated Data.
Challenges and Opportunities Posed by Power Laws in Network Analysis Bruno Ribeiro UMass Amherst MURI REVIEW MEETING Berkeley, 26 th Oct 2011.
Sharing Social Content from Home: A Measurement-driven Feasibility Study Massimiliano Marcon Bimal Viswanath Meeyoung Cha Krishna Gummadi NOSSDAV 2011.
Anant Pradhan PET: A Statistical Model for Popular Events Tracking in Social Communities Cindy Xide Lin, Bo Zhao, Qiaozhu Mei, Jiawei Han (UIUC)
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
SocialTube: P2P-assisted Video Sharing in Online Social Networks
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Network Community Behavior to Infer Human Activities.
Chapter 20: Social Service Selection Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
Structual Trend Analysis for Online Social Networks Ceren Budak Divyakant Agrawal Amr El Abbadi Science,UCSB SantaBarbara,USA Reporter: Qi Liu.
CS 590 Term Project Epidemic model on Facebook
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
1 1 MPI for Intelligent Systems 2 Stanford University Manuel Gomez Rodriguez 1,2 Bernhard Schölkopf 1 S UBMODULAR I NFERENCE OF D IFFUSION NETWORKS FROM.
F EATURE -E NHANCED P ROBABILISTIC M ODELS FOR D IFFUSION N ETWORK I NFERENCE Stefano Ermon ECML-PKDD September 26, 2012 Joint work with Liaoruo Wang and.
1 Patterns of Cascading Behavior in Large Blog Graphs Jure Leskoves, Mary McGlohon, Christos Faloutsos, Natalie Glance, Matthew Hurst SDM 2007 Date:2008/8/21.
Root Cause Localization on Power Networks Zhen Chen, ECEE, Arizona State University Joint work with Kai Zhu and Lei Ying.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
Arizona State University Fast Eigen-Functions Tracking on Dynamic Graphs Chen Chen and Hanghang Tong - 1 -
© 2014 Networking for Information Communications and Energy Lab. How do I viralize a YouTube video and tip a Groupon deal Prof. Hongseok Kim Sogang University,
Inferring Networks of Diffusion and Influence
Through University Faculty
Link Prediction and Network Inference
Apache Spark & Complex Network
Q4 : How does Netflix recommend movies?
Edge computing (1) Content Distribution Networks
Mixture of Mutually Exciting Processes for Viral Diffusion
N-Gram Model Formulas Word sequences Chain rule of probability
Example: Academic Search
Presentation transcript:

A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen

Statistics Up to 60% of all videos are watched through YouTube YouTube Traffic Characterization: A View From the Edge; P Gill, Arlitt, Li, Mahanti

Statistics: Campus View The most popular videos globally typically only account for about 1% of the videos viewed on campus daily 1 Correlation coefficient between global popularity and local popularity is too low YouTube Traffic Characterization: A View From the Edge; P Gill, Arlitt, Li, Mahanti

Typical Request Patterns Source: UMass Amherst YouTube trace dataset (1 week) “Romnesia speech” goes viral Conventional approach catches them too late Conventional approach catches them Sports Highlights Music Videos

Main Idea lol….did you see that video? Requests are correlated in time (and space) because of some hidden social contagion process

Main Idea Can a record of the transactions reveal information about the network structure and graph ? Can the network structure and a record of the transactions predict future trends ?

Traditional Caching Gateway Router YouTube Server Local Network Requests Cache Response

Predictive (Social) Caching Gateway Router Transactions Requests Cache Latent Network Transactions

Goal

Challenge Gateway Router Transactions Latent Network

Estimating the Social Network Mathematical Epidemiology

Mathematical Epidemiology Compartmental Models

Diffusion Model 2 Stages Stage 1 Decide who gets infected by whom independently Stage 2 “Decide” the time of infection (observed symptoms)

Latent Social Network Inference t=8 t=1 t=3 t=6 t=4 t=2 t=6 ; t=10

Latent Social Network Inference

Inference Steps Occurs in two stages: – Stage 1: Given the transaction fit the inter-arrivals to a power law – Stage 2: Given the estimated power law, and the transactions, find the Influence matrix that maximizes the likelihood of the observed transactions Maximum Likelihood Estimation

… back to caching We now have the social graph over the network of users We need a video relevance function to assign relevance scores to videos Rank the videos according to relevance scores and store the top K videos in the cache

Video Relevance Combine temporal score with social score.

Model Deficiencies In reality, all the requests cannot be completely modeled by diffusion processes – Influence external to network (news sites, aggregators, etc.) – User preference/tastes Insufficient data leads to many isolated vertices – On our dataset 60% of users are isolated vertices

Results Comparison% Improve Inter-Arrival/CRF11.6 Combined/Inter-Arrrival13.2 Table 1: Percentage Improvements of algorithms using all users Fig.2: Cache Size comparison between purely social and baseline using all users Fig. 1: Average Hitrate for all approaches over different cache sizes Few useful cascades leads to many isolated nodes

Results Connected Users Comparison% Improve Temporal/CRF15.6 Combined/CRF21.1 Table 2: Percentage Improvements of algorithms without isolated nodes Fig. 3: Average Hitrate for all approaches over different cache sizes without isolated nodes Fig.4: Cache Size comparison between purely social and baseline without isolated nodes

Future Directions Explore other epidemic models On the fly update of nodes and edges Graph clustering into different communities and influence groups User recommendations using the social graph Object detection and tagging via twitter