Viral Video Style: A Closer Look at Viral Videos on YouTube Lu Jiang, Yajie Miao, Yi Yang, Zhenzhong Lan, Alexander G. Hauptmann School of Computer Science,

Slides:



Advertisements
Similar presentations
LECTURE 1: COURSE INTRODUCTION Xiaowei Yang. Roadmap Why should you take the course? Who should take this course? Course organization Course work Grading.
Advertisements

What is Primary Research and How do I get Started?
Empirical Evaluation of Defect Projection Models for Widely-deployed Production Software Systems FSE 2004 Paul Li, Mary Shaw, Jim Herbsleb Institute for.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Language and Computation Group 18 th November 2011.
Part One Introduction.
© newsfutures 1 Prediction Markets: Collective Work Prediction Markets Collective Work Emile Servan-Schreiber, PhD NewsFutures.
Vote Calibration in Community Question-Answering Systems Bee-Chung Chen (LinkedIn), Anirban Dasgupta (Yahoo! Labs), Xuanhui Wang (Facebook), Jie Yang (Google)
Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.
WindMine: Fast and Effective Mining of Web-click Sequences SDM 2011Y. Sakurai et al.1 Yasushi Sakurai (NTT) Lei Li (Carnegie Mellon Univ.) Yasuko Matsubara.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Student: Hsu-Yung Cheng Advisor: Jenq-Neng Hwang, Professor
7 Reasons to Use Magazines. Advertising in magazines is still one of the most effective ways of building brands at the right time. They engage millions.
Work Session on the Communication of Statistics Geneva, June 2014 Maria Jesus Vinuesa INE-Spain Statistical Pills.
First of all….. it’s never too late to start. “blogger” is not “viral” is not “avatar” is not.
April 11, 2008 Data Mining Competition 2008 The 4 th Annual Business Intelligence Symposium Hualin Wang Manager of Advanced.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
More popular than Arab newspapers.  Do you use social networking sites like Facebook?  Which is the best social networking site?  What do you use these.
Suspended Accounts in Retrospect: An Analysis of Twitter Spam Kurt Thomas, Chris Grier, Vern Paxson, Dawn Song University of California, Berkeley International.
Carnegie Mellon School of Computer Science Modeling Website Popularity Competition in the Attention-Activity Marketplace Bruno Ribeiro Christos Faloutsos.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Poking Facebook: Characterization of OSN Applications Minas Gjoka, Michael Sirivianos, Athina Markopoulou, Xiaowei Yang University of California, Irvine.
The Website Shootout A presentation prepared by
Where will I collect this information from?  Is Social Networking Safe?  Do you use a social networking site like Facebook or MySpace? Have you ever.
The Marketing Research Project. Purposes of the Project 1.Give you practical experience at conducting a marketing research project. 2.Examine some factors.
The Literature Search and Background of the Problem.
Keller and Ozment (1999)  Problems of driver turnover  Costs $3,000 to $12,000 per driver  Shipper effect  SCM impact  Tested solutions  Pay raise.
Page 1 Online Aggregation for Large MapReduce Jobs Niketan Pansare, Vinayak Borkar, Chris Jermaine, Tyson Condie VLDB 2011 IDS Fall Seminar
School of Computer Science Carnegie Mellon University 1 The dynamics of viral marketing Jure Leskovec, Carnegie Mellon University Lada Adamic, University.
Keys to Successful Marketing  Must understand and meet customer needs and wants  To meet customer needs, marketers must collect information.
Chapter 10 Analyzing Content: Historical, Secondary, and Content Analysis, and Crime Mapping.
YZUCSE SYSLAB A Study of Web Search Engine Bias and its Assessment Ing-Xiang Chen and Cheng-Zen Yang Dept. of Computer Science and Engineering Yuan Ze.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
CYBER CRIMES PREVENTIONS AND PROTECTIONS Presenters: Masroor Manzoor Chandio Hira Farooq Qureshi Submitted to SIR ABDUL MALIK ABBASI SINDH MADRESA TUL.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
By Diana Imankulova 3 rd year Advertising and PR.
Factors Affecting Youth Awareness of Anti-Tobacco Media Messages Komal Kochhar, M.B.B.S., M.H.A. Terrell W. Zollinger, Dr.P.H. Robert M. Saywell, Jr.,
Analysing Clickstream Data: From Anomaly Detection to Visitor Profiling Peter I. Hofgesang Wojtek Kowalczyk ECML/PKDD Discovery.
2 Shocking Stats and Amazing Statistics 3 Americans spend more time on social media than any other major Internet activity, including .
ACM 4063 Communication Research
School of Computer Science 1 Information Extraction with HMM Structures Learned by Stochastic Optimization Dayne Freitag and Andrew McCallum Presented.
Jointly Modeling Topics, Events and User Interests on Twitter Qiming DiaoJing Jiang School of Information Systems Singapore Management University.
RADIOSONDE TEMPERATURE BIAS ESTIMATION USING A VARIATIONAL APPROACH Marco Milan Vienna 19/04/2012.
Understanding and Predicting Interestingness of Videos Yu-Gang Jiang, Yanran Wang, Rui Feng, Hanfang Yang, Yingbin Zheng, Xiangyang Xue School of Computer.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Chapter 17 Writing the Research Report. Public Disclosure of Results Culmination of the research process Options for disclosure –Journal article –Thesis.
Scope of the Journal The International Journal of Sports Medicine (IJSM) provides a forum for the publication of papers dealing with basic or applied information.
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
YouTube Starring: YOU. Background YouTube (2005) is a video-sharing website on which users can upload, share, and view videos. Used Adobe Flash video.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
YouTube was founded on February 14, 2005 by three PayPal employees; Chad Hurley, Steve Chen, and Jawed Karim. YouTube was originally supposed.
2005/09/13 A Probabilistic Model for Retrospective News Event Detection Zhiwei Li, Bin Wang*, Mingjing Li, Wei-Ying Ma University of Science and Technology.
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Get Inspired: MARKETING WITH VIDEO ACROSS THE CUSTOMER LIFECYCLE.
© 2014 Networking for Information Communications and Energy Lab. How do I viralize a YouTube video and tip a Groupon deal Prof. Hongseok Kim Sogang University,
Marketing Research.
Delving Deep into Personal Photo and Video Search
Computational Campaign Coverage with PollyVote.com
Chapter 1 Introduction.
The Literature Search and Background of the Problem
Radio Analytics Advertising Analytics Dashboard Presentation
Presidential Primaries Profile
How to Write a Research Proposal?
Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu
What Makes Conversations Interesting
Topic: Media.
A Network Science Approach to Fake News Detection on Social Media
Presentation transcript:

Viral Video Style: A Closer Look at Viral Videos on YouTube Lu Jiang, Yajie Miao, Yi Yang, Zhenzhong Lan, Alexander G. Hauptmann School of Computer Science, Carnegie Mellon University

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

What is a viral video? A viral video is a video that becomes popular through the process of (most often) Internet sharing through social media. Gangnam Style

What is a viral video? A viral video is a video that becomes popular through the process of (most often) Internet sharing through social media. Gangnam Style Charlie bit my finger Missing pilot MH370

Social Validity Viral videos have been having a profound impact on many aspects of society. Politics: – Pro-Obama video “Yes we can” went viral (10 million views) in 2008 US presidential election [Broxton 2013]. – Obama Style and Mitt Romney Style went viral (30 million views in the month of Election Day), and peaked on Election Day.

Social Validity Viral videos have been having a profound impact on many aspects of society. Politics: – Pro-Obama video “Yes we can” went viral (10 million views) in 2008 US presidential election [Broxton 2013]. – We found that Obama Style and Mitt Romney Style went viral (30 million views in the month of Election Day), and peaked on Election Day.

Social Validity

Social Validity cont. Viral videos have been having a profound impact on many aspects of society. Financial marketing: – Old Spice’s campaign went viral and improved the brand’s popularity among young customers[West et al 2011]. – Psy’s commercial deals has amounted to 4.6 million dollars from Gangnam Style.

Social Validity cont. Viral videos have been having a profound impact on many aspects of society. Financial marketing: – Old Spice’s campaign went viral and improved the brand’s popularity among young customers[West et al 2011]. – Psy’s commercial deals has amounted to 4.6 million dollars from Gangnam Style.

Motivation Existing studies are conducted on: – Small set (tens of videos)  biased observations? – Large-scale Google set  confidential! A relatively large and public dataset on viral videos would be conducive. Solution: CMU Viral Video Dataset. Beware the content are hilarious!

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

CMU Viral Video Dataset By far the largest public viral video dataset. Time’s list was the largest dataset (50 videos). Videos are manually selected by experts – Editors from Time Magazine, YouTube and the viral video review episodes. Statistics: 446 viral videos, 294 quality 19,260 background videos. 10+ million subscribers!

CMU Viral Video Dataset Cont. For a video, it includes: – Thumbnail – Video and user metadata – Insight data: historical views, likes, dislikes, etc. – Social data: #Inlinks, daily tweeter metions (pending) – Near duplicate videos (automatic detection + manual inspection).

CMU Viral Video Dataset Cont. For a video, it includes: – Thumbnail – Video and user metadata – Insight data: historical views, likes, dislikes, etc. – Social data: #Inlinks, daily tweeter mentions (pending) – Near duplicate videos (automatic detection + manual inspection).

CMU Viral Video Dataset Cont. For a video, it includes: – Thumbnail – Video and user metadata – Insight data: historical views, likes, dislikes, etc. – Social data: #Inlinks, daily tweeter mentions (pending) – Near duplicate videos (automatic detection + manual inspection).

CMU Viral Video Dataset Cont. For a video, it includes: – Thumbnail – Video and user metadata – Insight data: historical views, likes, dislikes, etc. – Social data: #Inlinks, daily tweeter mentions (pending) – Near duplicate videos (automatic detection + manual inspection).

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

Statistical Characteristics Observations agree with existing studies including the study on Google’s dataset – Short title, Short duration. Less biased.

Observation I The days-to-peak and the lifespan of viral videos decrease over time. Days-to-peak Lifespan

Observation II The popularity of the uploader is a more substantial factor that affects the popularity than the upload time. Upload time is believed to be the most important factor in background videos[Borghol et al. 2012], coined as First-mover advantage. An example: official music videos

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

Peak Day Prediction Forecast when a video can get its peak view based on its historical view pattern. ???

Peak Day Prediction Forecast when a video can get its peak view based on its historical daily view pattern. ???

Peak Day Prediction Forecast the date a video can get its peak view based on its daily view pattern. ???

Peak Day Prediction Forecast when a video can get its peak view based on its historical daily view pattern. True peak date

Peak Day Prediction Forecast the date a video can get its peak view based on its daily view pattern. This application is significant in supporting and driving the design of various services: – Advertising agencies: determine timing and estimate cost – YouTube: recommendation – Companies/Politicians: respond viral campaigns

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views h

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views h h

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views h h a1a1

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views h h a2a2 a1a1

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views h h a2a2 a1a1 a3a3 h h

HMM Model Model daily views using HMM model: Two types of states: – hibernating  less views – active  more views Novel modifications: Incorporate metadata in the prediction. Other work only use the pure view count [Pinto et al. 2013]. Smooth transition probability by a Gaussian prior.

Experimental Results Considering metadata in peak day prediction is instrumental.

Result cont. Early warning system for viral videos. Detect viral videos and forecast their peak dates.

Outline  Introduction  CMU Viral Video Dataset  Statistical Characteristics  Peak Day Prediction  Conclusions

Conclusions A few messages to take away from this talk: – CMU Viral Video Dataset is by far the largest open dataset for viral videos study. – This paper discovers several interesting characteristics about viral videos. – This paper proposes a novel method to forecast the peak day for viral videos. The preliminary results look promising.

References T. West. Going viral: Factors that lead videos to become internet phenomena. Elon Journal of Undergraduate Research, pages 76–84, T. Broxton, Y. Interian, J. Vaver, and M. Wattenhofer. Catching a viral video. Journal of Intelligent Information Systems, 40(2):241–259, Y. Borghol, S. Ardon, N. Carlsson, D. Eager, and A. Mahanti. The untold story of the clones: content-agnostic factors that impact youtube video popularity. In SIGKDD, pages 1186–1194, H. Pinto, J. M. Almeida, and M. A. Gon¸calves. Using early view patterns to predict the popularity of youtube videos. In WSDM, pages 365–374, 2013.