Popularity Prediction of Facebook Videos for Higher Quality Streaming

Slides:



Advertisements
Similar presentations
A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:
Advertisements

Overcoming Limitations of Sampling for Agrregation Queries Surajit ChaudhuriMicrosoft Research Gautam DasMicrosoft Research Mayur DatarStanford University.
Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors Abhishek Bhattacharjee Margaret Martonosi.
What should you Cache? A Global Analysis on YouTube Related Video Caching Dilip Kumar Krishnappa, Michael Zink and Carsten Griwodz NOSSDAV 2013.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Multimedia Streaming in Dynamic Peer-to-Peer Systems and Mobile Wireless.
Analysis of Using Broadcast and Proxy for Streaming Layered Encoded Videos Wilson, Wing-Fai Poon and Kwok-Tung Lo.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Proxy Cache Management for Fine-Grained Scalable Video Streaming Jiangchuan Liu, Xiaowen Chu, and Jianliang Xu INFOCOM 2004.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Performance Evaluation
Video Staging: A Proxy-Server-Based Approach to End-to-End Video Delivery over Wide-Area Networks Zhi-Li Zhang, Yuewei Wang, David H. C. Du, Dongli Su.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
RIPQ: Advanced Photo Caching on Flash for Facebook Linpeng Tang (Princeton) Qi Huang (Cornell & Facebook) Wyatt Lloyd (USC & Facebook) Sanjeev Kumar (Facebook)
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
Niranjan Balasubramanian Aruna Balasubramanian Arun Venkataramani University of Massachusetts Amherst Energy Consumption in Mobile Phones: A Measurement.
By Huang et al., SOSP 2013 An Analysis of Facebook Photo Caching Presented by Phuong Nguyen Some animations and figures are borrowed from the original.
Walter Hop Web-shop Order Prediction Using Machine Learning Master’s Thesis Computational Economics.
Qi Alfred Chen University of Michigan
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Alex Shye, Berkin Ozisikyilmaz, Arindam Mallik, Gokhan Memik, Peter A. Dinda, Robert P. Dick, and Alok N. Choudhary Northwestern University, EECS International.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Wei Gao1 and Qinghua Li2 1The University of Tennessee, Knoxville
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
Web Prefetching Between Low-Bandwidth Clients and Proxies : Potential and Performance Li Fan, Pei Cao and Wei Lin Quinn Jacobson (University of Wisconsin-Madsion)
Distributing Layered Encoded Video through Caches Authors: Jussi Kangasharju Felix HartantoMartin Reisslein Keith W. Ross Proceedings of IEEE Infocom 2001,
Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.
« Performance of Compressed Inverted List Caching in Search Engines » Proceedings of the International World Wide Web Conference Commitee, Beijing 2008)
Multiple Instance Real Boosting with Aggregation Functions Hossein Hajimirsadeghi and Greg Mori School of Computing Science Simon Fraser University International.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Evaluating Content Management Techniques for Web Proxy Caches Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich and Tai Jin Proceeding on.
Serving Photos at Scaaale: Caching and Storage An Analysis of Facebook Photo Caching. Huang et al. Finding a Needle in a Haystack. Beaver et al. Vlad Niculae.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Large-Scale Monitoring of DHT Traffic Ghulam Memon – University of Oregon Reza Rejaie – University of Oregon Yang Guo – Corporate Research, Thomson Daniel.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
If you have a transaction processing system, John Meisenbacher
A Practical Performance Analysis of Stream Reuse Techniques in Peer-to-Peer VoD Systems Leonardo B. Pinho and Claudio L. Amorim Parallel Computing Laboratory.
Introduction to Machine Learning, its potential usage in network area,
SketchVisor: Robust Network Measurement for Software Packet Processing
Information Retrieval in Practice
Ensemble Classifiers.
Data Transformation: Normalization
Introduction to Load Balancing:
Chilimbi, et al. (2014) Microsoft Research
Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation Junchen Jiang (CMU) Shijie Sun (Tsinghua Univ.)
Edinburgh Napier University
A Study of Group-Tree Matching in Large Scale Group Communications
Steve Ko Computer Sciences and Engineering University at Buffalo
CFA: A Practical Prediction System for Video Quality Optimization
PA an Coordinated Memory Caching for Parallel Jobs
An Analysis of Facebook photo Caching
Personalizing Search on Shared Devices
COS 518: Advanced Computer Systems Lecture 9 Michael Freedman
Be Fast, Cheap and in Control
Steve Ko Computer Sciences and Engineering University at Buffalo
08/03/14 Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications REF:Balasubramanian, Niranjan, Aruna Balasubramanian,
Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform
Design of Hierarchical Classifiers for Efficient and Accurate Pattern Classification M N S S K Pavan Kumar Advisor : Dr. C. V. Jawahar.
Christophe Dubach, Timothy M. Jones and Michael F.P. O’Boyle
Zipf-Distributions & Caching
Department of Electrical Engineering
LHD: Improving Cache Hit Rate By Maximizing Hit Density
Chavit Denninnart, Mohsen Amini Salehi and Xiangbo Li
Model Combination.
CherryPick: Adaptively Unearthing the Best
Challenges with developing a Commercial P2P System
Zeqi Lai, Y. Charlie Hu, Yong Cui, Linhui Sun, Ningwei Dai
Physics-guided machine learning for milling stability:
Human-centered Machine Learning
Scheduling of Regular Tasks in Linux
Presentation transcript:

Popularity Prediction of Facebook Videos for Higher Quality Streaming Linpeng Tang Qi Huang , Amit Puntambekar Ymir Vigfusson , Wyatt Lloyd , Kai Li ♭ † ∗ ‡ ∗ † ‡ ♭

Videos are Central to Facebook 8 billion views per day 9-year old singing on America’s Got Talent 44M views Black bear roaming in Princeton 3.8K views Small shop making frozen yogurt 122 views

Workflow of Videos on Facebook Streaming Video Engine Original Encoded Upload Backend Storage CDN ABR Streaming ABR streams the best quality version of the video that fits! Intensive processing needed to create multiple video versions for ABR streaming

Better Video Streaming from More Processing Better compression at the same quality QuickFire: 20% size reduction using 20X computation More users can view the high quality versions Alice Bob

Better Video Streaming from More Processing Better compression at the same quality QuickFire: 20% size reduction using 20X computation More users can view the high quality versions Alice Bob Better Compression

How to apply QuickFire for FB videos Infeasible to encode all videos with QuickFire Increase by 20X the already large processing fleet High skew in popularity Reap most benefit with modest processing?

Opportunity: High Skew in Popularity Access logs of 1 million videos randomly sampled by ID Watch time: total time users spent watching a video

Opportunity: High Skew in Popularity We can serve most watch time even with a small fraction of videos encoded with QuickFire Can we predict these videos for more processing? 80%+ watch time

CHESS Video Prediction System Popularity prediction is important for higher quality streaming Direct encoding on videos with the largest benefit Goal of CHESS video prediction system Identify videos with highest future watch time Maximize watch-time ratio with budgeted processing

CHESS Video Prediction System Streaming Video Engine Predicted Popular Videos CHESS-VPS Backend Storage CDN Social signals Facebook Graph Serving System Access logs

CHESS Video Prediction System Streaming Video Engine Predicted Popular Videos Original QuickFire Encoded CHESS-VPS Backend Storage CDN Social signals Facebook Graph Serving System Access logs

CHESS Video Prediction System Streaming Video Engine Predicted Popular Videos Original QuickFire Encoded CHESS-VPS Backend Storage CDN Social signals Facebook Graph Serving System Access logs Serving QuickFire-encoded versions!

Requirements of CHESS-VPS Handle working set of ~80 million videos Generate new predictions every few minutes Requires a new prediction algorithm: CHESS!

CHESS Key Insights Efficiently model influence of past accesses as the basis for scalable prediction Combine multiple predictors to boost accuracy

Efficiently model past access influence Self exciting process A past access makes future accesses more probable, i.e. provides some influence on future popularity Influence of past accesses

Efficiently model past access influence Self exciting process A past access makes future accesses more probable, i.e. provides some influence on future popularity Prediction: sum up total future influence of all past accesses now Total future influence

Efficiently model past access influence Influence modeled with kernel function Power-law kernel used by prior works Provides high accuracy Scan all past accesses, O(N) time/space not scalable

Efficiently model past access influence Influence modeled with kernel function Power-law kernel used by prior works Key insight: use exponential kernel for scalability

Efficiently model past access influence Self exciting process with the exponential kernel Current Access Watch-time + Exponential Decay x Previous Prediction

Efficiently model past access influence Single exponential kernel is less accurate than power-law kernel 10% lower watch time ratio O(1) space/time to maintain Single exponential kernel is less accurate yet scalable

Combining Efficient Features in a Model Key insight: maintain multiple exponential kernels O(1) space/time Exponential kernels Actual access pattern Modeled by Combining multiple exponential kernels is as accurate as a power-law kernel

Combining Efficient Features in a Model Raw features Past access watch-time Multiple Kernels Future Popularity Neural Network likes comments shares owner likes video age Directly-used Features Social signals further boosts accuracy

CHESS Video Prediction System Worker1 Worker2 Worker3 Worker4 Prediction workers Aggr Aggregated top videos Shard1 Shard2 Shard3 Shard4 Access logs Streaming Model NN Models Client

Evaluation What is the accuracy of CHESS? How do our design decisions on CHESS affect its accuracy and resource consumption? What is CHESS’s impact on video processing and watch time ratio of QuickFire?

Evaluation What is the accuracy of CHESS? How do our design decisions on CHESS affect its accuracy and resource consumption? What is CHESS’s impact on video processing and watch time ratio of QuickFire?

Metrics Watch time ratio Processing time Ratio of watch time from better encoded videos Directly proportional to benefits of better encoding Processing time

Metrics Watch time ratio Ratio of watch time from better encoded videos Directly proportional to benefits of better encoding Processing time (infeasible to encode all videos) Video length processing time Video length ratio ≈ computation overhead

CHESS is Accurate Vary video length ratio (proxy for processing overhead) Observe watch time ratio of better encoded videos

CHESS is Accurate Initial(1d): initial watch time up to 1 day after upload

CHESS is Accurate Initial(1d): initial watch time up to 1 day after upload SESIMIC: handcrafted power-law kernel

CHESS is Accurate Initial(1d): initial watch time up to 1 day after upload SESIMIC: handcrafted power-law kernel CHESS provides higher accuracy than even the non-scalable state of the art

CHESS Reduces Encoding Processing Predict on whole Facebook video workload in real-time Sample 0.5% videos for actual encoding CHESS reduces CPU by 3x (54% to 17%) for 80% watch time ratio

Related Work Popularity Prediction Video QoE Optimization Caching Hawkes'71, Crane'08, Szabo'10, Cheng'14, SEISMIC'15 CHESS is scalable and accurate Video QoE Optimization Liu'12, Aaron'15, Huang'15, Jiang'16, QuickFire'16 Optimize encoding with access feedback Caching LFU‘93, LRU’94, SLRU‘94, GDS’97, GDSF‘98, MQ’01 Identify hot items to improve efficiency

Conclusion Popularity prediction can direct encoding for higher quality streaming CHESS: first scalable and accurate popularity predictor Model influence of past accesses with O(1) time/space Combine multiple kernels & social signals to boost accuracy Evaluation on Facebook video workload More accurate than non-scalable state of the art method Serve 80% user watch time with 3x reduction in processing