Watching Television Over an IP Network & TV-Watching Behavior Research presented by Weiping He.

Slides:



Advertisements
Similar presentations
High Resolution studies
Advertisements

Quantitative Methods Topic 5 Probability Distributions
Introductory Mathematics & Statistics for Business
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
Usage-Based DHCP Lease- Time Optmization Manas Khadilkar, Nick Feamster, Russ Clark, Matt Sanders Georgia Tech.
Submission Page 1 January 2002 doc.: IEEE 802.RR-02/018A-d1 Andrew Myles, Cisco Systems Report of ad hoc group relating to DFS and JPT5G proposal Andrew.
1 Banking Services for Everyone? Barriers to Bank Access and Use Around the World Thorsten Beck Asli Demirgüç-Kunt Maria Soledad Martinez Peria The World.
Advanced Technology Laboratories page 1 Network Performance Monitoring at Small Time Scales Dina Papagiannaki, Rene Cruz, Christophe Diot.
2008 MTA NYC Travel Survey Summary of Selected Results June 22, 2009.
Sampling Research Questions
Outliers. Do Now Bill Gates makes $100 thousand a year. He s in a room with 9 teachers, 4 of whom make $40k, 3 make $45k, and 2 make $55k a year. What.
Status Report: Evaluation of Private Sector Data in Minneapolis Shawn Turner Texas Transportation.
Sampling Distributions and Estimators
Measuring Online Copyright Infringement Justin Le Patourel Copyright evidence seminar, Friday 13 th September 2013.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
What is Event History Analysis?
Is it true that university students sleep late into the morning and even into the afternoon? Suppose we want to find out what time university students.
1 Questionnaire design Module 3 Session 3. 2 Overview (of Session) This session starts by introducing some aspects that need to be considered when designing.
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Tests for Variances (Session 11)
Demand Resource Operable Capacity Analysis – Assumptions for FCA 5.
CHAPTER 11: Sampling Distributions
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edwards University.
Tablet ownership and behaviour 1 Results from YouGov survey conducted 28 th – 31 st December 2012.
What is Event History Analysis?
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Modeling Grid Job Time Properties Lovro.
Akamai Media Analytics: Driving Engagement
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
Introduction Queuing is the study of waiting lines, or queues.
Elementary Statistics
Non-Parametric Statistics
Samples The means of these samples
Digital Addressability Understanding the Changing TV Consumption Habits.
Broadcast on Demand in BFWA networks (IST EMBRACE) John R Norbury.
Media Use in Indonesia Findings from the 2012 International Audience Research Project.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
Taming User-Generated Content in Mobile Networks via Drop Zones Ionut Trestian Supranamaya Ranjan Aleksandar Kuzmanovic Antonio Nucci Northwestern University.
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
Oil & Gas Final Sample Analysis April 27, Background Information TXU ED provided a list of ESI IDs with SIC codes indicating Oil & Gas (8,583)
IP Multicast Information management 2 Groep T Leuven – Information department 2/14 Agenda •Why IP Multicast ? •Multicast fundamentals •Intradomain.
THE CENTRAL LIMIT THEOREM
Copyright © Cengage Learning. All rights reserved. 7 Probability.
Chapter 4 Inference About Process Quality
Hours Listening To Music In A Week! David Burgueño, Nestor Garcia, Rodrigo Martinez.
Quantitative Analysis (Statistics Week 8)
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Comparing Two Population Parameters
Addition 1’s to 20.
McGraw-Hill, Bluman, 7th ed., Chapter 9
Detecting Spam Zombies by Monitoring Outgoing Messages Zhenhai Duan Department of Computer Science Florida State University.
Week 1.
Chapter 15: Quantitatve Methods in Health Care Management Yasar A. Ozcan 1 Chapter 15. Simulation.
Traffic Measurements and End User Behavior in the Acreo National Broadband Test Bed Andreas Aurelius, Adnan Mahmood, Anders Berntson, Gunnar Arvidsson,
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Modeling User Activities in a Large IPTV System Tongqing Qiu, Jun (Jim) Xu (Georgia Tech) Zihui Ge, Seungjoon Lee, Jia Wang, Qi Zhao (AT&T Lab – Research)
A Hierarchical Characterization of a Live Streaming Media Workload E. Veloso, V. Almeida W. Meira, A. Bestavros, S. Jin Proceedings of Internet Measurement.
A Hierarchical Characterization of a Live Streaming Media Workload IEEE/ACM Trans. Networking, Feb Eveline Veloso, Virg í lio Almeida, Wagner Meira,
CHAPTER 6 Statistical Analysis of Experimental Data
The total number of hours viewed by this target group this week is 6.58 hours Infosys+ data sample Arabs 4 th July 2015 – Rank sort according to TRP Top.
Can Internet Video-on-Demand Be Profitable? SIGCOMM 2007 Cheng Huang (Microsoft Research), Jin Li (Microsoft Research), Keith W. Ross (Polytechnic University)
Add image. 3 “ Content is NOT king ” today 3 40 analog cable digital cable Internet 100 infinite broadcast Time Number of TV channels.
Chapter 4 Control Charts for Measurements with Subgrouping (for One Variable)
Normal Probabilities Plot Section 7.6. Normal Plots 2 A normal probability plot is a scatter plot of the (normal score*, observation) pairs. A substantially.
Television Bureau of Advertising Video Advertising in a Multi-Screen World 1.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Modelling IPTV Services
Modelling IPTV Services
Presumptions Subgroups (samples) of data are formed.
Presentation transcript:

Watching Television Over an IP Network & TV-Watching Behavior Research presented by Weiping He

Already Known: Television has been a dominant and pervasive mass media since 1950s New media (# of channels, video signal) IPTV

Still Unknown or Incompletely known: Ingrained TV viewing habits (Monitoring devices at individual homes) Nielsen Media Research long-standing research effort to estimate TV viewing behaviors through monitoring and surveys.

New Weapon to Explore the Unknown: IPTV: Enable us to monitor user behavior and network usage of an entire network; More visibility on TV viewing activities; Large user base;

IPTV Service Architecture Components: DSLAM, STB, home gateway IPTV channel switching logs Record the ICMP messages Timestamp in units of seconds IP address of the DSLAM IP address of the set-top-box (STB) IP address of the multiple group (channel) Multiple option of join or leave

The studies in this paper First in-depth analysis of IPTV workloads based on network traces from one of the worlds largest IPTV systems. 250,000 households, over 6-month period Characterize the properties of aggregate viewing sessions channel popularity dynamics geographical locality channel switching behaviors browsing pattern user arrival and departure pattern

Elements about the experiment: Channel groups/genre Free, mixed, kids, docu, local, cine, sports, music, news, audio, rest. Assumption on user modes Surfing Viewing Away Note: Different thresholds can be used according to particular experiment environment and requirement.

Trace Collection Collection of IPTV channel switching logs from backbone provider. Record the ICMP messages on the channel changes of 250,000 users. Process the logs/data Pre-process the log by excluding non-video multicast groups; Chronologically sort IGMP join messages; Analysis the data;

Perspectives for Observation High-level viewing characteristics Channel popularity and dynamics Geographical locality Factors that affect channel changes Switching from one channel to another User arrival and departure patterns Section 4 Section 5

High-level viewing characteristics Number of simultaneous online users Session characteristics Attention span Time spent on each genre return

Number of simultaneous online users Friday and Saturday have the lowest evening peaks within the week On weekends: # of viewers ramps up # of distinct viewers +5% total time spent on TV +30% backback

Session Characteristics Average household per day: 2.54 hours and 6.3 distinct channels; Average length of each online session: 1.2 hours Median=8sMean=14.8min The frequency of a TV watching duration increases from 1-4 sec; The graph after 4-sec mark follows a power-law-distribution back

Attention span Two steps for channel selections: a. browsing content to decide whether to continue or stop streaming b. Switching through multiple channel for repeated browsing, until a desired channel is found 50 th percentile values range from 6 to 11 seconds; 90 th and 95 th percentile values range from 3 to 21 minutes. back

Time spent on each genre There might be some significant difference from those reported by sampling statistics for US population, in term of channel genre population backback Genrefreemixedkidsdoculocalcine Viewing prob. Num. channels 38.6% % % 7 6.6% % % 6 GenresportsMusicnewsaudioresttotal Viewing prob. Num. channels 3.8% 8 2.3% % % % % 150 Table 2: Breakdown of popularity across genre (probability of a viewer watching each genre) *Genre categorized the rest includes ppv, satellite, and promotional channels.

Channel popularity and dynamics (1) The top 10% of channels account for nearly 80% of viewers; (Pareto principle or rule) This is consistent across different times of the day, regardless of the changing of viewer base over the course of a day. o Calculate the effective number of viewers by the fraction of time a user spent on each channel over a minute period. (Zipf-like distribution)

Channel popularity and dynamics (2) The average viewer shares are similar to that shown in channel popularity; The graph shows significant fluctuation across the day. Dissimilarity coefficient ξ = 1 ρ 2, ξ greater than 0.1 is considered to have substantial changes in ranks. return

Geographical locality Locality across regions Locality across DSLAMs return

Locality across regions The most popular genres are similar across regions: free, mixed, and kids channels are consistently popular; Users in some regions watch more local channels than those in other regions. back

Locality across DSLAMs back return

Factors that affect channel changes Genre clearly affects the likelihood and frequency of channel changes; Potential factors: the time of day and program popularity; return

Switching from one channel to another Linear vs nonlinear (EPG) Normalized average probability of channel changes between every pair of channels; Examine the influence of channel change patterns on viewing.

Switching from one channel to another (cont.) Several interesting channel switching habits in 1 st case: 1. Over 60% of channel changes are linear; 2. Certain genres show a distinctive pattern of non-linear channel changes within the genres, e.g., free, sports, and kids; 3. The pattern of linear channel changes continues through the less popular channels like music, satellite, and audio; 4. The remaining 18% of channel changes are non-linear across different genres. Distinctive difference between the two cases: The consecutive viewing of the same channel in the 2 nd case accounts for 17% of all viewing instances; Non-linear viewing patterns in the 2 nd case accounts for 67% of viewing instances. In summary: Viewers tend to continue watching the same channel even after switching for some time and with high probability. return

User arrival and departure patterns Arrival and departure rates Inter arrival and departure times return

Arrival and departure rates The arrival and departure rates are similar on average. Several observations: First, the arrival and departure rates vary over the day. Second, user departure patterns show consecutive spikes. Third, the user arrival is much less time-correlated than the departure. back

Inter arrival and departure times Both median CDF of inter-arrival and inter-departure is 0.07 (the same rate); The arrival rate varies over time and the arrival process is not stationary over the course of a day; backback returnreturn

Implications of findings: Existing and future IPTV systems; Design of the open Internet TV distribution systems; Other emerging/potential applications.