Quantifying Skype User Satisfaction Carol K. L. Wong 19 March, 2007 CSC7221.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Brief introduction on Logistic Regression
Forecasting Using the Simple Linear Regression Model and Correlation
Doc.: IEEE /0604r1 Submission May 2014 Slide 1 Modeling and Evaluating Variable Bit rate Video Steaming for ax Date: Authors:
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Departments of Medicine and Biostatistics
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
HSRP 734: Advanced Statistical Methods July 24, 2008.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hafeeda, Ahsan Habib et al. Presented By: Abhishek Gupta.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Statistical Methods Chichang Jou Tamkang University.
Available bandwidth measurement as simple as running wget D. Antoniades, M. Athanatos, A. Papadogiannakis, P. Markatos Institute of Computer Science (ICS),
Performance metrics and configuration strategies for group network communication Tom Z. J. FU Dah Ming Chiu John C. S. Lui.
Part II – TIME SERIES ANALYSIS C2 Simple Time Series Methods & Moving Averages © Angel A. Juan & Carles Serrat - UPC 2007/2008.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Today Concepts underlying inferential statistics
Correlation and Regression Analysis
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Model Checking in the Proportional Hazard model
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Chapter 2: The Research Enterprise in Psychology
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Correlation and Regression
Chapter 15 Correlation and Regression
Topic 5 Statistical inference: point and interval estimate
The Research Enterprise in Psychology. The Scientific Method: Terminology Operational definitions are used to clarify precisely what is meant by each.
1 G Lect 10a G Lecture 10a Revisited Example: Okazaki’s inferences from a survey Inferences on correlation Correlation: Power and effect.
Assessing Survival: Cox Proportional Hazards Model
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
User Study Evaluation Human-Computer Interaction.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Network Instruments VoIP Analysis. VoIP Basics  What is VoIP?  Packetized voice traffic sent over an IP network  Competes with other traffic on the.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Wide-scale Botnet Detection and Characterization Anestis Karasaridis, Brian Rexroad, David Hoeflin In First Workshop on Hot Topics in Understanding Botnets,
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Wireless communications and mobile computing conference, p.p , July 2011.
Chapter 10 Verification and Validation of Simulation Models
Lecture 12: Cox Proportional Hazards Model
1 Capacity Dimensioning Based on Traffic Measurement in the Internet Kazumine Osaka University Shingo Ata (Osaka City Univ.)
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Correlation & Regression Analysis
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Logistic Regression Analysis Gerrit Rooks
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
Business Research Methods
CORRELATION ANALYSIS.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Chapter 15 Inference for Regression. How is this similar to what we have done in the past few chapters?  We have been using statistics to estimate parameters.
Estimating standard error using bootstrap
26134 Business Statistics Week 5 Tutorial
12 Inferential Analysis.
Chapter 10 Verification and Validation of Simulation Models
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Correlation and Regression
12 Inferential Analysis.
Product moment correlation
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Quantifying Skype User Satisfaction Carol K. L. Wong 19 March, 2007 CSC7221

Skype a P2P Internet telephony network > 2 million Skype downloads ~ 85 millions users worldwide From Wikipedia.org

Skype’s Performance Q: Is Skype providing a good enough voice phone service to the users?

Comparison of Proposed & Existing Methods Speech quality measures Proposed User Satisfaction Index To quantifySpeech qualityUser satisfaction Built uponSubjective mean opinion score (MOS) Call duration PredictorsDistortion of signalsQoS factors: the bit rate, network latency, network delay variations, packet loss

Methodology Collect Skype VoIP sessions and their network parameters Analysis of Call Duration and propose an objective index, the User Satisfaction Index (USI), to quantify the level of user satisfaction Validate USI by an independent set of metrics that quantify the interactivity and smoothness of a conversation.

Trace Collection Collect Skype VoIP sessions and their network parameters.  Present the network setup and filtering method used in the traffic capture stage.  Introduce the algorithm for extracting VoIP sessions from packet traces  Strategy to sample path characteristics.  Summarize the collected VoIP sessions.

Network Setup

Capturing Skype Traffic Use 2-phase filtering to identify Skype VoIP sessions: filter and store possible Skype traffic on the disk. apply an off-line identification algorithm on the capture packet traces to extract actual Skype sessions.

Detect Possible Skype Traffic Known properties of Skype clients: dynamic port number chosen randomly when the application is installed and can be configured by users – “Skype port” In the login process, submits HTTP requests to a well-known server, ui.skype.com

Heuristic to Detect Skype Hosts and their Skype Ports treat sender for each HTTP request sent to ui.skype.com as a Skype host choose the port number used most frequently for outgoing UDP packets sent from that host within the next 10s as the Skype port. classify all peers that have bi-directional communication with the Skype port as Skype hosts. maintained a table of identified Skype hosts and their respective Skype ports, and recorded all traffic sent from or to these (host, port) pairs.

Identification of VoIP Sessions regard An active flow as a valid VoIP session if  The flow’s duration > 10s.  The average packet rate is within a reasonable range, (10, 100) pkt/s.  The average packet size is within (30,300) bytes.  The EWMA of the packet size process must be within (35, 500) bytes all the time.

Relayed Session Merge a pair of flow into a relayed session if  The flows’ start and finish time are close to each other with errors < 30s;  The ratio of their average packet rates < 1.5; and  Their packet arrival processes are positively correlated with a coefficient > 0.5.

Path Characteristics Measurement RTT and their jitters  send out ICMP and traceroute-like probe packets to measure paths’ RTT while capturing Skype traffic  used

Collected VoIP Sessions Category Calls HostsCens. TCP DurationBit Rate (mean/std) Avg. RTT (mean/std) Direct % (6.43, 10.42) min Kbps / Kpbs ms/ ms Relayed %(3.12,5.58) min Kbps / Kpbs ms/ ms Total %(5.17,7.70) min Kbps / Kpbs ms/ ms

Methodology Collect Skype VoIP sessions and their network parameters Analysis of Call Duration and propose an objective index, USI, to quantify the level of user satisfaction Validate USI by an independent set of metrics that quantify the interactivity and smoothness of a conversation.

Analysis of Call Duration Develop a model to describe the relationship between call duration and QoS factors. propose an objective index, the User Satisfaction Index (USI) to quantify the level of user satisfaction. validate USI by voice interactivity measures.

Survival Analysis With proper transformation, the relationships of session time and predictors can be described well by the Cox Proportional Hazards model (Cox Model) in survival analysis.

Survival Curves for Sessions with Different Bit Rate Levels

GroupMedian (min) Bit ratesLast > 40 min < 25 Kbps3% > 35 Kbps30% The log-rank test strongly suggests that call duration varies with different levels of bit rates.

Relation of the bit rate with call duration The trend of median duration shows a strong, consistent, positive, correlation with the bit rate.

Effect of Network Conditions Network conditions are also considered to be one of the primary factors that affect voice quality. the fluctuations in the data rate observed at the receiver should reflect network delay variations to some extent. used –  jitter to denote the standard deviation of the bit rate, and  packet rate jitter, or pr.jitter, to denote the standard deviation of the packet rate.

Effect of Round-Trip Times divided sessions into 3 equal-sized groups based on their RTTs, and compare their lifetime patterns with the estimated survival functions. the 3 group differ significantly GroupMedian duration of sessions (min) RTTs > 270 ms4 RTTs: ms5.2 RTTs < 80 ms11

Effect of Jitter Jitter has a much higher correlation with call duration than RTT.

GroupMedian session time (min) Jitter > 2 Kbps3 1 < Jitter < 2 Kbps11 Jitter < 1 Kbps21 These groups differs statistically

QoS related to Call Duration most of the QoS factors they defined, including:  the source rate,  RTT, and  jitter are related to call duration.

Collinearity Given that the bit rate & jitter are significantly correlated, true source of user dissatisfaction is unclear. Use the Cox model and treat QoS factors, e.g. the bit rate, as risk factors or covariates; i.e. as variables that can cause failures.  The hazard function of each session is decided completely by a baseline hazard function and the risk factors related to that session.

Collinearity 7 factors - bit rate (br),packet rate (pr),jitter, pr.jitter, packet size (pktsize), and round trip time (rtt) brprjitterpr.jitterpktsizertt br* pr+++* jitter++-*+++ pr.jitter * pktsize++++ * rtt--- * +/- : positive or negative correlation collinearity is computed by Kendall’s  statistic (Pearson’s product moment statistic yields similar results)

Collinearity the bit rate, packet rate, and packet size are strongly interrelated; jitter and packet rate jitter are strongly interrelated. the bit rate, jitter, and RTT are retained in the model

Cox Model define the risk factors of a session as a risk vector Z h(t|Z) = h 0 (t) exp(  t Z) = h 0 (t)exp(  p k=1  k Z k ) h(t|Z) - the hazard rate at time t for a session with risk vector Z; h 0 (t) - the baseline hazard function computed during the regression process;  = (  1,…,  p ) t - the coefficient vector that corresponds to the impact of risk factors. Z p is the p th factor of the session

The Cox model 2 sessions with risk vectors Z and Z’, the hazard ratio: h(t|Z)/ h(t|Z’) = exp(  p k=1  k Z k –  k Z’ k ) is time-independent constant Hence, the validity of the model relies on the assumption – the hazard rates for any 2 sessions must be in proportion all the time.

Sampling of QoS Factors In the regression modeling, we use a scalar value for each risk factor to capture user perceived quality. Divide the original series s into sub-series of length w, from which network conditions are sampled. Choose one of the min, average and max measures taken from sampled QoS factors having length [|s|/w] depending on their ability to describe the user perceived experience during a call.

Evaluation evaluate all kinds of measures and window sizes by:  fitting the extracted QoS factors into the Cox model and  comparing the model’s log-likelihood, i.e. an indicator of goodness-of-fit. Finally, the max bit rate and min jitter are chosen, both sampled with a window of 30s.

Model Fitting the Cox model assumes a linear relationship between the covariates and the hazard function the impact of the covariates on the hazard functions with the following equation: This corresponds to a Poisson regression model if h 0 (s) is known. t i - the censoring status of session i, f(Z) – the estimated functional form of the covariate Z.

The influence of the bit rate is not proportional to its magnitude - scale transformation.

The RTT factor has an approximate linear impact.

Jitter factor

Verification Employ a more generalized Cox model that allows time-dependent coefficients to check the proportional hazard assumption by hypothesis tests. After adjustment, none of covariates reject the linearity hypothesis at  = 0.1, the transformed variables have an approximate linear impact on the hazard functions. Use the Cox and Snell residuals r i (for session i) to assess the overall goodness-of-fit of the model. Except for a few sessions that have unusual call duration, most sessions fit the model very well.

Model Interpretation VariableCoef,e Coef Std. Err.zP > |z| br.log e+00 jitter.log e+00 rtt e-02 define the factors’ relative weights as their contribution to the risk score, i.e.,  t Z.  - coeff

Relative Influence of Difference QoS for each session The degrees of user dissatisfaction caused by the bit rate, jitter and round-trip time are 46%:53%:1%.

Conclusion Possible to improve user satisfaction by fine tuning the bit rate used. As the use of relaying does not seriously degrade user experience, higher round-trip times do not impact on users very much. Jitters have much more impact on user perception. The choice of relay node should focus more on network conditions, i.e., the level of congestion, rather than rely on network latency.

User Satisfaction Index (USI) As the risk score  t Z represents the levels of instantaneous hang up probability, it can be seen as a measure of user intolerance. Accordingly, define the USI of a session as its minus risk score: USI = -  t Z = 2.15xlog(bit rate) – 1.55xlog(jitter) – 0.36xRTT where the bit rate, jitter, and RTTs are sampled using a 2-level sampling approach

The prediction is based on the median USI for each group. y-axis is logarithmic to make the short duration groups clearer.

Advantages of USI over Other Objective Sound Quality Measures USI’s parameters are readily accessible:  the 1st and 2nd moment of the packet counting process can be obtained by simply counting the number and bytes of arrival packets  the round-trip times. Usually available in peer-to-peer applications for overlay network construction and path selection. developed the USI based on passive measurement rather than subjective surveys, it can also capture sub-conscious reactions of participants, which may not be accessible through surveys.

Methodology Collect Skype VoIP sessions and their network parameters Analysis of Call Duration and propose an objective index, the User Satisfaction Index (USI), to quantify the level of user satisfaction Validate USI by an independent set of metrics that quantify the interactivity and smoothness of a conversation.

Validation Results of the validation tests using a set of independent measures derived from user interactivities show a strong correlation between the call durations and user interactivities. This suggests that the USI based on call duration is significantly representative of Skype user satisfaction.