Measurement and Modelling of the Temporal Dependence in Packet loss Maya Yajnik, Sue Moon, Jim Kurose, Don Towsley Department of Computer Science University.

Slides:



Advertisements
Similar presentations
Autocorrelation Functions and ARIMA Modelling
Advertisements

Time Series Analysis Topics in Machine Learning Fall 2011 School of Electrical Engineering and Computer Science.
Outline input analysis input analyzer of ARENA parameter estimation
NUS.SOC.CS5248 Ooi Wei Tsang 1 Adaptive Playout. NUS.SOC.CS5248 Ooi Wei Tsang 2 You are Here Network Encoder Sender Middlebox Receiver Decoder.
1 The Output Analyzer Separate application, also accessible via Tools menu in Arena Reads binary files saved by Arena Various kinds of output-data display,
Introduction Data and simula- tion methodology VaR models and estimation results Estimation perfor- mance analysis Conclusions Appendix Doctoral School.
Lo Presti 1 Network Tomography Francesco Lo Presti Dipartimento di Informatica - Università dell’Aquila.
Simulation Where real stuff starts. ToC 1.What, transience, stationarity 2.How, discrete event, recurrence 3.Accuracy of output 4.Monte Carlo 5.Random.
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
 Don Towsley 2000 Network Tomography for the Internet: Open Problems D. Towsley U. Massachusetts.
Server-based Inference of Internet Performance V. N. Padmanabhan, L. Qiu, and H. Wang.
1 Chap 5 Sums of Random Variables and Long-Term Averages Many problems involve the counting of number of occurrences of events, computation of arithmetic.
Statistical analysis and modeling of neural data Lecture 6 Bijan Pesaran 24 Sept, 2007.
Simulation Modeling and Analysis
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
Statistics. Overview 1. Confidence interval for the mean 2. Comparing means of 2 sampled populations (or treatments): t-test 3. Determining the strength.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
TAPAS: A Research Paradigm for the Modeling, Prediction and Analysis of Non-stationary Network Behavior Almudena Konrad PhD Candidate at UC Berkeley
Network Tomography through End- End Multicast Measurements D. Towsley U. Massachusetts collaborators: R. Caceres, N. Duffield, F. Lo Presti (AT&T) T. Bu,
Overview of STAT 270 Ch 1-9 of Devore + Various Applications.
Improving the Accuracy of Continuous Aggregates & Mining Queries Under Load Shedding Yan-Nei Law* and Carlo Zaniolo Computer Science Dept. UCLA * Bioinformatics.
THE TITLE OF YOUR PAPER Your Name Communication Networks Laboratory School of Engineering Science Simon Fraser University.
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
1 Simulation Modeling and Analysis Pseudo-Random Numbers.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
1 Terminating Statistical Analysis By Dr. Jason Merrick.
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
Verification & Validation
CS380y Junior Thesis1 The Performance of TCP/IP over Bluetooth Chris Snow Supervisors: Serguei Primak, Electrical Engineering Hanan Lutfiyya, Computer.
Chapter 5 Statistical Models in Simulation
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology.
SIMULATION AND ANALYSIS OF LOSS IN IP NETWORKS
TFRC: TCP Friendly Rate Control using TCP Equation Based Congestion Model CS 218 W 2003 Oct 29, 2003.
#1 EC 485: Time Series Analysis in a Nut Shell. #2 Data Preparation: 1)Plot data and examine for stationarity 2)Examine ACF for stationarity 3)If not.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Measurement and Modeling of Packet Loss in the Internet Maya Yajnik.
Hung X. Nguyen and Matthew Roughan The University of Adelaide, Australia SAIL: Statistically Accurate Internet Loss Measurements.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Definitions Random Signal Analysis (Review) Discrete Random Signals Random.
Internet Performance Measurements and Measurement Techniques Jim Kurose Department of Computer Science University of Massachusetts/Amherst
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Randomness Test Fall 2012 By Yaohang Li, Ph.D.. Review Last Class –Random Number Generation –Uniform Distribution This Class –Test of Randomness –Chi.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
1 Network Tomography Using Passive End-to-End Measurements Venkata N. Padmanabhan Lili Qiu Helen J. Wang Microsoft Research DIMACS’2002.
Overview G. Jogesh Babu. Overview of Astrostatistics A brief description of modern astronomy & astrophysics. Many statistical concepts have their roots.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Dynamic Models, Autocorrelation and Forecasting ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
The Box-Jenkins (ARIMA) Methodology
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
ECE4270 Fundamentals of DSP Lecture 8 Discrete-Time Random Signals I School of Electrical and Computer Engineering Center for Signal and Image Processing.
Lo Presti 1 Ne X tworking’03 June 23-25,2003, Chania, Crete, Greece The First COST-IST(EU)-NSF(USA) Workshop on EXCHANGES & TRENDS IN N ETWORKING Network.
1 Network Tomography Using Passive End-to-End Measurements Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang.
Accurate WiFi Packet Delivery Rate Estimation and Applications Owais Khan and Lili Qiu. The University of Texas at Austin 1 Infocom 2016, San Francisco.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
How many iterations in the Gibbs sampler? Adrian E. Raftery and Steven Lewis (September, 1991) Duke University Machine Learning Group Presented by Iulian.
Bayesian Estimation and Confidence Intervals Lecture XXII.
Random signals Honza Černocký, ÚPGM.
Basic MC/Defn/Short Answer/Application Cumulative
Chapter 5 Statistical Models in Simulation
Estimating the Value of a Parameter Using Confidence Intervals
CPSC 641: Network Traffic Self-Similarity
P2.5 Sensitivity of Surface Air Temperature Analyses to Background and Observation Errors Daniel Tyndall and John Horel Department.
Presentation transcript:

Measurement and Modelling of the Temporal Dependence in Packet loss Maya Yajnik, Sue Moon, Jim Kurose, Don Towsley Department of Computer Science University of Massachusetts Amherst

Temporal dependence in end-end loss Questions –what is the time correlation of loss events? –what are good models? Applications: –FEC adjustment for audio, video, data –on-line loss estimation –performance analysis –simulation

Overview Measurement Analysis –stationarity –data representations –temporal dependence Modelling Summary

Measurement Methodology Collect point-point, multicast traces of periodically generated probes period: 20ms, 40ms, 80ms, 160ms source: Univ. of Mass. Amherst Destinations:, Atlanta, Los Angeles, Seattle, St. Louis, Stockholm 128 hours of data

Analysis Stationarity of traces –look for change in avg., variance over trace –remove non- stationary sections RESULT: 76 hours of data

Data Representations binary time series –no loss: 0, no loss: 1 –eg. { } interleaved sequences of good run lengths, loss run lengths –eg.{ } {3,5} {2,1} good loss {{{{

Correlation Timescale goal: Time interval between packets, at and beyond which, loss events are independent methodology: –autocorrelation fn. 95% confidence bounds –chi square test

Correlation Timescales Less than 1000ms over all traces

Run lengths question –are they independent? methodology –autocorrelation fns., crosscorrelation fn. answer –160ms independent –20ms,40ms dependent good runs

question –how are they distributed? –geometrically ? methology –chi-square goodness-of-fit test good run length distribution loss run length distribution

Modelling question –what is a good model for characterizing loss process? models –Bernoulli –2-state –k-th order Markov chain models prob. of loss/ no loss depends only on k previous events

Loss Models previously used: –Bernoulli loss: independent loss, single parameter –2-state loss model: prob. of loss/no loss depends only on the previous event 2 parameters

Markov chain models prob. of loss/no loss depends on k previous events number of states = 2 k Bernoulli, 2-state special cases use maximum likelihood estimates of parameters

Models Q: what is the order of the Markov process ? –the lag beyond which the loss events are independent. –correlation timescale = (order + 1 ) x sampling interval Q: accuracy of commonly used models? –order 0  Bernoulli model accurate: 14 / 76 hr –order 1  2-state model accurate: 20 / 76 hr

Summary collected/ analyzed 128 hours of loss data correlation timescale : 1000ms Markov chain models of k-th order Bernoulli model accurate 14 / 76 2-state model accurate 20 / 76 sliding window better than exp. smoothing

Ongoing and future work richer collection of data, better analysis of stationarity more parsimonious models by state aggregation application of models to performance analysis and on-line estimation