Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman,

Slides:



Advertisements
Similar presentations
Random Processes Introduction (2)
Advertisements

Internet Measurement Conference 2003 Source-Level IP Packet Bursts: Causes and Effects Hao Jiang Constantinos Dovrolis (hjiang,
Research Directions Mark Crovella Boston University Computer Science.
Estimation and identification of long-range dependence in Internet traffic Thomas Karagiannis University of California,
Doc.: IEEE /1216r1 Submission November 2009 BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
Copyright © 2005 Department of Computer Science CPSC 641 Winter Self-Similar Network Traffic The original paper on network traffic self-similarity.
Network and Service Assurance Laboratory Analysis of self-similar Traffic Using Multiplexer & Demultiplexer Loaded with Heterogeneous ON/OFF Sources Huai.
2014 Examples of Traffic. Video Video Traffic (High Definition) –30 frames per second –Frame format: 1920x1080 pixels –24 bits per pixel  Required rate:
STAT 497 APPLIED TIME SERIES ANALYSIS
2  Something “feels the same” regardless of scale 4 What is that???
1 Self-Similar Ethernet LAN Traffic Carey Williamson University of Calgary.
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
On the Self-Similar Nature of Ethernet Traffic - Leland, et. Al Presented by Sumitra Ganesh.
1 17. Long Term Trends and Hurst Phenomena From ancient times the Nile river region has been known for its peculiar long-term behavior: long periods of.
Statistics & Modeling By Yan Gao. Terms of measured data Terms used in describing data –For example: “mean of a dataset” –An objectively measurable quantity.
A gentle introduction to fluid and diffusion limits for queues Presented by: Varun Gupta April 12, 2006.
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
Small scale analysis of data traffic models B. D’Auria - Eurandom joint work with S. Resnick - Cornell University.
無線區域網路中自我相似交通流量之 成因與效能評估 The origin and performance impact of self- similar traffic for wireless local area networks 報 告 者:林 文 祺 指導教授:柯 開 維 博士.
Data Basics. Data Matrix Many datasets can be represented as a data matrix. Rows corresponding to entities Columns represents attributes. N: size of the.
A Nonstationary Poisson View of Internet Traffic T. Karagiannis, M. Molle, M. Faloutsos University of California, Riverside A. Broido University of California,
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002.
1 Interesting Links
1 Using A Multiscale Approach to Characterize Workload Dynamics Characterize Workload Dynamics Tao Li June 4, 2005 Dept. of Electrical.
Origins of Long Range Dependence Myths and Legends Aleksandar Kuzmanovic 01/08/2001.
CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic.
Internet Traffic Modeling Poisson Model vs. Self-Similar Model By Srividhya Chandrasekaran Dept of CS University of Houston.
Self-Similar Traffic COMP5416 Advanced Network Technologies.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
References for M/G/1 Input Process
Simulation Output Analysis
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
1 More about the Sampling Distribution of the Sample Mean and introduction to the t-distribution Presentation 3.
 1  Outline  stages and topics in simulation  generation of random variates.
Limits and the Law of Large Numbers Lecture XIII.
“A non parametric estimate of performance in queueing models with long-range correlation, with applications to telecommunication” Pier Luigi Conti, Università.
Exploiting Clustering Techniques for Web Session Inference A.Bianco, G. Mardente, M. Mellia, M.Munafò, L. Muscariello (Politecnico di Torino)
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
The Story of Wavelets.
1 FARIMA(p,d,q) Model and Application n FARIMA Models -- fractional autoregressive integrated moving average n Generating FARIMA Processes n Traffic Modeling.
0 K. Salah 2. Review of Probability and Statistics Refs: Law & Kelton, Chapter 4.
COMPSAC'14 - N. Larrieu /07/ How to generate realistic network traffic? Antoine VARET and Nicolas LARRIEU COMPSAC – Vasteras – July the 23.
Link Dimensioning for Fractional Brownian Input Chen Jiongze PhD student, Electronic Engineering Department, City University of Hong Kong Supported by.
The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
1 Self Similar Traffic. 2 Self Similarity The idea is that something looks the same when viewed from different degrees of “magnification” or different.
Statistical-Time Access Fairness Index of One-Bit Feedback Fair Scheduler Fumio Ishizaki Dept. of Systems Design and Engineering Nanzan University, Japan.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
A Nonstationary Poisson View of Internet Traffic Thomas Karagiannis joint work with Mart Molle, Michalis Faloutsos, Andre Broido.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
Performance Evaluation of Long Range Dependent Queues Performance Evaluation of Long Range Dependent Queues Chen Jiongze Supervisor: Moshe ZukermanCo-Supervisor:
1 EE571 PART 3 Random Processes Huseyin Bilgekul Eeng571 Probability and astochastic Processes Department of Electrical and Electronic Engineering Eastern.
Risk Analysis Workshop April 14, 2004 HT, LRD and MF in teletraffic1 Heavy tails, long memory and multifractals in teletraffic modelling István Maricza.
Joint Moments and Joint Characteristic Functions.
Notices of the AMS, September Internet traffic Standard Poisson models don’t capture long-range correlations. Poisson Measured “bursty” on all time.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
1 Interesting Links. On the Self-Similar Nature of Ethernet Traffic Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu BU Analysis.
Stochastic Process - Introduction
Internet Traffic Modeling
Interesting Links.
Minimal Envelopes.
Notices of the AMS, September 1998
Mark E. Crovella and Azer Bestavros Computer Science Dept,
STOCHASTIC HYDROLOGY Random Processes
Presented by Chun Zhang 2/14/2003
Javad Ghaderi, Tianxiong Ji and R. Srikant
CPSC 641: Network Traffic Self-Similarity
Presentation transcript:

Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman, Daniel V. Wilson Bellcore, Boston University SIGCOMM’95

Outline Introduction Self-similarity through high-variability Ethernet LAN traffic measurements at the source level Implications of the Noah Effect in practice Conclusion

Introduction Actual traffic exhibits correlations over a wide range of time scales (i.e. has long- range dependence). Traditional traffic models focus on a very limited range of time scales and are thus short-range dependent in nature.

Introduction Two problems that cause the resistance toward self-similar traffic modeling What is a physical “explanation” for the observed self-similar nature of measured traffic from today’s packet networks? What is the impact of self-similarity on network and protocol design and performance analysis?

Introduction The superposition of many ON/OFF sources whose ON-periods and OFF-periods exhibit the Noah Effect produces aggregate network traffic that features the Joseph Effect. Noah Effect: high variability or infinite variance Joseph Effect: self-similar or long-range dependent

Self-Similarity through High-Variability Idealized ON/OFF model An ON-period can be followed by an ON- period and an OFF-period can succeed another OFF-period. The distributions of the ON and OFF times may vary.

Idealized ON/OFF Model Reward sequence {W( l ), l = 0,1,2,…} {W( l )} is a 0/1-valued discrete time stochastic process. W( l ) = 1 or 0 depends on whether or not there is a packet at time l. {W( l )} consists of a sequence of 1’s (“ON- periods”) and 0’s (“OFF-periods”)

Idealized ON/OFF Model The lengths of the ON- and OFF-periods are i.i.d. positive random variables, denoted U k, k = 1,2,… Let S k = S 0 + U 2 + … + U k, k  0 be the corresponding renewal times.

Idealized ON/OFF Model Suppose there are M i.i.d. sources The m th source has its own reward sequence {W( l ), l  0} Superposition reward (“packet load”) b: non-overlapping time blocks j: the aggregation block number

Idealized ON/OFF Model Suppose that U has a hyperbolic tail distribution, as M  and b  , adequately normalized is fractional Gaussian noise, which is self-similar with Hurst parameter ½  H <1

Idealized ON/OFF Model Property (1) is the infinite variance syndrome or the Noah Effect.   2 implies E(U 2 ) =   > 1 ensures that E(U) < , and that S 0 is not infinite

Idealized ON/OFF Model Theorem 1. For large enough source Number M and Block aggregation size b, the cumulative load behaves statistically as where and. More precisely, where L lim means convergence in the sense of the finite- dimensional distributions (convergence in law)

Ethernet LAN Traffic Measurements at the Source Level Location Bellcore Morristown Research and Engineering Center The first set The busy hour of the August 1989 Ethernet LAN measurements About 105 sources, 748 active source-destination pairs 95% of the traffic was internal The second set 9 day-long measurement period in December 1994 About 3,500 sources, 10,000 active pairs Measurements are made up entirely of remote traffic

Textured Plots of Packet Arrival Times

Checking for the Noah Effect Complementary distribution plots Hill’s estimate Let U 1, U 2,…, U n denote the observed ON-(or OFF-)periods and write U (1)  U (2)  …  U (n) for the corresponding order statistics

A Robustness Property of the Noah Effect As far as the Noah Effect is concerned, it does not matter how the OFF-periods have been defined. The similar investigation of sensitivity of the ON- period distributions to the choice of threshold value reveals the same appealing robustness feature of the Noah Effect.

Self-Similarity and the Noah Effect: 1989 Traffic Traces 181(out of 748) source-destination pairs generated more than 93% of all the packets are considered. The data at the source-destination level are consistent with ON/OFF modeling assumption Noah Effect for the distribution of ON/OFF-periods  -values for the ON- and OFF-periods may be different.

Self-Similarity and the Noah Effect: 1994 Traffic Traces Non-Mbone traffic 300 (out of 10,000) pairs responsible for 83% of the traffic are considered. Self-similarity property of the aggregate packet stream is mainly due to the relative strong presence of the Noah Effect in the OFF-periods.

Self-Similarity and the Noah Effect: 1994 Traffic Traces Mbone traffic Only an analysis of the aggregate packet stream is performed. The strong intensity of the Joseph Effect become obvious only after aggregation levels beyond 100ms. There is no Noah Effect for ON-periods. Reason: The use of unsophisticated compression algorithms resulted in packets bursts separated by comparatively large idle periods.

Traffic Modeling and Generation Although network traffic is intrinsically complex, parsimonious modeling is still possible. Estimating a single parameter  (intensity of the Noah Effect) is enough.

Performance and Protocol Analysis The queue length distribution Traditional (Markovian) traffic: decreases exponentially fast Self-similar traffic: decreases much more slowly Protocol design should be expected to take into account knowledge about network traffic such as the presence or absence of the Noah Effect.

Conclusion The presence of the Noah Effect in measured Ethernet LAN traffic is confirmed. The superposition of many ON/OFF models with Noah Effect results in aggregate packet streams that are consistent with measured network traffic, and exhibits the self-similar or fractal properties.