Self-Similar Traffic COMP5416 Advanced Network Technologies.

Slides:



Advertisements
Similar presentations
Doc.: IEEE /1216r1 Submission November 2009 BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
Advertisements

Copyright © 2005 Department of Computer Science CPSC 641 Winter Self-Similar Network Traffic The original paper on network traffic self-similarity.
G. Alonso, D. Kossmann Systems Group
Network and Service Assurance Laboratory Analysis of self-similar Traffic Using Multiplexer & Demultiplexer Loaded with Heterogeneous ON/OFF Sources Huai.
2014 Examples of Traffic. Video Video Traffic (High Definition) –30 frames per second –Frame format: 1920x1080 pixels –24 bits per pixel  Required rate:
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
STAT 497 APPLIED TIME SERIES ANALYSIS
2  Something “feels the same” regardless of scale 4 What is that???
1 Network Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 Self-Similar Wide Area Network Traffic Carey Williamson University of Calgary.
1 Self-Similar Ethernet LAN Traffic Carey Williamson University of Calgary.
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
On the Self-Similar Nature of Ethernet Traffic - Leland, et. Al Presented by Sumitra Ganesh.
October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
A gentle introduction to fluid and diffusion limits for queues Presented by: Varun Gupta April 12, 2006.
Small scale analysis of data traffic models B. D’Auria - Eurandom joint work with S. Resnick - Cornell University.
無線區域網路中自我相似交通流量之 成因與效能評估 The origin and performance impact of self- similar traffic for wireless local area networks 報 告 者:林 文 祺 指導教授:柯 開 維 博士.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Probability By Zhichun Li.
A Nonstationary Poisson View of Internet Traffic T. Karagiannis, M. Molle, M. Faloutsos University of California, Riverside A. Broido University of California,
Performance Evaluation
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002.
1 Interesting Links
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
CSE 561 – Traffic Models David Wetherall Spring 2000.
Origins of Long Range Dependence Myths and Legends Aleksandar Kuzmanovic 01/08/2001.
Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman,
CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic.
Internet Traffic Modeling Poisson Model vs. Self-Similar Model By Srividhya Chandrasekaran Dept of CS University of Houston.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
1 Network Simulation and Testing Polly Huang EE NTU
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
Panel Topic: After Long Range Dependency (LRD) discoveries, what are the lessons learned so far to provide QoS for Internet advanced applications David.
SELF-SIMILAR INTERNET TRAFFIC AND IMPLICATIONS FOR WIRELESS NETWORK PERFORMANCE IN SUDAN Presented By HUDA M. A. EL HAG University Of Khartoum – Faculty.
References for M/G/1 Input Process
Network Traffic Modeling Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, March 6.
Traffic Modeling.
Traffic Modeling.
1 FARIMA(p,d,q) Model and Application n FARIMA Models -- fractional autoregressive integrated moving average n Generating FARIMA Processes n Traffic Modeling.
COMPSAC'14 - N. Larrieu /07/ How to generate realistic network traffic? Antoine VARET and Nicolas LARRIEU COMPSAC – Vasteras – July the 23.
Pravin Rajamoney CSE-581 Network Technology
Measurement, Modeling and Analysis of the Internet Wang Xiaofei Vishal Misra, Columbia University.
Link Dimensioning for Fractional Brownian Input Chen Jiongze PhD student, Electronic Engineering Department, City University of Hong Kong Supported by.
1 Self Similar Traffic. 2 Self Similarity The idea is that something looks the same when viewed from different degrees of “magnification” or different.
A Nonstationary Poisson View of Internet Traffic Thomas Karagiannis joint work with Mart Molle, Michalis Faloutsos, Andre Broido.
Burst Metric In packet-based networks Initial Considerations for IPPM burst metric Tuesday, March 21, 2006.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
MODELING THE SELF-SIMILAR BEHAVIOR OF PACKETIZED MPEG-4 VIDEO USING WAVELET-BASED METHODS Dogu Arifler and Brian L. Evans The University of Texas at Austin.
1 EE571 PART 3 Random Processes Huseyin Bilgekul Eeng571 Probability and astochastic Processes Department of Electrical and Electronic Engineering Eastern.
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Risk Analysis Workshop April 14, 2004 HT, LRD and MF in teletraffic1 Heavy tails, long memory and multifractals in teletraffic modelling István Maricza.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
Notices of the AMS, September Internet traffic Standard Poisson models don’t capture long-range correlations. Poisson Measured “bursty” on all time.
1 Interesting Links. On the Self-Similar Nature of Ethernet Traffic Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu BU Analysis.
CMPE 252A: Computer Networks
Load Balancing and Data centers
Internet Traffic Modeling
Interesting Links.
Minimal Envelopes.
CPSC 641: Network Measurement
Notices of the AMS, September 1998
Self-similar Distributions
Mark E. Crovella and Azer Bestavros Computer Science Dept,
Presented by Chun Zhang 2/14/2003
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Network Traffic Modeling
CPSC 641: Network Measurement
CPSC 641: Network Traffic Self-Similarity
Presentation transcript:

Self-Similar Traffic COMP5416 Advanced Network Technologies

Courtesy of Ashish Gupta (cs.northwestern.edu) 2 Why Self-Similarity? Trace data not consistent with queueing models

On the Self-Similar Nature of Ethernet Traffic Will E. Leland, Walter Willinger and Daniel V. Wilson Bellcore Murad S. Taqqu Boston University The Classic Paper:

Courtesy of Ashish Gupta (cs.northwestern.edu) 4 Overview What is Self Similarity? Ethernet Traffic is Self-Similar Source of Self Similarity Implications of Self Similarity

Courtesy of Ashish Gupta (cs.northwestern.edu) 5 Intuition of Self-Similarity Something “feels the same” regardless of scale

Courtesy of Ashish Gupta (cs.northwestern.edu) 6

7

8 Stochastic Objects In case of stochastic objects like time-series, self-similarity is used in the distributional sense  their mean, variance, correlation etc.

Courtesy of Ashish Gupta (cs.northwestern.edu) 9 Pictorial View of Self-Similarity

Courtesy of Ashish Gupta (cs.northwestern.edu) 10 Why is Self-Similarity Important? Recently, some network packet traffic has been identified as being self-similar Current network traffic modeling using Poisson distributing (etc.) does not take into account the self-similar nature of traffic This leads to inaccurate modeling of network traffic Is self-similarity relevant everytime?  remains a hot research area!

Courtesy of Ashish Gupta (cs.northwestern.edu) 11 Problems with Current Models A Poisson process  When observed on a fine time scale will appear bursty  When aggregated on a coarse time scale will flatten (smooth) to white noise A Self-Similar (fractal) process  When aggregated over wide range of time scales will maintain its bursty characteristic

Courtesy of Ashish Gupta (cs.northwestern.edu) 12 Pictorial View of Current Modeling

Courtesy of Ashish Gupta (cs.northwestern.edu) 13 Consequences of Self-Similarity Traffic has similar statistical properties at a range of timescales: ms, secs, mins, hrs, days Merging of traffic (as in a statistical multiplexer) does NOT result in smoothing of traffic Bursty Data Streams Aggregation Bursty Aggregate Streams

Courtesy of Ashish Gupta (cs.northwestern.edu) 14 Side-by-side View

Courtesy of Ashish Gupta (cs.northwestern.edu) 15 Definitions and Properties Long-Range Dependence  Autocorrelation {R x (t 1,t 2 ) = E[X(t 1 )X(t 2 )]} decays slowly Hurst Parameter  Developed by Harold Hurst (1965) Studies of Nile River flooding over 800 year period  H is a measure of “burstiness” also considered a measure of self-similarity  0.5 < H < 1.0

Courtesy of Ashish Gupta (cs.northwestern.edu) 16 Continuous-Time Definition Hurst Parameter The process x(t) is self-similar with parameter H if it has the same statistical properties as the process a -H x(at) for any real a>0.

Courtesy of Ashish Gupta (cs.northwestern.edu) 17 Discrete-Time Definition X = (X t : t = 0, 1, 2, ….) is random process defined at discrete points in time Let X (m) ={X k (m) } denote the new process obtained by averaging the original series X in non-overlapping sub-blocks of size m. E.g. X(1)= 4,12,34,2,-6,18,21,35 Then X(2)=8,18,6,28 X(4)=13,17

Courtesy of Ashish Gupta (cs.northwestern.edu) 18 Auto-correlation Definition X is exactly self-similar if  The aggregated processes have the same autocorrelation structure as X. i.e.  r (m) (k) = r(k), k  0 for all m =1,2, … X is asymptotically self-similar if the above holds when [ r (m) (k)  r(k), m  

Courtesy of Ashish Gupta (cs.northwestern.edu) 19 Self-Similarity in Traffic Measurement ( Ⅱ ) Network Traffic

Courtesy of Ashish Gupta (cs.northwestern.edu) 20 Auto-correlation Most striking feature of self-similarity: Correlation structures of the aggregated process do not degenerate as m   This is in contrast to traditional models  Correlation structures of their aggregated processes degenerate  i.e. r (m) (k)  0 as m  for k = 1,2,3,...

Courtesy of Ashish Gupta (cs.northwestern.edu) 21

Courtesy of Ashish Gupta (cs.northwestern.edu) 22 Long Range Dependence Processes with Long Range Dependence are characterized by an autocorrelation function that decays hyperbolically as k increases Important Property: This is also called non-summability of correlation

Courtesy of Ashish Gupta (cs.northwestern.edu) 23 Recap Self-similarity manifests itself in several equivalent fashions:  Non-degenerate autocorrelations  Slowly decaying variance  Long range dependence  Hurst effect

Courtesy of Ashish Gupta (cs.northwestern.edu) 24 The Famous Data Leland and Wilson collected hundreds of millions of Ethernet packets without loss and with recorded time-stamps accurate to within 100µs. Data collected from several Ethernet LAN’s at the Bellcore Morristown Research and Engineering Center at different times over the course of approximately 4 years.

Courtesy of Ashish Gupta (cs.northwestern.edu) 25 Plots Showing Self-Similarity ( Ⅱ ) Higher Traffic, Higher H! High Traffic Mid Traffic Low Traffic 1.3%-10.4% 3.4%-18.4% 5.0%-30.7%

Courtesy of Ashish Gupta (cs.northwestern.edu) 26 Crucial Findings Ethernet LAN traffic is statistically self-similar H  : the degree of self-similarity  H  : a function of utilization  H  : a measure of “burstiness”  Models like Poisson are not able to capture self- similarity As number of Ethernet users increases, the resulting aggregate traffic becomes burstier instead of smoother!!

Courtesy of Ashish Gupta (cs.northwestern.edu) 27 Discussions How to explain self-similarity ?  Heavy tailed file sizes How this would impact existing performance?  Limited effectiveness of buffering  Effectiveness of FEC error control for data transmission, whereby the sender adds redundant data to its messages, which allows the receiver to detect and correct errors without the need to ask the sender error controldata transmissionredundant data

Courtesy of Ashish Gupta (cs.northwestern.edu) 28 Explaining Self-Similarity The superposition of many ON/OFF sources whose ON-periods and OFF-periods exhibit the Noah Effect produces aggregate network traffic that features the Joseph Effect Also known as packet train models Noah Effect: High variability or infinite variance Joseph Effect: Self-similar or long-range dependent traffic

Courtesy of Ashish Gupta (cs.northwestern.edu) 29 The Noah Effect Noah Effect is the essential point of departure from traditional to self-similar traffic modeling Results in highly variable ON-OFF periods : Train length and inter-train distances can be very large with non-negligible probabilities Infinite Variance Syndrome : Many naturally occurring phenomenon can be well described with infinite variance distributions Heavy-tail distributions,  parameter

Courtesy of Ashish Gupta (cs.northwestern.edu) 30 Traditional Models Traditional traffic models: finite variance ON/OFF source models Superposition of such sources behaves like white noise, with only short range correlations

Courtesy of Ashish Gupta (cs.northwestern.edu) 31 The heavy-tail distribution A distribution is said to be heavy-tailed if: Property (1) is the infinite variance syndrome or the Noah Effect.   2 implies E(U 2 ) =   > 1 ensures that E(U) <  The asymptotic shape of the distribution is hyperbolic The simplest heavy-tail distribution is the Pareto distribution For example, we consider the sizes of files transferred from a web-server  Heavy-tail  A large number of small files transferred but, crucially, the number of very large files transferred remains significant.

Courtesy of Ashish Gupta (cs.northwestern.edu) 32

Courtesy of Ashish Gupta (cs.northwestern.edu) 33 Important Findings Most surprising result: Noah Effect is extremely widespread, regardless of source machine (fileserver or client machine) Explanations:  Hyperbolic tail behavior for file sizes residing in file sizes  Pareto-like tail behavior for UNIX processes run time  Human-computer interactions occur over a wide range of timescales Although network traffic is intrinsically complex, parsimonious modeling is still possible.  Estimating a single parameter  (intensity of the Noah Effect) is enough

Courtesy of Ashish Gupta (cs.northwestern.edu) 34 An example File size Distribution on a Win2000 machine

Courtesy of Ashish Gupta (cs.northwestern.edu) 35 Impact of Self Similarity

Courtesy of Ashish Gupta (cs.northwestern.edu) 36 Conclusion The presence of the Noah Effect in measured Ethernet LAN traffic is confirmed The superposition of many ON/OFF models with Noah Effect results in aggregate packet streams that are consistent with measured network traffic, and exhibits the self-similar or fractal properties Self-similarity in packetised data networks caused by the distribution of file sizes, human interactions and/or Ethernet dynamics Spawned research around the network community

Courtesy of Ashish Gupta (cs.northwestern.edu) 37 Self-similarity and long range dependence in networks Vern Paxson and Sally Floyd, Wide-Area Traffic: The Failure of Poisson ModelingWide-Area Traffic: The Failure of Poisson Modeling Mark E. Crovella and Azer Bestavros, Self-Similarity in World Wide Web Traffic: Evidence and Possible CausesSelf-Similarity in World Wide Web Traffic: Evidence and Possible Causes  It shows that self-similarity in Web traffic can be explained based on the underlying distribution of transferred document sizes, the effects of caching and user preference in file transfer, the effect of user ``think time'', and the superimposition of many such transfers in a local area network. A. Feldmann, A. C. Gilbert, W. Willinger, and T. G. Kurtz, The Changing Nature of Network Traffic: Scaling Phenomena,The Changing Nature of Network Traffic: Scaling Phenomena Mark Garrett and Walter Willinger, Analysis, Modeling and Generation of Self- Similar VBR Video TrafficAnalysis, Modeling and Generation of Self- Similar VBR Video Traffic  The paper shows that the marginal bandwidth distribution can be described as being heavy- tailed and that the video sequence itself is long-range dependent and can be modeled using a self-similar process  The paper presents a new source model for VBR video traffic and describes how it may be used to generate VBR traffic synthetically.

Courtesy of Ashish Gupta (cs.northwestern.edu) 38 Heavy tailed distributions in network traffic Gordon Irlam, Unix File Size Survey,Unix File Size Survey Will Leland and Teun Ott, Load-balancing Heuristics and Process Behavior, Mor Harchol-Balter and Allen Downey, Exploiting Process Lifetime Distributions for Dynamic Load BalancingExploiting Process Lifetime Distributions for Dynamic Load Balancing Carlos Cunha, Azer Bestavros, Mark Crovella, Characteristics of WWW Client-based TracesCharacteristics of WWW Client-based Traces  This paper presents some of the first Web client measurement ever made. It characterizes traces taken using an instrumented version of Mosaic from a university computer lab and shows that a number of Web properties can be modeled using heavy tailed distributions.  These properties include document size, user requests for a document, and document popularity.