CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic.

Slides:



Advertisements
Similar presentations
Estimation and identification of long-range dependence in Internet traffic Thomas Karagiannis University of California,
Advertisements

Doc.: IEEE /1216r1 Submission November 2009 BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
Copyright © 2005 Department of Computer Science CPSC 641 Winter Self-Similarity in WAN Traffic A subsequent paper established the presence of network.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Self-Similar Network Traffic The original paper on network traffic self-similarity.
Lesson 17: Models for Traffic Sources Giovanni Giambene Queuing Theory and Telecommunications: Networks and Applications 2nd edition, Springer All rights.
Network and Service Assurance Laboratory Analysis of self-similar Traffic Using Multiplexer & Demultiplexer Loaded with Heterogeneous ON/OFF Sources Huai.
Toyota InfoTechnology Center U.S.A, Inc. 1 Mixture Models of End-host Network Traffic John Mark Agosta, Jaideep Chandrashekar, Mark Crovella, Nina Taft.
2014 Examples of Traffic. Video Video Traffic (High Definition) –30 frames per second –Frame format: 1920x1080 pixels –24 bits per pixel  Required rate:
Finding Self-similarity in People Opportunistic Networks Ling-Jyh Chen, Yung-Chih Chen, Paruvelli Sreedevi, Kuan-Ta Chen Chen-Hung Yu, Hao Chu.
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
2  Something “feels the same” regardless of scale 4 What is that???
Chen Chu South China University of Technology. 1. Self-Similar process and Multi-fractal process There are 3 different definitions for self-similar process.
1 Network Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 Self-Similar Wide Area Network Traffic Carey Williamson University of Calgary.
1 Self-Similar Ethernet LAN Traffic Carey Williamson University of Calgary.
Model Fitting Jean-Yves Le Boudec 0. Contents 1 Virus Infection Data We would like to capture the growth of infected hosts (explanatory model) An exponential.
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
On the Self-Similar Nature of Ethernet Traffic - Leland, et. Al Presented by Sumitra Ganesh.
October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.
Finding Self-similarity in Opportunistic People Networks Yung-Chih Chen 1 Ling-Jyh Chen 1, Yung-Chih Chen 1, Tony Sun 2 Paruvelli Sreedevi 1, Kuan-Ta Chen.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
Small scale analysis of data traffic models B. D’Auria - Eurandom joint work with S. Resnick - Cornell University.
無線區域網路中自我相似交通流量之 成因與效能評估 The origin and performance impact of self- similar traffic for wireless local area networks 報 告 者:林 文 祺 指導教授:柯 開 維 博士.
A Hierarchical Characterization of a Live Streaming Media Workload E. Veloso, V. Almeida W. Meira, A. Bestavros, S. Jin Proceedings of Internet Measurement.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
A Nonstationary Poisson View of Internet Traffic T. Karagiannis, M. Molle, M. Faloutsos University of California, Riverside A. Broido University of California,
A Hierarchical Characterization of a Live Streaming Media Workload IEEE/ACM Trans. Networking, Feb Eveline Veloso, Virg í lio Almeida, Wagner Meira,
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002.
1 Interesting Links
Variance of Aggregated Web Traffic Robert Morris MIT Laboratory for Computer Science IEEE INFOCOM 2000’
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
CSE 561 – Traffic Models David Wetherall Spring 2000.
Origins of Long Range Dependence Myths and Legends Aleksandar Kuzmanovic 01/08/2001.
Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman,
Internet Traffic Modeling Poisson Model vs. Self-Similar Model By Srividhya Chandrasekaran Dept of CS University of Houston.
Self-Similar Traffic COMP5416 Advanced Network Technologies.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
1 Network Simulation and Testing Polly Huang EE NTU
Applications of Poisson Process
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
References for M/G/1 Input Process
Network Traffic Modeling Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, March 6.
Traffic Modeling.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Dr. Nawaporn Wisitpongphan. What do you need in order to conduct a PhD Research? A lot of reading in order to Find a PhD thesis topic Find out what other.
1 FARIMA(p,d,q) Model and Application n FARIMA Models -- fractional autoregressive integrated moving average n Generating FARIMA Processes n Traffic Modeling.
COMPSAC'14 - N. Larrieu /07/ How to generate realistic network traffic? Antoine VARET and Nicolas LARRIEU COMPSAC – Vasteras – July the 23.
1 Self Similar Traffic. 2 Self Similarity The idea is that something looks the same when viewed from different degrees of “magnification” or different.
A Nonstationary Poisson View of Internet Traffic Thomas Karagiannis joint work with Mart Molle, Michalis Faloutsos, Andre Broido.
CS 6401 Network Performance Measurement and Analysis Outline Measurement Tools and Techniques Workload generation Analysis Basic statistics Queuing models.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Risk Analysis Workshop April 14, 2004 HT, LRD and MF in teletraffic1 Heavy tails, long memory and multifractals in teletraffic modelling István Maricza.
1 CS 268: Lecture 14 Internet Measurements Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences.
1 Hailuoto Workshop A Statistician ’ s Adventures in Internetland J. S. Marron Department of Statistics and Operations Research University of North Carolina.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
Notices of the AMS, September Internet traffic Standard Poisson models don’t capture long-range correlations. Poisson Measured “bursty” on all time.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)
1 Interesting Links. On the Self-Similar Nature of Ethernet Traffic Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu BU Analysis.
Queuing Theory and Traffic Analysis
CMPE 252A: Computer Networks
Network Performance Measurement and Analysis
Internet Traffic Modeling
Interesting Links.
Minimal Envelopes.
Evaluation of Load Balancing Algorithms and Internet Traffic Modeling for Performance Analysis By Arthur L. Blais.
Notices of the AMS, September 1998
Mark E. Crovella and Azer Bestavros Computer Science Dept,
Presented by Chun Zhang 2/14/2003
Network Traffic Modeling
CPSC 641: Network Traffic Self-Similarity
Presentation transcript:

CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic

CS 6402 Motivation for Network Traffic Study Understanding network traffic behavior is essential for all aspects of network design and operation –Component design –Protocol design –Provisioning –Management –Modeling and simulation

CS 6403 Literature and Today’s Focus W. Leland, M. Taqqu, W. Willinger, D. Wilson, On the Self-Similar Nature of Ethernet Traffic, IEEE/ACM TON, –Baker Award winner V. Paxson, S. Floyd, Wide-Area Traffic: The Failure of Poisson Modeling, IEEE/ACM TON, M. Crovella, A. Bestavros, Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes, IEEE/ACM TON, 1997.

CS 6404 In the Past… Traffic modeling in the world of telephony was the basis for initial network models –Assumed Poisson arrival process –Assumed Poisson call duration –Well established queuing literature based on these assumptions –Enabled very successful engineering of telephone networks “Engineering for Mother’s Day”

CS 6405 The Story Begins with Measurement In 1989, Leland and Wilson begin taking high resolution traffic traces at Bellcore –Ethernet traffic from a large research lab –100  sec time stamps –Packet length, status, 60 bytes of data –Mostly IP traffic (a little NFS) –Four data sets over three year period –Over 100m packets in traces –Traces considered representative of normal use

CS 6406 Fractals

CS 6407 The packet count picture tells all A Poisson process –When observed on a fine time scale will appear bursty –When aggregated on a coarse time scale will flatten (smooth) to white noise A Self-Similar (fractal) process –When aggregated over wide range of time scales will maintain its bursty characteristic

CS 6408 Self-similarity manifests itself in several equivalent fashions: –Slowly decaying variance –Long range dependence –Non-degenerate autocorrelations –Hurst effect Self-similarity: manifestations

CS 6409 Definition of Self-Similarity Self-similar processes are the simplest way to model processes with long-range dependence – correlations that persist (do not degenerate) across large time scales The autocorrelation function r(k) of a process (statistical measure of the relationship, if any, between a random variable and itself, at different time lags)with long-range dependence is not summable: –  r(k) = inf. – r(k)  k -  as k  inf. for 0 <  < 1 Autocorrelation function follows a power law Slower decay than exponential process –Power spectrum is hyperbolic rising to inf. at freq. 0 –If  r(k) < inf. then you have short-range dependence

CS Self-Similarity contd. Consider a zero-mean stationary time series X = (X t ;t = 1,2,3,…), we define the m-aggregated series X (m) = (X k (m) ;k = 1,2,3,…) by summing X over blocks of size m. We say X is H-self-similar if for all positive m, X (m) has the same distribution as X rescaled by m H. If X is H-self-similar, it has the same autocorrelation function r(k) as the series X (m) for all m. This is actually distributional self-similarity. Degree of self-similarity is expressed as the speed of decay of series autocorrelation function using the Hurst parameter –H = 1 -  /2 –For SS series with LRD, ½ < H < 1 –Degree of SS and LRD increases as H  1

CS Graphical Tests for Self-Similarity Variance-time plots –Relies on slowly decaying variance of self-similar series –The variance of X (m) is plotted versus m on log-log plot –Slope (-  greater than –1 is indicative of SS R/S plots –Relies on rescaled range (R/S)statistic growing like a power law with H as a function of number of points n plotted. –The plot of R/S versus n on log-log has slope which estimates H Periodogram plot –Relies on the slope of the power spectrum of the series as frequency approaches zero –The periodogram slope is a straight line with slope  – 1 close to the origin

CS Graphical test examples – VT plot

CS Graphical test example – R/S plot

CS Graphical test examples - Periodogram

CS Non-Graphical Self-Similarity Test Whittle’s MLE Procedure –Provides confidence intervals for estimation of H –Requires an underlying stochastic process for estimate Typical examples are FGN and FARIMA FGN assumes no SRD

CS Analysis of Ethernet Traffic Analysis of traffic logs from perspective of packets/time unit found H to be between 0.8 and –Aggregations over many orders of magnitude –Effects seem to increase over time –Initial looks at external traffic pointed to similar behavior Paper also discusses engineering implications of these results –Burstiness –Synthetic traffic generation

CS Major Results of LTWW94 First use of VERY large measurements in network research Very high degree of statistical rigor brought to bare on the problem Blew away prior notions of network traffic behavior –Ethernet packet traffic is self-similar Led to ON/OFF model of network traffic [WTSW97]

CS What about wide area traffic? Paxson and Floyd evaluated 24 traces of wide-area network traffic –Traces included both Bellcore traces and five other sites taken between ’89 and ‘95 –Focus was on both packet and session behavior TELNET and FTP were applications considered –Millions of packets and sessions analyzed

CS TCP Connection Interarrivals The behavior analyzed was TCP connection start times –Dominated by diurnal traffic cycle –A simple statistical test was developed to assess accuracy of Poisson assumption Exponential distribution of interarrivals Independence of interarrivals –TELNET and FTP connection interarrivals are well modeled by a Poisson process Evaluation over 1 hour and 10 minute periods –Other applications (NNTP, SMTP, WWW, FTP DATA) are not well modeled by Poisson

CS TELNET Packet Interarrivals The interarrival times of TELNET originator’s packets was analyzed. –Process was shown to be heavy-tailed P[X > x] ~ x -  as x  inf. and 0 <  < 2 Simplest heavy-tailed distribution is the Pareto which is hyperbolic over its entire range –p(x) =  k  x – ,  k > 0, x >=k –If  < 2, the distribution has infinite variance – If  < 1, the distribution has infinite mean –It’s all about the tail! –Variance-Time plots indicate self-similarity

CS TELNET Session Size (packets) Size of TELNET session measured by number of originator packets transferred –Log-normal distribution was good model for session size in packets –Log-extreme has been used to model session size in bytes in prior work Putting this together with model for arrival processes results in a well fitting model for TELNET traffic

CS FTPDATA Analysis FTPDATA refers to data transferred after FTP session start –Packet arrivals within a connection are not treated –Spacing between DATA connections is shown to be heavy tailed Bimodal (due to mget) and can be approximated by log-normal distribution –Bytes transferred Very heavy tailed characteristic Most bytes transferred are contained in a few transfers

CS Self-Similarity of WAN Traffic Variance-time plots for packet arrivals for all applications indicate WAN traffic is consistent with self-similarity –The authors were not able to develop a single Hurst parameter to characterize WAN traffic

CS Major Results of PF95 Verify that TCP session arrivals are well modeled by a Poisson process Showed that a number of WAN characteristics were well modeled by heavy tailed distributions Establish that packet arrival process for two typical applications (TELNET, FTP) as well as aggregate traffic is self-similar Provide further statistical methods for generating self- similar traffic

CS What about WWW traffic? Crovella and Bestavros analyze WWW logs collected at clients over a 1.5 month period –First WWW client study –Instrumented MOSAIC ~600 students ~130K files transferred ~2.7GB data transferred

CS Self-Similar Aspects of Web traffic One difficulty in the analysis was finding stationary, busy periods –A number of candidate hours were found All four tests for self-similarity were employed – 0.7 < H < 0.8

CS Explaining Self-Similarity Consider a set of processes which are either ON or OFF –The distribution of ON and OFF times are heavy tailed  1  2  –The aggregation of these processes leads to a self- similar process H = (3 - min  1  2  )/2 [WTSW97] So, how do we get heavy tailed ON or OFF times?

CS Heavy Tailed ON Times and File Sizes Analysis of client logs showed that ON times were, in fact, heavy tailed –  ~ 1.2 –Over about 3 orders of magnitude This lead to the analysis of underlying file sizes –  ~ 1.1 –Over about 4 orders of magnitude –Similar to FTP traffic Files available from UNIX file systems are typically heavy tailed

CS Heavy Tailed OFF times Analysis of OFF times showed that they are also heavy tailed –  ~ 1.5 Distinction between Active and Passive OFF times –Inter vs. Intra click OFF times Thus, ON times are more likely to be cause of self-similarity

CS Major Results of CB97 Established that WWW traffic was self-similar Modeled a number of different WWW characteristics (focus on the tail) Provide an explanation for self-similarity of WWW traffic based on underlying file size distribution

CS Where are we now? There is no mechanistic model for Internet traffic –Topology? –Routing? People want to blame the protocols for observed behavior Multiresolution analysis may provide a means for better models Many people (vendors) chose to ignore self-similarity Lots of opportunity!!