Network Traffic Modeling Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, March 6.

Slides:



Advertisements
Similar presentations
A Measurement Study of Peer-to-Peer File Sharing Systems Presented by Cristina Abad.
Advertisements

Doc.: IEEE /1216r1 Submission November 2009 BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
The Effects of Wide-Area Conditions on WWW Server Performance Erich Nahum, Marcel Rosu, Srini Seshan, Jussara Almeida IBM T.J. Watson Research Center,
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
Network and Service Assurance Laboratory Analysis of self-similar Traffic Using Multiplexer & Demultiplexer Loaded with Heterogeneous ON/OFF Sources Huai.
Finding Self-similarity in People Opportunistic Networks Ling-Jyh Chen, Yung-Chih Chen, Paruvelli Sreedevi, Kuan-Ta Chen Chen-Hung Yu, Hao Chu.
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
2  Something “feels the same” regardless of scale 4 What is that???
Adapted from Menascé & Almeida.1 Workload Characterization for the Web.
2005/2/23 HUT T Characterizing Web Workload of Mobile Clients Chuang Yu Juha Raitio.
1 10 Web Workload Characterization Web Protocols and Practice.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
1 Network Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 Self-Similar Ethernet LAN Traffic Carey Williamson University of Calgary.
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
On the Self-Similar Nature of Ethernet Traffic - Leland, et. Al Presented by Sumitra Ganesh.
1 CPSC : Project Brainstorming Session Carey Williamson Department of Computer Science University of Calgary.
October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.
1 Web Performance Modeling Chapter New Phenomena in the Internet and WWW Self-similarity - a self-similar process looks bursty across several time.
ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.
A Hierarchical Characterization of a Live Streaming Media Workload E. Veloso, V. Almeida W. Meira, A. Bestavros, S. Jin Proceedings of Internet Measurement.
Hardware-based Load Generation for Testing Servers Lorenzo Orecchia Madhur Tulsiani CS 252 Spring 2006 Final Project Presentation May 1, 2006.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
CSE 561 – Traffic Models David Wetherall Spring 2000.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Copyright © 2005 Department of Computer Science CPSC 641 Winter LAN Traffic Measurements Some of the first network traffic measurement papers were.
Connection Admission Control Schemes for Self-Similar Traffic Yanping Wang Carey Williamson University of Saskatchewan.
CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Web Caching and Content Delivery. Caching for a Better Web Performance is a major concern in the Web Proxy caching is the most widely used method to improve.
Self-Similar Traffic COMP5416 Advanced Network Technologies.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
1 CS 425 Distributed Systems Fall 2011 Slides by Indranil Gupta Measurement Studies All Slides © IG Acknowledgments: Jay Patel.
Spatio-Temporal Modeling of Traffic Workload in a Campus WLAN Felix Hernandez-Campos 3 Merkouris Karaliopoulos 2 Maria Papadopouli 1,2,3 Haipeng Shen 2.
1 Self Similar Traffic. 2 Self Similarity The idea is that something looks the same when viewed from different degrees of “magnification” or different.
Multiplicative Wavelet Traffic Model and pathChirp: Efficient Available Bandwidth Estimation Vinay Ribeiro.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
Self-generated Self-similar Traffic Péter Hága Péter Pollner Gábor Simon István Csabai Gábor Vattay.
Burst Metric In packet-based networks Initial Considerations for IPPM burst metric Tuesday, March 21, 2006.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Risk Analysis Workshop April 14, 2004 HT, LRD and MF in teletraffic1 Heavy tails, long memory and multifractals in teletraffic modelling István Maricza.
1 Apricot2001 Effectiveness of VLAN Chan Wai Kok Faculty of Information Technology Salim Beg Faculty of Engineering.
#16 Application Measurement Presentation by Bobin John.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
Notices of the AMS, September Internet traffic Standard Poisson models don’t capture long-range correlations. Poisson Measured “bursty” on all time.
System Simulation (CAP 4800) May 30, of xx Notes on Barford SURGE paper Ken Christensen Department of Computer Science and Engineering College of.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)
1 Interesting Links. On the Self-Similar Nature of Ethernet Traffic Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu BU Analysis.
1 Tuning RED for Web Traffic SIGCOMM 2000 Paper by M. Christiansen, K. Jeffray, D. Ott, F.D. Smith, UNC – Chapel Hill CS 590 F Fall 2000 Paper presentation.
An IP-based multimedia traffic generator
Chapter 6 The Transport Layer.
Internet Traffic Modeling
CPSC 641: Network Measurement
Evaluation of Load Balancing Algorithms and Internet Traffic Modeling for Performance Analysis By Arthur L. Blais.
Notices of the AMS, September 1998
Self-similar Distributions
Mark E. Crovella and Azer Bestavros Computer Science Dept,
Presented by Chun Zhang 2/14/2003
Network Traffic Modeling
Performance Evaluation of Computer Networks
Performance Evaluation of Computer Networks
CPSC 641: Network Measurement
The Transport Layer Chapter 6.
CPSC 641: Network Traffic Self-Similarity
Presentation transcript:

Network Traffic Modeling Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, March 6

03/06/2002CSE581, Winter 2002 | Punit Shah 2 Papers Generating Representative Web Workloads for Networks and Server performance Evaluation –Paul Bardford, Mark Crovells. Comp Sci Department, Boston University. Self-Similarity in WWW traffic: Evidence and possible cause –Mark Crovells, Azer Bestavros. Comp Sci Department, Boston University. On the Self-Similar Nature of Ethernet Traffic –Will Leland et al. IEEE members. Funded by Boston University.

03/06/2002CSE581, Winter 2002 | Punit Shah 3 Traffic modeling Understand a nature of the network traffic – Establish a traffic pattern –Characteristics, metrics varies by the network stack layer

03/06/2002CSE581, Winter 2002 | Punit Shah 4 Why to model a traffic ? Understand behavior of the servers, network etc. in workload conditions. –Capacity management –infrastructure planning –Performance improvement –Design of the software and services –Testing and Validation Developing a simulators (work load generators), e.g. ns (CMU), SURGE, SpecWeb96 and many commercially available simulators.

03/06/2002CSE581, Winter 2002 | Punit Shah 5 Model Parameters Application layer (HTTP) –server file size distribution –request size distribution (file size + protocol headers) –temporal locality (caching) etc. Data Link layer (Ethernet) –packets per second –mean time between two consecutive packets –bandwidth utilization –effect of number of hosts etc.

03/06/2002CSE581, Winter 2002 | Punit Shah 6 Time Series Analysis Primer Correlation –Under similar circumstances if any two events exhibits an identical(opposite) pattern, then events are called positively(negatively) correlated. –Range for degree of correlation is [-1, 1]. –Correlation models. Long range dependence –Current event is positively correlated to the future event. Heavy tail –Non-negligible random distribution in the tail, e.g. hyperbolic CDF plot. Simplest distribution is Pareto. p(x) ~ x -  ; 0<  < 2

03/06/2002CSE581, Winter 2002 | Punit Shah 7 Self-Similarity Term introduced by Mandelbrot in Let X = (Xt: t = 0, 1, 2, ….) be a time series mean  and variance  2 lim r(k) = k (-  ), 0 <  < 1 k  autocorrelation function For each m = 1, 2, 3 … X (m) = (X k (m) : k = 1, 2 …m) is new time series, i.e. original series is divided into m non- overlapping segments, whose autocorrelation function is r (m) (k). If r (m) (k) = r(k), then X is called (asymptotically) second order self-similar with degree H = 1 -  /2. Where X k (m) = (X km-m+1 + … + X km )/m Also by  k r(k) = , self-similar means long-range dependence.

03/06/2002CSE581, Winter 2002 | Punit Shah 8 Self-similar

03/06/2002CSE581, Winter 2002 | Punit Shah 9 Self-similar  Xi = i=1,m ‘Self-Similarity’ == Burstiness

03/06/2002CSE581, Winter 2002 | Punit Shah 10 Ethernet Traffic Data Collection Data collected over four years, Aug 1989 to Feb 1992 to account for various network topologies. Main traffic at the time (1994) rlogin,e- mail, NFS, local radio station audio. Hosts ~27M packets. An instance of data collection encompassed low, medium, busy hours. Timestamp with 20  s accuracy.

03/06/2002CSE581, Winter 2002 | Punit Shah 11 Packets/unit time (empirical)

03/06/2002CSE581, Winter 2002 | Punit Shah 12 Packets/unit time (synthetic)

03/06/2002CSE581, Winter 2002 | Punit Shah 13 Statistical tests for self-similarity Variance-time plot –variance of log(X (m) ) is plotted against log(m); straight line with slope -  > -1; H = 1 -  /2 R/S plot (rescaled adjusted range stats.) –plot grows according to power law with exponent H as a n, i.e. n H periodogram –slope of the power spectrum of the series

03/06/2002CSE581, Winter 2002 | Punit Shah 14 Ethernet Variance Time plot Increasing m, slowly decreasing variance. Curve will cross threshold-line, if not self-similar.

03/06/2002CSE581, Winter 2002 | Punit Shah 15 Ethernet Traffic Analysis Ethernet traffic is self-similar. Unlike common belief, during busy times degree if self-similarity (burstiness) increases. >>50% traffic TCP packets, but no apparent effect of the non-TCP packets.

03/06/2002CSE581, Winter 2002 | Punit Shah 16 Web Traffic Data collection Traces collected from the real users accessing the web documents (Nov 94 - May 95) using HTTP v0.9 and 1.0 (No parallel connections) –4700 sessions –591 users –575,775 URL requests (46,830 unique per session) –130,140 files transferred Each file request is logged –URL –session, user, workstation ID –timestamp –size of doc, file transfer time

03/06/2002CSE581, Winter 2002 | Punit Shah 17 Trace Analysis Web traffic is self-similar

03/06/2002CSE581, Winter 2002 | Punit Shah 18 Reasons for the self-similarity Web transmission times –Distribution is highly variable. Available files are heavy-tailed. –Multi-media files to be blamed (image, audio, video) Quite time –Active off and inactive off

03/06/2002CSE581, Winter 2002 | Punit Shah 19 Quite Times

03/06/2002CSE581, Winter 2002 | Punit Shah 20 Quite Time Distribution

03/06/2002CSE581, Winter 2002 | Punit Shah 21 Generating Web Workload SURGE User Equivalence (UE) –Synthesized behavior should emulate the users –Multi-threaded program. HTTP v1.0. No parallel connections Distribution models –File sizes –Request sizes File size + Protocol Headers zero, if already cached Popularity –Zipf’s law: if files are ordered in decreasing popularity, then reference to a file is inversely proportional to its rank. P  1/r –Empirical data shows the popular web-docs are extremely popular and others receive a few hits

03/06/2002CSE581, Winter 2002 | Punit Shah 22 Model Parameters (contd.) Embedded object count –Determines a quite time, specifically ‘active off’ Temporal Locality (Caching) –Probability that same object would be requested again –Effect on network access –Stack distance OFF Times –Important parameter, self-similarity is lost if OFF times are ignored Matching problem: Assign the popularity to each file for given distribution of the file size and empirical request size (count?) distribution

03/06/2002CSE581, Winter 2002 | Punit Shah 23 SURGE Approach Use different (well known) models for each of the model parameter

03/06/2002CSE581, Winter 2002 | Punit Shah 24 SURGE Validation Compared with SpecWeb96 (specbench.org) –#of HTTP requests per second (h) –#of threads (t), per thread h/t requests Packets/sec - baseline –tests for 70,300, 500 packets/sec

03/06/2002CSE581, Winter 2002 | Punit Shah 25 Results Roughly similar #of TCP packets and requests in 30min run Mean active TCP connection is v/s 13.9 for SURGE, with very high variance of 3.92 (0.18) indicating self-similarity Server CPU utilization, active TCP connections are quite higher then the SepcWeb96

03/06/2002CSE581, Winter 2002 | Punit Shah 26 Active TCP Connections PPS  SpecWeb96 SURGE

03/06/2002CSE581, Winter 2002 | Punit Shah 27 CPU Utilization SpecWeb96 SURGEPPS 

03/06/2002CSE581, Winter 2002 | Punit Shah 28 Self-Similarity SpecWeb96 SURGEPPS 

03/06/2002CSE581, Winter 2002 | Punit Shah 29 Conclusion Self-similarity (burstiness) is integral part of the network traffic behavior. Degree of self-similarity increases with the load. Server and network load is radically different than the non-self-similar models. Nature of the congestion produced by the self- similar traffic is drastically different from the non self-similar traffic.