Tracking the Evolution of Web Traffic: 1999-2003 Felix Hernandez-Campos, Kevin Jeffay F. Donelson Smith IEEE/ACM International Symposium on Modeling, Analysis.

Slides:



Advertisements
Similar presentations
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
Advertisements

Camarillo / Schulzrinne / Kantola November 26th, 2001 SIP over SCTP performance analysis
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Network Services Networking for Home and Small Businesses – Chapter 6.
Workloads Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Live workload Benchmark applications Micro- benchmark.
1 “Tracking the Evolution of Web Traffic: Felix Hernandez-Campos, Kevin Jeffay, F. Donelson Smith IEEE/ACM International Symposium on Modeling,
The War Between Mice and Elephants LIANG GUO, IBRAHIM MATTA Computer Science Department Boston University ICNP (International Conference on Network Protocols)
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
Multi-Layer Analysis of Web Browsing Performance for Wireless PDAs Adesola Omotayo & Carey Williamson June 1, 2015.
An Empirical Study of Real Audio Traffic A. Mena and J. Heidemann USC/Information Sciences Institute In Proceedings of IEEE Infocom Tel-Aviv, Israel March.
Fresh Analysis of Streaming Media Stored on the Web Rabin Karki M.S. Thesis Presentation Advisor: Mark Claypool Reader: Emmanuel Agu 10 Jan, 2011.
The War Between Mice and Elephants Presented By Eric Wang Liang Guo and Ibrahim Matta Boston University ICNP
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Hypertext Transfer Protocol Kyle Roth Mark Hoover.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Advanced Computer Networks - RED Tuning Paper 1 Tuning RED for Web Traffic Mikkel Christiansen, Kevin Jeffay, David Ott, Donelson Smith UNC at Chapel Hill.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
The Transport Layer Chapter 6. The Transport Service Services Provided to the Upper Layers Transport Service Primitives Berkeley Sockets An Example of.
ACN: RED Tuning Paper1 Tuning RED for Web Traffic Mikkel Christiansen, Kevin Jeffay, David Ott, Donelson Smith UNC at Chapel Hill SIGCOMM 2000 Stockholm.
ACN: RED Tuning Paper1 Tuning RED for Web Traffic Mikkel Christiansen, Kevin Jeffay, David Ott, Donelson Smith UNC at Chapel Hill SIGCOMM 2000 Stockholm.
Dynamic parallel access to replicated content in the Internet Pablo Rodriguez and Ernst W. Biersack IEEE/ACM Transactions on Networking, August 2002.
© Asifuddin Mohammad Empirical Model for HTTP Network Traffic Asifuddin Mohammad Club presentation 27 February 2009Abbreviated Paper Title1.
Computer Networks Transport Layer. Topics F Introduction  F Connection Issues F TCP.
Design and Implementation of a Server Director Project for the LCCN Lab at the Technion.
Web Transfer Latency Study Presented by Ye Xia WebTP Presentation, Aug. 28, 2000 Paper Presented: Paul Barford and Mark Crovella, “Critical Path Analysis.
Investigating Forms of Simulating Web Traffic Yixin Hua Eswin Anzueto Computer Science Department Worcester Polytechnic Institute Worcester, MA.
Towards Understanding Modern Web Traffic
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Network Services Networking for Home and Small Businesses – Chapter.
Chapter 1: Introduction to Web Applications. This chapter gives an overview of the Internet, and where the World Wide Web fits in. It then outlines the.
Maintaining Performance while Saving Energy on Wireless LANs Ronny Krashinsky Term Project
Networks QUME 185 Introduction to Computer Applications.
Protocol(TCP/IP, HTTP) 송준화 조경민 2001/03/13. Network Computing Lab.2 Layering of TCP/IP-based protocols.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Network Services Networking for Home and Small Businesses – Chapter 6.
ICOM 6115©Manuel Rodriguez-Martinez ICOM 6115 – Computer Networks and the WWW Manuel Rodriguez-Martinez, Ph.D. Lecture 3.
World Wide Web Aaditya Bhatia CS 147. Agenda History OSI model Hardware IP address DNS Server - Client Peer-to-peer Web-Sites Web programming Search engines.
Proxy Servers.
The Intranet.
ECEN “Internet Protocols and Modeling”, Spring 2012 Course Materials: Papers, Reference Texts: Bertsekas/Gallager, Stuber, Stallings, etc Class.
Critical Path Analysis of TCP Transactions Authors:Paul Barford (University of Wisconsin-Madison) Mark Crovella (University of Boston) Member, IEEE Source:IEEE/ACM.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
S305 – Network Infrastructure Chapter 5 Network and Transport Layers.
Measuring and Mitigating Web Performance Bottlenecks in Broadband Access Networks Srikanth Sundaresan, Nick Feamster (Georgia Tech) Renata Teixeira (Inria)
Doc.: IEEE /1317r0 Submission December 2009 Vinko Erceg, BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
Web Services. 2 Internet Collection of physically interconnected computers. Messages decomposed into packets. Packets transmitted from source to destination.
1. Layered Architecture of Communication Networks: TCP/IP Model
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Computer Network Architecture Lecture 6: OSI Model Layers Examples 1 20/12/2012.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
#16 Application Measurement Presentation by Bobin John.
TCP/IP1 Address Resolution Protocol Internet uses IP address to recognize a computer. But IP address needs to be translated to physical address (NIC).
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Communication Networks NETW 501 Tutorial 2
Web page – Html –base file – Objects (first DNS) First get html file Then browser reads the html and gets the objects.
Week-7 (Lecture-1) Internet Protocols: TCP/IP, Client Server: Protocol: Specification in networking and Communications It defines the procedures to be.
Chapter 5 Network and Transport Layers
The Intranet.
Internet and Intranet.
Networking for Home and Small Businesses – Chapter 6
Web Caching? Web Caching:.
Networking for Home and Small Businesses – Chapter 6
Internet and Intranet.
Computer Communication & Networks
Internet and Intranet.
CSCD 330 Network Programming
Network Traffic Modeling
Hypertext Transfer Protocol (HTTP)
The War Between Mice & Elephants by, Matt Hartling & Sumit Kumbhar
Internet Applications & Programming
Networking for Home and Small Businesses – Chapter 6
Protocol Application TCP/IP Layer Model
Internet and Intranet.
Presentation transcript:

Tracking the Evolution of Web Traffic: Felix Hernandez-Campos, Kevin Jeffay F. Donelson Smith IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS) Orlando, FL, October 2003

Introduction Since mid-1990, Web traffic dominant on Internet –Need to understand effects and technology Since 1990s, Web has evolved –HTTP delivers more than HTML pages – news, IM, transactions Better understand Web as Internet traffic

This Paper Analysis of 1 terabyte of TCP/IP headers –From UNC during 1999, 2001, 2003 –Compare to other researchers Contribute –Empirical models for simulation –Characterization of TCP using new HTTP 1.1 –Characterization of TCP using new server loads, banner ads, etc

Outline Introduction(done) Related Work(next) Data Sets Analysis Comparison with Others Sampling Issues Conclusions

Related Work Web traffic typically based on two studies –Mah [10], gathered in 1995 –Barford, Crovella + [2,3,7] in 1995, 1998 But –Small users sets (students, single labs) –Small data sets (up to 1 million objects) –Before HTTP 1.1 –Old This paper –200 million objects, 35k users, in 2003

Outline Introduction(done) Related Work(done) Data Sets(next) Analysis Comparison with Others Sampling Issues Conclusions

Data Sets Gather TCP/IP headers from Web servers to clients –Sequence numbers, Ack numbers Data: –Fall 1999 (6 one-hour samples, over 7 days) –Spring 2001 (3 four-hour samples, 7 days) –2003? (8 one-hour traces over 7 days) Network: –1999 OC-3 (155 Mbps) –2001 OC048 (2.4 Gbps) –2003 (same?) –(WPI: 50 Mbps during term, 15 Mbps during summer)

Outline Introduction(done) Related Work(done) Data Sets(done) Analysis(next) Comparison with Others Sampling Issues Conclusions

Request Sizes Requests becoming larger Still typically fit in one packet

Response Sizes Responses becoming smaller Median fits in 1 packet (small!)

Long-tailed Contributes to Self-Similarity Response Size Tail

Inferring User and Browser Characteristics Have TCP data, try to “infer” what user is doing at HTTP layer Time-sorted all flows Assume each IP is one user (fewer NATS on campus) First request is “page” and subsequent requests are “objects” in page If idle for more than 1 second, “think time” Note, does not include client cache

Number of Consecutive Requests to Same Server Most requests get only one page per server Trend is increasing

Number of Objects per Page Many pages simple (1 object) Some pages complex (100 objs) Both trends increasing

Primary versus Secondary Servers Slightly more requests to non- primary servers Increasing trend

Response Sizes Primary vs. Non- Primary Only minor differences in object sizes to different servers

Limitations of Methodology TCP analysis solid (harder to mistake number of packets, flows, etc) HTTP analysis less certain –Pipelined exchanges (look like one) –User/browser interactions (stop, reload) –Browser and proxy caches –TCP processing to deal with loss, duplication, re-ordering

Outline Introduction(done) Related Work(done) Data Sets(done) Analysis(done) Comparison with Others(next) Sampling Issues Conclusions

SURGE (BU) with UNC Striking evolution over time (Mah (UCB) is similar)

Summary Data Notice trend in median sizes Largest sizes are because of larger samples

Outline Introduction(done) Related Work(done) Data Sets(done) Analysis(done) Comparison with Others(done) Sampling Issues(next) Conclusions

Network Traces Size is a concern –1 hour, headers (68 bytes) only, consumes 30 Gigabytes of storage –Processing takes hours –Capturing can slow down routers So: –Do lengths of traces affect distribution shape? –Do incomplete TCP connections affect distribution shapes?

Size versus Sub-Samples 90 seconds plenty! Note, tail (max) may be different

Incompletes don’t hurt Incomplete and Complete Connections

Conclusions Captured and Analyzed Web traffic for 35,000 UNC people, 3 sets from 3 years Find: –HTTP request sizes are increasing –HTTP response sizes are decreasing –Largest HTTP responses are increasing –Web pages complexity is increasing (more servers per page)

Future Work Effects of persistent connections and pipelining? What about other (non-port 80) traffic over HTTP? –About ½ of all TCP traffic “other” Are all objects Web objects? –As opposed to re-direction requests, error messages –This may help understand Web structure