1 CS 268: Lecture 14 Internet Measurements Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences.

Slides:



Advertisements
Similar presentations
Internet Measurement Conference 2003 Source-Level IP Packet Bursts: Causes and Effects Hao Jiang Constantinos Dovrolis (hjiang,
Advertisements

Connection-level Analysis and Modeling of Network Traffic understanding the cause of bursts control and improve performance detect changes of network state.
Doc.: IEEE /1216r1 Submission November 2009 BroadcomSlide 1 Internet Traffic Modeling Date: Authors: NameAffiliationsAddressPhone .
FAST TCP Anwis Das Ajay Gulati Slides adapted from : IETF presentation slides Link:
Pathload A measurement tool for end-to-end available bandwidth Manish Jain, Univ-Delaware Constantinos Dovrolis, Univ-Delaware Sigcomm 02.
Network Simulation One tool to simulation network protocols for the Internet is the network simulator (NS) The simulation environment needs to be set-
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
1 Estimating Shared Congestion Among Internet Paths Weidong Cui, Sridhar Machiraju Randy H. Katz, Ion Stoica Electrical Engineering and Computer Science.
Identifying Performance Bottlenecks in CDNs through TCP-Level Monitoring Peng Sun Minlan Yu, Michael J. Freedman, Jennifer Rexford Princeton University.
Profiling Network Performance in Multi-tier Datacenter Applications
Katz, Stoica F04 EECS 122 Introduction to Computer Networks (Fall 2003) Network simulator 2 (ns-2) Department of Electrical Engineering and Computer Sciences.
Introduction Future wireless systems will be characterized by their heterogeneity - availability of multiple access systems in the same physical space.
Internet Traffic Patterns Learning outcomes –Be aware of how information is transmitted on the Internet –Understand the concept of Internet traffic –Identify.
1 Is the Round-trip Time Correlated with the Number of Packets in Flight? Saad Biaz, Auburn University Nitin H. Vaidya, University of Illinois at Urbana-Champaign.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
EECS122 – Lecture 5 Department of Electrical Engineering and Computer Sciences University of California Berkeley.
Katz, Stoica F04 EECS 122: Introduction to Computer Networks Performance Modeling Computer Science Division Department of Electrical Engineering and Computer.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Available bandwidth measurement as simple as running wget D. Antoniades, M. Athanatos, A. Papadogiannakis, P. Markatos Institute of Computer Science (ICS),
A Nonstationary Poisson View of Internet Traffic T. Karagiannis, M. Molle, M. Faloutsos University of California, Riverside A. Broido University of California,
1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.
Chapter 4 Network Layer slides are modified from J. Kurose & K. Ross CPE 400 / 600 Computer Communication Networks Lecture 13.
Variance of Aggregated Web Traffic Robert Morris MIT Laboratory for Computer Science IEEE INFOCOM 2000’
E2E Routing Behavior in the Internet Vern Paxson Sigcomm 1996 Slides are adopted from Ion Stoica’s lecture at UCB.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
Data Communication and Networks
Rethinking Internet Traffic Management: From Multiple Decompositions to a Practical Protocol Jiayue He Princeton University Joint work with Martin Suchara,
Origins of Long Range Dependence Myths and Legends Aleksandar Kuzmanovic 01/08/2001.
Self-Similar through High-Variability: Statistical Analysis of Ethernet LAN Traffic at the Source Level Walter Willinger, Murad S. Taqqu, Robert Sherman,
CS 6401 Network Traffic Characteristics Outline Motivation Self-similarity Ethernet traffic WAN traffic Web traffic.
On the Characteristics and Origins of Internet Flow Rates ACM SIGCOMM 2002 Yin Zhang Lee Breslau Vern Paxson Scott Shenker AT&T Labs – Research
Ch. 28 Q and A IS 333 Spring Q1 Q: What is network latency? 1.Changes in delay and duration of the changes 2.time required to transfer data across.
21.1 Chapter 21 Network Layer: Address Mapping, Error Reporting, and Multicasting Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
Traffic Modeling.
P2P Architecture Case Study: Gnutella Network
Transport Layer 4 2: Transport Layer 4.
POSTECH DP&NM Lab. Internet Traffic Monitoring and Analysis: Methods and Applications (1) 4. Active Monitoring Techniques.
Estimating Bandwidth of Mobile Users Sept 2003 Rohit Kapoor CSD, UCLA.
Transport Layer: TCP and UDP. Overview of TCP/IP protocols Comparing TCP and UDP TCP connection: establishment, data transfer, and termination Allocation.
1 Lecture 14 High-speed TCP connections Wraparound Keeping the pipeline full Estimating RTT Fairness of TCP congestion control Internet resource allocation.
Presented by Rajan Includes slides presented by Andrew Sprouse, Northeastern University CISC 856 TCP/IP and Upper Layer Protocols Date:May 03, 2011.
Measurement and Modeling of Packet Loss in the Internet Maya Yajnik.
CS551: End-to-End Packet Dynamics Paxon’99 Christos Papadopoulos (
HighSpeed TCP for High Bandwidth-Delay Product Networks Raj Kettimuthu.
Networking Fundamentals. Basics Network – collection of nodes and links that cooperate for communication Nodes – computer systems –Internal (routers,
Detecting the Long-Range Dependence in the Internet Traffic with Packet Trains Péter Hága, Gábor Vattay Department Of Physics of Complex Systems Eötvös.
Multiplicative Wavelet Traffic Model and pathChirp: Efficient Available Bandwidth Estimation Vinay Ribeiro.
N. Hu (CMU)L. Li (Bell labs) Z. M. Mao. (U. Michigan) P. Steenkiste (CMU) J. Wang (AT&T) Infocom 2005 Presented By Mohammad Malli PhD student seminar Planete.
On the Characteristics and Origins of Internet Flow Rates ACM SIGCOMM 2002 ICIR AT&T Labs – Research
A Bandwidth Estimation Method for IP Version 6 Networks Marshall Crocker Department of Electrical and Computer Engineering Mississippi State University.
정하경 MMLAB Fundamentals of Internet Measurement: a Tutorial Nevil Brownlee, Chris Lossley, “Fundamentals of Internet Measurement: a Tutorial,” CMG journal.
The New Policy for Enterprise Networking Robert Bays Chief Scientist June 2002.
TCP Congestion Control 컴퓨터공학과 인공지능 연구실 서 영우. TCP congestion control2 Contents 1. Introduction 2. Slow-start 3. Congestion avoidance 4. Fast retransmit.
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
1 Transport Layer: Basics Outline Intro to transport UDP Congestion control basics.
Precision Measurements with the EVERGROW Traffic Observatory Péter Hága István Csabai.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
Notices of the AMS, September Internet traffic Standard Poisson models don’t capture long-range correlations. Poisson Measured “bursty” on all time.
Bandwidth estimation: metrics, measurement techniques, and tools Presenter: Yuhang Wang.
CIS679: UDP and Multimedia r Review of last lecture r UDP and multimedia.
CS 268: Lecture 6 Scott Shenker and Ion Stoica
Network Core and QoS.
Network Traffic Modeling
Javad Ghaderi, Tianxiong Ji and R. Srikant
CS4470 Computer Networking Protocols
Transport Layer: Congestion Control
pathChirp Efficient Available Bandwidth Estimation
pathChirp Efficient Available Bandwidth Estimation
Network Core and QoS.
Presentation transcript:

1 CS 268: Lecture 14 Internet Measurements Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences University of California, Berkeley Berkeley, CA

2 Project Presentations  Five slides  Five minutes  Five questions  You’re outta there!

3  1 st slide: Title  2 nd slide: motivations and problem formulation -Why is the problem important? -What is challenging/hard about your problem  3 rd slide: main idea of your solution  4 th slide: status  5 th slide: future plans and schedule

4 Poll  How many people know: -Lamport clocks -Timestamp vectors -Byzantine agreement -Epidemic/gossip dissemination algorithms -Consistency -Bayou

5 Internet Measurements  This field went from casual sideline to serious science with Vern Paxson’s thesis in 1997  Now there is a massive literature in Internet measurements  Two devoted conferences (PAM and IMC) in addition to standard networking conferences

6 Today’s Lecture  Provide a quick overview of the kinds of techniques used and questions addressed  Tom Anderson (UW) will lead us in a design exercise

7 Philosophical Question  Why do we take Internet measurements?

8 Measurement Techniques  Passive: -Traces -Netflow data -Network telescopes -BGP looking glass sites -……  Active: -Ping -Traceroutes -……

9 Measurement Infrastructure  NIMI  Planetlab  Telescopes (CIED)  Traceroute servers  …..

10 Varieties of Measurement Studies  Basic characterization from direct measurements -Raw statistics -Model fitting -Just because it is “direct” doesn’t mean it is easy! Look at the care taken by Paxson in the two readings  Deep inference -Estimating something that can’t be directly measured

11 Examples of Direct Studies  Packet sizes and protocol type  Flow sizes  Route stability  DNS usage  Stationarity  …….

12 A Few Results  Most flows are small, but most bytes in large flows  Most flows are slow, but most bytes are in fast flows -Rate/size highly correlated (not due to slow start)  Losses are reasonably modeled as Poisson arrival of loss events followed by geometric loss train  Internet routing does not yield good paths…. In 2003,  Bytes: TCP 83%UDP 16%  Packets:TCP 75%UDP 22%  Flows:TCP 56%UDP 33%

13 Traffic Variances  A Bellcore team discovered that ethernet traffic had strage behavior  No matter what time scale they averaged it over, the variances were still large!  How could that be?

14 Self Similarity, LRD, and All That….  Correlation: r(t) = - 2  Short-range correlation: Sum r(t) finite  Long-range correlation (LRD): Sum r(t) infinite  Example of LRD: r(t) ~ t -.5

15 Aggregates and Variance  Consider a sum of M iid random variables: -Variance of sums ~ 1/M  Consider process with short-range correlations: -On large-enough time scales, intervals are iid -Variance should decay as 1/M  Only with LRD can the variance decay more slowly -Often as a power law with p<1

16 LRD Comes from Many Sources?  Poisson arrivals, with long-tailed sizes is one mechanism -Most files small -Most bytes in large files  Pareto distributions are very common!

17 Zipf and Pareto  Zipf: F(rank) ~ rank -b -F is frequency, size, etc. -1 < b  Pareto: P(size) ~ size -a -1 < a < 2 -Log(size) has geometric distribution! -Pareto distributed interarrival times means logs are uncorrelated  a = 1+1/b and b = 1/(a-1)  The tail of the distribution can’t be ignored!

18 Examples of Zipf/Pareto  Income distribution  Asteroid sizes  Letter frequencies  File sizes  Telnet packet interarrivals  Web access frequencies (impact on caching)  AS connectivity

19 Examples of Inference Tools  Bandwidth estimation  Critical path analysis  Network tomography  Flow rate causes  …..

20 Bandwidth Estimation (I)  Send pair of packets back-to-back  If not intermediate packets intervene, their spacing is a function of the bandwidth of the bottleneck link  Send a series of packet-pairs, measure the minimal delays  Doesn’t work well: often overestimate capacity!  Can do better with many packets back-to-back

21 Bandwidth Estimation (II)  Send variable sized packets, k hops (using TTL).  Delay at link i: Di = Ai + P/Ci -Ai is size-independent delay -P is packet size -Ci is capacity of link  Model delay of k hops: Sum (i = 1 to k) Di

22 Possible Causes of Flow Rates  Application: doesn’t generate data fast enough  Opportunity: never leaves slow-start  Receiver: limited dby receive window  Sender: limited by sending buffer  Bandwidth: using full bottleneck bandwidth  Congestion: flow is responding to packet loss  Transport: none of the above…

23 Differentiating Causes  Take trace of flow  Estimate RTT  Look at window epochs  Identify where TCP is in cycle