1 Network Measurements and Sampling Nick Duffield, Carsten Lund, and Mikkel Thorup AT&T Labs-Research, Florham Park, NJ.

Slides:



Advertisements
Similar presentations
New Directions in Traffic Measurement and Accounting Cristian Estan – UCSD George Varghese - UCSD Reviewed by Michela Becchi Discussion Leaders Andrew.
Advertisements

Chapter 10: Estimating with Confidence
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Fast, Memory-Efficient Traffic Estimation by Coincidence Counting Fang Hao 1, Murali Kodialam 1, T. V. Lakshman 1, Hui Zhang 2, 1 Bell Labs, Lucent Technologies.
Evaluation of Header Field Entropy for Hash-Based Packet Selection Evaluation of Header Field Entropy for Hash-Based Packet Selection Christian Henke,
Sampling and Flow Measurement Eric Purpus 5/18/04.
Trajectory Sampling for Direct Traffic Observation Matthias Grossglauser joint work with Nick Duffield AT&T Labs – Research.
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Traffic Engineering With Traditional IP Routing Protocols
Sampling Distributions
Error Propagation. Uncertainty Uncertainty reflects the knowledge that a measured value is related to the mean. Probable error is the range from the mean.
Sociology 601: Class 5, September 15, 2009
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Statistics & Modeling By Yan Gao. Terms of measured data Terms used in describing data –For example: “mean of a dataset” –An objectively measurable quantity.
Chapter 7 Sampling and Sampling Distributions
Chapter 10 Sampling and Sampling Distributions
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
1 Deriving Traffic Demands for Operational IP Networks: Methodology and Experience Anja Feldmann*, Albert Greenberg, Carsten Lund, Nick Reingold, Jennifer.
Code and Decoder Design of LDPC Codes for Gbps Systems Jeremy Thorpe Presented to: Microsoft Research
1 Deriving Traffic Demands for Operational IP Networks: Methodology and Experience Anja Feldmann*, Albert Greenberg, Carsten Lund, Nick Reingold, Jennifer.
Part III: Inference Topic 6 Sampling and Sampling Distributions
1 An Information Theoretic Approach to Network Trace Compression Y. Liu, D. Towsley, J. Weng and D. Goeckel.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
AGG-NANOG IP Network Traffic Engineering Albert Greenberg Internet and Networking Systems Research Lab AT&T Labs - Research; Florham Park, NJ See.
Not All Microseconds are Equal: Fine-Grained Per-Flow Measurements with Reference Latency Interpolation Myungjin Lee †, Nick Duffield‡, Ramana Rao Kompella†
UCSC 1 Aman ShaikhICNP 2003 An Efficient Algorithm for OSPF Subnet Aggregation ICNP 2003 Aman Shaikh Dongmei Wang, Guangzhi Li, Jennifer Yates, Charles.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Predicting and Bypassing End-to-End Internet Service Degradation Anat Bremler-BarrEdith CohenHaim KaplanYishay Mansour Tel-Aviv UniversityAT&T Labs Tel-Aviv.
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
SIGCOMM 2002 New Directions in Traffic Measurement and Accounting Focusing on the Elephants, Ignoring the Mice Cristian Estan and George Varghese University.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
CHAPTER 8 Estimating with Confidence
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Topic 5 Statistical inference: point and interval estimate
Lecture 14 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Using Measurement Data to Construct a Network-Wide View Jennifer Rexford AT&T Labs—Research Florham Park, NJ
Statistical Sampling & Analysis of Sample Data
Bruno Ribeiro CS69000-DM1 Topics in Data Mining. Bruno Ribeiro  Reviews of next week’s papers due Friday 5pm (Sunday 11:59pm submission closes) ◦ Assignment.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
The Common Shock Model for Correlations Between Lines of Insurance
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
ACN: RED paper1 Random Early Detection Gateways for Congestion Avoidance Sally Floyd and Van Jacobson, IEEE Transactions on Networking, Vol.1, No. 4, (Aug.
Hung X. Nguyen and Matthew Roughan The University of Adelaide, Australia SAIL: Statistically Accurate Internet Loss Measurements.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
Challenges and Opportunities Posed by Power Laws in Network Analysis Bruno Ribeiro UMass Amherst MURI REVIEW MEETING Berkeley, 26 th Oct 2011.
Chapter 6. Control Charts for Variables. Subgroup Data with Unknown  and 
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
April 4th, 2002George Wai Wong1 Deriving IP Traffic Demands for an ISP Backbone Network Prepared for EECE565 – Data Communications.
Trajectory Sampling for Direct Traffic Oberservation N.G. Duffield and Matthias Grossglauser IEEE/ACM Transactions on Networking, Vol. 9, No. 3 June 2001.
Review Session Jehan-François Pâris. Agenda Statistical Analysis of Outputs Operational Analysis Case Studies Linear Regression.
SCREAM: Sketch Resource Allocation for Software-defined Measurement Masoud Moshref, Minlan Yu, Ramesh Govindan, Amin Vahdat (CoNEXT’15)
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
1 Netflow Collection and Aggregation in the AT&T Common Backbone Carsten Lund.
Variance reduction techniques Mat Simulation
Behavioral Statistics
Modeling, sampling, generating Networks with MRV
ISP and Egress Path Selection for Multihomed Networks
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
SCREAM: Sketch Resource Allocation for Software-defined Measurement
Daniela Stan Raicu School of CTI, DePaul University
Presentation transcript:

1 Network Measurements and Sampling Nick Duffield, Carsten Lund, and Mikkel Thorup AT&T Labs-Research, Florham Park, NJ

2 IP Network The Edge of Network Links Routers

3 Some Questions that are hard to answer about IP networks o Given a hotspot, how do we change network configuration to fix the problem. ¨ What exactly is the problem? Type of traffic? o Traffic usage. Which applications are using the network? o How do we design an efficient network? ¨ Where do the traffic want to go?

4 Traffic Matrix Construction o Traffic Matrix ¨ Point to Point Demands ¨ Point to Multi-Point Demands –Needed to model in networks with multi-homing and peering. (e.g., ISPs etc.) o Input for many tasks ¨ Network Design ¨ Reroute traffic ¨ Failure analysis ¨ OSFP weight optimization (Thorup et al.)

5 Measurement Options o SNMP ¨ Counting packets on interfaces o Packet sniffers/Probes ¨ Whole packet header/contents o Flow measurements ¨ Router support ¨ Network-wide measurements o Packet sampling (sFlow) o Trajectory sampling o Etc.

6 Traffic Matrix: SNMP based methods o Network Tomography ¨ From link measurement determine traffic matrix, using temporal correlations. o Network Tomogravity ¨ Starts with a model of the traffic matrix from edge measurements. ¨ Uses internal measurements to find a consistent solutions that is close to the traffic matrix that the model predicted.

7 Traffic Matrix: Flow based methods. o Detailed measurement where the traffic enter and determine from the destination IP address, where the traffic will exits the network ¨ Need Prefix level routing information to determine the exit point

8 Traffic Matrix: Method Comparison o SNMP based ¨ Less accurate ¨ Harder to deal with multi-point destinations –Tomography do not. –Tomogravity do try to model it. ¨ Measurement instrumentation/gathering is standard. o Flow based ¨ More accurate ¨ Deal with the multi-point destinations ¨ Measurement instrumentation/gathering is not standard. ¨ Need to deal with huge amount of detailed measurements

9 Outline of the rest of the talk o Context ¨ What are Flows? ¨ Measurement Infrastructure o The main idea ¨ Size dependent sampling ¨ Optimal sampling o How do this compare to size independent sampling? o Billing application

10 flow 1flow 2flow 3 flow 4 IP Flow Abstraction o IP flow abstraction ¨ set of packets identified with “same” address, ports, etc. ¨ packets that are “close” together in time o IP flow summaries ¨ reports of measured flows from routers –flow identifiers, total packets/bytes, start and end times o Several flow definitions in commercial use

11 Collection System Flow Data Aggregate Data Routers Flow Collectors Central Server

12 Router CPU RAM Collectors CPU RAM Central Server Disk CPU RAM Bandwidth Resources in the Measurement System

13 Measure/Export All Traffic Flows? o Flow volumes ¨ one OC48 (2.4 Gbps Trunk)  several GB flow summaries per hour o Cost ¨ network resources for transmission ¨ storage/processing at the collection system

14 Flow Sampling? o Sampling ¨ statisticians reflex action to large datasets o Export selected flows ¨ reduce transmission/storage/processing costs o Sufficiently accurate? ¨ Depends on application. –Traffic Engineering (where do traffic flow) –Traffic Analysis (application usage, etc.) –Billing l risk of overcharging (  irate customers) l risk of undercharging (  irate shareholders)

15 Packet Sampling and Flow Sampling o Packet Sampling ¨ when router can’t form flows at line rate –scaling at a single router o Flow sampling ¨ managing volume of flow statistics –scaling across downstream measurement infrastructure o Complementary ¨ could combine –e.g. 1 in N packet sampling + flow sampling

16 Usage Estimation o Each flow i has ¨ “size” x i –bytes or packets ¨ “color” c i –combination of IP address, port, ToS etc that maps to billable stream ( = customer + billing class) o Goal ¨ to estimate total usage X(c) in each color c

17 Basic Ideas o Match sampling method to flow characteristics ¨ high fraction of traffic found in small fraction of long flows –sample long flows more frequently than short flows l large contributions to usage more reliably estimated o Show how to relate sampling and accuracy ¨ simple rules to achieve desired accuracy

18 Size independent flow sampling bad o Sample 1 in N flows ¨ estimate total bytes by N times sampled bytes o Problem: ¨ long flow lengths –estimate sensitive to inclusion or omission of a single large flow

19 Size dependent flow sampling o Sample flow summary of size x with prob. p(x) o Estimate usage X by ¨ boost up size x by factor 1/p(x) in estimate X’ –compensate against chance of being sampled o Chose p(x) to be increasing in x ¨ longer flows more likely to be sampled ¨ compare size independent sampling: p(x) =1/N

20 Statistical Properties o Fixed set of flow sizes {x 1, x 2, …,x n } ¨ we only consider randomness of sampling o X’ is unbiased estimator of actual usage X =  i x i ¨  X’ = X: averaging over all possible samplings ¨ holds for all probability functions p(x) o Proof: ¨ X’ =  i w i x i /p(x i ) –w i random variable l w i =1 with prob. p(x i ), 0 otherwise –  w i = p(x i ) hence  X’ =   i w i x i /p(x i )=  i x i =X

21 What is best choice of p(x)? o Trade-off accuracy vs. number of samples o Express trade-off through cost function ¨ cost = variance(X’) + z 2 average number of samples –parameter z: relative importance of variance vs. # samples o Which choice of p(x) minimizes cost? o p z (x) = min { 1, x/z } ¨ flows with size  z: always selected ¨ flows with size < z: selected with prob. proportional to their size o Trade-off ¨ smaller z –more samples, lower variance ¨ larger z –fewer samples, higher variance o Will call sampling with p z (x) “optimal” p z (x) z 1 x

22 Optimal vs. size independent sampling o NetFlow traces ¨ 1000’s cable users, 1 week o Color flows ¨ by customer-side IP address c o Compare ¨ 1 in N sampling ¨ optimal sampling –same average sampling rate o Measure of accuracy ¨ weighted mean relative error o Heavy tailed flow size distribution is our friend! ¨ allows more accurate encoding of usage information

23 Charging and Sampling Error o Optimal sampling ¨ no sampling error for flows larger than z o Exploit in charging scheme ¨ fixed charge for small usage ¨ usage sensitive charge only for usage above insensitivity level L o Charge according to estimated usage f(X’(c)) = a + b max{ L, X’(c) } –coefficients a, b and level L could depend on color c o Only usage above L needs reliable estimation f(X) X L

24 Accuracy and Parameter Choice o Given target accuracy ¨ relate sampling threshold z to level L o Theorem ¨ Variance(X’)  z X (tight bound) ¨ now assume: z   2 L –Std.Dev. X’   X if X  L l bound sampling error of estimated usage > L –Std.Dev. f(X’)   f(X) l bound error of charge based on estimated usage o Bounds hold for any flow sizes {x i } ¨ no assumption on flow size distribution –just choose z   2 L

25 Example o Target parameters ¨ L = 10 7,  = 10%  z = 10 5 o Scatter plot ¨ ratio estimated/actual usage vs. actual usage –each color c ¨ observe better estimation of higher usage o Want to avoid ¨ ratio > 1+  = 1.1 and usage > L = 10 7 o Less than 1 in 1000 “bad” points

26 o Aim: ¨ reduce chance of overestimating usage o Method: ¨ theorem gave bound: Var(X’)  z X ¨ anticipate upwards variations in X’ by subtracting off multiples of std. dev. –charge according to ¨ again: no assumptions on flow size distribution Compensating variance for mean

27 Example: s=1 o Scatter pushed down: ¨ no points with ratio>1.1 and usage > 10 7 o Drawback ¨ more unbillable usage –when X’ s <X o Small unbillable usage for heavy users ¨ ratio  1 ¨ Std.Dev.(X’)/X’ vanishes as X grows

28 Example: s=2 o Scatter pushed down further: ¨ no points with ratio > 1 o Trade off ¨ unbillable usage vs. overestimation sunbill. bytes X’ s >X? 0-0.1%50% 13.1%3% 26.2%0%

29 How to reduce unbillable usage? o Make sampling more accurate ¨ reduce z! o For unbillable fraction <  ¨ chose s z   2 L o Example: ¨ s = 2,  = 10% ¨ reduce z –from 10 5 to 10 4 o Alternative ¨ increase coefficent a in charge f(X) to cover costs

30 Tension between accuracy and volume o Want to reduce z ¨ better accuracy, less unbillable usage o Drawback ¨ increased sample volume o Solution ¨ make billing period longer instead –usage roughly proportional to billing period –allows increased charge insensitivity level L ¨ sampling production rate controlled by threshold z –rate r  x f(x)p z (x) l flow arrival rate r, fraction f(x) of flows size x o Need only z =  2 L ¨ larger L allows smaller error  for given z

31 Summary o Size dependent optimal sampling ¨ preferentially sample large flows –more accurate usage estimates for given sample volume –sample flow of size x with probability p z (x) o Charging from measured usage X’ ¨ charge f(X’) = a +b max{L,X’} –fixed charge for usage below insensitivity level L –only need to reliably estimate usage above L o Sampling/charging accuracy ¨ choose z =  2 L to get standard error  o Variance compensation ¨ replace X’ by o Longer billing cycle ¨ increases L, better accuracy (  ) at given sampling rate (z)

32 Papers o