Download presentation
Presentation is loading. Please wait.
Published byJoseph Flowers Modified over 9 years ago
1
SIGCOMM 2002 New Directions in Traffic Measurement and Accounting Focusing on the Elephants, Ignoring the Mice Cristian Estan and George Varghese University of California, San Diego
2
SIGCOMM 2002 Talk outline Problem definition Sample and hold Multistage filters Validation, measurements Conclusions
3
SIGCOMM 2002 Traffic analysis today Router Fast link Measurement module Sampled packets Workstation Large raw data Collection and analysis software Concise analysis results Offline analysis
4
SIGCOMM 2002 Our research agenda Router Real-time analysis Is it doable? Is it better? Fast link Measurement module Concise analysis results
5
SIGCOMM 2002 What is traffic analysis used for? Network planning: need to know traffic between pairs of networks (traffic matrix) Accounting: usage based billing Detecting DoS attacks: flood attacks Application characterization: breaking up the traffic based on port numbers …
6
SIGCOMM 2002 Common abstractions Packets are grouped together into streams based on header fields Traffic matrix – by source and destination AS DoS attacks – by destination IP address Measuring large streams (this paper) Estimating the number of active streams (poster) …
7
SIGCOMM 2002 Why is measuring streams hard? Cheap memories (DRAM) are too slow to count all packets Fast memories (SRAM) are too small to keep counters for all streams Opportunity: elephants matter, mice don’t Problem: usually we don’t know in advance which streams are large
8
SIGCOMM 2002 Problem definition Given a fixed definition for streams, measure large streams accurately Large = above 1% of link capacity over a 1 minute interval Assumptions Mice don’t matter Accuracy of results important
9
SIGCOMM 2002 Talk outline Problem definition Sample and hold Multistage filters Validation, measurements Conclusions
10
SIGCOMM 2002 How does sample and hold work? stream memory stream1 1 Sample Insert
11
SIGCOMM 2002 How does sample and hold work? stream memory stream1 1stream1 2 Update
12
SIGCOMM 2002 How does sample and hold work? stream memory stream1 2 stream2 1 Sample Insert
13
SIGCOMM 2002 Why is sample & hold better? uncertainty Sample and hold Ordinary sampling
14
SIGCOMM 2002 Comparing the relative error of the estimate for a stream at 1/F of the link bandwidth Memory limited to M entries How much better is it? Measure Ordinary sampling Sample and hold Error √ F/MF/M Memory accesses 1/S1
15
SIGCOMM 2002 Talk outline Problem definition Sample and hold Multistage filters Validation, measurements Conclusions
16
SIGCOMM 2002 Multistage filters Characteristics: No large stream is ever omitted Very few entries are used by small streams Better performance but implementation and tuning is more complex
17
SIGCOMM 2002 How do multistage filters work? stream memory Array of counters Hash(Pink)
18
SIGCOMM 2002 How do multistage filters work? stream memory Array of counters Hash(Green)
19
SIGCOMM 2002 How do multistage filters work? stream memory Array of counters Hash(Green)
20
SIGCOMM 2002 How do multistage filters work? stream memory
21
SIGCOMM 2002 How do multistage filters work? stream memory Collisions are OK
22
SIGCOMM 2002 How do multistage filters work? stream memory stream1 1 Insert Reached threshold
23
SIGCOMM 2002 How do multistage filters work? stream memory stream1 1
24
SIGCOMM 2002 How do multistage filters work? stream memory stream1 1 stream2 1
25
SIGCOMM 2002 Stage 2 How do multistage filters work? stream memory stream1 1 Stage 1
26
SIGCOMM 2002 Conservative update Gray = all prior packets
27
SIGCOMM 2002 Conservative update Redundant
28
SIGCOMM 2002 Conservative update
29
SIGCOMM 2002 Talk outline Problem definition Sample and hold Multistage filters Validation, measurements Conclusions
30
SIGCOMM 2002 Validation Analytical evaluation Comparison of analytical results to measured performance Comparison of full measurement devices using different algorithms
31
SIGCOMM 2002 On traces, algorithms much better than analysis predicts Number of stages Percentage of small streamspassingfilter (log scale) TheoryZipfActual Conservativeupdate
32
SIGCOMM 2002 Measurement results Setup: OC48 trace, 100,000 TCP flows, 5 second intervals, ordinary sampling - unlimited memory, sampling 1 in 16 our algorithms - 1Mbit, adapting parameters to keep it around 90% full Large streams (above 0.1%): ordinary sampling has an error of 9% sample and hold 0.075%, multistage filter 0.037%
33
SIGCOMM 2002 Talk outline Problem definition Sample and hold Multistage filters Validation, measurements Conclusions
34
SIGCOMM 2002 Our contributions Abstraction: Real-time packet analysis abstractions can help systematize router implementations. While the notion of elephants and mice is inherent in earlier work, we abstracted measurement of large streams - it can be used by many applications.
35
SIGCOMM 2002 Our contributions (2) Algorithms: Sample and hold is a simple and efficient algorithm for identifying and measuring large streams. Multistage filters with conservative update perform better but are more complex. Both can be used for real-time as well as offline analysis.
36
SIGCOMM 2002 Our contributions (3) Validation: Theoretical results that make no assumptions on traffic distribution Simulations on traces are orders of magnitude better Preliminary hardware design (John Huber) indicates feasibility at OC192 speeds
37
SIGCOMM 2002 Thank you!
38
SIGCOMM 2002 Optimizations to sample and hold Preserving entries: Keep large entries from one measurement interval to the next Reduces error by a factor of 6 Early removal: Quickly remove entries that do not accumulate much traffic Reduces memory usage by 25%
39
SIGCOMM 2002 Optimizations to multistage filters Preserving entries: Keep large entries from one measurement interval to the next Reduces error by a factor of 5 Shielding: Large streams identified in previous intervals don’t pass through the filter Reduces memory usage by up to 70%
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.