Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University.

Slides:



Advertisements
Similar presentations
Responsive Yet Stable Traffic Engineering Srikanth Kandula Dina Katabi, Bruce Davie, and Anna Charny.
Advertisements

Traffic Dynamics at a Commercial Backbone POP Nina Taft Sprint ATL Co-authors: Supratik Bhattacharyya, Jorjeta Jetcheva, Christophe Diot.
Collaborators: Mark Coates, Rui Castro, Ryan King, Mike Rabbat, Yolanda Tsang, Vinay Ribeiro, Shri Sarvotham, Rolf Reidi Network Bandwidth Estimation and.
Advanced Technology Laboratories Traffic Matrix Estimation in Non- Stationary Environments Presented by R. L. Cruz Department of Electrical & Computer.
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
Infocom 2003 An Approach to Alleviate Link Overload as Observed on an IP Backbone Tuesday, April 1 st Infocom 2003 Sundar Iyer 1,2, Supratik Bhattacharrya.
SIGCOMM 2003 Making Intra-Domain Routing Robust to Changing and Uncertain Traffic Demands: Understanding Fundamental Tradeoffs David Applegate Edith Cohen.
Planning under Uncertainty
GS 540 week 6. HMM basics Given a sequence, and state parameters: – Each possible path through the states has a certain probability of emitting the sequence.
Dynamic routing – QoS routing Other approaches to QoS routing Traffic Engineering Practical Traffic Engineering.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
1 Resource Management in IP Telephony Networks Matthew Caesar, Dipak Ghosal, Randy H. Katz {mccaesar,
Traffic Engineering With Traditional IP Routing Protocols
Parallel Simulation etc Roger Curry Presentation on Load Balancing.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
Load Balancing for Routing Sundar Iyer Stanford University.
ASWP – Ad-hoc Routing with Interference Consideration June 28, 2005.
Traffic Engineering for ISP Networks
Predictive End-to-End Reservations via A Hierarchical Clearing House Endeavour Retreat June 19-21, 2000 Chen-Nee Chuah (Advisor: Professor Randy H. Katz)
Network Protocols Designed for Optimizability Jennifer Rexford Princeton University
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
© John M. Abowd 2005, all rights reserved Statistical Tools for Data Integration John M. Abowd April 2005.
Call Admission and Redirection in IP Telephony A Performance Study Matthew Caesar, Dipak Ghosal, Randy Katz {mccaesar,
On Multi-Path Routing Aditya Akella 03/25/02. What is Multi-Path Routing?  Dynamically route traffic Multiple paths to a destination Path taken dependant.
Probability Grid: A Location Estimation Scheme for Wireless Sensor Networks Presented by cychen Date : 3/7 In Secon (Sensor and Ad Hoc Communications and.
Comparison of MSTP and (G)ELS Benchmarking Carrier Ethernet Technologies Workshop Session AI.1: Scientific and Technical Results Krakow, Poland April 30,
The Delta Routing Project Low-loss Routing for Hybrid Private Networks George Porter (UCB) Minwen Ji, Ph.D. (SRC - HP Labs)
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Traffic Matrix Estimation for Traffic Engineering Mehmet Umut Demircin.
Introduction to Monte Carlo Methods D.J.C. Mackay.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Cost-Performance Tradeoffs in MPLS and IP Routing Selma Yilmaz Ibrahim Matta Boston University.
Topology Design for Service Overlay Networks with Bandwidth Guarantees Sibelius Vieira* Jorg Liebeherr** *Department of Computer Science Catholic University.
Particle Filtering in Network Tomography
1 Performance Evaluation of Computer Networks: Part II Objectives r Simulation Modeling r Classification of Simulation Modeling r Discrete-Event Simulation.
9/15/2015CS622 - MIRO Presentation1 Wen Xu and Jennifer Rexford Department of Computer Science Princeton University Chuck Short CS622 Dr. C. Edward Chow.
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
A Fair and Dynamic Load Balancing Mechanism F. Larroca and J.L. Rougier International Workshop on Traffic Management and Traffic Engineering for the Future.
Routing Protocol Evaluation David Holmer
CSC321: Neural Networks Lecture 12: Clustering Geoffrey Hinton.
Algorithms for Allocating Wavelength Converters in All-Optical Networks Authors: Goaxi Xiao and Yiu-Wing Leung Presented by: Douglas L. Potts CEG 790 Summer.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Unsupervised Learning: Clustering Some material adapted from slides by Andrew Moore, CMU. Visit for
Aemen Lodhi (Georgia Tech) Amogh Dhamdhere (CAIDA)
Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Conceptual Differences Between Cube Analyst and Cube Analyst Drive Austen C. Duffy, Ph.D. Computational Mathematician, Citilabs.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
Some questions about multipath Damon Wischik, UCL Trilogy UCL.
Taming Internet Traffic Some notes on modeling the wild nature of OD flows Augustin Soule Kavé Salamatian Antonio Nucci Nina Taft Univ. Paris VI Sprintlabs.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
HELSINKI UNIVERSITY OF TECHNOLOGY Visa Holopainen 1/18.
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
1 ISMA Backbone Traffic Inference MAKE SYSTEMS THE NETWORK RESOURCE PLANNING COMPANY ISP Backbone Traffic Inference Methods to Support Traffic Engineering.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Internet Traffic Demand and Traffic Matrix Estimation
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
ISP and Egress Path Selection for Multihomed Networks
Routing and Logistics with TransCAD
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
L12. Network optimization
Congestion Control, Quality of Service, & Internetworking
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Chapter-5 Traffic Engineering.
Srinivasan Seetharaman - College of Computing, Georgia Tech
Presentation transcript:

Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University of Paris VI), S. Bhattacharyya, C. Diot (Sprint Labs) Presented by Matthew Caesar

Problem scope Environment: –Single ISP, provides SLAs to customers Goal: Estimate traffic matrix –Amount of traffic flowing between each (origin, destination) pair –Hard to measure exactly (requires extensive logging and/or offline parsing) Why would we want to know the traffic matrix? –Helps determine load balancing, routing protocols configuration, dimensioning, provisioning, failover strategies –Allows quantification of cost of providing QoS vs. overprovisioning

Solution idea Main idea: –Measure utilization (“link count”) on each network link Can be easily done in router fast path Done via snmp query –Find a set of OD flows that would produce the measured link counts Sticky issue: how to find the set of OD flows? –Three techniques: Linear Programming (LP) Bayesian estimation Expectation Maximization (EM)

Traffic Estimation Assumptions can be operator’s knowledge (eg. maybe some pairs are always zero) Prior TM: sometimes need seed TM to start with Routing Matrix Link counts (link utilizations)

Problem setup See whiteboard

Scheme #1: Linear Programming (LP) Linear program: –Objective function + constraints Main idea: –Try to maximize the total amount of traffic routed through the network –Given contraints: Total traffic must be less than the measured link count Flow conservation Observations: –Leads to solutions where OD pairs with few intermediate hops will be assigned large amts of bandwidth, while more distant pairs will get much less bandwidth –Solution: put more weight on pairs separated by greater distances

Scheme #2: Bayesian Inference See whiteboard

Scheme #3: Expectation Maximization (EM) See whiteboard

Evaluation Method Impossible to obtain “real” traffic matrix via direct measurement. –Therefore, use simulations How to characterize flow between OD pairs? –Tried Constant, Poisson, Gaussian, Uniform and Bimodal (flash crowd) TMs

Results: Linear programming vs. Statistical methods Linear programming method performs poorly –Assigns zero to many OD pairs, increasing error –Problem: tries to match OD pairs to link counts –Different objective functions give similar results –  error too high for use in practical networks Bayesian and EM: –EM beats Bayesian in terms of average error and worst case error –Estimation errors correlated to heavily shared links (links with many OD flows are more likely to be mis- estimated)

Results: Goodness of prior Goodness of prior matrix (seed values) –Bayesian is much more sensitive to the prior matrix than EM However, EM is also quite sensitive Perhaps because: EM method has deterministic convergence behavior (can be analyzed) while Bayesian has stochastic convergence (it oscillates) –After a certain point, additional measurements don’t provide additional gain Measuring over long periods of time only gives small additional improvement

Results: Marginal gains What improvement could be gained if we could measure some components of the traffic matrix directly? –Carrier may have the option to deploy a certain amount of monitoring equipment 3 ways to add rows: –Randomly, row-sum (by traffic volume), and error magnitude Results: –Error rate drops off roughly linearly with each additional row added –Bayesian not sensitive to order rows are added –EM does better when rows added by largest-error first –  reduction in adding a row is 2% for 13 OD pairs

Other results Which OD pairs are most difficult to estimate? –Error increases as the link-sharing factor increases, also as path length increases How to characterize OD flows? –Poisson and Gaussian assumption holds well, but only for certain hours during the day.

Recommendations Network operators know a lot about their network. We need to devise methods to allow incorporation of network specific information into the estimation scheme. We need a better model of OD flows through an ISP. –Possible solution: “gravity models” based on utility factor (see whiteboard) We need a good way to generate good prior TMs.

References: Statistical INference: theorem-long.html eksl.cs.umass.edu/library/Statistics/Expectation- Maximization/ Traffic Matrix Estimation: user-slides.ppt