Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin Joint work with Han Hee Song and Lili.

Slides:



Advertisements
Similar presentations
Minimizing Probing Cost for Detecting Interface Failures: Algorithms and Scalability Analysis Hung Nguyen (Univ. of Adelaide, Australia) Renata Teixeira.
Advertisements

Predictable Performance Optimization for Wireless Networks Lili Qiu University of Texas at Austin Joint work with Yi Li, Yin Zhang,
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
Minimizing Multi-Hop Wireless Routing State under Application- based Accuracy Constraints Mustafa Kilavuz & Murat Yuksel University of Nevada, Reno.
Robust Network Compressive Sensing Lili Qiu UT Austin NSF Workshop Nov. 12, 2014.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hafeeda, Ahsan Habib et al. Presented By: Abhishek Gupta.
Configurable restoration in overlay networks Matthew Caesar, Takashi Suzuki.
Traffic Engineering With Traditional IP Routing Protocols
Server-based Inference of Internet Performance V. N. Padmanabhan, L. Qiu, and H. Wang.
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
Bottleneck Routing Games in Communication Networks Ron Banner and Ariel Orda Department of Electrical Engineering Technion- Israel Institute of Technology.
Path Protection in MPLS Networks Ashish Gupta Design and Evaluation of Fault Tolerance Algorithms with Performance Constraints.
1 Network Tomography Venkat Padmanabhan Lili Qiu MSR Tab Meeting 22 Oct 2001.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast M. Hefeeda, A. Habib, B. Botev, D. Xu, and B. Bhargava ACM Multimedia 2003, November 2003.
A General approach to MPLS Path Protection using Segments Ashish Gupta Ashish Gupta.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Gerhard Maierbacher Scalable Coding Solutions for Wireless Sensor Networks IT.
RRAPID: Real-time Recovery based on Active Probing, Introspection, and Decentralization Takashi Suzuki Matthew Caesar.
Scalable and Deterministic Overlay Network Diagnosis Yao Zhao, Yan Chen Northwestern Lab for Internet and Security Technology (LIST) Dept. of Computer.
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Building a Strong Foundation for a Future Internet Jennifer Rexford ’91 Computer Science Department (and Electrical Engineering and the Center for IT Policy)
Not All Microseconds are Equal: Fine-Grained Per-Flow Measurements with Reference Latency Interpolation Myungjin Lee †, Nick Duffield‡, Ramana Rao Kompella†
Optimal Placement and Selection of Camera Network Nodes for Target Localization A. O. Ercan, D. B. Yang, A. El Gamal and L. J. Guibas Stanford University.
©2003/04 Alessandro Bogliolo Background Information theory Probability theory Algorithms.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Network Planète Chadi Barakat
1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Particle Filtering in Network Tomography
Network Aware Resource Allocation in Distributed Clouds.
Physical Layer Informed Adaptive Video Streaming Over LTE Xiufeng Xie, Xinyu Zhang Unviersity of Winscosin-Madison Swarun KumarLi Erran Li MIT Bell Labs.
Topology aggregation and Multi-constraint QoS routing Presented by Almas Ansari.
Spatio-Temporal Compressive Sensing Yin Zhang The University of Texas at Austin Joint work with Matthew Roughan.
A novel approach of gateway selection and placement in cellular Wi-Fi system Presented By Rajesh Prasad.
On AS-Level Path Inference Jia Wang (AT&T Labs Research) Joint work with Z. Morley Mao (University of Michigan, Ann Arbor) Lili Qiu (University of Texas,
Budapest University of Technology and Economics Department of Telecommunications and Media Informatics Optimized QoS Protection of Ethernet Trees Tibor.
Wireless Mesh Network 指導教授:吳和庭教授、柯開維教授 報告:江昀庭 Source reference: Akyildiz, I.F. and Xudong Wang “A survey on wireless mesh networks” IEEE Communications.
1 Passive Network Tomography Using Bayesian Inference Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang Microsoft Research Internet Measurement.
Energy-Efficient Signal Processing and Communication Algorithms for Scalable Distributed Fusion.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints Yao Zhao, Zhaosheng Zhu, Yan Chen, Northwestern University Dan.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Computational Intelligence: Methods and Applications Lecture 23 Logistic discrimination and support vectors Włodzisław Duch Dept. of Informatics, UMK Google:
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
TCP with Variance Control for Multihop IEEE Wireless Networks Jiwei Chen, Mario Gerla, Yeng-zhong Lee.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
1 Network Tomography Using Passive End-to-End Measurements Venkata N. Padmanabhan Lili Qiu Helen J. Wang Microsoft Research DIMACS’2002.
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Troubleshooting Mesh Networks Lili Qiu Joint Work with Victor Bahl, Ananth Rao, Lidong Zhou Microsoft Research Mesh Networking Summit 2004.
DISTIN: Distributed Inference and Optimization in WSNs A Message-Passing Perspective SCOM Team
Probabilistic Model-Driven Recovery in Distributed Systems Kaustubh R. Joshi, Matti A. Hiltunen, William H. Sanders, and Richard D. Schlichting May 2,
Wireless Mesh Networks Myungchul Kim
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
Sporadic model building for efficiency enhancement of the hierarchical BOA Genetic Programming and Evolvable Machines (2008) 9: Martin Pelikan, Kumara.
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
Computational Sensing = Modeling + Optimization CENS seminar Jan 28, 2005 Miodrag Potkonjak Key Contributors: Bradley Bennet, Alberto.
Efficient Geographic Routing in Multihop Wireless Networks Seungjoon Lee*, Bobby Bhattacharjee*, and Suman Banerjee** *Department of Computer Science University.
1 Network Tomography Using Passive End-to-End Measurements Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
Optimization-based Cross-Layer Design in Networked Control Systems Jia Bai, Emeka P. Eyisi Yuan Xue and Xenofon D. Koutsoukos.
Accurate WiFi Packet Delivery Rate Estimation and Applications Owais Khan and Lili Qiu. The University of Texas at Austin 1 Infocom 2016, San Francisco.
1 Chapter 6 Reformulation-Linearization Technique and Applications.
Data Mining Techniques Applied in Advanced Manufacturing PRESENT BY WEI SUN.
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
Experimental Design for Practical Network Diagnosis
Predictable performance optimization for wireless networks
10701 / Machine Learning Today: - Cross validation,
Backbone Traffic Engineering
Presentation transcript:

Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin Joint work with Han Hee Song and Lili Qiu MSR EdgeNet Summit June 2, 2006

2 Practical Network Diagnosis Ideal –Every network element is self-monitoring, self-reporting, self-…, there is no silent failures … –Oracle walks through the haystack of data, accurately pinpoints root causes, and suggests response actions Reality –Finite resources (CPU, BW, human cycles, …)  cannot afford to instrument/monitor every element –Decentralized, autonomous nature of the Internet  infeasible to instrument/monitor every organization –Protocol layering minimizes information exposure  difficult to obtain complete information at every layer Practical network diagnosis: Maximize diagnosis accuracy under given resource constraint and information availability

3 Design of Diagnosis Experiments Input –A candidate set of diagnosis experiments Reflects infrastructure constraints –Information availability Existing information already available Information provided by each new experiment –Resource constraint E.g., number of experiments to conduct (per hour), number of monitors available Output: A diagnosis experimental plan –A subset of experiments to conduct –Configuration of various control parameters E.g., frequency, duration, sampling ratio, …

4 Example: Network Benchmarking 1000s of virtual networks over the same physical network Wants to summarize the performance of each virtual net –E.g. traffic-weighted average of individual virtual path performance (loss, delay, jitter, …) –Similar problem exists for monitoring per-application/customer performance Challenge: Cannot afford to monitor all individual virtual paths –N 2 explosion times 1000s of virtual nets Solution: monitor a subset of virtual paths and infer the rest Q: which subset of virtual paths to monitor? R R R R R R R

5 Example: Client-based Diagnosis Clients probe each other Use tomography/inference to localize trouble spot –E.g. links/regions with high loss rate, delay jitter, etc. Challenge: Pair-wise probing too expensive due to N 2 explosion Solution: monitor a subset of paths and infer the link performance Q: which subset of paths to probe? C& W UUNet AOL Sprint Qwest AT&T Why is it so slow?

6 More Examples Wireless sniffer placement –Input: A set of locations to place wireless sniffers –Not all locations possible – some people hate to be surrounded by sniffers Monitoring quality at each candidate location –E.g. probabilities for capturing packets from different APs Expected workload of different APs Locations of existing sniffers –Output: K additional locations for placing sniffers Cross-layer diagnosis –Infer layer-2 properties based on layer-3 performance –Which subset of layer-3 paths to probe?

7 Beyond Networking Software debugging –Select a given number of tests to maximize the coverage of corner cases Car crash test –Crash a given number of cars to find a maximal number of defects Medicine design –Conducting a given number of tests to maximize the chance of finding an effective ingredient Many more …

8 Need Common Solution Framework Can we have a framework that solves them all? –As opposed to ad hoc solutions for individual problems Key requirements: –Scalable: work for large networks (e.g nodes) –Flexible: accommodate different applications Differentiated design –Different quantities have different importance, e.g., a subset of paths belong to a major customer Augmented design –Conduct additional experiments given existing observations, e.g., after measurement failures Multi-user design –Multiple users interested in different parts of network or have different objective functions

9NetQuest A baby step towards such a framework –“NetQuest: A flexible framework for large-scale network measurement”, Han Hee Song, Lili Qiu and Yin Zhang. ACM SIGMETRICS Achieves scalability and flexibility by combining –Bayesian experimental design –Statistical inference Developed in the context of e2e performance monitoring Can extend to other network monitoring/ diagnosis problems

10 What We Want A function f(x) of link performance x –We use a linear function f(x)=F*x in this talk x1x1 x 11 x4x4 x5x5 x 10 x9x9 x7x7 x6x6 x8x8 3 x2x2 x3x3 Ex. 1: average link delay f(x) = (x1+…+x11)/11 Ex. 2: end-to-end delays Apply to any additive metric, eg. Log (1 – loss rate)

11 Problem Formulation What we can measure: e2e performance Network performance estimation –Goal: e2e performance on some paths  f(x) –Design of experiments Select a subset of paths S to probe such that we can estimate f(x) based on the observed performance y S, A S, and y S =A S x –Network inference Given e2e performance, infer link performance Infer x based on y=F*x, y, and F

12 Design of Experiments State of the art –Probe every path (e.g., RON) Not scalable since # paths grow quadratically with #nodes –Rank-based approach [sigcomm04] Let A denote routing matrix Monitor rank(A) paths that are linearly independent to exactly reconstruct end-to-end path properties Still very expensive Select a “best” subset of paths to probe so that we can accurately infer f(x) How to quantify goodness of a subset of paths?

13 Bayesian Experimental Design A good design maximizes the expected utility under the optimal inference algorithm Different utility functions yield different design criteria –Let, where is covariance matrix of x –Bayesian A-optimality Goal: minimize the squared error –Bayesian D-optimality Goal: maximize the expected gain in Shannon information

14 Search Algorithm Given a design criterion, next step is to find s rows of A to optimize –This problem is NP-hard –We use a sequential search algorithm to greedily select the row that results in the largest improvement in –Better search algorithms?

15Flexibility Differentiated design –Give higher weights to the important rows of matrix F Augmented design –Ensure the newly selected paths in conjunction with previously monitored paths maximize the utility Multi-user design –New design criteria: a linear combination of different users’ design criteria

16 Network Inference Goal: find x s.t. Y=Ax Main challenge: under-constrained problem L2-norm minimization L1-norm minimization Maximum entropy estimation

17 Evaluation Methodology Data sets Accuracy metric # nodes# overlay nodes # paths# linksRank PlanetLab-RTT Planetlab-loss Brite-n1000-o Brite-n5000-o

18 Comparison of DOE Algorithms: Estimating Network-Wide Mean RTT A-optimal yields the lowest error.

19 Comparison of DOE Algorithms: Estimating Per-Path RTT A-optimal yields the lowest error.

20 Differentiated Design: Inference Error on Preferred Paths Lower error on the paths with higher weights.

21 Differentiated Design: Inference Error on the Remaining Paths Error on the remaining paths increases slightly.

22 Augmented Design A-optimal is most effective in augmenting an existing design.

23 Multi-user Design A-optimal yields the lowest error.

24Summary Our contributions –Bring Bayesian experimental design to network measurement and diagnosis –Develop a flexible framework to accommodate different design requirements –Experimentally show its effectiveness Future work –Making measurement design fault tolerant –Applying our technique to other diagnosis problems –Extend our framework to incorporate additional design constraints

25 Thank you!