1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.

Slides:



Advertisements
Similar presentations
Micro-loop Prevention Methods draft-bryant-shand-lf-conv-frmwk-00.txt draft-zinin-microloop-analysis-00.txt.
Advertisements

Understanding the Impact of Route Reflection in Internal BGP Ph.D. Final Defense presented by Jong Han (Jonathan) Park July 15 th,
1 End-to-End Routing Behavior in the Internet Internet Routing Instability Presented by Carlos Flores Gaurav Jain May 31st CS 6390 Advanced Computer.
Part IV: BGP Routing Instability. March 8, BGP routing updates  Route updates at prefix level  No activity in “steady state”  Routing messages.
1 Experimental Study of Internet Stability and Wide-Area Backbone Failure Craig Labovitz, Abha Ahuja Merit Network, Inc Presented by Changchun Zou.
Consensus Routing: The Internet as a Distributed System John P. John, Ethan Katz-Bassett, Arvind Krishnamurthy, and Thomas Anderson Presented.
1 BGP Anomaly Detection in an ISP Jian Wu (U. Michigan) Z. Morley Mao (U. Michigan) Jennifer Rexford (Princeton) Jia Wang (AT&T Labs)
By Hitesh Ballani, Paul Francis, Xinyang Zhang Slides by Benson Luk for CS 217B.
Transient BGP Loops Do they matter, and what can be done about them? Nate Kushman MIT/Akamai Srikanth Kandula, Dina Katabi and John Wroclawski.
End-to-End Routing Behavior in the Internet Vern Paxson Presented by Zhichun Li.
Detecting Traffic Differentiation in Backbone ISPs with NetPolice Ying Zhang Zhuoqing Morley Mao Ming Zhang.
DYNAMICS OF PREFIX USAGE AT AN EDGE ROUTER Kaustubh Gadkari, Dan Massey and Christos Papadopoulos 1.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
University of Massachusetts at Amherst 1 Flooding Attacks by Exploiting Persistent Forwarding Loops Jianhong Xia, Lixin Gao and Teng Fei University of.
Accurate Real-Time Identification of IP Prefix Hijacking Z. Morley Mao Xin Hu 2007 IEEE Symposium on and Privacy Oakland, California 2007 IEEE Symposium.
Internet Routing Instability Labovitz et al. Sigcomm 1997 Largely adopted from Ion Stoica’s slide at UCB.
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Measurement in the Internet. Outline Internet topology Bandwidth estimation Tomography Workload characterization Routing dynamics.
E2E Routing Behavior in the Internet Vern Paxson Sigcomm 1996 Slides are adopted from Ion Stoica’s lecture at UCB.
RD-CSY /09 Distance Vector Routing Protocols.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley C. Cranor, M. Rabinovich,
Ningning HuCarnegie Mellon University1 A Measurement Study of Internet Bottlenecks Ningning Hu (CMU) Joint work with Li Erran Li (Bell Lab) Zhuoqing Morley.
End-to-End Issues. Route Diversity  Load balancing o Per packet splitting o Per flow splitting  Spill over  Route change o Failure o policy  Route.
A victim-centric peer-assisted framework for monitoring and troubleshooting routing problems.
Hot Potatoes Heat Up BGP Routing Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman Shaikh, and.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman.
1 Studying Black Holes on the Internet with Hubble Ethan Katz-Bassett, Harsha V. Madhyastha, John P. John, Arvind Krishnamurthy, David Wetherall, Thomas.
Low-Rate TCP-Targeted DoS Attack Disrupts Internet Routing
Predicting and Bypassing End-to-End Internet Service Degradation Anat Bremler-BarrEdith CohenHaim KaplanYishay Mansour Tel-Aviv UniversityAT&T Labs Tel-Aviv.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
Distance Vector Routing Protocols W.lilakiatsakun.
M. Menelaou CCNA2 DYNAMIC ROUTING. M. Menelaou DYNAMIC ROUTING Dynamic routing protocols can help simplify the life of a network administrator Routing.
1 © 2004, Cisco Systems, Inc. All rights reserved. CCNA 2 v3.1 Module 7 Distance Vector Routing Protocols.
“Intra-Network Routing Scheme using Mobile Agents” by Ajay L. Thakur.
1 The Research on Analyzing Time- Series Data and Anomaly Detection in Internet Flow Yoshiaki HARADA Graduate School of Information Science and Electrical.
Reducing Transient Disconnectivity using Anomaly-Cognizant Forwarding Andrey Ermolinskiy, Scott Shenker University of California – Berkeley and ICSI.
Prediction Assisted Single-copy Routing in Underwater Delay Tolerant Networks Zheng Guo, Bing Wang and Jun-Hong Cui Computer Science & Engineering Department,
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira (UCSD),
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
Active Measurements on the AT&T IP Backbone Len Ciavattone, Al Morton, Gomathi Ramachandran AT&T Labs.
A Firewall for Routers: Protecting Against Routing Misbehavior1 June 26, A Firewall for Routers: Protecting Against Routing Misbehavior Jia Wang.
BGP topics to be discussed in the next few weeks: –Excessive route update –Routing instability –BGP policy issues –BGP route slow convergence problem –Interaction.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
On Understanding of Transient Interdomain Routing Failures Feng Wang, Lixin Gao, Jia Wang, and Jian Qiu Department of Electrical and Computer Engineering.
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
1 Quantifying Path Exploration in the Internet Ricardo Oliveira, Rafit Izhak-Ratzin, Lixia Zhang, UCLA Beichuan Zhang, UArizona Dan Pei, AT&T Labs -- Research.
By, Matt Guidry Yashas Shankar.  Analyze BGP beacons which are announced and withdrawn, usually within two hour intervals.  The withdraws have an effect.
1 © 2004, Cisco Systems, Inc. All rights reserved. CCNA 2 v3.1 Module 7 Distance Vector Routing Protocols.
Distance Vector Routing Protocols Dynamic Routing.
Routing and Routing Protocols
Low-Rate TCP-Targeted DoS Attack Disrupts Internet Routing Ying Zhang Z. Morley Mao Jia Wang Presented in NDSS07 Prepared by : Hale Ismet.
End-to-End Routing Behavior in the Internet Vern Paxson Presented by Sankalp Kohli and Patrick Wong.
BGP Routing Stability of Popular Destinations Jennifer Rexford, Jia Wang, Zhen Xiao, and Yin Zhang AT&T Labs—Research Florham Park, NJ All flaps are not.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
1 © 2004, Cisco Systems, Inc. All rights reserved. Routing and Routing Protocols: Routing Static.
Placing Relay Nodes for Intra-Domain Path Diversity Meeyoung Cha Sue Moon Chong-Dae Park Aman Shaikh Proc. of IEEE INFOCOM 2006 Speaker 游鎮鴻.
Optimization-based Cross-Layer Design in Networked Control Systems Jia Bai, Emeka P. Eyisi Yuan Xue and Xenofon D. Koutsoukos.
1 On the Impact of Route Monitor Selection Ying Zhang* Zheng Zhang # Z. Morley Mao* Y. Charlie Hu # Bruce M. Maggs ^ University of Michigan* Purdue University.
PlanetSeer: Internet Path Failure Monitoring and Characterization in Wide-Area Services Ming Zhang, Chi Zhang Vivek Pai, Larry Peterson, Randy Wang Princeton.
Monitoring Persistently Congested Internet Links
Jian Wu (University of Michigan)
Routing and Routing Protocols: Routing Static
Routing and Routing Protocols: Routing Static
Transient BGP Loops Do they matter, and what can be done about them?
Distance Vector Routing Protocols
Presentation transcript:

1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang

2 Internet routing changes  Various causes  Link failures, configuration changes, topology changes, etc.  Direct influence on the data plane  Transient data-plane disruption  Packet loss, increased delay, forwarding loops Internet C BR C C Destination Source Old path New path C BR C C C C

Motivation  Frequent routing dynamics can cause transient disruption in the data plane  Inconsistent routes during convergence  Real-time applications can be affected  Predicting performance impact can assist more intelligent route selection 3

Measuring and predicting the impact  Comprehensively measure the impact of routing changes  Characterize the properties of routing changes that cause traffic disruption  Search for pattern to help prediction 4

Outline  Motivation  Methodology  Characterization of data-plane failures  Failure prediction model 5

Methodology  Data collection  Control plane: local real-time BGP updates  Data plane: ping and traceroute probes for each update  A light weight active probing methodology  A coarse-grained performance metric: reachability  Destination reachable: any ping reply  Scalable to many destinations with live IPs  Measurement-based approach  No simplifying assumptions  Empirical evidence 6

Our approach  Focus: measure data-plane failures caused by routing changes  Coarse-grained performance metrics  Methodology: light-weight active probing  Triggered by locally observed routing updates  Probing target of a live IP within the prefix 7 Prefix P Old path New path C BR AS C Update Prefix: P, AS path: A D B C BR AS B AS A C BR AS D Measurement Framework Internet

Our approach  Focus: measure data-plane failure caused by routing changes  Methodology: light-weight active probing  Triggered by locally observed routing updates  Probing target of a live IP within the prefix 8 Live IP 1 within Prefix P Old path New path C BR AS C Ping C BR AS B AS A C BR AS D Measurement Framework Internet Traceroute Ping, traceroute

Probing control  Background probing  Identifying persistent failures  Verifying live IP’s response  Resource control  Ignoring updates due to table transfers  Imposing maximum probing duration  Accuracy control  Impose maximum waiting duration 9

Outline  Motivation  Methodology  Characterization of data-plane failures  Failure prediction model 10

Characterization of data-plane failures  Failure types  Reachability failure  Ping reply is not received due to network problems  Forwarding loops  A subset of reachability failures  Transient loops observed in the path  Failure properties  Affected networks  Failure duration  Failure predictability 11

Overall reachability failure statistics 12 IncidencePrefixAS Unreachable Loop6%23%33% Other36%72%38% All42%73%63% Reachable57%83%98% Internet experiments for 11 weeks

Affected network locations  Understanding the networks affected by routing changes  Most Ases are near the edge and in foreign countries  Small fraction of destinations experiencing many unreachable incidences 13

Failure durations  Short duration  Most last less than 300 seconds  Transient routing failure, convergence delay  10% incidences with longer duration  Configuration errors or path failures 14

Failure predictability  Destination prefix information  Appearance probability  Probability of an unreachable incidence for prefix D  Destination prefix and AS path segments  Conditional probability on AS path segments  Probability of an unreachable event occurring given a particular AS path segment  Responsible AS  Where traceroute stops 15

Outline  Motivation  Methodology  Characterization of data plane failure  Failure prediction model 16

Prediction model  Prefix and AS segment information  The data plane failure likelihood ratio  P(Y=1|R;D): the conditional probability of data-plane failure given a routing update R for prefix D  Assuming the failure on each AS is independent x i is the responsible AS in history data x i is the responsible AS in history data 17

Evaluation  The trade-off between selectivity and sensitivity  is the decision threshold which determines false positives and false negative route  Receiver operating characteristic  Evaluation results  60% detection rate with 18% false positives 18

Conclusion  Developed an efficient framework for measuring and predicting data-plane failures caused by routing changes  Identified patterns to accurately predict data-plane failures  Provided suggestions for more intelligent route selections 19