BGP-lens: Patterns and Anomalies in Internet Routing Updates

Slides:



Advertisements
Similar presentations
ABSTRACT Due to the Internets sheer size, complexity, and various routing policies, it is difficult if not impossible to locate the causes of large volumes.
Advertisements

Multihoming and Multi-path Routing
Internet Measurement Conference 2003 Source-Level IP Packet Bursts: Causes and Effects Hao Jiang Constantinos Dovrolis (hjiang,
1 BGP Policy Atoms Yehuda Afek Omer Ben-Shalom Anat Bremler-Barr Tel-Aviv University.
Comparing IPv4 and IPv6 from the perspective of BGP dynamic activity Geoff Huston APNIC February 2012.
Technical Aspects of Peering Session 4. Overview Peering checklist/requirements Peering step by step Peering arrangements and options Exercises.
The Connectivity and Fault-Tolerance of the Internet Topology
Internet Routing Instability
1 BGP Anomaly Detection in an ISP Jian Wu (U. Michigan) Z. Morley Mao (U. Michigan) Jennifer Rexford (Princeton) Jia Wang (AT&T Labs)
Efficient Distribution Mining and Classification Yasushi Sakurai (NTT Communication Science Labs), Rosalynn Chong (University of British Columbia), Lei.
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
BGP-lens: Patterns and Anomalies in Internet Routing Updates B. Aditya Prakash 1, Nicholas Valler 2, David Andersen 1, Michalis Faloutsos 2, Christos Faloutsos.
1 Measurement of Highly Active Prefixes in BGP Ricardo V. Oliveira, Rafit Izhak-Ratzin, Beichuan Zhang, Lixia Zhang GLOBECOM’05.
An Engineering Approach to Computer Networking
CMU SCS Mining Billion-node Graphs Christos Faloutsos CMU.
DYNAMICS OF PREFIX USAGE AT AN EDGE ROUTER Kaustubh Gadkari, Dan Massey and Christos Papadopoulos 1.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
CMU SCS Data Mining Meets Systems: Tools and Case Studies Christos Faloutsos SCS CMU.
Learning-Based Anomaly Detection in BGP Updates Jian Zhang Jennifer Rexford Joan Feigenbaum.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Lecture Week 3 Introduction to Dynamic Routing Protocol Routing Protocols and Concepts.
Transport Layer 3-1 Chapter 4 Network Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley March 2012  CPSC.
EQ-BGP: an efficient inter- domain QoS routing protocol Andrzej Bęben Institute of Telecommunications Warsaw University of Technology,
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
1 Computer Communication & Networks Lecture 22 Network Layer: Delivery, Forwarding, Routing (contd.)
Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009.
Network Anomography Yin Zhang – University of Texas at Austin Zihui Ge and Albert Greenberg – AT&T Labs Matthew Roughan – University of Adelaide IMC 2005.
BGP topics to be discussed in the next few weeks: –Excessive route update –Routing instability –BGP policy issues –BGP route slow convergence problem –Interaction.
1 Route Optimization for Large Scale Network Mobility Assisted by BGP Feriel Mimoune, Farid Nait-Abdesselam, Tarik Taleb and Kazuo Hashimoto GLOBECOM 2007.
TCOM 509 – Internet Protocols (TCP/IP) Lecture 06_a Routing Protocols: RIP, OSPF, BGP Instructor: Dr. Li-Chuan Chen Date: 10/06/2003 Based in part upon.
Exploiting Network Structure for Proactive Spam Mitigation Shobha Venkataraman * Joint work with Subhabrata Sen §, Oliver Spatscheck §, Patrick Haffner.
1 7-Jan-16 S Ward Abingdon and Witney College Dynamic Routing CCNA Exploration Semester 2 Chapter 3.
BGP Routing Stability of Popular Destinations Jennifer Rexford, Jia Wang, Zhen Xiao, and Yin Zhang AT&T Labs—Research Florham Park, NJ All flaps are not.
DataJewel 1 : Tightly Integrating Visualization with Temporal Data Mining Mihael Ankerst, David H. Jones, Anne Kao, Changzhou Wang 1 US patent pending.
Chapter 4: Network Layer
Lec4: Introduction to Dynamic Routing Protocol
Introduction to Dynamic Routing Protocol
CS 3700 Networks and Distributed Systems
BGP Routing Stability of Popular Destinations
BGP 1. BGP Overview 2. Multihoming 3. Configuring BGP.
CS 3700 Networks and Distributed Systems
Cellular IP: A New Approach to Internet Host Mobility
Jian Wu (University of Michigan)
Chapter 4: Network Layer
Power Prefixes Prioritization for Smarter BGP Reconvergence
BGP supplement Abhigyan Sharma.
Interdomain Traffic Engineering with BGP
Trickle: Code Propagation and Maintenance
NetMine: Mining Tools for Large Graphs
Introduction to Dynamic Routing Protocol
CS 457 – Lecture 14 Global Internet
Introduction to Dynamic Routing Protocol
Dynamic Routing Protocols
Delivery and Routing of IP Packets
Static Routing 1st semester
COS 561: Advanced Computer Networks
CS 3700 Networks and Distributed Systems
COMPUTER NETWORKS CS610 Lecture-42 Hammad Khalid Khan.
Chapter 4: Network Layer
CS 3700 Networks and Distributed Systems
Chapter 4: Network Layer
COS 561: Advanced Computer Networks
Chapter 4: Network Layer
Identifying problematic inter-domain routing issues
COMP/ELEC 429/556 Introduction to Computer Networks
2005 – A BGP Year in Review February 2006 Geoff Huston
Visualization of Temporal Difference of BGP Routing Information
Chapter 4: Network Layer
An Engineering Approach to Computer Networking
Data Pre-processing Lecture Notes for Chapter 2
Presentation transcript:

BGP-lens: Patterns and Anomalies in Internet Routing Updates B. Aditya Prakash1, Nicholas Valler2, David Andersen1, Michalis Faloutsos2, Christos Faloutsos1 1Carnegie Mellon University 2UC-Riverside KDD 2009, Paris

Introduction Each Row is an update Border Gateway Protocol (BGP) Internet Routing Protocol Router sending messages to each other Keeps path information up-to-date Ideal Setting - no BGP updates Really – many updates link failures, router restarts, malicious behavior Time peerAS originAS prefix 2005-02-17 12:39:42 ATT SPRINT 204.29.119.0/24 2005-02-17 12:39:43 VERIZON AOL 204.29.80.0/24 2005-02-17 12:39:46 WASH ATLA 204.29.79.0/24 ….

Automated Tool needed! Introduction contd. Question: Find patterns/anomalies? Challenges: Millions of updates sent over network Data has multiple dimensions Noisy Measurements Impossible for human to sift through updates Automated Tool needed!

The Data 18 million update messages – over two years! Time peerAS originAS prefix 2005-02-17 12:39:42 ATT SPRINT 204.29.119.0/24 2005-02-17 12:39:43 VERIZON AOL 204.29.80.0/24 2005-02-17 12:39:46 WASH ATLA 204.29.79.0/24 …. Data from Datapository.net Abilene Network 18 million update messages – over two years!

Our Approach Look at a simple time-series Focus on just the time # of updates received every b seconds (bin size) Specific Problem we are tackling Given such time-series Report patterns and anomalies Also find suspicious entities (paths, ASes etc.) Time 2005-02-17 12:39:42 2005-02-17 12:39:43 2005-02-17 12:39:46 2005-02-17 12:40:01 …. Time peerAS originAS prefix 2005-02-17 12:39:42 ATT SPRINT 204.29.119.0/24 2005-02-17 12:39:43 VERIZON AOL 204.29.80.0/24 2005-02-17 12:39:46 WASH ATLA 204.29.79.0/24 …. time b secs Bin: 0 1 2 … Count: 4 2 6 …

Real data: Washington Router Bin Size = 600s Very Bursty! # of Updates Traditional Tools like FFT, auto-regression don’t work  Bin number (‘Time’)

Outline Introduction and Problem Statement Techniques BGP-lens at work Temporal Analysis Frequency Analysis BGP-lens at work Conclusions

Temporal Analysis Bin size: 10s First Cut: Take log-linear plot emphasizes small values over high values

But: Bin size is important!

Bin size: 600s ‘Clotheslines’

Clotheslines Q1: Why Clotheslines? Q2: How to automate this discovery? Near consecutive updates over long time-period Can be Route Flapping advertise/withdraw same path frequently important to identify Q2: How to automate this discovery?

Proposal: Marginals to Rescue PDF of volume of updates Number of time-bins with volume Extremes == Height of the clotheslines!

Marginals to Rescue PDF of volume of updates Number of time-bins with volume

Algorithm - Clotheslines Details! For marginals plot use the median filtering approach to determine ‘outliers’; For each time interval found, report the most consistent IPs/ASes etc. High Level Idea only – details in paper!

Outline Introduction and Problem Statement Techniques BGP-lens at work Temporal Analysis Frequency Analysis BGP-lens at work Conclusions 1-2 sentences leading to wavelets (self-similar, same on many scales, used tests)

time -> High energy Low energy ‘Tornado’ does not touch down Low Freq. High energy Low energy ‘Tornado’ does not touch down High Freq. time -> Signal Basic tool – wavelet scalogram plots values of wavelet coefficients in the scale-space domain scale vs time lower frequencies on the top darker color -> high energy in the co-efficient at finer scale – fast short cycles at coarser scale – slow long cycles

In real data… E2

E2 ~ 8 hrs ~ 20,000 updates!

Why Prolonged Spike? Bursts of short duration Can represent malicious behavior Or simple router restarts! Exact cause hard to find – but important for system-administrators

Algorithm – Prolonged Spikes Details! Basic idea: find tornados from scalogram Find suitable starting point at higher levels Extend downward as much as possible The finest scale where tornado stops the shortest time period to look for a prolonged spike Again, details in paper!

Scalability 

BGP-lens: User Interface optional # of suspicious events sysadmin wants to check Ready to be used as is scans multiple levels ranks deviations duration: length of events to be checked (think daily vs weekly vs monthly)

Outline Introduction and Problem Statement Techniques BGP-lens at work Temporal Analysis Frequency Analysis BGP-lens at work Conclusions

BGP-lens at Work We found real events too . examples- Event 1: 50-clothesline Prefix and Origin-AS pointed to Alabama Supercomputing Net When contacted sysadmins attributed changes to route flapping “the route for 207.157.115.0/24 was appearing and disappearing in [the] IGP routing table ... [which] may have caused BGP to flap.” Anomaly went undetected and unresolved for 30 days!

Results from real data Event 2 Prolonged Spike May 12th 2006 – 8hr spike Most persistent IPs/ASes Primary and middle schools in a large district in a country Two more spikes Jan18-19, 2006 and Aug 1

Conclusions Studied huge real data (~18 million updates) Developed two new techniques effective spots subtle phenomena like clotheslines and prolonged spikes scalable BGP-lens: a user-friendly tool provides reasonable defaults provides easy-to-use knobs leads like IPs/ASes

Thank You! Any questions? www.cs.cmu.edu/~badityap Author-Reel! We thank NSF, USA for their support. Author-Reel!

Extra - Frequency Analysis Data is self-similar! we used the entropy-plot measure also called the b-model [26] Corresponds to b-model of 75-25 Multi-resolution techniques needed!

Extra - FFT

Extra – Marginals for 10sec

Extra – Prolonged Spike Algorithm