Comparative Analysis of Internet Topology Data Sets

Slides:



Advertisements
Similar presentations
University of Nevada, Reno Router-level Internet Topology Mapping CS790 Presentation Modified from Dr. Gunes slides by Talha OZ.
Advertisements

© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Introduction to IPv4 Introduction to Networks.
Addressing the Network IPv4
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
CCNA2 Module 4. Discovering and Connecting to Neighbors Enable and disable CDP Use the show cdp neighbors command Determine which neighboring devices.
1 Semester 2 Module 4 Learning about Other Devices Yuda college of business James Chen
Internet Topology Mapping
Router-level Internet Topology Mapping By Talha OZ.
1 A survey of Internet Topology Discovery. 2 Outline Motivations Internet topology IP Interface Level Router Level AS Level PoP Level.
Network Measurement Bandwidth Analysis. Why measure bandwidth? Network congestion has increased tremendously. Network congestion has increased tremendously.
Measuring ISP topologies with Rocketfuel Ratul Mahajan Neil Spring David Wetherall University of Washington ACM SIGCOMM 2002.
1 Network Topology Measurement Yang Chen CS 8803.
Copyright 2003 CCNA 1 Chapter 7 TCP/IP Protocol Suite and IP Addressing By Your Name.
1 CS 4396 Computer Networks Lab The Internet. 2 A Definition On October 24, 1995, the FNC unanimously passed a resolution defining the term Internet.
IPlane Status Harsha V. Madhyastha University of California San Diego.
INTERNET TOPOLOGY MAPPING INTERNET MAPPING PROBING OVERHEAD MINIMIZATION  Intra- and inter-monitor redundancy reduction IBRAHIM ETHEM COSKUN University.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 1 v3.0 Module 9 TCP/IP Protocol Suite and IP Addressing.
1 26-Aug-15 Addressing the network using IPv4 Lecture # 2 Engr. Orland G. Basas Prepared by: Engr. Orland G. Basas IT Lecturer.
CCNA Introduction to Networking 5.0 Rick Graziani Cabrillo College
Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.
Managing Network connections. Network Cabling Ethernet Topology Bus topology – Connects each node in a line – Has no central connection point Star topology.
Unit 4, Lesson 11 How Data Travels the Internet
CCNA 1 v3.0 Module 9 TCP/IP Protocol Suite and IP Addressing
Chapter 4. After completion of this chapter, you should be able to: Explain “what is the Internet? And how we connect to the Internet using an ISP. Explain.
Objectives: Chapter 5: Network/Internet Layer  How Networks are connected Network/Internet Layer Routed Protocols Routing Protocols Autonomous Systems.
Connecting to a Network Lesson 5. Objectives Understand the OSI Reference Model and its relationship to Windows 7 networking Install and configure networking.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
UNDERSTANDING THE HOST-TO-HOST COMMUNICATIONS MODEL - OSI LAYER & TCP/IP MODEL 1.
Chapter 15 – Part 2 Networks The Internal Operating System The Architecture of Computer Hardware and Systems Software: An Information Technology Approach.
1 The Internet Introductory material. An overview lecture that covers Internet related topics, including a definition of the Internet, an overview of its.
Lecture 14 Internet Measurements. 2 Web of interconnected networks Grows with no central authority Autonomous Systems optimize local communication efficiency.
IP1 The Underlying Technologies. What is inside the Internet? Or What are the key underlying technologies that make it work so successfully? –Packet Switching.
CCNA 1 v3.0 Module 9 TCP/IP Protocol Suite and IP Addressing
Internet Measurements. 2 Web of interconnected networks Grows with no central authority Autonomous Systems optimize local communication efficiency The.
Lecture 14: Internet Measurement CS 765: Complex Networks.
Lecture 17 Internet Measurements. 2 Web of interconnected networks Grows with no central authority Autonomous Systems optimize local communication efficiency.
1 The Internet Introductory material. An overview lecture that covers Internet related topics, including a definition of the Internet, an overview of its.
Internet Measurements. 2 Web of interconnected networks Grows with no central authority Autonomous Systems optimize local communication efficiency The.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
TCP/IP Protocol Suite and IP Addressing Presented By : Dupien AMS.
The OSI Model. Understanding the OSI Model In early 1980s, manufacturers began to standardize networking so that networks from different manufacturers.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the network IPv4 CCNA Exploration Semester 1 – Chapter 6.
ITMT Windows 7 Configuration Chapter 5 – Connecting to a Network ITMT 1371 – Windows 7 Configuration 1.
NT1210 Introduction to Networking
Lecture 2: Internet Measurement CS 790g: Complex Networks.
1 © 2004, Cisco Systems, Inc. All rights reserved. CCNA 2 v3.1 Module 8 TCP/IP Suite Error and Control Messages.
June 11, 2002 Abilene Route Quality Control Initiative Aaron D. Britt Guy Almes Route Optimization.
1 Computer Networks Chapter 5. Network layer The network layer is concerned with getting packets from the source all the way to the destination. Getting.
IP Addresses IPv4 IPv6.
COMPUTER NETWORKS and INTERNETS
DDoS Attacks on Financial Institutions Presentation
Mapa de Topología usando sondas RIPE Atlas
How The Internet Works Lecture 2
Distributed Systems.
Network Tools and Utilities
Comparative Analysis of Internet Topology Data Sets
Improved Algorithms for Network Topology Discovery
Internet and Intranet.
Computer Communication and Networking
Introduction to Networking
RESOLVING IP ALIASES USING DISTRIBUTED SYSTEMS
Chapter 9 Objectives Understand TCP/IP Protocol.
Internet and Intranet.
Measured Impact of Crooked Traceroute
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
Internet and Intranet.
The Internet Introductory material.
Lecture 26: Internet Topology CS 765: Complex Networks.
IPv6 Addressing By Aman Agrawal Archisman Bhattacharya
Internet and Intranet.
Presentation transcript:

Comparative Analysis of Internet Topology Data Sets Jay Thom

Outline Introduction Background Related Works Project Goal Conclusion Why is this important? Background History Internet Topology Measurement Related Works Data Sources Papers Project Goal Data Collection and Preparation Analysis Conclusion

Why Is This Important? My motivation: Internet Topology Measurement Our project Challenges with data collection Dealing with large amounts of data What can be learned? Project goals

Some History… Circuit Switching 1877 - First commercial circuit switched networks appear

Some History… Packet Switching First packet-switched network (ARPANET) sends a message from UCLA to Stanford, 1969 Attempted to send the word “LOGIN”, but crashed, sending the message “LO”

History of the Internet 1970 – First network protocol introduced: Network Control Protocol (NCP) 1970 – First network applications begin to appear 1972 – First ’hot’ application introduced; email 1982 – TCP/IP protocol introduced 1986 – NSFNET interconnects computer centers at several universities 1990 – First commercial connections to the network appear 1991 – World Wide Web goes live, widespread access to the network begins 2016 - http://www.internetlivestats.com/ Why is all of this important?

The Network of Networks Internet is made up of 55,483 autonomous systems (Ases) Each system is managed independently Cooperation within the network is voluntary ASes seek to improve their own performance, maintain competitive relationships Topological details of each AS is undisclosed, proprietary How do we know what the Internet actually looks like? What tools do we have to monitor it?

Measurement Tools – Ping

Measurement Tools - Traceroute

Measurement Platforms CAIDA – Archipelago (Ark) Measurement Lab (M-Lab) University of Washington Information Plane (iPlane) Ripe NNC Atlas University of Southern California ISI Ant Census PlanetLab

Ark Statistics CAIDA – Center for Applied Internet Data Analysis University of California San Diego Measurement and data curation (archives) 165 monitors in 57 countries Growing by 766 million traces per month Growing by 316 GB per month stored 41 billion traces performed (total) Data stored in a binary format (.warts file) Began running in 1998

Ark Raspberry Pi Monitor

Ark Monitor Locations

Ripe Atlas Ripe NNC – Reseaux IP Europeens Network Coordination Centre Regional Internet Registry (RIR) for Europe, Middle East, and Central Asia Based in Amsterdam, Netherlands 13,554 probes (small network devices) 208 anchors (rack-mounted servers) Traces are performed regularly from probes to anchors Anchors can also perform user-defined traces to any IP address Traces are stored at anchors, and can be downloaded in .json format Established in 1992

Ripe Atlas Hardware Rack mounted anchor Small probe (connected anywhere)

Ripe Atlas Anchors

Ripe Atlas Probes

PlanetLab Nodes hosted by corporate/academic institutions 1090 nodes in 507 countries Affiliated users are granted a ”slice” to run experiments on most* available nodes UNR has 2 PlanetLab nodes in operation Used by M-Lab and iPlane as vantage points for measurements Slowly dying…only about 176 nodes currently in operation * Some nodes are reserved for exclusive use by M-Lab

PlanetLab

M-Lab Utilizes PlanetLab nodes to generate traces Supports a number of tools, and archives all data making it accessible to the public. such as: Glasnost - detects prioritization or censorship of network traffic Network Diagnostic Tool (NDT) – measures TCP performance Neubot – for studying broadband performance NPAD – diagnoses issues in a network plan to improve performance OONI – censorship, surveillance, traffic manipulation. Paris Traceroute – network topology mapping SideStream – TCP state information Mlab-collectd – monitors M-Lab slices on PlanetLab A collaborative effort involving Google, New America, Princeton University, and others.

iPlane Utilizes PlanetLab nodes to perform traceroutes to infer router level topology. Clusters interfaces into PoPs: iPlane clusters interfaces that are in the same Point of Presence (PoP). For this, every interface in the router atlas is probed using UDP and ICMP packets. Interfaces that respond with the same source address or have similar return TTLs to all the vantage points are clustered together. Measures link attributes: loss rate, bandwidth capacity of all inter-cluster links. Performs route prediction: iPlane composes segments of observed Internet paths to predict the end-to-end path between any pair of end-hosts, and uses this prediction to estimate end-to-end performance for overlay services.

Ant Census Started in 2003 Scans IPv4 space every 42 days 4.3 billion addresses Utilizes ICMP ping, looks for response Shows ownership of all 256 /8 subnets Brighter – more replies About 6% of addresses respond

Problems Extract all files and convert to a common format Each platform storing data in a different format on a different file system Ripe – uncompressed .json format, data stored at each anchor Ark – compressed .warts files (binary), nested file system, password protected iPlane – compressed binary files, custom format (C++), password protected M-Lab – Google Cloud storage, compressed files within files Ant Census – released once every two months Extract all files and convert to a common format 1 month = 1TB of data

Related Work Pietro Marchetta et al, “Topology discovery at the router level: a new hybrid tool targeting ISP Networks”, 2011 An attempt to find a new way to collect measurements without traceroute and alias resolution. Introduce a new tool, Merlin, a central server controls distributed vantage points to probe a targeted AS. Uses MRINFO, a ping-like tool to monitor active multicast groups, utilizes IGMP messages ASK_NEIGHBORS and NEIGHBORS_REPLY. Deployed recursively to all routers in an AS and gives a listing of each router’s multicast neighbors. Performs worse than traceroute for inter-AS measurements, but much better for mapping the core of an AS.

Related Work Benoit Donnet and Timur Friedman, “Internet Topology Discovery: A Survey”, 2007 Discusses the four levels of Internet topology: IP interface level, Router level, PoP level, and AS level Seek to build a formal graph from network measurement data From this perspective, study characteristics of the network in terms of average degree, degree distribution, clustering coefficient, and between-ness centrality Would like to build a visualization of the network based on these factors No actual data collection.

Related Work Hakan Kardes, Mehmet Gunes, Talha Oz, “Cheleby: A subnet-level Internet Topology Mapping System”, 2012 Divide available PlanetLab nodes into 7 teams based on geographic location Each node is assigned a block of IP addresses from other sources, plus the first address from each /24 subnet (to reach all subnets) Each monitor probes 4 destination blocks at a time, and each block is only probed by 1 monitor Each process is independent, and eventually all blocks are probed by all monitors Amassed data set is analyzed to discover features in the network topology (i.e. number of nodes, edges, alias resolution statistics)

Related Work Kimberly Claffy et al, “Internet Mapping: from Art to Science”, 2009 Seeking to improve on previous tool (SKITTER) This was the paper written to introduce the Ark project by CAIDA Intend to amass largest data set Plan to build repository for measurement data from their study as well as others Will make data available to research community

Related Work Bradley Huffaker et al, “Internet Topology Data Comparison”, 2012 Studies topology at IP, Router, and AS level Notes that different topology studies are producing conflicting results Discusses metrics; Average node degree Graph size Number of edges Node degree distribution Clustering Mean local clustering Discusses possible inaccuracies in previous works because of a lack of IP alias resolution leading to an over-estimation of the number of routers

Related Work Vaibhav Bajpai and Jorgen Schownwilder, “A Survey on Internet Performance Measurement Platforms and Related Standardization Efforts”, 2015 Survey of network performance tools Focus on performance metrics rather than topology (bandwidth, reachability, censorship, throttling, etc. Consider some of the same platforms, but from another perspective

Related Work John Heidemann et al, “A Survey of the Visible Internet”, 2008 Many hosts are hidden (firewalls, private IP space), but there is much to be learned from the visible address space Census: walk the entire address space and look for responsive hosts Survey: frequently sample a fraction of that space Some results: 3.6% of the allocated space are actually occupied by visible hosts ¼ of responsive /24 subnet blocks are less than 5% filled 9% of responsive /24 subnet blocks are more than ½ filled 16% (34 million IPs) are responsive and stable Estimated from this, 60 million Internet-accessible computers exist

Conclusion History Why is this important? Measurement Platforms Problems Related Work Project Goals