On the Geographic Location of Internet Resources Mark Crovella Boston University Computer Science with Anukool Lakhina, John Byers, and Ibrahim Matta or.

Slides:



Advertisements
Similar presentations
University of Nevada, Reno Router-level Internet Topology Mapping CS790 Presentation Modified from Dr. Gunes slides by Talha OZ.
Advertisements

1 Generating Network Topologies That Obey Power LawsPalmer/Steffan Carnegie Mellon Generating Network Topologies That Obey Power Laws Christopher R. Palmer.
Complex Networks Advanced Computer Networks: Part1.
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
Analysis and Modeling of Social Networks Foudalis Ilias.
School of Information University of Michigan Network resilience Lecture 20.
Authors: Venkata N. Padmanabhan and Lakshminarayanan Subramanian Publisher: SIGCOMM 2001 Presenter: Chai-Yi Chu Date: 2013/03/06 1.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Small World Networks Lecture 5.
Internet Topology Caterina Scoglio KSU. Why need for Internet Topology models To evaluate performance of algorithms and protocols Realistic models at.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
CS728 Lecture 5 Generative Graph Models and the Web.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
New Directions in Internet Topology Measurement and Modeling John Byers Topology Modeling Group, Boston University CS: Mark Crovella, Marwan Fayed, Anukool.
On the Geographic Location of Internet Resources CSCI 780, Fall 2005.
1 A survey of Internet Topology Discovery. 2 Outline Motivations Internet topology IP Interface Level Router Level AS Level PoP Level.
Network Design IS250 Spring 2010 John Chuang. 2 Questions  What does the Internet look like? -Why do we care?  Are there any structural invariants?
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Computer Science 1 An Approach to Universal Topology Generation Alberto Medina Anukool Lakhina Ibrahim Matta John Byers
On Power-Law Relationships of the Internet Topology CSCI 780, Fall 2005.
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Heuristics for Internet Map Discovery Ramesh Govindan USC/Information Sciences Institute Joint work with Hongsuda Tangmunarunkit.
Computer Science Sampling Biases in IP Topology Measurements John Byers with Anukool Lakhina, Mark Crovella and Peng Xie Department of Computer Science.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Heuristics for Internet Map Discovery R. Govindan, H. Tangmunarunkit Presented by Zach Schneirov.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
The structure of the Internet. The Internet as a graph Remember: the Internet is a collection of networks called autonomous systems (ASs) The Internet.
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
Measuring ISP topologies with Rocketfuel Ratul Mahajan Neil Spring David Wetherall University of Washington ACM SIGCOMM 2002.
1 Network Topology Measurement Yang Chen CS 8803.
PALMTREE M. Engin TozalKamil Sarac The University of Texas at Dallas.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Modeling Internet Topology Ellen W. Zegura College of Computing Georgia Tech.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
(jeez y) Where is the Internet? Answers from : (G. Whilikers) Out there. (Mike) the way I see it, the "internet" has to be somewhere. a router collects.
Towards Highly Reliable Enterprise Network Services via Inference of Multi-level Dependencies Paramvir Bahl, Ranveer Chandra, Albert Greenberg, Srikanth.
Subnet Level Internet Topology Generator Mehmet Burak AKGUN CS790 Complex Networks.
Traceroute-like exploration of unknown networks: a statistical analysis A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, France) L.
Lecture 8 Page 1 Advanced Network Security Review of Networking Basics: Internet Architecture, Routing, and Naming Advanced Network Security Peter Reiher.
A Place-based Model for the Internet Topology Xiaotao Cai Victor T.-S. Shi William Perrizo NDSU {Xiaotao.cai, Victor.shi,
“The Geography of the Internet Infrastructure: A simulation approach based on the Barabasi-Albert model” Sandra Vinciguerra and Keon Frenken URU – Utrecht.
Issues with Inferring Internet Topological Attributes Lisa Amini ab, Anees Shaikh a, Henning Schulzrinne b a IBM T.J. Watson Research Center b Columbia.
1 Spectral Analysis of Power-Law Graphs and its Application to Internet Topologies Milena Mihail Georgia Tech.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Aemen Lodhi (Georgia Tech) Amogh Dhamdhere (CAIDA)
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
The Geography and Evolution of the European Internet Infrastructure Sandra Vinciguerra URU – Utrecht University
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
A Place-based Model for the Internet Topology Xiaotao Cai Victor T.-S. Shi William Perrizo NDSU {Xiaotao.cai, Victor.shi,
1 On the Impact of Route Monitor Selection Ying Zhang* Zheng Zhang # Z. Morley Mao* Y. Charlie Hu # Bruce M. Maggs ^ University of Michigan* Purdue University.
Link-Level Internet Structures
Topics In Social Computing (67810)
No Direction Home: The True cost of Routing Around Decoys
Subnet Level Internet Topology Generator
Measured Impact of Crooked Traceroute
Peer-to-Peer and Social Networks Fall 2017
Lecture 26: Internet Topology CS 765: Complex Networks.
Presentation transcript:

On the Geographic Location of Internet Resources Mark Crovella Boston University Computer Science with Anukool Lakhina, John Byers, and Ibrahim Matta or Where on Earth is the Internet?

Some observations about the Internet Rapid, decentralized growth: –90% of Internet systems were added in the last four years –Connecting to the network can be a purely local operation This rapid, decentralized growth has opened significant questions about the physical structure of the network; e.g., –The number of hosts connected to the network –The properties of network links (delay, bandwidth) –The interconnection pattern of hosts and routers –The interconnection relationships of ISPs –The geographic locations of hosts, routers and links

Internet Science Engineering or Science? Engineering: study of things made Science: study of things found Although the Internet is a engineered artifact, it now presents us with questions that are better approached from a scientific posture

For example: where is the Internet? ? ? ? ? ? What is the relationship between geography and the network?

Why does this matter? Our motivation is in developing better generators for “representative network topologies” –Many simulation based results in networking depend critically on network topology –Topology generation is still fairly ad hoc However, there is a scientific goal as well! –We need to understand what drives Internet growth –Basic investigations will pay off in unexpected ways

Assumptions and Definitions We will treat the Internet as an undirected graph embedded on the Earth’s surface –Nodes correspond to routers or interfaces –Edges correspond to physical router-router links –Not concerned with hosts (end systems) We will ignore many higher and lower level questions –Autonomous systems –Link layers

Some Basic Questions What is the relationship between population and network geometry? What effect does distance have on network topology? Our Basic Approach Obtain IP-level router maps  Mercator and Skitter Find geographic locations of those routers  Ixia’s IxMapping service

Mercator: Govindan et al., USC/ISI, ICSI Discovers a router-level map of the Internet using active probing Uses hop-limited UDP packets Addresses a number of difficult problems: –What IP addresses represent valid hosts? –How to observe cross-links from a single vantage point? –How to resolve aliases?

Mercator: Govindan et al., USC/ISI, ICSI Skitter: Moore et al., CAIDA Based on active probing from a single site Resolves aliases Uses loose source routing to explore alternate paths Traceroutes from 19 monitors to large set of destinations Does not resolve aliases Destinations attempt to cover IP address space

Datasets Mercator Collected August ,263 routers 320,149 links Skitter Collected January ,107 interfaces 1,075,454 links

IxMapping: Moore et al., CAIDA Given an IP address, infers geographic location based on a variety of heuristics –Hostnames, DNS LOC, whois e.g., 190.ATM8-0-0.GW3.BOS1.ALTER.NET is in Boston Able to map –99% of Mercator routers –98.5% of Skitter interfaces Similar to GeoTrack [Padmanabhan] which exhibits reasonable accuracy –Median error of 64 mi –90% queries within 250 mi –for well connected nodes

Putting it together Used most recent Mercator dataset (2001) –217,000 routers (after cleanup) –37,000 unique locations –279,000 links Mapped all routers using NetGeo –88% “City”, 10% “Country”, 2% “State/Prov” –Precision on the order of tens of miles or more

Where are the routers? USA

Europe

Routers and People: World (Grid size: ~150 mi x 150 mi) Ugh!

People Per Interface (Skitter) Pop (M) Intfs (K) People / Intfc Online (M) Online / Intfc Africa8378,379100, S. America34110,13133, ,161 Mexico1544,36135, W. Europe36695,9933, ,489 Japan13637,6493, ,250 Australia1818, USA299282,0481, World5,653563,52110, Pop (M) Intfs (K) People / Intfc Online (M) Online / Intfc Africa8378,379100, S. America34110,13133, ,161 Mexico1544,36135, W. Europe36695,9933, ,489 Japan13637,6493, ,250 Australia1818, USA299282,0481, World5,653563,52110,

Interfaces and People: USA, Skitter Grid size: ~90 mi x 90 mi

Routers and People Upper, Mercator; Lower, Skitter USA Europe Japan

Router Location: Summary Router location is strongly driven by both population density and economic development Superlinear relationship between router and population density: R  k P a k varies with economic development (users online) a is greater than one More routers per person in more densely populated areas

Models for Network Topology Spatial Models Structural Models Degree-based Models

Spatial model: Waxman, 1988 Nodes are distributed randomly (uniformly) in the plane. Probability that two nodes separated by distance d are connected: P[C|d] =  exp(-d/  L) 0  ,   1; L = diameter of region  : degree of distance sensitivity  : edge density A spatially imbedded random graph

Structural Models Real networks have structure –Always connected! –Formed by interconnection of component networks –Distinction between transit and stub networks imposes a hierarchy on resulting graph Tiers: Doar, 1996 GT-ITM: Calvert, Doar, Zegura, 1997

Degree-based Models Faloutsos, Faloutsos, & Faloutsos, 1999: –Empirically measured networks show a power law in degree distribution: P[node has degree d] = k d -a Barabasi & Albert, 1999: –This property will be present in a graph where: Nodes and links are added incrementally Probability of connecting to a node is proportional to its degree (preferential connectivity) BRITE: Medina, Matta, Byers, 2001

Empirical Evidence Interested in influence of distance on link formation: f(d) = P[C|d] i.e., Probability two nodes separated by distance d are connected Estimated as: number links of length d f(d) = number of router pairs separated by d

f(d) for USA (Skitter) Distance Sensitive Distance Insensitive

Link Distance Preference for USA Skitter, d < 250, semi-log plot  L  140 mi.

Link distance preference: all regions Upper, Mercator; Lower, Skitter; small d USA  L  140 mi Europe  L  80 mi Japan  L  140 mi

Large d: distance insensitivity F(d) =  f(u) d u=1 USA data, Skitter

Distance insensitivity, all regions Upper, Mercator; Lower, Skitter; large d USA Europe Japan

Limits to distance sensitivity MercatorSkitter Limit% Links < Limit Limit% Links < Limit USA820 mi.82.1%818 mi.77.2% Europe383 mi.97.3%366 mi.95.4% Japan165 mi.91.5%116 mi.92.8%

Link Formation: Summary Link formation seems to be a mixture of distance-dependent and –independent processes Waxman (exponential) model remarkably good for large fraction of all links! –But, crucial difference is that we are using a very irregular spatial distribution of nodes Small fraction of non-local links are very important (structural)

Generating Topologies: a new recipe 1.Good models for population density exist –CIESIN’s Gridded Population of the World –2.5 arc minutes (<5 mi 2 ) … very high quality –Time trends also available 2.Router density then follows from population relationship 3.Link formation driven by hybrid process –Distance-dependent and –independent

Related Work Matrix.net: –Uses DNS hostname allocations –proprietary location methods Akamai –Extensive peering and measurement infrastructure Padmanabhan and Subramanian, 2000: –assessed accuracy of geographic mapping techniques Yook, Jeong, and Barabasi, 2001: –Similar goals –Find linear (not exponential) distance dependence

Final Thoughts The Internet has fully interpenetrated human society Scientific understanding of the net is essential Applications: –Simulation –Security –Reliability –Planning

Thanks! CAIDA: –David Moore –k claffy –Andre Broido Notre Dame: –Lazslo Barabasi USC: –Ramesh Govindan –Hongsuda Tangmunarunkit

Routers and People: North America

Subdividing the Data N. US C. A. S. US

Economic Heterogeneity

What Influences the Formation of Links? Waxman, 1988: spatial model Zegura et al., 1997: structural model –Explicitly captures hierarchical structure Barabasi et al., 1999: degree-based preferential connectivity –Matches observed power-law node degree –Inspired by Faloutsos et al., 1999 Strogatz & Watts, 1998: small-world properties –Captures “six degrees of separation”

Routers and Economics Matrix.net: hosts Mercator: routers

Penultimate Geographic Map, Oct 1980

Last Complete Geographic Map, Aug 1982