L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.

Slides:



Advertisements
Similar presentations
T. S. Eugene Ng Mellon University1 Towards Global Network Positioning T. S. Eugene Ng and Hui Zhang Department of Computer.
Advertisements

Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
Delay bounded Routing in Vehicular Ad-hoc Networks Antonios Skordylis Niki Trigoni MobiHoc 2008 Slides by Alex Papadimitriou.
Fabián E. Bustamante, 2007 Meridian: A lightweight network location service without virtual coordinates B. Wong, A. Slivkins and E. Gün Sirer SIGCOM 2005.
EL9331 Meridian: A Lightweight Network Location Service without Virtual Coordinates Bernard Wong, Aleksandrs Slivkins, Emin Gun Sirer SIGCOMM’05 ( Slides.
1 Turning Heterogeneity into an Advantage in Overlay Routing Gisik Kwon Dept. of Computer Science and Engineering Arizona State University Published in.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,
By Hitesh Ballani, Paul Francis, Xinyang Zhang Slides by Benson Luk for CS 217B.
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
15-441: Computer Networking Lecture 26: Networking Future.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
Overlay Networks EECS 122: Lecture 18 Department of Electrical Engineering and Computer Sciences University of California Berkeley.
A Network Measurement Architecture for Adaptive Networked Applications Mark Stemm* Randy H. Katz Computer Science Division University of California at.
Predicting Communication Latency in the Internet Dragan Milic Universität Bern.
T. S. Eugene Ng Mellon University1 Global Network Positioning: A New Approach to Network Distance Prediction Tze Sing Eugene.
Measurement and Monitoring Nick Feamster Georgia Tech.
Spring Routing & Switching Umar Kalim Dept. of Communication Systems Engineering 06/04/2007.
Searching in Unstructured Networks Joining Theory with P-P2P.
Network Measurement Bandwidth Analysis. Why measure bandwidth? Network congestion has increased tremendously. Network congestion has increased tremendously.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
1 Network Topology Measurement Yang Chen CS 8803.
Ad Hoc Wireless Routing COS 461: Computer Networks
On the Power of Off-line Data in Approximating Internet Distances Danny Raz Technion - Israel Institute.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
1 Pertemuan 20 Teknik Routing Matakuliah: H0174/Jaringan Komputer Tahun: 2006 Versi: 1/0.
Packet-Switching Networks Routing in Packet Networks.
Quantifying the Causes of Path Inflation Neil Spring, Ratul Mahajan, and Thomas Anderson Presented by Luv Kohli COMP November 24, 2003.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
HERO: Online Real-time Vehicle Tracking in Shanghai Xuejia Lu 11/17/2008.
Network Aware Resource Allocation in Distributed Clouds.
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Advanced Networking Lab. Given two IP addresses, the estimation algorithm for the path and latency between them is as follows: Step 1: Map IP addresses.
TDTS21: Advanced Networking Lecture 7: Internet topology Based on slides from P. Gill and D. Choffnes Revised 2015 by N. Carlsson.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Resource Addressable Network (RAN) An Adaptive Peer-to-Peer Substrate for Internet-Scale Service Platforms RAN Concept & Design  Adaptive, self-organizing,
1 Vivaldi: A Decentralized Network Coordinate System Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris Presented by: Chen Qian.
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Network Coordinates : Internet Distance Estimation Jieming ZHU
CSE534- Fundamentals of Computer Networking Lecture 12-13: Internet Connectivity + IXPs (The Underbelly of the Internet) Based on slides by D. Choffnes.
ECE 544 Project3 Group 9 Brien Range Sidhika Varshney Sanhitha Rao Puskuru.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Indian Institute of Technology Bombay 1 Communication Networks Prof. D. Manjunath
Chapter 25 Internet Routing. Static Routing manually configured routes that do not change Used by hosts whose routing table contains one static route.
Distance Vector Routing
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
15-829A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Review.
Doc.: IEEE /2200r2 Submission July 2007 Sandesh Goel, Marvell et alSlide 1 Route Metric Proposal Date: Authors:
Performance Comparison of Ad Hoc Network Routing Protocols Presented by Venkata Suresh Tamminiedi Computer Science Department Georgia State University.
PATH DIVERSITY WITH FORWARD ERROR CORRECTION SYSTEM FOR PACKET SWITCHED NETWORKS Thinh Nguyen and Avideh Zakhor IEEE INFOCOM 2003.
PlanetSeer: Internet Path Failure Monitoring and Characterization in Wide-Area Services Ming Zhang, Chi Zhang Vivek Pai, Larry Peterson, Randy Wang Princeton.
Route Metric Proposal Date: Authors: July 2007 Month Year
Lecture 13 – Network Mapping
Vivaldi: A Decentralized Network Coordinate System
PROTEAN: A Scalable Architecture for Active Networks
Intra-Domain Routing Jacob Strauss September 14, 2006.
Early Measurements of a Cluster-based Architecture for P2P Systems
CMPE 252A : Computer Networks
Route Metric Proposal Date: Authors: July 2007 Month Year
Presentation transcript:

L-24 Adaptive Applications 1

State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2

Motivation Large-scale distributed services and applications  Napster, Gnutella, End System Multicast, etc Large number of configuration choices K participants  O(K 2 ) e2e paths to consider Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley 3

Why is Automated Adaptation Hard? Must infer Internet performance  Scalability  Accuracy  Tradeoff with timeliness Support for a variety of applications  Different performance metrics  API requirements Layered implementations hide information 4

Tools to Automate Adaptation Tools to facilitate the creation of adaptive networked applications Adapting on longer time scale (minutes)  Deciding what actions to perform  Deciding where to perform actions  Need to predict performance Adapting on short time scale (round-trip time)  Deciding how to perform action  Need to determine correct rate of transmission 5

Adaptation on Different Time Scales ? CaliforniaNew York Long Time Scale Short Time Scale Content Negotiation Server Selection Adaptive Media 6

Motivation Source: planet-lab.org What’s the closest server to a client in Brazil ? Geographical distances server1 -> 4500 miles server2 -> 6000 miles …… Client Server 7

Motivation Difficulties:  Geographical distances ≠ network distances  Routing policies/Connectivity  GPS not available  Client needs ‘N’ distances to select the closest server 8

Motivation Source: planet-lab.org Network Latency (time) Network Latency server1 -> 120 ms server2 -> 130 ms …… 9

Motivation Network latency = network distance  E.g. ping measurements Still have the issue of ‘N’ distances…  Need ‘N’ measurements (high overhead)  Update list of network distances  How do we solve this problem ? 10

Outline Active Measurements Passive Observation Network Coordinates 11

Network Distance Round-trip propagation and transmission delay Reflects Internet topology and routing A good first order performance optimization metric  Helps achieve low communication delay  A reasonable indicator of TCP throughput  Can weed out most bad choices But the O(N 2 ) network distances are also hard to determine efficiently in Internet-scale systems 12

Active Measurements Network distance can be measured with ping-pong messages But active measurement does not scale 13

Scaling Alternatives 14

State of the Art: IDMaps [Francis et al ‘99] A network distance prediction service Tracer HOPS Server A B 50msA/B 15

Assumptions Probe nodes approximate direct path  May require large number  Careful placement may help Requires that distance between end-points is approximated by sum  Triangle inequality must hold (i.e., (a,c) > (a,b) + (b,c) 16

Triangle Inequality in the Internet 17

A More Detailed Internet Map How do we …  build a structured atlas of the Internet?  predict routing between arbitrary end-hosts?  measure properties of links in the core?  measure links at the edge? 18

Build a Structural Atlas of the Internet Use PlanetLab + public traceroute servers  Over 700 geographically distributed vantage points Build an atlas of Internet routes  Perform traceroutes to a random sample of BGP prefixes  Cluster interfaces into PoPs  Repeat daily from vantage points 19

Model for Path Prediction S D V2 (Rio) V1 (Seattle) (Portland) (Paris) V3 (Chicago) I Identify candidate paths by intersecting observed routes Choose candidate path that models Internet routing Actual path unknown V4 (Atlanta) I2I2 20

Example of Path Prediction Actual path: RTT 298ms Predicted path: RTT 310ms 21

Predicting Path Properties To estimate end-to-end path properties between arbitrary S and D  Use measured atlas to predict route  Combine properties of  Links in the core along predicted route  Access links at either end LatencySum of link latencies Loss-rateProduct of link loss-rates BandwidthMinimum of link bandwidths 22

Outline Active Measurements Passive Observation Network Coordinates 23

SPAND Design Choices Measurements are shared  Hosts share performance information by placing it in a per-domain repository Measurements are passive  Application-to-application traffic is used to measure network performance Measurements are application-specific  When possible, measure application response time, not bandwidth, latency, hop count, etc. 24

SPAND Architecture Data Perf. Reports Perf Query/ Response Client Packet Capture Host Client Performance Server Internet 25

SPAND Assumptions Geographic Stability: Performance observed by nearby clients is similar  works within a domain Amount of Sharing: Multiple clients within domain access same destinations within reasonable time period  strong locality exists Temporal Stability: Recent measurements are indicative of future performance  true for 10’s of minutes 26

Prediction Accuracy Packet capture trace of IBM Watson traffic Compare predictions to actual throughputs Cumulative Probability Ratio of Predicted to Actual Throughput 27

Outline Active Measurements Passive Observation Network Coordinates 28

First Key Insight With millions of hosts, “What are the O(N 2 ) network distances?” may be the wrong question Instead, could we ask: “Where are the hosts in the Internet?”  What does it mean to ask “Where are the hosts in the Internet?” Do we need a complete topology map?  Can we build an extremely simple geometric model of the Internet? 29

New Fundamental Concept: “Internet Position” Using GNP, every host can have an “Internet position”  O(N) positions, as opposed to O(N 2 ) distances Accurate network distance estimates can be rapidly computed from “Internet positions” “Internet position” is a local property that can be determined before applications need it Can be an interface for independent systems to interact 30 y (x 2,y 2,z 2 ) x z (x 1,y 1,z 1 ) (x 3,y 3,z 3 ) (x 4,y 4,z 4 )

Vision: Internet Positioning Service Enable every host to independently determine its Internet position Internet position should be as fundamental as IP address  “Where” as well as “Who” (2,0) (6,0) (1,3) (2,4) (5,4) (7,3) 31

Global Network Positioning (GNP) Coordinates Model the Internet as a geometric space (e.g. 3-D Euclidean) Characterize the position of any end host with geometric coordinates Use geometric distances to predict network distances y (x 2,y 2,z 2 ) x z (x 1,y 1,z 1 ) (x 3,y 3,z 3 ) (x 4,y 4,z 4 ) 32

Landmark Operations (Basic Design) Measure inter-Landmark distances  Use minimum of several round-trip time (RTT) samples Compute coordinates by minimizing the discrepancy between measured distances and geometric distances  Cast as a generic multi-dimensional minimization problem, solved by a central node 33 y x Internet (x 2,y 2 ) (x 1,y 1 ) (x 3,y 3 ) L1L1 L2L2 L3L3 L1L1 L2L2 L3L3

Ordinary Host Operations (Basic Design) Each host measures its distances to all the Landmarks Compute coordinates by minimizing the discrepancy between measured distances and geometric distances  Cast as a generic multi-dimensional minimization problem, solved by each host 34 x Internet (x 4,y 4 ) L1L1 L2L2 L3L3 y (x 2,y 2 ) (x 1,y 1 ) (x 3,y 3 ) L2L2 L1L1 L3L3

Overall Accuracy

Why the Difference? IDMaps overpredicts IDMaps GNP (1-dimensional model) 36

Alternate Motivation Select nodes based on a set of system properties Real-world problems  Locate closest game server  Distribute web-crawling to nearby hosts  Perform efficient application level multicast  Satisfy a Service Level Agreement  Provide inter-node latency bounds for clusters 37

Underlying Abstract Problems I. Finding closest node to target II. Finding the closest node to the center of a set of targets III. Finding a node that is <r i ms from target t i for all targets 38

Meridian Approach Solve node selection directly without computing coordinates  Combine query routing with active measurements 3 Design Goals  Accurate: Find satisfying nodes with high probability  General: Users can express their network location requirements  Scalable: O(log N) state per node Design Tradeoffs  Active measurements incur higher query latencies  Overhead more dependent on query load 39

Multi-resolution Rings Organize peers into small fixed number of concentric rings Radii of rings grow outwards exponentially Logarithmic number of peers per ring Retains a sufficient number of pointers to remote regions 40

Multi-resolution Ring structure For the i th ring: Inner Radius r i = s i-1 Outer Radius R i = s i  is a constant s is multiplicative increase factor r 0 = 0, R 0 =  Each node keeps track of finite rings 41

Ring Membership Management Number of nodes per ring represents tradeoff between accuracy and overhead Geographical diversity maintained within each ring Ring membership management run in background 42

Gossip Based Node Discovery Aimed to assist each node to maintain a few pointers to a diverse set of nodes Protocol 1. Each node A randomly picks a node B from each of its rings and sends a gossip packet to B containing a randomly chosen node from each of its rings 2. On receiving the packet, node B determines through direct probes its latency to A and to each of the nodes contained in the gossip packet from A 3. After sending a gossip packet to a node in each of its rings, node A waits until the start of its next gossip period and then begins again from step 1 43

Closest Node Discovery Client sends closest node discovery request for target T to Meridian node A Node A determines latency to T, say d Node A probes its ring members within distance (1-β). d to (1+β). d, where β is the acceptance threshold between 0 and 1 The request is then forwarded to closest node discovered that is closer than β times the distance d to T Process continues until no node that is β times closer can be found 44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

Revisit: Why is Automated Adaptation Hard? Must infer Internet performance  Scalability  Accuracy  Tradeoff with timeliness Support for a variety of applications  Different performance metrics  API requirements Layered implementations hide information 64

65

Locality-aware P2P: P2P’s Attempt to Improve Network Efficiency P2P has flexibility in shaping communication patterns Locality-aware P2P tries to use this flexibility to improve network efficiency  E.g., Karagiannis et al. 2005, Bindal et al. 2006, Choffnes et al (Ono)

Problems of Locality-aware P2P Locality-aware P2P needs to reverse engineer network topology, traffic load and network policy Locality-aware P2P may not achieve network efficiency Choose congested links Traverse costly interdomain links ISP 0 ISP K ISP 1 ISP 2

68

Can Miss Intersections Helps in reuse of measurements without loss of accuracy Fewer links to be measured S D V1V1 I Cluster interfaces that have similar routing performance V3V3 69

Cluster Interfaces into PoPs Interfaces on the same router use the same routing table Routers at the same location within an AS will have similar routing tables  Discover locations based on DNS names  Invalidate inferred locations if incorrect  Discover co-located interfaces  Nearby interfaces have similar reverse paths back to each vantage point 70

Does Path Prediction work? Used atlas measured from PlanetLab to predict paths from public traceroute servers 68% of path predictions are perfect 1  |Intersection of ASes | |Union of ASes | 71

Challenges in building iPlane How do we …  build a structured atlas of the Internet?  predict routing between arbitrary end-hosts?  measure properties of links in the core?  measure links at the edge? 72

Measuring Links in the Core Only need to measure inter-cluster links Objectives  Probe each link mostly once  Distribute probing load evenly across vantage points  Probe each link from closest vantage point Frontier Search algorithm selects paths that cover all links  Parallelized BFS across PlanetLab nodes To span atlas measured from 200 PlanetLab sites  Each node has to measure around 700 links 73

Challenges in building iPlane How do we …  build a structured atlas of the Internet?  predict routing between arbitrary end-hosts?  measure properties of links in the core?  measure links at the edge? 74

Measuring the Edge Participate in BitTorrent swarms  Popular application: wide coverage of end-hosts Passively monitor TCP connections to measure access link properties  Will not raise alarms 75

Reusability of Measurements Measurements to multiple addresses in the same /24 within 20% of each other in 66% of cases Reuse bandwidth measurements within a /24 prefix 76