Sequoia: Supporting Latency-Aware Applications through Prediction Trees Dahlia Malkhi, MSR and Hebrew U Joint work with Ittai Abraham, Mahesh Balakrishnan,

Slides:



Advertisements
Similar presentations
Hidden Metric Spaces and Navigability of Complex Networks
Advertisements

Topology-Aware Overlay Construction and Server Selection Sylvia Ratnasamy Mark Handley Richard Karp Scott Shenker Infocom 2002.
Sequoia: Virtual-Tree Models for Internet Path Metrics Rama Microsoft Research Also:Ittai Abraham (Hebrew Univ.) Mahesh Balakrishnan (Cornell) Archit Gupta.
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Evaluation of a Scalable P2P Lookup Protocol for Internet Applications
Kademlia: A Peer-to-peer Information System Based on the XOR Metric.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Topologically-Aware Overlay Construction and Server Selection Sylvia Ratnasamy, Mark Handly, Richard Karp and Scott Shenker Presented by Shreeram Sahasrabudhe.
1 K-clustering in Wireless Ad Hoc Networks Fernandess and Malkhi Hebrew University of Jerusalem Presented by: Ashish Deopura.
1 K-clustering in Wireless Ad Hoc Networks using local search Rachel Ben-Eliyahu-Zohary JCE and BGU Joint work with Ran Giladi (BGU) and Stuart Sheiber.
Fabián E. Bustamante, 2007 Meridian: A lightweight network location service without virtual coordinates B. Wong, A. Slivkins and E. Gün Sirer SIGCOM 2005.
EL9331 Meridian: A Lightweight Network Location Service without Virtual Coordinates Bernard Wong, Aleksandrs Slivkins, Emin Gun Sirer SIGCOMM’05 ( Slides.
SplitStream: High- Bandwidth Multicast in Cooperative Environments Monica Tudora.
Structuring Unstructured Peer-to-Peer Networks Stefan Schmid Roger Wattenhofer Distributed Computing Group HiPC 2007 Goa, India.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Routing, Anycast, and Multicast for Mesh and Sensor Networks Roland Flury Roger Wattenhofer RAM Distributed Computing Group.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice S 3 : A Scalable Sensing Service.
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
Vivaldi Coordinate Service Justin Ma, Patrick Verkaik, Michael Vrable Department of Computer Science And Engineering UCSD CSE222A, Winter 2005.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network (CAN) ACIRI U.C.Berkeley Tahoe Networks.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Informed Detour Selection Helps Reliability Boulat A. Bash.
Exploring Tradeoffs in Failure Detection in P2P Networks Shelley Zhuang, Ion Stoica, Randy Katz HIIT Short Course August 18-20, 2003.
Efficient Hop ID based Routing for Sparse Ad Hoc Networks Yao Zhao 1, Bo Li 2, Qian Zhang 2, Yan Chen 1, Wenwu Zhu 3 1 Lab for Internet & Security Technology,
Distributed Quad-Tree for Spatial Querying in Wireless Sensor Networks (WSNs) Murat Demirbas, Xuming Lu Dept of Computer Science and Engineering, University.
Exploring Tradeoffs in Failure Detection in P2P Networks Shelley Zhuang, Ion Stoica, Randy Katz Sahara Retreat January, 2003.
Robust and Efficient Path Diversity in Application-Layer Multicast for Video Streaming Ruixiong Tian, Qian Zhang, Senior Member, IEEE, Zhe Xiang, Yongqiang.
1 PLuSH – Mesh Tree Fast and Robust Wide-Area Remote Execution Mikhail Afanasyev ‧ Jose Garcia ‧ Brian Lum.
Predicting Communication Latency in the Internet Dragan Milic Universität Bern.
Dept. of Computer Science Distributed Computing Group Asymptotically Optimal Mobile Ad-Hoc Routing Fabian Kuhn Roger Wattenhofer Aaron Zollinger.
1 Sonia Fahmy and Minseok Kwon Department of Computer Sciences Purdue University For slides, technical reports, and implementations, please see:
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
Locality Aware Network Solutions Dahlia Malkhi The Hebrew University of Jerusalem.
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
Mario Čagalj supervised by prof. Jean-Pierre Hubaux (EPFL-DSC-ICA) and prof. Christian Enz (EPFL-DE-LEG, CSEM) Wireless Sensor Networks:
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
P2P Architecture Case Study: Gnutella Network
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
Path Stitching: Internet-Wide Path and Delay Estimation from Existing Measurements DK Lee, Keon Jang, Changhyun Lee, Sue Moon, Gianluca Iannaccone* ASIAFI.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization Yang Chen Department of Computer Science Duke University
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks IPTPS Feb. 25, 2005 Byung-Gon Chun, Ben Y. Zhao, and John Kubiatowicz.
Rate-based Data Propagation in Sensor Networks Gurdip Singh and Sandeep Pujar Computing and Information Sciences Sanjoy Das Electrical and Computer Engineering.
1 Reading Report 5 Yin Chen 2 Mar 2004 Reference: Chord: A Scalable Peer-To-Peer Lookup Service for Internet Applications, Ion Stoica, Robert Morris, david.
IEEE Globecom 2010 Tan Le Yong Liu Department of Electrical and Computer Engineering Polytechnic Institute of NYU Opportunistic Overlay Multicast in Wireless.
Paper Group: 20 Overlay Networks 2 nd March, 2004 Above papers are original works of respective authors, referenced here for academic purposes only Chetan.
Peer Pressure: Distributed Recovery in Gnutella Pedram Keyani Brian Larson Muthukumar Senthil Computer Science Department Stanford University.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Locality Aware Network Solutions Dahlia Malkhi The Hebrew University of Jerusalem.
L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.
A Membership Management Protocol for Mobile P2P Networks Mohamed Karim SBAI, Emna SALHI, Chadi BARAKAT.
University “Ss. Cyril and Methodus” SKOPJE Cluster-based MDS Algorithm for Nodes Localization in Wireless Sensor Networks Ass. Biljana Stojkoska.
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Network Coordinates : Internet Distance Estimation Jieming ZHU
On the Impact of Clustering on Measurement Reduction May 14 th, D. Saucez, B. Donnet, O. Bonaventure Thanks to P. François.
Sharp Hybrid Adaptive Routing Protocol for Mobile Ad Hoc Networks
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Security Kim Soo Jin. 2 Contents Background Introduction Secure multicast using clustering Spatial Clustering Simulation Experiment Conclusions.
Gang Wang, Shining Wu, Guodong Wang, Beixing Deng, Xing Li Tsinghua University Tsinghua Univ. Oct Experimental Study on Neighbor Selection Policy.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Oct 23, 2005FOCS Metric Embeddings with Relaxed Guarantees Alex Slivkins Cornell University Joint work with Ittai Abraham, Yair Bartal, Hubert Chan,
Vivaldi: A Decentralized Network Coordinate System
Multi-phase process mining
A Scalable Content Addressable Network
Routing in Networks with Low Doubling Dimension
Presentation transcript:

Sequoia: Supporting Latency-Aware Applications through Prediction Trees Dahlia Malkhi, MSR and Hebrew U Joint work with Ittai Abraham, Mahesh Balakrishnan, Fabian Kuhn, Rama Ramasubramanian, Nir Sonenschein, and Kunal Talwar

Introduction  Latency-awareness is critical for internet applications CDNs, P2P file sharing, network monitoring, etc.  Latency-enabled functionality: Closest node discovery Locality-based clustering Detour routing Spanning trees

Current State of the Art  Application-specific approaches Closest node [Meridian, Oasis, …] Detour routing [OneHop Source Routing, Detour, Akella et al…] Clustering [SDIMS, …]  General-purpose frameworks Measurement based inference [iPlane]  Requires intrusive, expensive measurements Coordinate-based latency prediction [Vivaldi, PIC, GNP, ICS, Virtual Landmarks, PCoord, NPS, Lighthouse, IDMaps]  Needs substantial work to support applications

Goals Internet Topology is not directly known to end-hosts Inter-node Ping latencies are available Model Clustering, Closest Node Discovery… End-Host Pings

Big Insight “It's not a big truck. It's a series of tubes.” Ted Stevens, Senator from Alaska

Prediction Trees Tree of Virtual Routing Nodes Distance between nodes is path length on the tree Interior: Virtual Routing Nodes Leaves: Physical End-hosts

Metric Embedding into Trees  Generally hard  Ultra-metric: MST yields HST representing distances precisely  Tree: MST yields the right tree  Tree-metric: Buneman’s Steiner tree yields the right tree

Is the Internet a Tree?  The Four-Point Condition: Given 4 points A,B,C,D: If AC+BD ≥ AD+BC ≥ AB+CD, AC+BC = AD+BC ≥ AB+CD AC+BD = AD+BC ≥ AB+CD

Is the Internet a Tree? We can model it as one! Relaxed Four Point Condition: AC+BD=[AD+BC]+2  * min{AB,CD} Internet Latencies are very close to a tree metric Random power-law graph latencies are also very close to a tree metric

New Challenge: Embed Relaxed Tree Metrics in Trees  Teriffic experimental evidence  Distance prediction, clustering, finding closest nodes, spanning-trees  (1+O(ε log(n)) ) / (1 – O(ε log(n)) ) Steiner approximation  (1+O(ε log(n)) ) stretch lower bound  (1 + O(ε)) Steiner approximation for metrics generated from relaxed tree metric graphs  (1 + O(ε)) approximation by log(n) Steiner trees

Some Open Directions  Close lower/upper gap  Embed random graphs with power-law degrees in trees Relaxed tree-metric condition of such graphs  Embed into distribution on trees  Embed into fixed-size collection of trees Lower bound  Instance-specific Steiner-tree approximation

Thanks! Sponsored link: LOCALITY 2007, Satellite workshop at PODC 2007, August, Portland Oregon

(Re)constructing the Tree ABC A B C Cx = (AC+BC-AB)/2 Ax = (AB+AC-BC)/2 Bx = (AB+BC-AC)/2

Growing the Tree…

Towards a Distributed System  Virtual nodes are emulated by ‘surrogate’ physical nodes  Distributed Tree-Building Protocol  Discrete Event Simulator: executed on ping data from PlanetLab, King Datasets

Latency Prediction: Mechanism  Distance Labels Path to Root planet0.jaist.ac.jp label = ( , , ) planetlab1.cs.ubc.ca label= ( , , ) Path between them = ( , , ) Distance = = ms

Latency Prediction: Performance I PlanetLab, 117 nodes, 1 month

Latency Prediction: Performance II King Dataset (Harvard), 1835 nodes

Latency Prediction: Multiple Trees  Each node joins x randomly selected trees out of t total trees  Existing theoretical work on modeling a graph metric with a distribution of dominating trees…  To predict latencies between 2 nodes, select all trees both nodes belong to, and pick median T1T2T3 dist(B,C) = median(dist T1 (B,C), dist T2 (B,C))

Latency Prediction: Multiple Trees

Closest Node Discovery: Mechanism A xz y BCD Problem: Can’t ping inner virtual nodes, only physical leaf nodes Solution: Each virtual node maintains ping-able representatives for each virtual neighbor T Traverse the tree, always picking the neighbor that’s closer to the target More Overhead  More Accuracy Number of representatives per neighbor BD CB Number of parallel queries

Closest Node Discovery: Performance I PlanetLab, 117 nodes, 1 month ping data

Closest Node Discovery: Performance II King Dataset (Meridian), 2500 nodes Meridian performance on same data: ~1 ms Vivaldi, GNP: ranging from 8 ms to 18 ms (from Wong et al, Sigcomm 05) q = queries, r = reps

Clustering PlanetLab Europe

Clustering PlanetLab Poland, Germany, Scandinavia

Clustering PlanetLab Poland

Work in Progress  Explore better tree-building algorithms  Model other properties using these trees Loss rate, bandwidth  Build a robust distributed system: Failure Handling, Tree Balancing

Conclusions  Prediction Trees are a promising way of modeling internet latencies Simple yet powerful abstraction  Latency estimation comparable with coordinate schemes  Closest Node Discovery comparable to Meridian  Good locality-based Clustering

THANK YOU!

King Dataset (harvard)  1835 Nodes

Four Point Condition  Relaxed 4PC: d(AC)+d(BC) = [d(AD)+d(BC)] +  * [d(AB)+d(CD)]

Prediction Trees Tree of Virtual Routing Nodes End-hosts Interior nodes are virtual ‘steiner’ nodes Leaf nodes are physical end-hosts Estimated Distance is Path Length on the Tree

Clustering PlanetLab All European Nodes

Clustering PlanetLab German and Scandinavian nodes Hostname ends with “de” OR “fi” OR “no” OR “se”