Symphony: Distributed Hashing in a Small World Gurmeet Singh Manku Mayank Bawa Prabhakar Raghavan Presented by Satpreet Singh.

Slides:



Advertisements
Similar presentations
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Advertisements

Scalable Content-Addressable Network Lintao Liu
CHORD – peer to peer lookup protocol Shankar Karthik Vaithianathan & Aravind Sivaraman University of Central Florida.
The Chord P2P Network Some slides have been borowed from the original presentation by the authors.
Chord: A scalable peer-to- peer lookup service for Internet applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashock, Hari Balakrishnan.
1 1 Chord: A scalable Peer-to-peer Lookup Service for Internet Applications Dariotaki Roula
P2P Network Structured Networks: Distributed Hash Tables Pedro García López Universitat Rovira I Virgili
Xiaowei Yang CompSci 356: Computer Network Architectures Lecture 22: Overlay Networks Xiaowei Yang
Distributed Hash Tables CPE 401 / 601 Computer Network Systems Modified from Ashwin Bharambe and Robert Morris.
Robert Morris, M. Frans Kaashoek, David Karger, Hari Balakrishnan, Ion Stoica, David Liben-Nowell, Frank Dabek Chord: A scalable peer-to-peer look-up.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Small-world Overlay P2P Network
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan (Stanford University) Krishna Gummadi (U. of Washington) Hector Garcia-Molina.
Peer to Peer File Sharing Huseyin Ozgur TAN. What is Peer-to-Peer?  Every node is designed to(but may not by user choice) provide some service that helps.
Positive Feedback Loops in DHTs or Be Careful How You Simulate January 13, 2004 Sean Rhea, Dennis Geels, Timothy Roscoe, and John Kubiatowicz From “Handling.
Distributed Lookup Systems
University of Oregon Slides from Gotz and Wehrle + Chord paper
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
1 Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan (Stanford University) Krishna Gummadi (U. of Washington) Hector Garcia-Molina.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Viceroy: A scalable and dynamic emulation of the Butterfly Presented in CS294-4 by Sailesh Krishnamurthy Sep 22, 2003.
Mercury: Supporting Scalable Multi-Attribute Range Queries A. Bharambe, M. Agrawal, S. Seshan In Proceedings of the SIGCOMM’04, USA Παρουσίαση: Τζιοβάρα.
The Small World Phenomenon: An Algorithmic Perspective by Anton Karatoun.
Wide-area cooperative storage with CFS
Beacon Vector Routing: Scalable Point-to-Point Routing in Wireless Sensornets.
File Sharing : Hash/Lookup Yossi Shasho (HW in last slide) Based on Chord: A Scalable Peer-to-peer Lookup Service for Internet ApplicationsChord: A Scalable.
“Umbrella”: A novel fixed-size DHT protocol A.D. Sotiriou.
Ji-Yong Shin * Bernard Wong +, and Emin Gün Sirer * * Cornell University + University of Waterloo 2 nd ACM Symposium on Cloud ComputingOct 27, 2011 Small-World.
1 Koorde: A Simple Degree Optimal DHT Frans Kaashoek, David Karger MIT Brought to you by the IRIS project.
Distributed Hash Table Systems Hui Zhang University of Southern California.
Other Structured P2P Systems CAN, BATON Lecture 4 1.
Designing a DHT for low latency and high throughput Robert Vollmann P2P Information Systems.
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
Chord & CFS Presenter: Gang ZhouNov. 11th, University of Virginia.
1 Reading Report 5 Yin Chen 2 Mar 2004 Reference: Chord: A Scalable Peer-To-Peer Lookup Service for Internet Applications, Ion Stoica, Robert Morris, david.
Searching via Your Neighbor’s Neighbor: The Power of Lookahead in P2P Networks Moni Naor Udi Wieder The Weizmann Institute of Science Gurmeet Manku Stanford.
CCAN: Cache-based CAN Using the Small World Model Shanghai Jiaotong University Internet Computing R&D Center.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
On Heterogeneous Overlay Construction and Random Node Selection in Unstructured P2P Networks Presenter: 游創文.
Chord Advanced issues. Analysis Theorem. Search takes O (log N) time (Note that in general, 2 m may be much larger than N) Proof. After log N forwarding.
Lecture 12 Distributed Hash Tables CPE 401/601 Computer Network Systems slides are modified from Jennifer Rexford.
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Chord Advanced issues. Analysis Search takes O(log(N)) time –Proof 1 (intuition): At each step, distance between query and peer hosting the object reduces.
Minimizing Churn in Distributed Systems P. Brighten Godfrey, Scott Shenker, and Ion Stoica UC Berkeley SIGCOMM’06.
1 Presented by Jing Sun Computer Science and Engineering Department University of Conneticut.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
CS 347Notes081 CS 347: Parallel and Distributed Data Management Notes 08: P2P Systems.
Improving Fault Tolerance in AODV Matthew J. Miller Jungmin So.
1 Roie Melamed, Technion AT&T Labs Araneola: A Scalable Reliable Multicast System for Dynamic Wide Area Environments Roie Melamed, Idit Keidar Technion.
CS694 - DHT1 Distributed Hash Table Systems Hui Zhang University of Southern California.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
The Chord P2P Network Some slides taken from the original presentation by the authors.
The Chord P2P Network Some slides have been borrowed from the original presentation by the authors.
Distributed Hash Tables
A Scalable Peer-to-peer Lookup Service for Internet Applications
(slides by Nick Feamster)
SKIP GRAPHS James Aspnes Gauri Shah SODA 2003.
SCOPE: Scalable Consistency in Structured P2P Systems
A Probabilistic Routing Protocol for Mobile Ad Hoc Networks
A Scalable Content Addressable Network
MIT LCS Proceedings of the 2001 ACM SIGCOMM Conference
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Presentation transcript:

Symphony: Distributed Hashing in a Small World Gurmeet Singh Manku Mayank Bawa Prabhakar Raghavan Presented by Satpreet Singh

Motivation GOAL: To maintain a large DHT over a WAN DESIRED CHARACTERISTICS: Scalability (work for a range of network sizes) Stability (handle churn) Performance (provide low-latency lookups & low maintenance costs with churn) Flexibility (provide design knobs, preferably run-time) Simplicity (easy to understand, code, deploy…) SOLUTION: Symphony (fuses ideas from Chord & Klienberg’s Small World greedy routing algorithm/result)

Features/Advantages of Symphony 1.Low state maintenance (conseq. of low degree) Fewer pings/keep-alives, less (ambient) control traffic Distributed locking and coordination overhead over smaller sets of nodes Smaller bootstrapping time when a node joins Smaller recovery time when a node leaves 2. Smooth out-degree vs latency tradeoff Only protocol that offers this tuning knob even at run time! Out-degree not fixed at runtime, or as function of network size. 3. Flexibility and support for heterogeneity Different nodes can have different number of links 4. Fault tolerance Only short links are bolstered. No backups for long links

Architecture: Overview node long link short link A typical Symphony network Establish keyspace as [0, 1) (wrap around a ring like in Chord) Every node ‘manages’ subrange from own-id. to next-clockwise- node’s-id. (~ equi-sized) Objects hash to m-bit hash-key K, managed by node that manages real number containing K/2 m 2 Short-links: one with each immediate neighbor

Architecture: Overview node long link short link A typical Symphony network k (≥ 1) Long Links (uni-/bi-direct.) - draws a rand. number (x) from a Probability Distribution Function - contacts manager of (x) using a Routing Protocol - Establishes a link (if incoming links at manager ≤ 2k, if not resample PDF) PDF is a type of harmonic distribution (so, Symphony) P n (x) = 1/(x ln [n]) in x Є [1/n, 1] = 0 otherwise PDF estimates [n] using an Estimation Protocol

Network Size Estimation Protocol Goal: to estimate ‘n’ - the current total number of nodes in the DHT So if, - S is any set of s distinct nodes, -X s is the sum of segment-lengths managed by them, Estimated n = s/X s All s nodes update their estimate, Experimentally s = 3 found good enough, So simply use node and it’s two immediate neighbors Fact: Impact of increasing s on avg. latency is insignificant x = Length of arc 1/x = Estimate of n (Idea from Viceroy)

Routing Protocol(s) Klienberg’s Small World result: A message can be routed to any node by greedy routing in O(log 2 n) hops, in a construction where each node has one link to each of it’s 4 directional neighbors and a single long-distance link to a node chosen from a suitable PDF. To lookup hash key x Є [0,1), contact the manager of x : Unidirectional Routing Protocol: Node forwards a lookup for x along (short or long) link that minimizes the clockwise distance to x Bidirectional Routing Protocol: Node forwards a lookup for x along (short or long) link that minimizes the absolute distance to x In both cases, expected path length in an n-node network with k = O(1) links is O(1/k log 2 n) hops. Bidirectional & 1-Lookahead reduce latency by 40% and 30% each

Join/Leave Protocols JOIN: The new node chooses its id x from [0, 1) uniformly at random Using the routing protocol it identifies the current manager of x It then runs the estimation protocol using s = 3 X then uses P n x to establish its long distance links Cost = k links * O(1/k log 2 n) msgs. = O(log 2 n) messages LEAVE: All out- and in- links to x’s long distance neighbors are snapped Other nodes whose outgoing links to x are just broken, reinstate those links with other nodes The immediate neighbors of x establish short-links between themselves Successor of x initiates estimation protocol over s = 3 neighbors Cost = O(log 2 n) messages

Re-linking Protocols etc… RE-LINKING: n x = x’s current estimate of n n x link = x’s estimate when long distance links were last established When n x and n x link differ → stale estimate Re-link only when n x / n x link is not in the range [0.5, 2] Re-linking gains are marginal, cost high: O(log 2 n) messages LOOKAHEAD: Node can maintain list of neighbor’s neighbors Improves choice of neighbor for routing queries No extra messages – piggyback on keep-alives of TCP link Cost = O(k 2 ) space. Number of long-links remains unchanged FAULT TOLERANCE: Deletion of short links more detrimental as leads to node isolation Make f copies of node’s content in f next clockwise nodes

Experimental Data: SETUP: Large DHT: 2 5 to 2 15 nodes simulated in network Four kinds of test networks: Static, Expanding, Expanding-Relink & Dynamic Estimate Protocol Performance: estimate improves for log(n) neighbors, but impact on avg. latency is minimal (later)

Experimental Data: Routing Protocol Performance: Increasing links beyond 2 has marginal benefits. Bidirectional routing is good (30% reduction in latency)

Experimental Data: Lookahead Performance: 1-Lookahead reduces avg. latency by 40% for small value of k. Also, it does not entail an increase in the no. of long-links per node. Neighbor-lists are exchanged periodically piggy-backed on normal routing traffic or keep-alives

Experimental Data: Fault-tolerance motivation: [Left] On deleting a random set of links (short + long), successful lookups drops quite quickly [deletion of short links causes node isolation quickly] [Right] Impact of removing only long links not as severe (and only avg. latency goes up) Thus, only fortify short links. Make f copies of content in clockwise direction.

Comparisons & Conclusions: Conclusions: For large DHTs, ( 2 5 to 2 15 nodes) Symphony outperforms others Avg. TCP links = 10; Latency is about 8 hops; Lower costs of Join/Leave compared to Chord etc. Number of neighbors not fixed at outset; No backup links

Wrapping up… Symphony... is a simple protocol for managing large DHTs supports a dynamic network of hosts with relatively short lifetimes scales well has low lookup latency has low maintenance cost requires few neighbors per node supports heterogenity in nodes (run-time knobs) provides flexibility in design Questions… ?