SQUARE Scalable Quorum-based Atomic Memory with Local Reconfiguration Vincent Gramoli, Emmanuelle Anceaume, Antonino Virgillito.

Slides:



Advertisements
Similar presentations
CAN 1.Distributed Hash Tables a)DHT recap b)Uses c)Example – CAN.
Advertisements

Scalable Content-Addressable Network Lintao Liu
CS 5204 – Operating Systems1 Paxos Student Presentation by Jeremy Trimble.
1Department of Electrical Engineering and Computer Science, University of Michigan, USA. 2Department of Computer Science, National University of Singapore,
Distributed Shared Memory, Related Issues, and New Challenges in Large-Scale Dynamic Systems Vincent Gramoli 1.
1 Data-Centric Storage in Sensornets with GHT, A Geographic Hash Table Sylvia Ratnasamy, Scott Shenker, Brad Karp, Ramesh Govindan, Deborah Estrin, Li.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Coding for Atomic Shared Memory Emulation Viveck R. Cadambe (MIT) Joint with Prof. Nancy Lynch (MIT), Prof. Muriel Médard (MIT) and Dr. Peter Musial (EMC)
Small-world Overlay P2P Network
Augustin Chaintreau Pierre FraigniaudEmmanuelle Lebhar ThomsonCNRSCNRS ParisUniversite Paris DiderotUniversite Paris Diderot Paper Discussed by: Ranjeet.
Applications over P2P Structured Overlays Antonino Virgillito.
OPODIS 05 Reconfigurable Distributed Storage for Dynamic Networks Gregory Chockler, Seth Gilbert, Vincent Gramoli, Peter M Musial, Alexander A Shvartsman.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network (CAN) ACIRI U.C.Berkeley Tahoe Networks.
P2P Architecture for Self-* Atomic Memory Emmanuelle Anceaume Maria Gradinariu Vincent Gramoli Antonino Virgillito.
A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Timed Quorum Systems … for large-scale and dynamic environments Vincent Gramoli, Michel Raynal.
Two Techniques For Improving Distributed Database Performance ICS 214B Presentation Ambarish Dey Vasanth Venkatachalam March 18, 2004.
November 22, 2007 Vincent Gramoli1/61 Distributed Shared Memory for Large-Scale Dynamic Systems Vincent Gramoli supervised by Michel Raynal.
Mario Čagalj supervised by prof. Jean-Pierre Hubaux (EPFL-DSC-ICA) and prof. Christian Enz (EPFL-DE-LEG, CSEM) Wireless Sensor Networks:
“Umbrella”: A novel fixed-size DHT protocol A.D. Sotiriou.
CS401 presentation1 Effective Replica Allocation in Ad Hoc Networks for Improving Data Accessibility Takahiro Hara Presented by Mingsheng Peng (Proc. IEEE.
1 A scalable Content- Addressable Network Sylvia Rathnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker Pirammanayagam Manickavasagam.
A Delaunay Triangulation Architecture Supporting Churn and User Mobility in MMVEs Mohsen Ghaffari, Behnoosh Hariri and Shervin Shirmohammadi Advanced Communications.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
CONTENT ADDRESSABLE NETWORK Sylvia Ratsanamy, Mark Handley Paul Francis, Richard Karp Scott Shenker.
GeoGrid: A scalable Location Service Network Authors: J.Zhang, G.Zhang, L.Liu Georgia Institute of Technology presented by Olga Weiss Com S 587x, Fall.
Distributed Shared Memory: A Survey of Issues and Algorithms B,. Nitzberg and V. Lo University of Oregon.
Ahmad Al-Shishtawy 1,2,Tareq Jamal Khan 1, and Vladimir Vlassov KTH Royal Institute of Technology, Stockholm, Sweden {ahmadas, tareqjk,
Structuring P2P networks for efficient searching Rishi Kant and Abderrahim Laabid Abderrahim Laabid.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Improving the Efficiency of Fault-Tolerant Distributed Shared-Memory Algorithms Eli Sadovnik and Steven Homberg Second Annual MIT PRIMES Conference, May.
Classification and Analysis of Distributed Event Filtering Algorithms Sven Bittner Dr. Annika Hinze University of Waikato New Zealand Presentation at CoopIS.
Ch 10 Shared memory via message passing Problems –Explicit user action needed –Address spaces are distinct –Small Granularity of Transfer Distributed Shared.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
Rendezvous Regions: A Scalable Architecture for Service Location and Data-Centric Storage in Large-Scale Wireless Sensor Networks Karim Seada, Ahmed Helmy.
A Membership Management Protocol for Mobile P2P Networks Mohamed Karim SBAI, Emna SALHI, Chadi BARAKAT.
P2P Group Meeting (ICS/FORTH) Monday, 28 March, 2005 A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp,
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Chapter 4 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University Building Dependable Distributed Systems.
An Optimal Broadcast Algorithm for Content-Addressable Networks Ludovic Henrio Fabrice Huet Justine Rochas 1 18/12/ OPODIS (Nice)
BATON A Balanced Tree Structure for Peer-to-Peer Networks H. V. Jagadish, Beng Chin Ooi, Quang Hieu Vu.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
EEC 688/788 Secure and Dependable Computing Lecture 9 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
TRUST Self-Organizing Systems Emin G ü n Sirer, Cornell University.
Distribution and components
Plethora: Infrastructure and System Design
Providing Secure Storage on the Internet
Effective Replica Allocation
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
Outline Announcements Lab2 Distributed File Systems 1/17/2019 COP5611.
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
A Scalable Content Addressable Network
EEC 688/788 Secure and Dependable Computing
Fault-Tolerant SemiFast Implementations of Atomic Read/Write Registers
Presentation transcript:

SQUARE Scalable Quorum-based Atomic Memory with Local Reconfiguration Vincent Gramoli, Emmanuelle Anceaume, Antonino Virgillito

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Context and Motivations Distributed systems become  Large-scale  Dynamic  Unpredictable Challenges in Distributed Shared Memory  Atomic Consistency  Load Support

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Distributed Shared Memory (DSM) Atomic Consistency  Object Composition: we focus on a single object  Read operations return the last value written Replicated Object  Replica is a node maintaining the value of the object  Memory is the set of replicas Read/Write Operations [ABD95]  Any client can read and modify (write) the object  To do so, it contacts quorums (a.k.a. mutually intersecting sets) of replicas

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Existing DSM solutions Lack of Independence Non-terminating operation may block others undefinitely Lack of Scalability Lack of Adaptiveness Memory Underloaded Memory (unused resource) Overloaded Memory (bursts of load) Memory Clients keep track of all memory replicas (replacing a replica is complex)

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito System Model Object replicated on failure-prone nodes  The replicas r 1, …, r k share a 2-dim coordinate space r1r1 r2r2 r3r3 r4r4 r5r5 r6r6 r7r7 r8r8 … … r k-1 rkrk

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito System Model Unreliable communication through neighborhood  Each replica r i can communicate only with its nearest neighbors riri

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito System Model Topology takeover mechanism (CAN [RFH+01] )  Upon node failure/departure the space sharing is modified accordingly If a node r i fails, a takeover node r j replaces it riri rjrj

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito System Model Topology takeover mechanism (CAN [RFH+01] )  Upon node failure/departure the space sharing is modified accordingly If a node r i fails, a takeover node r j replaces it rjrj

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Introducing our Dynamic Quorums Dual-type Dynamic Quorums  Vertical Quorum: All replicas responsible of an abscissa x  Horizontal Quorum: All replicas responsible of an ordinate y Intersection for Atomicity requirement  Values are propagated (consulted) at a vertical (horizontal) quorum  Thus, all consultations obtain the lastly propagated value x y For any horizontal quorum H and any vertical quorum V: H  V ≠ Ø

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito SQUARE features Atomicity and Independence  Atomic operations are independent from each other Local Knowledge  Reactive Quorum Access: wrapping around the torus Fast Adaptive Read Operations  Single phase operations: accessing a single horizontal quorum is sufficient Memory Adaptiveness  If overloaded (global approximation), then expand  If underloaded (local observation), then shrink

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Operation Execution Basic Read Operation: 1)Get up-to-date value, 2) Propagate this value on a vertical Quorum. Basic Write Operation: 1)Get up-to-date value, 2) Propagate the value to write (and a higher version number) twice on the same vertical quorum Fast Adaptive Read Op: 1)Get up-to-date value once on a single horizontal quorum.

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Adjustment of the overlay size SQUARE thwarts if the requested replica is overloaded: Other replicas on its diagonal are contacted in turn until a non- overloaded one is found SQUARE expands if all contacted replicas are overloaded: A node outside the memory is added, and the object value is replicated at this node. SQUARE shrinks if a replica gets underloaded: The replica simply leaves the memory after neighbors notification.

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Simulation Results Self-Adaptiveness

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Simulation Results Load-Balancing

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Conclusion Atomic Consistency is guaranteed  Using dynamic quorum intersection,  Each failed/leaving participant is replaced to ensure quorum availability. Adaptiveness makes the algorithm tunable  Minimizing operation latency as much as possible,  Maximizing capability to support bursts of load. Perspective on operation speed up  Kleinberg’s model to route in polylog(q) hops

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Some References [CGG+05] Reconfigurable distributed storage for dynamic networks. G. Chockler, S. Gilbert, V. Gramoli, P. M. Musial, and A. A. Shvartsman. In Proc. of 9th Int’l Conf. on Principles of Distributed Systems (OPODIS’05), [AGGV05] P2P Architecture for Self-*Atomic Memory E. Anceaume, M. Gradinariu, V. Gramoli, A. Virgillito In Proc of the 8th Intl Symposium on Parallel Architectures, Algorithms,and Networks (I- SPAN’05) 214–219, [RFH+01] A Scalable Content Adressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker In Proc. of the ACM SIGCOMM, 161–172, [ABD95] Sharing Memory Robustly in Message Passing Systems H. Attiya, A. Bar-Noy, D. Dolev In Journal of the ACM, 42(1):124–142, 1995.

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Simulation Results Operation Latency Request rate Read latency Write latency Max. memory size Max. hor quorum size Max. vert. quorum size 1/ / / / /

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Simulation Results Fault-tolerance

ACM SAC’07 March 14th V. Gramoli, E. Anceaume, A. Virgillito Simulation Results Scalability