The Chubby Lock Service for Loosely-coupled Distributed Systems Mike Mosharaf Chowdhury.

Slides:



Advertisements
Similar presentations
Paxos and Zookeeper Roy Campbell.
Advertisements

Thialfi: A Client Notification Service for Internet-Scale Applications
Wait-free coordination for Internet-scale systems
Megastore: Providing Scalable, Highly Available Storage for Interactive Services. Presented by: Hanan Hamdan Supervised by: Dr. Amer Badarneh 1.
W3C Workshop on Web Services Mark Nottingham
HUG – India Meet November 28, 2009 Noida Apache ZooKeeper Aby Abraham.
The Google File System Authors : Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung Presentation by: Vijay Kumar Chalasani 1CS5204 – Operating Systems.
Chubby Lock server for distributed applications 1Dennis Kafura – CS5204 – Operating Systems.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Google Chubby Lock Service Steve Ko Computer Sciences and Engineering University at Buffalo.
Presented By Alon Adler – Based on OSDI ’12 (USENIX Association)
Lecture 6 – Google File System (GFS) CSE 490h – Introduction to Distributed Computing, Winter 2008 Except as otherwise noted, the content of this presentation.
Distributed Systems CS Google Chubby and Message Ordering Recitation 4, Sep 29, 2011 Majd F. Sakr, Vinay Kolar, Mohammad Hammoud.
The Chubby lock service for loosely-coupled distributed systems Presented by Petko Nikolov Cornell University 3/12/09.
Distributed File System: Design Comparisons II Pei Cao Cisco Systems, Inc.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
CHUBBY and PAXOS Sergio Bernales 1Dennis Kafura – CS5204 – Operating Systems.
7/2/2015EECS 584, Fall Bigtable: A Distributed Storage System for Structured Data Jing Zhang Reference: Handling Large Datasets at Google: Current.
Distributed storage for structured data
Bigtable: A Distributed Storage System for Structured Data
BigTable CSE 490h, Autumn What is BigTable? z “A BigTable is a sparse, distributed, persistent multidimensional sorted map. The map is indexed by.
Case Study - GFS.
File Systems (2). Readings r Silbershatz et al: 11.8.
Inexpensive Scalable Information Access Many Internet applications need to access data for millions of concurrent users Relational DBMS technology cannot.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Gowtham Rajappan. HDFS – Hadoop Distributed File System modeled on Google GFS. Hadoop MapReduce – Similar to Google MapReduce Hbase – Similar to Google.
Synchronization Methods for Multicore Programming Brendan Lynch.
The Chubby lock service for loosely-coupled distributed systems Mike Burrows (Google), OSDI 2006 Shimin Chen Big Data Reading Group.
Presented by: Alvaro Llanos E.  Motivation and Overview  Frangipani Architecture overview  Similar DFS  PETAL: Distributed virtual disks ◦ Overview.
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
WINDOWS AZURE STORAGE 11 de Mayo, 2011 Gisela Torres – Windows Azure MVP Aventia-Renacimiento Twitter:
1 The Google File System Reporter: You-Wei Zhang.
Google Bigtable Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber.
Other Google Technologies Peng Bo School of EECS, Peking University 7/15/2008 Refer to Aaron Kimball’s slides.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services J. Baker, C. Bond, J.C. Corbett, JJ Furman, A. Khorlin, J. Larson, J-M.
Google Bigtable Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber.
Bigtable: A Distributed Storage System for Structured Data 1.
Google Bigtable Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber.
Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows,
CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM S. A. Weil, S. A. Brandt, E. L. Miller D. D. E. Long, C. Maltzahn U. C. Santa Cruz OSDI 2006.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
The Google File System by S. Ghemawat, H. Gobioff, and S-T. Leung CSCI 485 lecture by Shahram Ghandeharizadeh Computer Science Department University of.
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Paxos A Consensus Algorithm for Fault Tolerant Replication.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
Lock Services in Distributed File Systems Shaan Mahbubani Anshuman Gupta Ravi Vijay Anup Tapadia UCSD CSE 221 Operating Systems - Winter 07.
CSE 486/586, Spring 2014 CSE 486/586 Distributed Systems Google Chubby Lock Service Steve Ko Computer Sciences and Engineering University at Buffalo.
© Spinnaker Labs, Inc. Google Cluster Computing Faculty Training Workshop Module VII: Other Google Technologies.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Google Chubby Lock Service Steve Ko Computer Sciences and Engineering University at Buffalo.
CSC590 Selected Topics Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A.
© Spinnaker Labs, Inc. Chubby. © Spinnaker Labs, Inc. What is it? A coarse-grained lock service –Other distributed systems can use this to synchronize.
Bigtable : A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach Mike Burrows,
Synchronization in Distributed File Systems Advanced Operating System Zhuoli Lin Professor Zhang.
Zookeeper Wait-Free Coordination for Internet-Scale Systems.
Bigtable: A Distributed Storage System for Structured Data Google Inc. OSDI 2006.
ZOOKEEPER. CONTENTS ZooKeeper Overview ZooKeeper Basics ZooKeeper Architecture Getting Started with ZooKeeper.
Truly Distributed File Systems Paul Timmins CS 535.
Bigtable A Distributed Storage System for Structured Data.
From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Chapter 3 System Models.
The Chubby lock service for loosely-coupled distributed systems Mike Burrows, Google Inc. Presented By: Harish Rayapudi, Shiva Prasad Malladi.
Ion Stoica, UC Berkeley November 7, 2016
Data Management with Google File System Pramod Bhatotia wp. mpi-sws
Slide credits: Thomas Kao
CPS 512 midterm exam #1, 10/7/2016 Your name please: ___________________ NetID:___________ /60 /40 /10.
EECS 498 Introduction to Distributed Systems Fall 2017
آزمايشگاه سيستمهای هوشمند علی کمالی زمستان 95
Ali Ghodsi and Ion Stoica, UC Berkeley February 21, 2018
Wait-free coordination for Internet-scale systems
Cloud Computing Storage Systems
Presentation transcript:

The Chubby Lock Service for Loosely-coupled Distributed Systems Mike Mosharaf Chowdhury

Problem (Un)Locking-as-a-service – Leader election – Synchronization –…–… Uses Paxos to solve the distributed consensus problem in an asynchronous environment

Overview Goals/Non-goals Primary – Availability – Reliability – Usability & deployability Secondary – Performance Non-goals – Storage capacity Use Cases Planned usage by – GFS, – BigTable, and – MegaStore Also heavily used as – Internal name service – MapReduce rendezvous point

System Structure Proxy Server R R R R R R R R M Reads are satisfied by the master alone Writes are acknowledged after updating a majority

How to Use it? UNIX File system like interface – API modeled in a similar way Read/write locks on each file/directory – Advisory locks – Coarse-grained Event notification – After corresponding action

Cogs KeepAlives – Piggybacks event notifications, cache invalidations etc. Leases/TimeOuts – Master and client-side local leases – Failover handling

Developers are … Our developers sometimes do not plan for high availability in the way one would wish. A lock-based interface is more familiar to our programmers. Our developers are confused by non-intuitive caching semantics, so we prefer consistent caching. However, mistakes, misunderstanding and the differing expectations of our developers lead to efforts that are similar to attacks. Despite attempts at education, our developers regularly write loops that retry indefinitely when a file is not present, or poll a file by opening it and closing it repeatedly when one might expect they would open the file just once. Developers also fail to appreciate the difference between a service being up, and that service being available to their applications. Human 

Hit or Miss? Scales to clients – Can be scaled further using proxies and partitioning 61 outages in total – 52 under 30s (almost no impact) – 1 outage due to overload Data loss on 6 occasions – 4 due to software error (fixed) – 2 due to operators Scalable Available Reliable

More Numbers Naming-related events – 60% file opens – 46% stored files 10 clients use each cached file out of 230k – 14% of cache are negative caches for names KeepAlive account for 93% traffic – Uses UDP instead of TCP