Pond: the OceanStore Prototype CS 6464 Cornell University Presented by Yeounoh Chung.

Slides:



Advertisements
Similar presentations
Tapestry: Decentralized Routing and Location SPAM Summer 2001 Ben Y. Zhao CS Division, U. C. Berkeley.
Advertisements

What is OceanStore? - 10^10 users with files each - Goals: Durability, Availability, Enc. & Auth, High performance - Worldwide infrastructure to.
Slides for Chapter 10: Peer-to-Peer Systems From Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edition 4, © Addison-Wesley.
C-Store: Data Management in the Cloud Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 5, 2009.
POND: the OceanStore Prototype Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon, Ben Zhao and John Kubiatowicz UC, Berkeley File and Storage.
Pond The OceanStore Prototype. Pond -- Dennis Geels -- January 2003 Talk Outline System overview Implementation status Results from FAST paper Conclusion.
Pond: the OceanStore Prototype Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon,
Pond The OceanStore Prototype. Introduction Problem: Rising cost of storage management Observations: Universal connectivity via Internet $100 terabyte.
Pond: the OceanStore Prototype Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon,
David Choffnes, Winter 2006 OceanStore Maintenance-Free Global Data StorageMaintenance-Free Global Data Storage, S. Rhea, C. Wells, P. Eaton, D. Geels,
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Outline for today Structured overlay as infrastructures Survey of design solutions Analysis of designs.
1 Principles of Reliable Distributed Systems Tutorial 12: Frangipani Spring 2009 Alex Shraer.
Coda file system: Disconnected operation By Wallis Chau May 7, 2003.
OceanStore Exploiting Peer-to-Peer for a Self-Repairing, Secure and Persistent Storage Utility John Kubiatowicz University of California at Berkeley.
OceanStore An Architecture for Global-scale Persistent Storage By John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
OceanStore Status and Directions ROC/OceanStore Retreat 1/13/03 John Kubiatowicz University of California at Berkeley.
Naming and Integrity: Self-Verifying Data in Peer-to-Peer Systems Hakim Weatherspoon, Chris Wells, John Kubiatowicz University of California, Berkeley.
Erasure Coding vs. Replication: A Quantiative Comparison
OceanStore: Data Security in an Insecure world John Kubiatowicz.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
OceanStore: An Architecture for Global-Scale Persistent Storage Professor John Kubiatowicz, University of California at Berkeley
Opportunities for Continuous Tuning in a Global Scale File System John Kubiatowicz University of California at Berkeley.
Slide 1 ISTORE: System Support for Introspective Storage Appliances Aaron Brown, David Oppenheimer, and David Patterson Computer Science Division University.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Wide-area cooperative storage with CFS
OceanStore An Architecture for Global-Scale Persistent Storage Motivation Feature Application Specific Components - Secure Naming - Update - Access Control-
Long Term Durability with Seagull Hakim Weatherspoon (Joint work with Jeremy Stribling and OceanStore group) University of California, Berkeley ROC/Sahara/OceanStore.
OceanStore: An Architecture for Global - Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patric Eaton, Dennis Geels,
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
Federated, Available, and Reliable Storage for an Incompletely Trusted Environment Atul Adya, Bill Bolosky, Miguel Castro, Gerald Cermak, Ronnie Chaiken,
1 The Google File System Reporter: You-Wei Zhang.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
OceanStore: An Architecture for Global-Scale Persistent Storage John Kubiatowicz, et al ASPLOS 2000.
1 JTE HPC/FS Pastis: a peer-to-peer file system for persistant large-scale storage Jean-Michel Busca Fabio Picconi Pierre Sens LIP6, Université Paris 6.
Failure Resilience in the Peer-to-Peer-System OceanStore Speaker: Corinna Richter.
1 The Design of a Robust Peer-to-Peer System Gisik Kwon Dept. of Computer Science and Engineering Arizona State University Reference: SIGOPS European Workshop.
Low-Overhead Byzantine Fault-Tolerant Storage James Hendricks, Gregory R. Ganger Carnegie Mellon University Michael K. Reiter University of North Carolina.
Pond: the OceanStore Prototype Sean Rhea, Patric Eaton, Dennis Gells, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz University of California, Berkeley.
1 Configurable Security for Scavenged Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh with: Samer Al-Kiswany, Matei Ripeanu.
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Edition 5, © Addison-Wesley 2012 Slides for Chapter 10: Peer-to-Peer.
What is a Distributed File System?? Allows transparent access to remote files over a network. Examples: Network File System (NFS) by Sun Microsystems.
OceanStore: In Search of Global-Scale, Persistent Storage John Kubiatowicz UC Berkeley.
Storage Management and Caching in PAST A Large-scale persistent peer-to-peer storage utility Presented by Albert Tannous CSE 598D: Storage Systems – Dr.
OceanStore: An Architecture for Global- Scale Persistent Storage.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Toward Achieving Tapeless Backup at PB Scales Hakim Weatherspoon University of California, Berkeley Frontiers in Distributed Information Systems San Francisco.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
POND: THE OCEANSTORE PROTOTYPE S. Rea, P. Eaton, D. Geels, H. Weatherspoon, J. Kubiatowicz U. C. Berkeley.
Secure Location-Independent Autonomic Storage Architectures GR/S44501/01 February January 2007 Graham Kirby, Alan Dearle, Ron Morrison & Stuart.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
THE EVOLUTION OF CODA M. Satyanarayanan Carnegie-Mellon University.
Database Laboratory Regular Seminar TaeHoon Kim Article.
Seminar On Rain Technology
CS791Aravind Elango Maintenance-Free Global Data Storage Sean Rhea, Chris Wells, Patrick Eaten, Dennis Geels, Ben Zhao, Hakim Weatherspoon and John Kubiatowicz.
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
OceanStore: An Architecture for Global-Scale Persistent Storage
Making the Archive Real
Providing Secure Storage on the Internet
Building a Database on S3
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
Pond: the OceanStore Prototype
OceanStore: An Architecture for Global-Scale Persistent Storage
Content Distribution Network
Outline for today Oceanstore: An architecture for Global-Scale Persistent Storage – University of California, Berkeley. ASPLOS 2000 Feasibility of a Serverless.
Presentation transcript:

Pond: the OceanStore Prototype CS 6464 Cornell University Presented by Yeounoh Chung

Motivation / Introduction “OceanStore is an Internet-scale, persistent data store” “for the first time, one can imagine providing truly durable, self maintaining storage to every computer user.” “vision” of highly available, reliable, and persistent, data store utility model - Amazon S3 ?!

Motivation / Introduction OceanStore (1999)Amazon S3 (2009) Universal availability High durability Incremental Scalability Self-maintaining Self-organizing Virtualization Transparency Monthly access fee Untrusted infrastructure Two-tier system High availability High durability High scalability Self-maintaining Self-organizing Virtualization Transparency Pay-per-use fee Trusted infrastructure Single-tier system

Outline Motivation / Introduction System Overview Consistency Persistency Failure Tolerance Implementation Performance Conclusion Related Work

System Overview Hakim Weatherspoon Dennis Geels Sean Rhea Patrick Eaton Ben Zhao OceanStore Cloud OceanStore Cloud

System Overview (Data Object)

System Overview (Update Model) Updates are applied atomically An array of actions guarded by a predicate No explicit locks Application-specific consistency - e.g. database, mailbox

System Overview (Tapestry) Scalable overlay network, built on TCP/IP Performs DOLR based on GUID - Virtualization - Location independence Locality aware Self-organizing self-maintaining HotOS Attendee Paul Hogan

System Overview (Primary Rep.) Each data object is assigned an inner-ring Apply updates and create new versions Byzantine fault-tolerant Ability to change inner-ring servers any time - public key cryptography, proactive threshold signature, Tapestry Responsible party

System Overview (Achi. Storage) Erasure codes are more space efficient Fragments are distributed uniformly among archival storage servers - BGUID, n_frag Pond uses Cauchy Reed-Solomon code Z W W ZY X f f -1

System Overview (Caching) Promiscuous caching Whole-block caching Host caches the read block and publishes its po session in Tapestry Pond uses LRU Use Heartbeat to get the most recent copy

System Overview (Diss. Tree)

Outline Motivation / Introduction System Overview Consistency Persistency Failure Tolerance Implementation Performance Conclusion Related Work

Consistency (Primary Replica) Read-only blocks Application-specific consistency Primary-copy replication - heartbeat

Persistency (Archival Storage) Archival storage Aggressive replication Monitoring - Introspection Replacement - Tapestry

Failure Tolerance (Everybody) All newly created blocks are encoded and stored in Archival servers Aggressive replication Byzantine agreement protocol for inner-ring Responsible Party - single point of failure? - scalability?

Outline Motivation / Introduction System Overview Consistency Persistency Failure Tolerance Implementation Performance Conclusion Related Work

Implementation Built in Java, atop SEDA Major subsystems are functional - self-organizing Tapestry - primary replica with Byzantine agreement - self-organizing dissemination tree - erasure-coding archive - application interface: NFS, IMAP/SMTP, HTTP

Implementation

Outline Motivation / Introduction System Overview Consistency Persistency Failure Tolerance Implementation Performance Conclusion Related Work

Performance Updata performance Dissemination tree performance Archival retrieval performance The Andrew Benchmark

Performance (test beds) Local cluster - 42 machines at Berkeley - 2x 1.0 GHz CPU, 1.5 GB SDRAM, 2x 36 GB hard drives - gigabit Ethernet adaptor and switch PlanetLab nodes across 43 sites GHz, 1 GB memory

Performance (update)

Performance (archival)

Performance (dissemination tree)

Performance (Andrew benchmark)

Conclusion Pond is a working subset of the vision Promising in WAN Threshold signatures, erasure-coded archival are expensive Pond is fault tolerant system, but it is not tested with any failed node Any thoughts?

Related Work FarSite ITTC, COCA PAST, CFS, IVY Pangaea