Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
OceanStore Global-Scale Persistent Storage John Kubiatowicz.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
OceanStore Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
OceanStore Global-Scale Persistent Storage John Kubiatowicz.
OceanStore: An Architecture for Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
1 OceanStore Global-Scale Persistent Storage Ying Lu CSCE496/896 Spring 2011.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Introduction to z/OS Basics © 2006 IBM Corporation Chapter 8: Designing and developing applications for z/OS.
The Oceanic Data Utility: (OceanStore) Global-Scale Persistent Storage John Kubiatowicz.
OceanStore: Data Security in an Insecure world John Kubiatowicz.
OceanStore Theoretical Issues and Open Problems John Kubiatowicz University of California at Berkeley.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
OceanStore: An Architecture for Global-Scale Persistent Storage Professor John Kubiatowicz, University of California at Berkeley
Opportunities for Continuous Tuning in a Global Scale File System John Kubiatowicz University of California at Berkeley.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Live for today as if it is your last day but plan for tomorrow as if it will last forever!
OceanStore An Architecture for Global-Scale Persistent Storage Motivation Feature Application Specific Components - Secure Naming - Update - Access Control-
OceanStore: An Architecture for Global - Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patric Eaton, Dennis Geels,
P EER - TO -P EER N ETWORKS Michael Fine 1. W HAT ARE P EER -T O -P EER N ETWORKS ? Napster Social networking spawned from this concept. Emerged in the.
Pond: the OceanStore Prototype Sean Rhea, Patric Eaton, Dennis Gells, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz University of California, Berkeley.
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Cloud Computing Project By:Jessica, Fadiah, and Bill.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
OceanStore: An Architecture for Global- Scale Persistent Storage.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
POND: THE OCEANSTORE PROTOTYPE S. Rea, P. Eaton, D. Geels, H. Weatherspoon, J. Kubiatowicz U. C. Berkeley.
To provide the world with a next generation storage platform for unstructured data, enabling deployment of mobile applications, virtualization solutions,
Societal-Scale Computing: The eXtremes Scalable, Available Internet Services Information Appliances Client Server Clusters Massive Cluster Gigabit Ethernet.
System Architecture CS 560. Project Design The requirements describe the function of a system as seen by the client. The software team must design a system.
Best-in-class enterprise backup for the mobile enterprise Prepared for [Insert customer name] [Date}
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
Clouding with Microsoft Azure
Chapter 1 Characterization of Distributed Systems
Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung
Chapter 19: Network Management
Netscape Application Server
What is Cloud Computing - How cloud computing help your Business?
Peer-to-peer networking
File System Implementation
Database Systems: Design, Implementation, and Management Tenth Edition
Cloud Computing By P.Mahesh
Introduction.
Scaling for the Future Katherine Yelick U.C. Berkeley, EECS
Cloud Computing.
Built on the Powerful Microsoft Azure Platform, Lievestro Delivers Care Information, Capacity Management Solutions to Hospitals, Medical Field MICROSOFT.
Data Security for Microsoft Azure
Systems Design Chapter 6.
OceanStore: Data Security in an Insecure world
John D. Kubiatowicz UC Berkeley
Cloud computing mechanisms
Software models - Software Architecture Design Patterns
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
An Introduction to Software Architecture
SAMANVITHA RAMAYANAM 18TH FEBRUARY 2010 CPE 691
MAINTAINING SERVER AVAILIBILITY
THE GOOGLE FILE SYSTEM.
Peer-to-peer networking
Content Distribution Network
Salesforce.com Salesforce.com is the world leader in on-demand customer relationship management (CRM) services Manages sales, marketing, customer service,
The Database Environment
Outline for today Oceanstore: An architecture for Global-Scale Persistent Storage – University of California, Berkeley. ASPLOS 2000 Feasibility of a Serverless.
Database management systems
Presentation transcript:

Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz

Ubiquitous Devices  Ubiquitous Storage Consumers of data move, change from one device to another, work in cafes, cars, airplanes, the office, etc. Properties REQUIRED for Endeavour storage substrate: Strong Security: data must be encrypted whenever in the infrastructure; resistance to monitoring Coherence: too much data for naïve users to keep coherent “by hand” Automatic replica management and optimization: huge quantities of data cannot be managed manually Simple and automatic recovery from disasters: probability of failure increases with size of system Utility model: world-scale system requires cooperation across administrative boundaries

Utility-based Infrastructure Canadian OceanStore Sprint AT&T IBM Pac Bell IBM Service provided by confederation of companies Monthly fee paid to one service provider Companies buy and sell capacity from each other

State of the Art? Widely deployed systems: NFS, AFS (/DFS) Single “regions” of failure, caching only at endpoints ClearText exposed at various levels of system Compromised server all data on server compromised Mobile computing community: Coda, Ficus, Bayou Small scale, fixed coherence mechanism Not optimized to take advantage of high-bandwidth connections between server components ClearText also exposed at various levels of system Web caching community: Inktomi, Akamai Specialized, incremental solutions Caching along client/server path, various bottlenecks Database Community: Interfaces not usable by legacy applications ACID update semantics not always appropriate

OceanStore Assumptions Untrusted Infrastructure: Infrastructure is comprised of untrusted components Only cyphertext within the infrastructure Must be careful to avoid leaking information Mostly Well-Connected: Data producers and consumers are connected to a high-bandwidth network most of the time Exploit mechanism such as multicast for quicker consistency between replicas Promiscuous Caching: Data may be cached anywhere, anytime Global optimization through tacit information collection Operations Interface with Conflict Resolution: Applications employ an operations-oriented interface, rather than a file-systems interface Coherence is centered around conflict resolution

OceanStore Technologies I: Naming and Data Location Requirements: Find nearby data without global communication Don’t get in way of rapid relocation of data Search should reflect locality and network efficiency System-level names should help to authenticate data OceanStore Technology: Underlying namespace is flat and built from cryptographic signatures (160-bit SHA-1) Data location is a form of gradient-search of local pools of data (use of attenuated Bloom-filters) Fallback to global, “exact” indexing structure in case data not found with local search

Cascaded-Pools Hierarchy Combined Downward Summary Search in local pool Check local summary Check summaries at each outgoing “Pipe” Local Summary Local Summary Downward Summary Downward Summary Local Summary Local Summary Local Summary Every pool has good randomized index structure (such as Treaps)

OceanStore Technologies II: High-Availability and Disaster Recovery Requirements: Handle diverse, unstable participants in OceanStore Eliminate backup as independent (and fallible) technology Flexible “disaster recovery” for everyone OceanStore Technologies: Use of erasure-codes (Tornado codes) to provide stable storage for archival copies and snapshots of live data Mobile replicas are self-contained centers for logging and conflict resolution Version-based update for painless recovery Redundancy exploited to tolerate variation of performance from network servers (RIVERS)

OceanStore Technologies III: Introspective Monitoring and Optimization Requirements: Reasonable job on a global-scale optimization problem Take advantage of locality whenever possible Sensitivity to limited storage and bandwidth at endpoints Stability in chaotic environment OceanStore Technologies: Introspective Monitoring and analysis of relationships: between different pieces of data between users of a given piece of data Rearrangement of data in response to monitoring: Economic models with analogies to simulated annealing Sub problem of Tacit Information Analysis (option 5)

OceanStore Technologies IV: Rapid Update in an Untrusted Infrastructure Requirements: Scalable coherence mechanism which provides performance even though replicas widely separated Operate directly on encrypted data Updates should not reveal info to untrusted servers OceanStore Technologies: Operations-based interface using conflict resolution Use of incremental cryptographic techniques: No time to decrypt/update/re-encrypt Use of oblivious function techniques to perform this update (fallback to secure hardware in general case) Use of automatic techniques to verify security protocols

Two-Phase Implementation: Year I: Read-Mostly Prototype Construction of data location facility Initial introspective gathering of tacit info and adaptation Initial archival techniques (use of erasure codes) Unix file-system interface under Linux (“legacy apps”) Year III: Full Prototype Final conflict resolution and encryption techniques More sophisticated tacit info gathering and rearrangement Final object interface and integration with Endeavour applications Wide-scale deployment via NTON and Internet-2