The Oceanic Data Utility: (OceanStore) Global-Scale Persistent Storage John Kubiatowicz.

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Distributed Data Processing
Tapestry: Decentralized Routing and Location SPAM Summer 2001 Ben Y. Zhao CS Division, U. C. Berkeley.
Which server is right for you? Get in Contact with us
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Database Theory Why use database? Data is a valuable corporate resource which needs adequate accuracy, consistency and security controls. The centralized.
Technical Review Group (TRG)Agenda 27/04/06 TRG Remit Membership Operation ICT Strategy ICT Roadmap.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Password?. Project CLASP: Common Login and Access rights across Services Plan
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
Password?. Project CLASP: Common Login and Access rights across Services Plan
Active Directory: Final Solution to Enterprise System Integration
Trust Establishment in Pervasive Grid Environments Syed Naqvi, Michel Riguidel TÉLÉCOM PARIS ÉNST É cole N ationale S upérieur des T élécommunications.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
OceanStore: An Architecture for Global-Scale Persistent Storage Professor John Kubiatowicz, University of California at Berkeley
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
Building a Strong Foundation for a Future Internet Jennifer Rexford ’91 Computer Science Department (and Electrical Engineering and the Center for IT Policy)
Client-Server Computing in Mobile Environments
Module – 7 network-attached storage (NAS)
SPRING 2011 CLOUD COMPUTING Cloud Computing San José State University Computer Architecture (CS 147) Professor Sin-Min Lee Presentation by Vladimir Serdyukov.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
“ Does Cloud Computing Offer a Viable Option for the Control of Statistical Data: How Safe Are Clouds” Federal Committee for Statistical Methodology (FCSM)
MADE Mobile Agents based system for Distance Evaluation Vikram Jamwal KReSIT, IIT Bombay Guide : Prof. Sridhar Iyer.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
What is Enterprise Architecture?
Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
What is a Distributed File System?? Allows transparent access to remote files over a network. Examples: Network File System (NFS) by Sun Microsystems.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
A Low-bandwidth Network File System Athicha Muthitacharoen et al. Presented by Matt Miller September 12, 2002.
HUSKY CONSULTANTS FRANKLIN VALENCIA WIOLETA MILCZAREK ANTHONY GAGLIARDI JR. BRIAN CONNERY.
Chapter 8 – Cloud Computing
Peer to Peer Computing. What is Peer-to-Peer? A model of communication where every node in the network acts alike. As opposed to the Client-Server model,
Introduction to Active Directory
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Societal-Scale Computing: The eXtremes Scalable, Available Internet Services Information Appliances Client Server Clusters Massive Cluster Gigabit Ethernet.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
“End to End VoIP“ The Challenges of VoIP Access to the Enterprise Charles Rutledge VP Marketing Quintum Technologies
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Cloud Computing for Business Cloud Computing Services Cloud Computing Services.
Cloud Computing 3. TECHNOLOGY GUIDE 3: Cloud Computing 2 Copyright John Wiley & Sons Canada.
What is Cloud Computing 1. Cloud computing is a service that helps you to perform the tasks over the Internet. The users can access resources as they.
Truly Distributed File Systems Paul Timmins CS 535.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
Clouding with Microsoft Azure
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage
XACML and the Cloud.
Scaling for the Future Katherine Yelick U.C. Berkeley, EECS
Mobile Computing.
CSE 4340/5349 Mobile Systems Engineering
MAINTAINING SERVER AVAILIBILITY
Dynamic Replica Placement for Scalable Content Delivery
Outline for today Oceanstore: An architecture for Global-Scale Persistent Storage – University of California, Berkeley. ASPLOS 2000 Feasibility of a Serverless.
Presentation transcript:

The Oceanic Data Utility: (OceanStore) Global-Scale Persistent Storage John Kubiatowicz

ODU Properties of the ODU Motivation: –Growing quantity of inconsistent data –Widespread mobility of producers and consumers –Simplicity: subsume Web, , filesystems, databases Nomadic Data: Serverless, Homeless –Sharing of information between anyone, anywhere –Promiscuous caching of data enabled by tacit information (option 5/introspection) –Efficient dissemination of information (multicast) –Federation of many different companies, just like phone service or electric grid. Highly-available: data always duplicated –Higher-probability access –Copies placed with low probability of correlated failure Shares technology with options 1,4,5, and 8

ODU Technical Challenges Scalability: performance easy to destroy –vast number of entities: ~billions –cross-administrative domains –Security is not optional: data never cleartext Availability –Should bootstrap redundancy available on global scale –Economies of scale applied to achieving data reliability Maintainability –Too large for human intervention in normal operation Naming: How to maintain global namespace? Indexability –Must enable efficient location/searching of data Consistency/Conflict resolution –Multiple copies must have well-defined relationship

ODU State of the Art? Remote file-system community: NFS, AFS –All have single points of failure –Only caching at endpoints Mobile computing community: Coda –Small scale, fixed coherence mechanism Web caching community: Inktomi, others? –Specialized, incremental solutions –Caching along client/server path, various bottlenecks Database Community: Mariposa –Still small scale, specialized types of queries –Economic model not quite right but on right track Internet backup companies: Medley –Very limited in scope and flexibility PalmPilot: inspired general conflict-resolution

ODU Our Enabling Technologies Data Economy –User pays monthly fee to a primary utility provider who is responsible for reliability of data –Utilities buy and sell capacity (both data and bandwidth); prices set for quantity and reliability –Authoritative naming servers paid per query? Underlying database organization –User-visible structure (e.g. filesystem) synthesized Federation of overlapping data location structures (indices) + Introspection –Separate the absolute authority for data location from moment-to-moment “hearsay” authorities –Partially consistent indices continually adapted to improve performance Conflict Resolution, not consistency –policies set via domain specific language

ODU 3 Year Plan for Success Year1: –Initial design and refinement of four components: naming & security scheme (security based on name) fluid, partially coherent index structures introspection for intelligent migration of data initial take on economic models –Begin prototype implementation with all components Year2: –Finish prototyping and refinement of first-generation –Client implementation for Windows and/or UNIX Year 3: –Second-generation prototype on Millenium infrastructure –formulate plan for large-scale test –Final evaluation and usability results