OceanStore Theoretical Issues and Open Problems John Kubiatowicz University of California at Berkeley.

Slides:



Advertisements
Similar presentations
Perspective on Overlay Networks Panel: Challenges of Computing on a Massive Scale Ben Y. Zhao FuDiCo 2002.
Advertisements

Tapestry: Decentralized Routing and Location SPAM Summer 2001 Ben Y. Zhao CS Division, U. C. Berkeley.
What is OceanStore? - 10^10 users with files each - Goals: Durability, Availability, Enc. & Auth, High performance - Worldwide infrastructure to.
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
POND: the OceanStore Prototype Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon, Ben Zhao and John Kubiatowicz UC, Berkeley File and Storage.
Pond: the OceanStore Prototype CS 6464 Cornell University Presented by Yeounoh Chung.
Pond: the OceanStore Prototype Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon,
David Choffnes, Winter 2006 OceanStore Maintenance-Free Global Data StorageMaintenance-Free Global Data Storage, S. Rhea, C. Wells, P. Eaton, D. Geels,
1 Accessing nearby copies of replicated objects Greg Plaxton, Rajmohan Rajaraman, Andrea Richa SPAA 1997.
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
1 U.S. Department of the Interior U.S. Geological Survey A Cognitive Agent Based Geospatial Data Distribution System 12 May 2006.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
OceanStore Exploiting Peer-to-Peer for a Self-Repairing, Secure and Persistent Storage Utility John Kubiatowicz University of California at Berkeley.
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage John Kubiatowicz.
CS252/Kubiatowicz Lec 1.1 8/25/03 CS294-4 Peer-to-Peer systems Introduction August 25, 2003 John Kubiatowicz/Anthony Joseph
OceanStore Status and Directions ROC/OceanStore Retreat 1/16/01 John Kubiatowicz University of California at Berkeley.
OceanStore Global-Scale Persistent Storage John Kubiatowicz.
OceanStore: An Architecture for Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
1 OceanStore Global-Scale Persistent Storage Ying Lu CSCE496/896 Spring 2011.
OceanStore Toward Global-Scale, Self-Repairing, Secure and Persistent Storage John Kubiatowicz University of California at Berkeley.
OceanStore Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
OceanStore An Architecture for Global-scale Persistent Storage By John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Tapestry : An Infrastructure for Fault-tolerant Wide-area Location and Routing Presenter: Chunyuan Liao March 6, 2002 Ben Y.Zhao, John Kubiatowicz, and.
Naming and Integrity: Self-Verifying Data in Peer-to-Peer Systems Hakim Weatherspoon, Chris Wells, John Kubiatowicz University of California, Berkeley.
The Oceanic Data Utility: (OceanStore) Global-Scale Persistent Storage John Kubiatowicz.
OceanStore: Data Security in an Insecure world John Kubiatowicz.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Freenet A Distributed Anonymous Information Storage and Retrieval System I Clarke O Sandberg I Clarke O Sandberg B WileyT W Hong.
Weaving a Tapestry Distributed Algorithms for Secure Node Integration, Routing and Fault Handling Ben Y. Zhao (John Kubiatowicz, Anthony Joseph) Fault-tolerant.
OceanStore: An Architecture for Global-Scale Persistent Storage Professor John Kubiatowicz, University of California at Berkeley
Opportunities for Continuous Tuning in a Global Scale File System John Kubiatowicz University of California at Berkeley.
OceanStore Toward Global-Scale, Self-Repairing, Secure and Persistent Storage John Kubiatowicz University of California at Berkeley.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Or, Providing High Availability and Adaptability in a Decentralized System Tapestry: Fault-resilient Wide-area Location and Routing Issues Facing Wide-area.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
OceanStore Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
OceanStore An Architecture for Global-Scale Persistent Storage Motivation Feature Application Specific Components - Secure Naming - Update - Access Control-
OceanStore: An Architecture for Global - Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patric Eaton, Dennis Geels,
Review Session for Fourth Quiz Jehan-François Pâris Summer 2011.
1 Plaxton Routing. 2 Introduction Plaxton routing is a scalable mechanism for accessing nearby copies of objects. Plaxton mesh is a data structure that.
Arnold N. Pears, CoRE Group Uppsala University 3 rd Swedish Networking Workshop Marholmen, September Why Tapestry is not Pastry Presenter.
Jan 17, 2001CSCI {4,6}900: Ubiquitous Computing1 Announcements I will be out of town Monday and Tuesday to present at Multimedia Computing and Networking.
Failure Resilience in the Peer-to-Peer-System OceanStore Speaker: Corinna Richter.
Pond: the OceanStore Prototype Sean Rhea, Patric Eaton, Dennis Gells, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz University of California, Berkeley.
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
OceanStore: An Infrastructure for Global-Scale Persistent Storage John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels,
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
OceanStore: In Search of Global-Scale, Persistent Storage John Kubiatowicz UC Berkeley.
Distributed Architectures. Introduction r Computing everywhere: m Desktop, Laptop, Palmtop m Cars, Cellphones m Shoes? Clothing? Walls? r Connectivity.
Rushing Attacks and Defense in Wireless Ad Hoc Network Routing Protocols ► Acts as denial of service by disrupting the flow of data between a source and.
1 More on Plaxton routing There are n nodes, and log B n digits in the id, where B = 2 b The neighbor table of each node consists of - primary neighbors.
OceanStore: An Architecture for Global- Scale Persistent Storage.
Freenet “…an adaptive peer-to-peer network application that permits the publication, replication, and retrieval of data while protecting the anonymity.
Toward Achieving Tapeless Backup at PB Scales Hakim Weatherspoon University of California, Berkeley Frontiers in Distributed Information Systems San Francisco.
POND: THE OCEANSTORE PROTOTYPE S. Rea, P. Eaton, D. Geels, H. Weatherspoon, J. Kubiatowicz U. C. Berkeley.
Tapestry : An Infrastructure for Fault-tolerant Wide-area Location and Routing Presenter : Lee Youn Do Oct 5, 2005 Ben Y.Zhao, John Kubiatowicz, and Anthony.
CS791Aravind Elango Maintenance-Free Global Data Storage Sean Rhea, Chris Wells, Patrick Eaten, Dennis Geels, Ben Zhao, Hakim Weatherspoon and John Kubiatowicz.
OceanStore Global-Scale Persistent Storage John Kubiatowicz University of California at Berkeley.
Overview Issues in Mobile Databases – Data management – Transaction management Mobile Databases and Information Retrieval.
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage
OceanStore: An Architecture for Global-Scale Persistent Storage
Scaling for the Future Katherine Yelick U.C. Berkeley, EECS
OceanStore August 25, 2003 John Kubiatowicz
OceanStore: Data Security in an Insecure world
John D. Kubiatowicz UC Berkeley
Content Distribution Network
Outline for today Oceanstore: An architecture for Global-Scale Persistent Storage – University of California, Berkeley. ASPLOS 2000 Feasibility of a Serverless.
Presentation transcript:

OceanStore Theoretical Issues and Open Problems John Kubiatowicz University of California at Berkeley

OceanStore:2Theory Lunch (1/31) OceanStore Context: Ubiquitous Computing Computing everywhere: –Desktop, Laptop, Palmtop –Cars, Cellphones –Shoes? Clothing? Walls? Connectivity everywhere: –Rapid growth of bandwidth in the interior of the net –Broadband to the home and office –Wireless technologies such as CMDA, Satelite, laser

OceanStore:3Theory Lunch (1/31) Questions about information: Where is persistent information stored? –Want: Geographic independence for availability, durability, and freedom to adapt to circumstances How is it protected? –Want: Encryption for privacy, signatures for authenticity, and Byzantine commitment for integrity Can we make it indestructible? –Want: Redundancy with continuous repair and redistribution for long-term durability Is it hard to manage? –Want: automatic optimization, diagnosis and repair Who owns the aggregate resouces? –Want: Utility Infrastructure!

OceanStore:4Theory Lunch (1/31) Utility-based Infrastructure Pac Bell Sprint IBM AT&T Canadian OceanStore IBM Transparent data service provided by federation of companies: –Monthly fee paid to one service provider –Companies buy and sell capacity from each other

OceanStore:5Theory Lunch (1/31) OceanStore: Everyone’s Data, One Big Utility “The data is just out there” How many files in the OceanStore? –Assume people in world –Say 10,000 files/person (very conservative?) –So files in OceanStore! –If 1 gig files (ok, a stretch), get 1 mole of bytes! Truly impressive number of elements… … but small relative to physical constants Aside: new results: 1.5 Exabytes/year (1.5  )

OceanStore:6Theory Lunch (1/31) OceanStore Assumptions Untrusted Infrastructure: –The OceanStore is comprised of untrusted components –Only ciphertext within the infrastructure Responsible Party: –Some organization (i.e. service provider) guarantees that your data is consistent and durable –Not trusted with content of data, merely its integrity Mostly Well-Connected: –Data producers and consumers are connected to a high- bandwidth network most of the time –Exploit multicast for quicker consistency when possible Promiscuous Caching: –Data may be cached anywhere, anytime Optimistic Concurrency via Conflict Resolution: –Avoid locking in the wide area –Applications use object-based interface for updates

OceanStore:7Theory Lunch (1/31) Game Theoretic Issues? Mechanism design to achieve optimal behavior from service providers Resource allocation algorithms –Quality of service contracts? –Proof of data (challenge/response protocol?) –Proof of resources? –Control over redundancy? –Introspection?

OceanStore:8Theory Lunch (1/31) The Path of an OceanStore Update

OceanStore:9Theory Lunch (1/31) OceanStore Consistency via Conflict Resolution Consistency is form of optimistic concurrency –An update packet contains a series of predicate-action pairs which operate on encrypted data –Each predicate tried in turn: If none match, the update is aborted Otherwise, action of first true predicate is applied Role of Responsible Party –All updates submitted to Responsible Party which chooses a final total order –Byzantine agreement with threshold signatures This is powerful enough to synthesize: –ACID database semantics –release consistency (build and use MCS-style locks) –Extremely loose (weak) consistency

OceanStore:10Theory Lunch (1/31) Oblivious Updates on Encrypted Data? Tentative Scheme: –Divide data into small blocks –Updates on a per-block basis –Predicates derived from techniques for searching on encrypted data Challenges: –Better Predicates on Encrypted Data –The Merge problem: Insert a small block of data into a larger block on a server which doesn’t have the encryption key! TimeStamp Client ID {Pred1, Update1} {Pred2, Update2} {Pred3, Update3} Client Signature

OceanStore:11Theory Lunch (1/31) Other Challenges Reencryption on insecure server –Existance of a one-way function, f [Key1, Key2](Data): Transforms data encrypted with Key 1 into data encrypted with Key 2 Can’t derive Key 1 or Key 2 from f [Key1, Key2] Can’t derive f [Key2, Key1] from f [Key1, Key2] Better Byzantine Algorithm: –Inner ring takes all updates, serializes them, applies their predicate/update pairs, then signs the result –Current scheme derived from Liskov and Castro Has single server in inner ring that acts like general – must be reelected when bad Result must be signed by aggregate. Using Threshold signatures – at the moment, this adds another phase! –Is there a decentralized algorithm? Any update can be sent to any ring member Assume presence of synchronized clocks (We have been looking at periodic heart beats Must keep update traffic and latency low

OceanStore:12Theory Lunch (1/31) Two-levels of Routing Fast, probabilistic search for “routing cache”: –Built from attenuated bloom filters –Approximation to gradient search potential function Redundant Plaxton Mesh used for underlying routing infrastructure: –Randomized data structure with locality properties –Redundant, insensitive to faults, and repairable –Amenable to continuous adaptation to adjust for: Changing network behavior Faulty servers Denial of service attacks

OceanStore:13Theory Lunch (1/31) Basic Plaxton Mesh Incremental suffix-based routing NodeID 0x43FE NodeID 0x13FE NodeID 0xABFE NodeID 0x1290 NodeID 0x239E NodeID 0x73FE NodeID 0x423E NodeID 0x79FE NodeID 0x23FE NodeID 0x73FF NodeID 0x555E NodeID 0x035E NodeID 0x44FE NodeID 0x9990 NodeID 0xF990 NodeID 0x993E NodeID 0x04FE

OceanStore:14Theory Lunch (1/31) Use of Plaxton Mesh Randomization and Locality

OceanStore:15Theory Lunch (1/31) Use of the Plaxton Mesh (the Tapestry infrastructure) OceanStore enhancements for reliability: –Each node has multiple neighbors with given properties –Documents have multiple roots through salted hashes Multiple paths to a given document Multiple roots responsible for welfare of document –Searches along multiple paths simultaneously Tradeoff between reliability and bandwidth utilization? –Routing-level validation of query results Dynamic node insertion and deletion algorithms –Uses redundant neighbors during integration –Continuous repair and incremental optimization of links

OceanStore:16Theory Lunch (1/31) Questions for routing? Analysis of Tapestry properties –Routing Stretch? –Optimality of insertion algorithms? –Fault Tolerance What is the tradeoff between redundancy of searches and tolerance of faults? What is the best (optimal number of messages, etc) use of redundant info in Tapestry under failures? –Repair bandwidth/mechanisms What are the lowest bandwidth repair techniques? Least subject to attack? Better probabilistic potential functions –Goal: Compact Summary of neighbors which can be used to drive an incremental search

OceanStore:17Theory Lunch (1/31) Data Coding Model Two distinct forms of data: active and archival Active Data in Floating Replicas –Per object virtual server –Logging for updates/conflict resolution –Interaction with other replicas to keep data consistent –May appear and disappear like bubbles Archival Data in Erasure-Coded Fragments –OceanStore equivalent of stable store –During commit, previous version coded with erasure-code and spread over 100s or 1000s of nodes –Fragments are self-verifying –Advantage: any 1/2 or 1/4 of fragments regenerates data Law of large numbers advantage to fragmentation Most data in the OceanStore is archival!

OceanStore:18Theory Lunch (1/31) Floating Replica and Deep Archival Coding Ver1: 0x34243 Ver2: 0x49873 Ver3: … Full Cop y Conflict Resolution Logs Ver1: 0x34243 Ver2: 0x49873 Ver3: … Full Cop y Conflict Resolution Logs Ver1: 0x34243 Ver2: 0x49873 Ver3: … Full Cop y Conflict Resolution Logs

OceanStore:19Theory Lunch (1/31) Archival Storage Issues Archival Properties –Behavior under correlated failures –Derivation of correlated failure model of servers which can be used to –Refresh behavior which consumes least “energy” General Question: Thermodynamic description of OceanStore –Entropy management –General tradeoff between redundancy and rate of information retrieval

OceanStore:20Theory Lunch (1/31) For more info: OceanStore vision paper for ASPLOS 2000 “OceanStore: An Architecture for Global-Scale Persistent Storage” OceanStore web site: