Farsite: Ferderated, Available, and Reliable Storage for an Incompletely Trusted Environment Microsoft Reseach, Appear in OSDI’02.

Slides:



Advertisements
Similar presentations
Windows Server ® 2008 File Services Infrastructure Planning and Design Published: June 2010 Updated: November 2011.
Advertisements

G O O G L E F I L E S Y S T E M 陳 仕融 黃 振凱 林 佑恩 Z 1.
FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment Presented by: Boon Thau Loo CS294-4 (Adapted from Adya’s OSDI’02.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Replica Control for Peer-to- Peer Storage Systems.
Ivy: A Read/Write P2P File System Athicha Muthitacharoan, Robert Morris, Thomer Gil and Benjie Chen Presented by Rachel Rubin CS 294-4, Fall 2003.
Lecture 6 – Google File System (GFS) CSE 490h – Introduction to Distributed Computing, Winter 2008 Except as otherwise noted, the content of this presentation.
Storage Management and Caching in PAST, a large-scale, persistent peer- to-peer storage utility Authors: Antony Rowstorn (Microsoft Research) Peter Druschel.
Coda file system: Disconnected operation By Wallis Chau May 7, 2003.
3.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 3: Introducing Active Directory.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
Taming Aggressive Replication in the Pangaea Wide-area File System Y. Saito, C. Kaamanolis, M. Karlsson, M. Mahalingam Presented by Jason Waddle.
G Robert Grimm New York University Farsite: A Serverless File System.
Object Naming & Content based Object Search 2/3/2003.
Hands-On Microsoft Windows Server 2003 Administration Chapter 5 Administering File Resources.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
The Google File System.
Wide-area cooperative storage with CFS
Understanding Active Directory
Chapter 7 Configuring & Managing Distributed File System
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
Presented by: Alvaro Llanos E.  Motivation and Overview  Frangipani Architecture overview  Similar DFS  PETAL: Distributed virtual disks ◦ Overview.
Federated, Available, and Reliable Storage for an Incompletely Trusted Environment Atul Adya, Bill Bolosky, Miguel Castro, Gerald Cermak, Ronnie Chaiken,
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment.
1 The Google File System Reporter: You-Wei Zhang.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
Module 5: Planning a DNS Strategy. Overview Planning DNS Servers Planning a Namespace Planning Zones Planning Zone Replication and Delegation Integrating.
1 JTE HPC/FS Pastis: a peer-to-peer file system for persistant large-scale storage Jean-Michel Busca Fabio Picconi Pierre Sens LIP6, Université Paris 6.
FARSITE: Federated, Available and Reliable Storage for an Incompletely Trusted Environment A. Atta, W. J. Bolowsky, M. Castro, G. Cermak, R. Chaiken, J.
Consistency And Replication
Pond: the OceanStore Prototype Sean Rhea, Patric Eaton, Dennis Gells, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz University of California, Berkeley.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Distributed File Systems Case Studies: Sprite Coda.
1 Administering Shared Folders Understanding Shared Folders Planning Shared Folders Sharing Folders Combining Shared Folder Permissions and NTFS Permissions.
Chapter 12: SYSVOL: Old & New BAI617. Chapter Topics What is SysVol? Understanding File Replication System (FRS) Understanding 2008 R2 Distributed.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Chen ying 1 Taming aggressive replication in the Pangaea wide-area file system Authors: Yasushi Saito, Christos Karamanolis, Magnus Karlsson, Mallik Mahalingam.
Presenters: Rezan Amiri Sahar Delroshan
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
Peer-to-peer Information Systems Universität des Saarlandes Max-Planck-Institut für Informatik – AG5: Databases and Information Systems Group Prof. Dr.-Ing.
Module 6: Designing Name Resolution. Module Overview Collecting Information for a Name Resolution Design Designing a DNS Server Strategy Designing a DNS.
 Distributed file systems having transaction facility need to support distributed transaction service.  A distributed transaction service is an extension.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
Computer Science Lecture 19, page 1 CS677: Distributed OS Last Class: Fault tolerance Reliable communication –One-one communication –One-many communication.
1 JTE HPC/FS Pastis: a peer-to-peer file system for persistant large-scale storage Jean-Michel Busca Fabio Picconi Pierre Sens LIP6, Université Paris 6.
Module 4: Configuring Active Directory Sites and Replication.
POND: THE OCEANSTORE PROTOTYPE S. Rea, P. Eaton, D. Geels, H. Weatherspoon, J. Kubiatowicz U. C. Berkeley.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Presenter: Seikwon KAIST The Google File System 【 Ghemawat, Gobioff, Leung 】
Module 1: Introduction to Active Directory
Module 11: Configuring and Managing Distributed File System.
Dsitributed File Systems
Robustness in the Salus scalable block store Yang Wang, Manos Kapritsos, Zuocheng Ren, Prince Mahajan, Jeevitha Kirubanandam, Lorenzo Alvisi, and Mike.
The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Presenter: Chao-Han Tsai (Some slides adapted from the Google’s series lectures)
Module 11 Configuring and Managing Distributed File System.
Ivy: A Read/Write Peer-to- Peer File System Authors: Muthitacharoen Athicha, Robert Morris, Thomer M. Gil, and Benjie Chen Presented by Saurabh Jha 1.
Mobility Victoria Krafft CS /25/05. General Idea People and their machines move around Machines want to share data Networks and machines fail Network.
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung
Nomadic File Systems Uri Moszkowicz 05/02/02.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Providing Secure Storage on the Internet
Outline Announcements Lab2 Distributed File Systems 1/17/2019 COP5611.
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Federated, Available, and Reliable Storage for an Incompletely Trusted Environment Atul Adya, William J. Bolosky, Miguel Castro, Gerald Cermak, Ronnie.
Presentation transcript:

Farsite: Ferderated, Available, and Reliable Storage for an Incompletely Trusted Environment Microsoft Reseach, Appear in OSDI’02

Design Assumption 100,000 machines in a large corporation or university, interconnected by a high-bandwidth, low-latency network Allow large-scale read-only sharing Allow small-scale read/write sharing A small fraction of users misbehave

Enabling Technology Trends Large amount of unused disk space enables the use of replication for reliability Relatively low cost of strong cryptography enables distributed security

Problems Namespace roots – A file system is a hierachical directory namespace, originated at a root – Allows multiple roots, each of which can be regarded as a virtual file server – A root cooresponds to a set of participating machines Trust and Certification – The security of any distributed system is an issue of trust – Manage trust using public-key-cryptographic certificates A namespace certificate A user certificate A machine certificate

Basic System Each machine performs three roles: a client, a member of a directory group, and a file host A directory group: a set of machine that collectively manage file information using a Byzantine-fault-tolerant protocol A file host: a machine used to store file data replicas

Performance Considerations Problems ? – All FS metadata operations involve Byzantine-fault- tolerant protocol(BFT) – BFT is high-cost Solution – Local caching improves read performance (by content leases) – Batch logged updates(write-back caching, due to many writes are deleted or overwritten shortly after they occur)

Security Access control by ACL Privacy – Convergent encryption to protect the file data – Exclusive encryption to protect directory or file names Integrity by a Merkle hash tree

Scalability When a directory group becomes overloaded, it can delegate part of its namespace to another group When open a file/directory with a paticular pathname, it needs to determine which group of machines is responsible for that name – Hint-based pathname translation (caching) like in Sprite

Taming aggressive replication in the Pangaea wide-area file system HP Labs

Design Goals Speed: hide the wide-area networking latency Availability and autonomy Network economy: transfer data between nodes in physical proximity, thereby reducing latency and bandwidth

Structure of a file system Gold replicas – The directory entry of a file lists the file’s gold replicas – Form a clique Bronze replicas

Replica set management Pervasive replication: a replica is created whenever a file is accessed by a user File creation Replica addition: the new replica S must be added to the graph (m edged) – adds an edge to a random gold replica (from a different region than S) – Asks a random gold replica P, to pick the replica (among P’s immediate graph neighbors)closest to S – Asks P to choose m-2 random replicas using random walk Name-space containment

Propagating updates Efficient and reliable update propagation – Delta propagation, harbingers, and using a spanning tree to exploit physical topology Conflict resolution: combing version vectors and last- writer-win rules Lack of strong consistency guarantees: eventually achieved

Questions? Graph-based replica for each file, too much metadata to maintain Like a multicast-based file system, updates are propagated using multicast

Discussion Metadata and data management in a distributed file sytem – Either mutable, but have to trust some machines, like xFS, or Farsite using Byzantine-fault-tolerant to trust part of machines to serialize updates – Or immutable, using logged updates, it relies on each individual user to form the image of a file system – The replication factor of metadata and data maybe differ according to their usage?