Nomadic File Systems Uri Moszkowicz 05/02/02.

Slides:



Advertisements
Similar presentations
Eventual Consistency Jinyang. Sequential consistency Sequential consistency properties: –Latest read must see latest write Handles caching –All writes.
Advertisements

Linearizability Linearizability is a correctness criterion for concurrent object (Herlihy & Wing ACM TOPLAS 1990). It provides the illusion that each operation.
Replication Management. Motivations for Replication Performance enhancement Increased availability Fault tolerance.
Computer Science Lecture 20, page 1 CS677: Distributed OS Today: Coda, xFS Case Study: Coda File System Brief overview of other recent file systems –xFS.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
“Managing Update Conflicts in Bayou, a Weekly Connected Replicated Storage System” Presented by - RAKESH.K.
Overview of Mobile Computing (3): File System. File System for Mobile Computing Issues for file system design in wireless and mobile environments Design.
G Robert Grimm New York University Disconnected Operation in the Coda File System.
Disconnected Operation in the Coda File System James J. Kistler and M. Satyanarayanan Carnegie Mellon University Presented by Deepak Mehtani.
Disconnected Operation in the Coda File System James J. Kistler and M. Satyanarayanan Carnegie Mellon University Presented by Cong.
Coda file system: Disconnected operation By Wallis Chau May 7, 2003.
“Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System ” Distributed Systems Κωνσταντακοπούλου Τζένη.
Flexible Update Propagation for Weakly Consistent Replication Karin Petersen, Mike K. Spreitzer, Douglas B. Terry, Marvin M. Theimer and Alan J. Demers.
Computer Science Lecture 21, page 1 CS677: Distributed OS Today: Coda, xFS Case Study: Coda File System Brief overview of other recent file systems –xFS.
Department of Electrical Engineering
CSS490 Replication & Fault Tolerance
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Mutual Consistency Detection of mutual inconsistency in distributed systems (Parker, Popek, et. al.) Distributed system with replication for reliability.
G Robert Grimm New York University Bayou: A Weakly Connected Replicated Storage System.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
Jeff Chheng Jun Du.  Distributed file system  Designed for scalability, security, and high availability  Descendant of version 2 of Andrew File System.
Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System D. B. Terry, M. M. Theimer, K. Petersen, A. J. Demers, M. J. Spreitzer.
Mobility Presented by: Mohamed Elhawary. Mobility Distributed file systems increase availability Remote failures may cause serious troubles Server replication.
Client-Server Computing in Mobile Environments
File System. NET+OS 6 File System Architecture Design Goals File System Layer Design Storage Services Layer Design RAM Services Layer Design Flash Services.
70-294: MCSE Guide to Microsoft Windows Server 2003 Active Directory, Enhanced Chapter 7: Active Directory Replication.
Distributed Deadlocks and Transaction Recovery.
Distributed Systems Principles and Paradigms Chapter 10 Distributed File Systems 01 Introduction 02 Communication 03 Processes 04 Naming 05 Synchronization.
Mobility in Distributed Computing With Special Emphasis on Data Mobility.
CS Storage Systems Lecture 14 Consistency and Availability Tradeoffs.
Feb 7, 2001CSCI {4,6}900: Ubiquitous Computing1 Announcements Tomorrow’s class is officially cancelled. If you need someone to go over the reference implementation.
Bayou. References r The Case for Non-transparent Replication: Examples from Bayou Douglas B. Terry, Karin Petersen, Mike J. Spreitzer, and Marvin M. Theimer.
Replication for Mobile Computing Prasun Dewan Department of Computer Science University of North Carolina
Replication ( ) by Ramya Balakumar
Overview – Chapter 11 SQL 710 Overview of Replication
Chapter 12: SYSVOL: Old & New BAI617. Chapter Topics What is SysVol? Understanding File Replication System (FRS) Understanding 2008 R2 Distributed.
Serverless Network File Systems Overview by Joseph Thompson.
IM NTU Distributed Information Systems 2004 Replication Management -- 1 Replication Management Yih-Kuen Tsay Dept. of Information Management National Taiwan.
Mobile File System Byung Chul Tak. AFS  Andrew File System Distributed computing environment developed at CMU provides transparent access to remote shared.
CS425 / CSE424 / ECE428 — Distributed Systems — Fall 2011 Some material derived from slides by Prashant Shenoy (Umass) & courses.washington.edu/css434/students/Coda.ppt.
Distributed File Systems
Information/File Access and Sharing Coda: A Case Study J. Kistler, M. Satyanarayanan. Disconnected operation in the Coda File System. ACM Transaction on.
Write Conflicts in Optimistic Replication Problem: replicas may accept conflicting writes. How to detect/resolve the conflicts? client B client A replica.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Consistency Guarantees Prasun Dewan Department of Computer Science University of North Carolina
Eventual Consistency Jinyang. Review: Sequential consistency Sequential consistency properties: –All read/write ops follow some total ordering –Read must.
Highly Available Services and Transactions with Replicated Data Jason Lenthe.
Computer Science Lecture 19, page 1 CS677: Distributed OS Last class: Distributed File Systems Issues in distributed file systems Sun’s Network File System.
THE EVOLUTION OF CODA M. Satyanarayanan Carnegie-Mellon University.
Mobility Victoria Krafft CS /25/05. General Idea People and their machines move around Machines want to share data Networks and machines fail Network.
Distributed Databases – Advanced Concepts Chapter 25 in Textbook.
Mobile File Systems.
Coda / AFS Thomas Brown Albert Ng.
Distributed File Systems
Eventual Consistency: Bayou
Chapter 25: Advanced Data Types and New Applications
Lecturer : Dr. Pavle Mogin
Example Replicated File Systems
Disconnected Operation in the Coda File System
EECS 498 Introduction to Distributed Systems Fall 2017
EECS 498 Introduction to Distributed Systems Fall 2017
Today: Coda, xFS Case Study: Coda File System
Chapter 10 Transaction Management and Concurrency Control
Outline The Case for Non-transparent Replication: Examples from Bayou Douglas B. Terry, Karin Petersen, Mike J. Spreitzer, and Marvin M. Theimer. IEEE.
Replica Placement Model: We consider objects (and don’t worry whether they contain just data or code, or both) Distinguish different processes: A process.
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Gold Rush : Mobile Transaction Middleware with JAVA Object Replication
Last Class: Web Caching
System-Level Support CIS 640.
Presentation transcript:

Nomadic File Systems Uri Moszkowicz 05/02/02

Nomadic File Systems Overview Previous papers Coda Bayou Andrew File System Epidemic protocols

Coda General idea Ability to continue work when file servers are inaccessible through use of caching Transparent operation through optimistic conflict resolution

Coda

Coda Venus States Hoarding – Hoard data anticipating disconnect Emulation – Use of local cache Reintegration – Updating local cache

Coda Hoarding priorities Hoard walking Recent usage Hoard profiles – user specified Children and descendants may be included (ie directories) Hoard walking Since priority based on recent usage, every once in a while need to update file system to reflect priorities 10 min intervals chosen?

Coda Emulation Coda is faithful; all updates accepted At reintegration, validity is checked by servers All writes are logged to be replayed at reintegration Logs may use up all disk space; not gracefully handled (writes disabled)

Coda Reintegration Updates propagated to servers and vice versa (one volume at a time) 4 stages Log parsed, files locked Validation: conflict detection, security, integrity Fetching: shadow files are transferred (as needed) Commit: locks released and changes finalized Failures must be manually examined from logs

Coda Performance Reintegration time Disk size Likelihood of conflicts Significantly affected by hardware (~1990) and application

Bayou Anti-entropy Address Coda limitation Also Mobile replicas cannot reconcile amongst themselves in Coda Also Incremental progress (not volume) Efficient storage management Light-weight replica creation

Bayou Anti-entropy Simple anti-entropy protocol One-way operation between pairs Propagation of write operations Accept-order maintained Replicas can be ordered in any topology Amount of data propagated proportional to update activity at replicas, not size of data being replicated

Bayou Anti-entropy Basic operation Write log version vector sent from one replica to another Receiving server traverses write log and requests all writes it has not yet seen Write log Prune write log entries when stable ie all replicas have seen it Re-execute subsequent writes when learn of earlier write Each write log entry maintains a primary replica to track its stability or encode stability in messages

Bayou Anti-entropy

Bayou Anti-entropy Tuple Store Database (SQL) obtained by executing writes in order; cache for read requests Two views: full and committed Write Log - Ordered list of all writes ever received Undo Log - Facilitates reordering of write log “O” vector – for each vector timestamp of latest write that has been discarded (avoid re-accepting earlier writes because later one omitted) “C” vector – tracks committed writes “F” vector – tracks full writes “C”+”F” used to determine what to needs update

Bayou Anti-entropy Conflict Detection & Resolution Application specified and included in write Dependency checks (detection) write/write + read/write Merge procedures (resolution) Failures resolved later by user through application

Bayou Anti-entropy

Bayou Anti-entropy Consistency Servers can receive writes from other servers & clients, but writes are applied immediately when received Non-determinism must be removed since servers can be in different states Environment information (ie system clock) Exceeding resource limits

Bayou Anti-entropy Security Early version relied on central authentication server No servers? Possible invalid updates maintained on replicas until connected Later version relies on cryptography

Bayou Anti-entropy Protocol extensions Transportation of write log through portable media Causal order through logical clocks & total order of eventual consistency Light-weight server creation and deletion (not specified at volume creation time)

Bayou Anti-entropy Disadvantages Security Large network traffic during checkpoint synchronization Every replica must know about every other replica: complexity grows with size of network

Bayou Anti-entropy Policies When to reconcile Periodic, Manual, System Triggered Choosing which replicas to reconcile with Reachable, updatedness, primary status Deciding when to truncate write-log Available HD & RAM space, network usage Selecting server to create replicas Connection bandwidth, completeness of write-logs

Bayou Anti-entropy Performance ~1000 servers Hierarchy schemes to expand network size Size(tentative writes) = 10*size(committed writes) due to access control certificate Linear increase in anti-entropy time with number of replicas

Bayou Anti-entropy Examples Meeting room scheduler Users select several possible meeting times Upon connection, meeting times resolved based on chosen times or prompted for new time if irresolvable conflict Tentative meeting times exposed to other users

Bayou Anti-entropy Examples Bibliographic database Users enter bibliographic entries At resolution, dependency check can access database to see if two entries are the same (just typed differently), merge, or prompt user User may opt to be disconnected to avoid expensive cellular fees