Advanced Operating Systems - Spring 2009 Lecture 20 – Wednesday April 1 st, 2009 Dan C. Marinescu Office: HEC 439 B.

Slides:



Advertisements
Similar presentations
OSes: 15. Distributed File Systems 1 Operating Systems v Objectives –introduce issues such as naming, stateful and stateless, and replication Certificate.
Advertisements

Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 16 Distributed-File Systems Background Naming and Transparency Remote File.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Lecture 23: Distributed-File Systems (Chapter 17)
Module 17: Distributed-File Systems Background Naming and Transparency Remote File Access Stateful versus Stateless Service File Replication Example Systems.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 17: Distributed-File Systems.
Chapter 17: Distributed-File Systems Silberschatz, Galvin and Gagne ©2005 AE4B33OSS Chapter 17 Distributed-File Systems Background Naming and Transparency.
Chapter 17: Distributed-File Systems Adapted to COP4610 by Robert van Engelen.
1DT057 DISTRIBUTED INFORMATION SYSTEM DISTRIBUTED FILE SYSTEM 1.
Distributed File Systems CS 3100 Distributed File Systems1.
Background Distributed file system (DFS) – a distributed implementation of the classical time-sharing model of a file system, where multiple users share.
Chapter 11: File System Implementation. File-System Structure File-System Implementation Directory Implementation Allocation Methods Free-Space Management.
Distributed File Systems
Chapter 11: File System Implementation. 2 File System Implementation File System Implementation File-System Structure File-System Implementation Directory.
11.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts with Java – 8 th Edition Chapter 11: File System Implementation.
Chapter 17 Distributed File Systems By: Amar Deo( ) Ankur Patel( )
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 11: File System Implementation.
Distributed File SystemsCS-502 Fall Distributed File Systems CS-502 Operating Systems Fall 2006 (Slides include materials from Operating System Concepts,
Distributed-File Systems
NFS. The Sun Network File System (NFS) An implementation and a specification of a software system for accessing remote files across LANs. The implementation.
University of Pennsylvania 11/21/00CSE 3801 Distributed File Systems CSE 380 Lecture Note 14 Insup Lee.
Chapter 17: Distributed-File Systems Part 1
Dr. Kalpakis CMSC 421, Operating Systems File System Implementation.
File Systems (2). Readings r Silbershatz et al: 11.8.
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
O/S 4740 Chapter 10 File-System Implementation. File system It gives us access to persistent data –Info will survive termination of your process –provides.
Distributed File Systems 1 CS502 Spring 2006 Distributed Files Systems CS-502 Operating Systems Spring 2006.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 AE4B33OSS Chapter 11: File System Implementation Chapter 11: File System.
Advanced Operating Systems - Spring 2009 Lecture 21 – Monday April 6 st, 2009 Dan C. Marinescu Office: HEC 439 B. Office.
Distributed File Systems
1 10/2/2015 Chapter 16 Distributed-File Systems 此章不作上课内容 l Background l Naming and Transparency l Remote File Access l Stateful versus Stateless Service.
1DT057 DISTRIBUTED INFORMATION SYSTEM DISTRIBUTED FILE SYSTEM 1.
Distributed File Systems Distributed file system (DFS) – a distributed implementation of the classical time-sharing model of a file system, where multiple.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Dr. M. Munlin Network and Distributed System Structures 1 NETE0516 Operating Systems Instructor: ผ. ศ. ดร. หมัดอามีน หมัน หลิน Faculty of Information.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition Chapter 11: File System Implementation.
XE33OSA Chapter 11: File System Implementation. 11.2XE33OSA Silberschatz, Galvin and Gagne ©2005 Chapter 11: File System Implementation Chapter 11: File.
12.1 Silberschatz, Galvin and Gagne ©2003 Operating System Concepts with Java Chapter 12: File System Implementation Chapter 12: File System Implementation.
Ridge Xu 12.1 Operating System Concepts Chapter 12: File System Implementation File System Structure File System Implementation Directory Implementation.
ITEC 502 컴퓨터 시스템 및 실습 Chapter 11-2: File System Implementation Mi-Jung Choi DPNM Lab. Dept. of CSE, POSTECH.
Chapter 11: Implementing File Systems Silberschatz, Galvin and Gagne ©2005 Operating System Principles Chapter 11: Implementing File Systems Chapter.
Chapter 11: File System Implementation. 11.2Operating System – CSCI 380 Chapter 11: File System Implementation Chapter 11: File System Implementation.
Operating System Concepts 7th Edition Abraham SilBerschatz Peter Baer Galvin Greg Gagne Prerequisite: CSE212.
Chapter 11: Implementing File Systems Silberschatz, Galvin and Gagne ©2005 Operating System Principles Chapter 11: Implementing File Systems Chapter.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 11: File System Implementation Chapter.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 11: File System Implementation Chapter.
CS307 Operating Systems Distributed-File Systems Guihai Chen Department of Computer Science and Engineering Shanghai Jiao Tong University Fall 2011.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 File-System Structure.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 11: File.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 11: File System Implementation Chapter.
Silberschatz, Galvin and Gagne  Operating System Concepts Distributed File Systems Distributed file system (DFS) – a distributed implementation.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 11: File System Implementation.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 11: File.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 11: File.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 11: Implementing File-Systems.
CS307 Operating Systems Distributed-File Systems Guihai Chen Department of Computer Science and Engineering Shanghai Jiao Tong University Spring 2012.
Chapter 11: File System Implementation Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Outline n File-System.
Silberschatz, Galvin and Gagne  Operating System Concepts Distributed File Systems Distributed file system (DFS) – a distributed implementation.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 16 Distributed-File Systems Background Naming and Transparency Remote File.
CSE Operating System Principles File System Interface.
Chapter 12: File System Implementation
Chapter 17: Distributed-File Systems
Chapter 11: File System Implementation
File System Implementation
Chapter 17: Distributed-File Systems
Chapter 15: File System Internals
Chapter 11: File System Implementation
Chapter 15: File System Internals
Chapter 15: File System Internals
Chapter 17: Distributed-File Systems
Presentation transcript:

Advanced Operating Systems - Spring 2009 Lecture 20 – Wednesday April 1 st, 2009 Dan C. Marinescu Office: HEC 439 B. Office hours: M, Wd 3 – 4:30 PM. TA: Chen Yu Office: HEC 354. Office hours: M, Wd 1.00 – 3:00 PM. 1

Last, Current, Next Lecture Last time: File System Implementation Today NFS Distributed File System Next time: Introduction to Distributed Systems 2

The Sun Network File System (NFS) An implementation and a specification of a software system for accessing remote files across LANs (or WANs) for Solaris and SunOS. Set of independent machines with independent file systems, which allows sharing among these file systems in a transparent manner A remote directory is mounted over a local file system directory. The mounted directory looks like an integral subtree of the local file system, replacing the subtree descending from the local directory Specification of the remote directory for the mount operation is nontransparent; the host name of the remote directory has to be provided. Subject to access-rights accreditation, potentially any file system (or directory within a file system), can be mounted remotely on top of any local directory NFS is designed to operate in a heterogeneous environment.(different machines, operating systems, and network architectures; the NFS specifications independent of these media. Based upon UDP/IP protocol and Ethernet 3

NFS Implementation Independence is achieved through the use of RPC primitives built on top of an External Data Representation (XDR) protocol used between two implementation-independent interfaces The NFS specification distinguishes between the services provided by a mount mechanism and the actual remote-file-access services 4

Mounting in NFS Mounts Cascading mounts 5

NFS Mount Protocol User’s view and does not affect the server side Establishes initial logical connection between server and client Mount operation includes name of remote directory to be mounted and name of server machine storing it Mount request is mapped to corresponding RPC and forwarded to mount server running on server machine Export list – specifies local file systems that server exports for mounting, along with names of machines that are permitted to mount them Following a mount request that conforms to its export list, the server returns a file handle—a key for further accesses File handle – a file-system identifier, and an inode number to identify the mounted directory within the exported file system The mount operation changes only the u 6

NFS Protocol Provides a set of remote procedure calls for remote file operations. The procedures support the following operations: searching for a file within a directory reading a set of directory entries manipulating links and directories accessing file attributes reading and writing files NFS servers are stateless; each request has to provide a full set of arguments (NFS V4 is just coming available – very different, stateful) Modified data must be committed to the server’s disk before results are returned to the client (lose advantages of caching) Does not provide concurrency-control mechanisms 7

NFS Architecture UNIX file-system interface (based on open, read, write, and close calls, and file descriptors Virtual File System (VFS) layer – distinguishes local files from remote ones, and local files are further distinguished according to their file-system types The VFS activates file-system- specific operations to handle local requests according to their file- system types Calls the NFS protocol procedures for remote request NFS service layer – bottom layer of the architecture. Implements the NFS protocol 8

NFS Path-Name Translation Performed by breaking the path into component names and performing a separate NFS lookup call for every pair of component name and directory vnode To make lookup faster, a directory name lookup cache on the client’s side holds the vnodes for remote directory names 9

NFS Remote Operations Nearly one-to-one correspondence between regular UNIX system calls and the NFS protocol RPCs (except opening and closing files) NFS adheres to the remote-service paradigm, but employs buffering and caching techniques for the sake of performance File-blocks cache – when a file is opened, the kernel checks with the remote server whether to fetch or revalidate the cached attributes Cached file blocks are used only if the corresponding cached attributes are up to date File-attribute cache – the attribute cache is updated whenever new attributes arrive from the server Clients do not free delayed-write blocks until the server confirms that the data have been written to disk. 10

Service-oriented systems Service  a particular type of function useful to a priori unknown users. The service is provided by software running on one or more system. Server  system providing a service. Server process  the process implementing the software function provided by the server Client  user of a service. Client process  process that can invoke a service. Client interface  set of operations that allowing a client process to communicate with a server process. A client interface for a file service is formed by a set of primitive file operations (create, delete, read, write) The client interface of a DFS should be transparent  not distinguish between local and remote files

Distributed file system Service-oriented system for storage management. Distributed file system (DFS)  a distributed implementation of the classical time-sharing model of a file system, where multiple users share files and storage resources A DFS manages set of dispersed storage devices. The storage space managed by a DFS is composed of different, remotely located, smaller storage spaces Correspondence between constituent storage spaces and sets of files Issues Naming and Transparency Remote File Access Stateful versus Stateless Service File Replication An Example: AFS

Naming, replication, and transparency Naming  mapping between logical and physical objects Replication  A file is replicated in several sites to For fault-tolerance Reduce the access time and the contention. Transparency  DFS hides the network location of the file. Location transparency  file name does not reveal the file’s physical storage location Location independence  file name does not need to be changed when the file’s physical storage location changes Multilevel mapping  abstraction of a file that hides the details of how and where on the disk the file is actually stored. The mapping returns a set of the locations of this file’s replicas; both the existence of multiple copies and their location are hidden

Naming schemes Files name: concatenation of the host name and local path name guarantees a unique systemwide name file migration difficult Attach remote directories to local directories gives the appearance of a coherent directory tree; only previously mounted remote directories can be accessed transparently Total integration of the component file systems A single global name structure spans all the files in the system If a server is unavailable, some arbitrary set of directories on different machines also becomes unavailable

Remote file access Remote-service mechanism  access the remote file when needed File caching  Reduce network traffic by retaining recently accessed disk blocks in a cache, so that repeated accesses to the same information can be handled locally If needed data not already cached, a copy of data is brought from the server to the user Accesses are performed on the cached copy Files identified with one master copy residing at the server machine, but copies of (parts of) the file are scattered in different caches Cache-consistency problem  keep the cached copies consistent with the master file Could be called network virtual memory

Caching where? on disk or main memory Disk caches More reliable Cached data kept on disk are still there during recovery and don’t need to be fetched again Main-memory caches: Permit workstations to be diskless Data can be accessed more quickly Performance speedup in bigger memories Server caches (used to speed up disk I/O) are in main memory regardless of where user caches are located; using main-memory caches on the user machine permits a single caching mechanism for servers and users

Cache update policy Write-through  write data through to disk as soon as they are placed on any cache. Reliable, but poor performance Delayed-write  modifications written to the cache and then written through to the server later Write accesses complete quickly; some data may be overwritten before they are written back, and so need never be written at all Poor reliability; unwritten data will be lost whenever a user machine crashes Variation – scan cache at regular intervals and flush blocks that have been modified since the last scan Write-on-close  write data back to the server when the file is closed. Best for files that are open for long periods and frequently modified

CacheFS CacheFS  software technologies designed to speed up network file sysyem file access. Copies of files are cached on a local disk. Used on several Unix-like operating systems. Developed by Sun in Linux version in 2003.

Consistency Is locally cached copy of the data consistent with the master copy? Client-initiated approach Client initiates a validity check Server checks whether the local data are consistent with the master copy Server-initiated approach Server records, for each client, the (parts of) files it caches When server detects a potential inconsistency, it must react

Caching vs. remote file access Caching many file accesses handled efficiently by the local cache servers are contracted only occasionally reduces server load and network traffic enhances potential for scalability Remote access every remote access across the network; penalty in network traffic, server load, and performance network overhead in transmitting big chunks of data for caching is lower than a series of responses to specific requests in remote-access

Caching vs. remote file access (cont’d) Caching is superior in access patterns with infrequent writes With frequent writes, substantial overhead incurred to overcome cache-consistency problem Benefit from caching when execution carried out on machines with either local disks or large main memories Remote access on diskless, small-memory-capacity machines should be done through remote-service method In caching, the lower intermachine interface is different form the upper user interface In remote-service, the intermachine interface mirrors the local user-file- system interface

Stateful file server Mechanism Client opens a file Server fetches information about the file from its disk, stores it in its memory, and gives the client a connection identifier unique to the client and the open file Identifier is used for subsequent accesses until the session ends Server must reclaim the main-memory space used by clients who are no longer active Increased performance Fewer disk accesses Stateful server knows if a file was opened for sequential access and can thus read ahead the next blocks Potential problems The session requires state information at both client and server Potential of inconsistent state of client and server

Stateless file server Avoids state information by making each request self- contained Each request identifies the file and position in the file No need to establish and terminate a connection by open and close operations

Recovery from failures A stateful server loses all its volatile state in a crash Restore state by recovery protocol based on a dialog with clients, or abort operations that were underway when the crash occurred Server needs to be aware of client failures in order to reclaim space allocated to record the state of crashed client processes (orphan detection and elimination) Stateless server  the effects of server failures and recovery are almost unnoticeable for the client. A newly reincarnated server can respond to a self-contained request without any difficulty

Stateless vs. stateful servers Stateless service: longer request messages slower request processing additional constraints imposed on DFS design Some environments require stateful service A server employing server-initiated cache validation cannot provide stateless service, since it maintains a record of which files are cached by which clients UNIX use of file descriptors and implicit offsets is inherently stateful; servers must maintain tables to map the file descriptors to inodes, and store the current offset within a file