Cloud Computing CS Distributed File Systems and Cloud Storage – Part I

Slides:

Advertisements

Similar presentations

Distributed components

Advertisements

Distributed File Systems Chapter 11

Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.

Distributed Systems CS Distributed File Systems- Part I Lecture 19, Nov 14, 2011 Majd F. Sakr, Mohammad Hammoud andVinay Kolar 1.

P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.

Wide-area cooperative storage with CFS

PRASHANTHI NARAYAN NETTEM.

NFS. The Sun Network File System (NFS) An implementation and a specification of a software system for accessing remote files across LANs. The implementation.

Lecture 8 Epidemic communication, Server implementation.

Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.

Distributed File Systems

Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.

1 The Google File System Reporter: You-Wei Zhang.

CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.

Distributed Systems. Interprocess Communication (IPC) Processes are either independent or cooperating – Threads provide a gray area – Cooperating processes.

Chapter 20 Distributed File Systems Copyright © 2008.

Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.

Database Applications (15-415) Part II- Hadoop Lecture 26, April 21, 2015 Mohammad Hammoud.

Distributed File Systems Architecture – 11.1 Processes – 11.2 Communication – 11.3 Naming – 11.4.

Distributed File Systems Architecture – 11.1 Processes – 11.2 Communication – 11.3 Naming – 11.4.

Seminar on Service Oriented Architecture Distributed Systems Architectural Models From Coulouris, 5 th Ed. SOA Seminar Coulouris 5Ed.1.

Distributed Systems CS Distributed File Systems- Part I Lecture 20, Nov 28, 2012 Majd F. Sakr and Mohammad Hammoud 1.

Distributed Systems CS Distributed File Systems Lecture 22, Dec 2, 2015 Mohammad Hammoud 1.

Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.

Distributed Systems: Distributed File Systems Ghada Ahmed, PhD. Assistant Prof., Computer Science Dept. Web:

Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.

An Introduction to GPFS

DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)

Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung

Last Class: Introduction

Memory Management.

Distributed File Systems

Distributed File Systems

Cloud Computing CS Distributed File Systems and Cloud Storage – Part II Lecture 13, Feb 27, 2012 Majd F. Sakr, Mohammad Hammoud and Suhail Rehman.

File System Implementation

CHAPTER 3 Architectures for Distributed Systems

Google File System CSE 454 From paper by Ghemawat, Gobioff & Leung.

Distributed Systems CS

Introduction to Networks

A Survey on Distributed File Systems

Database Applications (15-415) Hadoop Lecture 26, April 19, 2016

Distributed Systems CS

NFS and AFS Adapted from slides by Ed Lazowska, Hank Levy, Andrea and Remzi Arpaci-Dussea, Michael Swift.

Ch 11 Distributed File System

Distributed Systems CS

Advanced Operating Systems Chapter 11 Distributed File systems 11

Chapter 15: File System Internals

Today: Coda, xFS Case Study: Coda File System

Multiple Processor Systems

CSE 451: Operating Systems Winter Module 22 Distributed File Systems

Distributed File Systems

DISTRIBUTED FILE SYSTEMS

Distributed File Systems

CSE 451: Operating Systems Spring Module 21 Distributed File Systems

Distributed File Systems

Distributed Systems CS

Multiple Processor and Distributed Systems

CSE 451: Operating Systems Winter Module 22 Distributed File Systems

Chapter 15: File System Internals

Today: Distributed File Systems

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S

THE GOOGLE FILE SYSTEM.

Introduction To Distributed Systems

Database System Architectures

Distributed File Systems

Chapter 15: File System Internals

Distributed Systems (15-440)

Lecture 4: File-System Interface

Distributed File Systems

Presentation transcript:

Cloud Computing CS 15-319 Distributed File Systems and Cloud Storage – Part I Lecture 12, Feb 22, 2012 Majd F. Sakr, Mohammad Hammoud and Suhail Rehman

Today… Last two sessions Pregel, Dryad and GraphLab Today’s session Distributed File Systems- Part I Announcement: Project update is due today

Discussion on Distributed File Systems Distributed File Systems (DFSs) Basics DFS Aspects

Distributed File Systems Why File Systems? To organize data (as files) To provide a means for applications to store, access, and modify data Why Distributed File Systems? Big data continues to grow In contrary to a local file system, a distributed file system (DFS) can hold big data and provide access to this data to many clients distributed across a network

File Server (Providing NAS) NAS versus SAN Another term for DFS is network attached storage (NAS), referring to attaching storage to network servers that provide file systems A similar sounding term that refers to a very different approach is storage area network (SAN) SAN makes storage devices (not file systems) available over a network SAN can typically: Provide extremely high data throughput and is usually implemented with Fibre Channel storage Be costly. Each node connected to the SAN must have a Fibre Channel host bus adapter (HBA) to connect to the Fibre channel network Client Computer File Server (Providing NAS) Client Computer LAN SAN Client Computer Database Server Client Computer

Benefits of DFSs DFSs provide: File sharing over a network: without a DFS, we would have to exchange files by e-mail or use applications such as the Internet’s FTP Transparent files accesses: A user’s programs can access remote files as if they are local. The remote files have no special APIs; they are accessed just like local ones Easy file management: managing a DFS is easier than managing multiple local file systems

DFS Components DFS information can be typically categorized as follows: The data state: This is the contents of files The attribute state (meta data): This is the information about each file (e.g., file’s size and access control list) The open-file state: This includes which files are open or otherwise in use, as well as describing how files are locked Designing a DFS entails determining how its various components are placed. Specifically, by component placement we indicate: What resides on the servers What resides on the clients

DFS Component Placement (1) The data and the attribute states permanently reside on the server’s local file system, but recently accessed or modified information might reside in server and/or client caches The open-file state is transitory; it changes as processes open and close files Data Cache Data Cache Local File System Attribute Cache Network Attribute Cache Open-File State Open-File State Client Server

DFS Component Placement (2) Three basic concerns govern the DFS components placement strategy: Access speed: Caching information on clients improves performance considerably Consistency: If clients cache information, do all parties share the same view of it? Recovery: If one or more computers crash, to what extent are the others affected? How much information is lost?

Discussion on Distributed File Systems Distributed File Systems (DFSs) Basics Basics DFS Aspects

DFS Aspects Aspect Description Aspect Description Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs?

DFS Aspects Aspect Description Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs?

Architectures Client-Server Distributed File Systems Cluster-Based Distributed File Systems Symmetric Distributed File Systems

Network File System Many distributed file systems are organized along the lines of client-server architectures Sun Microsystem’s Network File System (NFS) is one of the most widely-deployed DFSs for Unix-based systems NFS comes with a protocol that describes precisely how a client can access a file stored on a (remote) NFS file server NFS allows a heterogeneous collection of processes, possibly running on different OSs and machines, to share a common file system

Remote Access Model The model underlying NFS and similar systems is that of remote access model In this model, clients: Are offered transparent access to a file system that is managed by a remote server Are normally unaware of the actual location of files Are offered an interface to a file system similar to the interface offered by a conventional local file system Replies from server Server Client File Requests from client to access remote file File stays on server

Upload/Download Model A contrary model, referred to as upload/download model, allows a client to access a file locally after having downloaded it from the server The Internet’s FTP service can be used this way when a client downloads a complete file, modifies it, and then puts it back File moved to client Server Client File File New File All accesses are done at the client side When client is done, file is returned to the server

The Basic NFS Architecture Client Server System call layer System call layer Virtual File System (VFS) layer Virtual File System (VFS) layer Local file system interface NFS client NFS server Local file system interface Virtually, all modern OSs provide VFS The whole idea of VFS is to hide the differences between various file systems NFS client takes care of handling access to files stored at a remote server NFS is largely independent of local file systems RPC client stub RPC server stub A Client Request in NFS Network

Architectures Client-Server Distributed File Systems Cluster-Based Distributed File Systems Symmetric Distributed File Systems

Data-Intensive Applications Today there is a deluge of large data-intensive applications Most data-intensive applications fall into one of two styles of computing: Internet services (or cloud computing) High-performance computing (HPC) Cloud computing and HPC applications run typically on thousands of compute nodes and handle big data Visualization of entropy in Terascale Supernova Initiative application. Image from Kwan-Liu Ma’s visualization team at UC Davis

Cluster-Based Distributed File Systems The underlying cluster-based file system is a key component for providing scalable data-intensive application performance The cluster-based file system divides and distributes big data, using file striping techniques, for allowing concurrent data accesses The cluster-based file system could be either a cloud computing or an HPC oriented distributed file system Google File System (GFS) and S3 are examples of cloud computing DFSs Parallel Virtual File System (PVFS) and IBM’s General Parallel File System (GPFS) are examples of HPC DFSs

File Striping Techniques Server clusters are often used for parallel applications and their associated file systems are adjusted to satisfy their requirements One well-known technique is to deploy file-striping techniques, by which a single file is distributed across multiple servers Hence, it becomes possible to fetch different parts in parallel Accessing file parts in parallel a b d e a a c e c b b d c d e

Round-Robin Distribution (1) How to stripe a file over multiple machines? Round-Robin is typically a reasonable default solution Logical File 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Stripe Size Striping Unit Server 1 Server 2 Server 3 Server 4 4 8 12 1 5 9 13 2 6 10 14 3 7 11 15

Round-Robin Distribution (2) Clients perform writes/reads of file at various regions Client I: 512K write, offset 0 Client II: 512K write, offset 512 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 4 1 5 2 6 3 7 8 12 9 13 10 14 11 15 Server 1 Server 2 Server 3 Server 4 4 8 12 1 5 9 13 2 6 10 14 3 7 11 15

2D Round-Robin Distribution (1) What happens when we have many servers (say 100s)? 2D distribution can help Logical File 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Stripe Size Striping Unit Server 1 Server 2 Server 3 Server 4 2 4 6 1 3 5 7 8 10 12 14 9 11 13 15 Group Size = 2

2D Round-Robin Distribution (2) 2D distribution can limit the number of servers per client Client I: 512K write, offset 0 Client II: 512K write, offset 512 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 4 1 5 2 6 3 7 8 12 9 13 10 14 11 15 Server 1 Server 2 Server 3 Server 4 2 4 6 1 3 5 7 8 10 12 14 9 11 13 15 Group Size = 2

General Purpose Applications For general-purpose data-intensive applications, or those with irregular or many different types of data structures, file striping may not be effective In those cases, it is often more convenient to partition the file system as a whole and simply store files on different servers File block of file a a b c d e a c e b d a c e b d

GFS Data Distribution Policy The Google File System (GFS) is a cloud-computing-based scalable DFS for large distributed data-intensive applications GFS divides large files into multiple pieces called chunks or blocks (by default 64MB) and stores them on different data servers This design is referred to as block-based design Each GFS chunk has a unique 64-bit identifier and is stored as a file in the lower-layer local file system on the data server GFS distributes chunks across cluster data servers using a random distribution policy

GFS Random Distribution Policy Server 0 (Writer) Server 1 Server 2 Server 3 0M Blk 0 Blk 0 Blk 1 Blk 0 64M Blk 1 Blk 2 Blk 2 Blk 1 128M Blk 2 Blk 3 Blk 4 Blk 4 Blk 3 Blk 3 Blk 6 192M 256M Blk 4 Blk 5 Blk 5 Blk 5 320M Blk 6 Blk 6 384M Load Imbalance

GFS Architecture The storage and compute capabilities of a cluster are organized in two ways: Co-locate storage and compute in the same node Separate storage nodes from compute nodes GFS File name, chunk index GFS client Master Contact address The master keeps track of where a chunk is located The GFS master is contacted for metadata information The GFS master maintains a name space, along with a mapping from file name to chunks The chunk servers keep an account of what they have stored Chunks are replicated (server-side- No data caching; but it does cache metadata) to handle failures (No RAID, No SAN) Replicas are updated serially Chunk Id, range Chunk Server Chunk Server Chunk Server Chunk data Linux File System Linux File System Linux File System

PVFS Data Distribution Policy Parallel Virtual File System (PVFS) is an HPC-based scalable DFS for large distributed data-intensive applications PVFS divides large files into multiple pieces called stripe units (by default 64KB) and stores them on different data servers This design is referred to as object-based design Unlike the block-based design of GFS, PVFS stores an object (or a handle) as a file that includes all the stripe units at a data server PVFS distributes stripe units across cluster data servers using a round robin policy

PVFS Round-Robin Distribution Policy Server 0 (Writer) Server 1 Server 2 Server 3 0M Blk 0 Blk 0 Blk 0 Blk 1 64M Blk 1 Blk 1 Blk 2 Blk 2 128M Blk 2 Blk 3 Blk 3 Blk 3 192M Blk 4 Blk 4 Blk 4 Blk 5 256M Blk 5 Blk 5 Blk 6 Blk 6 320M Blk 6 384M Load Balance

PVFS Architecture The storage and compute capabilities of a cluster are organized in two ways: Co-locate storage and compute in the same node Separate storage nodes from compute nodes PVFS Metadata Manager Network Compute Nodes I/O Nodes

Architectures Client-Server Distributed File Systems Cluster-Based Distributed File Systems Symmetric Distributed File Systems

Ivy Fully symmetric organizations that are based on peer-to-peer technology also exist All current proposals use a DHT-based system for distributing data, combined with a key-based lookup mechanism As an example, Ivy is a distributed file system that is built using a Chord DHT-based system Data storage in Ivy is realized by a block-oriented (i.e., blocks are distributed over storage nodes) distributed storage called DHash DHT is a class of a decentralized distributed system that provides a lookup service similar to a hash table DHT-based systems do not consider network proximity at all

Ivy Architecture Ivy consists of 3 separate layers: Ivy Ivy Ivy DHash Node where a file system is rooted File System Layer Ivy Ivy Ivy Block-Oriented Storage DHash DHash DHash DHT Layer Chord Chord Chord

DFS Aspects Aspect Description Aspect Description Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs?

Processes (1) Cooperating processes in DFSs are usually the storage servers and file manager(s) The most important aspect concerning DFS processes is whether they should be stateless or stateful Stateless Approach: Does not require that servers maintain any client state When a server crashes, there is no need to enter a recovery phase to bring the server to a previous state Locking a file cannot be easily done E.g., NFSv3 and PVFS (no client-side caching)

Processes (2) Stateful Approach: Requires that a server maintains some client state Clients can make effective use of caches but this would entail an efficient underlying cache consistency protocol Provides a server with the ability to support callbacks (i.e., the ability to do RPC to a client) in order to keep track of its clients E.g., NFSv4 and HDFS With a stateful approach clients can make effective use of caches. This requires an efficient cache consistency protocol. Such protocols often work best in collaboration with a server that maintains some information on files as used by its clients. For example, a server may associate a lease with each file it hands out to a client, promising to give the client exclusive read and write access until the lease expires or is refreshed.

DFS Aspects Aspect Description Aspect Description Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs?

Communication Communication in DFSs is mainly based on remote procedure calls (RPCs) The main reason for choosing RPC is to make the system independent from underlying OSs, networks, and transport protocols In NFS, all communication between a client and server proceeds along the Open Network Computing RPC (ONC RPC) GFS uses RPC and may break a read into multiple RPCs to increase parallelism PVFS currently uses TCP for all its internal communication

RPCs in NFS Client Server Lookup Up until NFSv4, the client was made responsible for making the server’s life as easy as possible by keeping requests simple The drawback becomes apparent when considering the use of NFS in a wide-area system In that case, the extra latency of a second RPC leads to performance degradation To circumvent such a problem, NFSv4 supports compound procedures Lookup name Read Time Read file data Client Server Lookup Open Read Lookup name Open file Time Read file data

Next Class: DFS Aspects Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs? Aspect Description Architecture How are DFSs generally organized? Processes Who are the cooperating processes? Are processes stateful or stateless? Communication What is the typical communication paradigm followed by DFSs? How do processes in DFSs communicate? Naming How is naming often handled in DFSs? Synchronization What are the file sharing semantics adopted by DFSs? Consistency and Replication What are the various features of client-side caching as well as server-side replication? Fault Tolerance How is fault tolerance handled in DFSs?