Download presentation
Presentation is loading. Please wait.
Published byLindsay Mitchell Modified over 9 years ago
1
Ch 11 Distributed File System Ch11.1 Architecture Lei Zhang Oct 26 2015
2
Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based
3
Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based
4
Intro to NFS 2 ways of C-S Arch Naive way. RPC
5
Intro to NFS- basics Although implemented by SUN Solaris, it is the predominant FS implementation on Unix System Layered Structure
6
NFS API Interfaces
7
Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based
8
Cluster-Based Distributed File Systems Downsides of a C-S Arch Performance bottle neck Single-Point-Failure Solution: Files(resources) can be stored on a few servers A big file across multi servers File Stripping for big structured files Many files on different servers Most files are not well structured
9
Cluster-Based Distributed File Systems How to support file access in a Data Center? Files permanently growing File size might be multi gigabytes. A server might be malfunction File access request from any client should be responded in any condition
10
Cluster-Based Distributed File Systems
11
GFS, how does it work? A cluster has a master node, which ONLY keeps meta information of files A big file is splited into CHUNKS, a CHUNK of size 64Mbs. Chunks are spread on many chunk servers More details on GFS Chunks are replicated --- Redundancy Master does not keep up-to-date of chunk locations A Chunks server knows what exactly it stores. If client retrieval failed(low probability), ask Master again, master update latest info from chunk servers
12
Cluster-Based Distributed File Systems GFS, how does it work? File update. Client pushes back updated file chunk to corresponding chunk server Chunk server conducts the backup/replication Master node is kept out of this loop, bottle neck problem is solved I/O performance of a GFS is pretty good and scalability is good as well
13
Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based
14
Symmetric Arch Peer-to-Peer No Client, No server, No Master, No Chunk First realization is Ivy
15
Symmetric Arch
16
Ivy details Data storage. File composed of 8kb data blocks. Content-hash data blocks Public-key based blocks Replication Every block B is stored on K immediate successors, better availability
17
Future Developments Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based
18
Reference 1.Ghemawat, Sanjay, Howard Gobioff, and Shun-Tak Leung. "The Google file system." ACM SIGOPS operating systems review. Vol. 37. No. 5. ACM, 2003. 2.Sandberg, Russel, et al. "Design and implementation of the Sun network filesystem." Proceedings of the Summer USENIX conference. 1985. 3.Muthitacharoen, Athicha, et al. "Ivy: A read/write peer-to-peer file system." ACM SIGOPS Operating Systems Review 36.SI (2002): 31-44. 4.Naor, Moni, and Udi Wieder. "A simple fault tolerant distributed hash table."Peer-to-Peer Systems II. Springer Berlin Heidelberg, 2003. 88-97. 5.Cai, Min, Ann Chervenak, and Martin Frank. "A peer-to-peer replica location service based on a distributed hash table." Proceedings of the 2004 ACM/IEEE conference on Supercomputing. IEEE Computer Society, 2004. 6.Kleiman, Steve R. "Vnodes: An Architecture for Multiple File System Types in Sun UNIX." USENIX Summer. Vol. 86. 1986.
19
Thank you Q&A
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.