Ch 11 Distributed File System Ch11.1 Architecture Lei Zhang Oct 26 2015.

Slides:



Advertisements
Similar presentations
Peer-to-Peer (P2P) Distributed Storage 1Dennis Kafura – CS5204 – Operating Systems.
Advertisements

Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google Jaehyun Han 1.
The Google File System Authors : Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung Presentation by: Vijay Kumar Chalasani 1CS5204 – Operating Systems.
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
The Google File System (GFS). Introduction Special Assumptions Consistency Model System Design System Interactions Fault Tolerance (Results)
Ken Birman. Distributed File Systems Goal: view a distributed system as a file system Storage is distributed Web tries to make world a collection of hyperlinked.
Distributed File Systems Chapter 11
Lecture 6 – Google File System (GFS) CSE 490h – Introduction to Distributed Computing, Winter 2008 Except as otherwise noted, the content of this presentation.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Distributed Systems CS Distributed File Systems- Part I Lecture 19, Nov 14, 2011 Majd F. Sakr, Mohammad Hammoud andVinay Kolar 1.
Structure Overlay Networks and Chord Presentation by Todd Gardner Figures from: Ion Stoica, Robert Morris, David Liben- Nowell, David R. Karger, M. Frans.
Implementation of Simple Cloud-based Distributed File System Group ID: 4 Baolin Wu, Liushan Yang, Pengyu Ji.
Wide-area cooperative storage with CFS
Northwestern University 2007 Winter – EECS 443 Advanced Operating Systems The Google File System S. Ghemawat, H. Gobioff and S-T. Leung, The Google File.
Inexpensive Scalable Information Access Many Internet applications need to access data for millions of concurrent users Relational DBMS technology cannot.
Distributed File Systems
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
1 The Google File System Reporter: You-Wei Zhang.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
Distributed Systems Concepts and Design Chapter 10: Peer-to-Peer Systems Bruce Hammer, Steve Wallis, Raymond Ho.
Map Reduce for data-intensive computing (Some of the content is adapted from the original authors’ talk at OSDI 04)
Latest Relevant Techniques and Applications for Distributed File Systems Ela Sharda
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Apache Cassandra - Distributed Database Management System Presented by Jayesh Kawli.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
The Google File System Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung
Large Scale Machine Translation Architectures Qin Gao.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
Presenters: Rezan Amiri Sahar Delroshan
Agile Survivable Store PIs: Mustaque Ahamad, Douglas M. Blough, Wenke Lee and H.Venkateswaran PhD Students: Prahlad Fogla, Lei Kong, Subbu Lakshmanan,
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
Distributed File Systems Architecture – 11.1 Processes – 11.2 Communication – 11.3 Naming – 11.4.
Sun Network File System Presentation 3 Group A4 Sean Hudson, Syeda Taib, Manasi Kapadia.
1 JTE HPC/FS Pastis: a peer-to-peer file system for persistant large-scale storage Jean-Michel Busca Fabio Picconi Pierre Sens LIP6, Université Paris 6.
Distributed systems [Fall 2015] G Lec 1: Course Introduction.
Distributed File Systems Architecture – 11.1 Processes – 11.2 Communication – 11.3 Naming – 11.4.
Toward Efficient and Simplified Distributed Data Intensive Computing IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 22, NO. 6, JUNE 2011PPT.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture Chunkservers Master Consistency Model File Mutation Garbage.
Distributed Systems CS Distributed File Systems- Part I Lecture 20, Nov 28, 2012 Majd F. Sakr and Mohammad Hammoud 1.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Lecture 24: GFS.
Distributed Systems CS Distributed File Systems Lecture 22, Dec 2, 2015 Mohammad Hammoud 1.
Implementation of Simple Cloud-based Distributed File System Group ID: 4 Baolin Wu, Liushan Yang, Pengyu Ji.
Peer-to-Peer (P2P) File Systems. P2P File Systems CS 5204 – Fall, Peer-to-Peer Systems Definition: “Peer-to-peer systems can be characterized as.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Distributed Systems: Distributed File Systems Ghada Ahmed, PhD. Assistant Prof., Computer Science Dept. Web:
The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Presenter: Chao-Han Tsai (Some slides adapted from the Google’s series lectures)
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
Data Management with Google File System Pramod Bhatotia wp. mpi-sws
Cloud Computing CS Distributed File Systems and Cloud Storage – Part I
Google File System CSE 454 From paper by Ghemawat, Gobioff & Leung.
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung Google Presented by Jiamin Huang EECS 582 – W16.
Sajitha Naduvil-vadukootu
Ch 11 Distributed File System
A Redundant Global Storage Architecture
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
Distributed File Systems
Distributed File Systems
CSE 451: Operating Systems Spring Module 21 Distributed File Systems
Cloud scale storage: The Google File system
Distributed File Systems
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Distributed File Systems
Distributed File Systems
Presentation transcript:

Ch 11 Distributed File System Ch11.1 Architecture Lei Zhang Oct

Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based

Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based

Intro to NFS 2 ways of C-S Arch Naive way. RPC

Intro to NFS- basics Although implemented by SUN Solaris, it is the predominant FS implementation on Unix System Layered Structure

NFS API Interfaces

Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based

Cluster-Based Distributed File Systems Downsides of a C-S Arch Performance bottle neck Single-Point-Failure Solution: Files(resources) can be stored on a few servers A big file across multi servers File Stripping for big structured files Many files on different servers Most files are not well structured

Cluster-Based Distributed File Systems How to support file access in a Data Center? Files permanently growing File size might be multi gigabytes. A server might be malfunction File access request from any client should be responded in any condition

Cluster-Based Distributed File Systems

GFS, how does it work? A cluster has a master node, which ONLY keeps meta information of files A big file is splited into CHUNKS, a CHUNK of size 64Mbs. Chunks are spread on many chunk servers More details on GFS Chunks are replicated --- Redundancy Master does not keep up-to-date of chunk locations A Chunks server knows what exactly it stores. If client retrieval failed(low probability), ask Master again, master update latest info from chunk servers

Cluster-Based Distributed File Systems GFS, how does it work? File update. Client pushes back updated file chunk to corresponding chunk server Chunk server conducts the backup/replication Master node is kept out of this loop, bottle neck problem is solved I/O performance of a GFS is pretty good and scalability is good as well

Three Archs to Introduce Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based

Symmetric Arch Peer-to-Peer No Client, No server, No Master, No Chunk First realization is Ivy

Symmetric Arch

Ivy details Data storage. File composed of 8kb data blocks. Content-hash data blocks Public-key based blocks Replication Every block B is stored on K immediate successors, better availability

Future Developments Client-Server Arch (Centralized) NFS Cluster-based Arch (Less Centralized) GFS Symmetric Arch (Fully Distributed) DHT-based

Reference 1.Ghemawat, Sanjay, Howard Gobioff, and Shun-Tak Leung. "The Google file system." ACM SIGOPS operating systems review. Vol. 37. No. 5. ACM, Sandberg, Russel, et al. "Design and implementation of the Sun network filesystem." Proceedings of the Summer USENIX conference Muthitacharoen, Athicha, et al. "Ivy: A read/write peer-to-peer file system." ACM SIGOPS Operating Systems Review 36.SI (2002): Naor, Moni, and Udi Wieder. "A simple fault tolerant distributed hash table."Peer-to-Peer Systems II. Springer Berlin Heidelberg, Cai, Min, Ann Chervenak, and Martin Frank. "A peer-to-peer replica location service based on a distributed hash table." Proceedings of the 2004 ACM/IEEE conference on Supercomputing. IEEE Computer Society, Kleiman, Steve R. "Vnodes: An Architecture for Multiple File System Types in Sun UNIX." USENIX Summer. Vol

Thank you Q&A