Nan DUN Kenjiro Taura Akinori Yonezawa The University of Tokyo Sep 12, 2007 1http://www.yl.is.s.u-tokyo.ac.jp/~dunnan/talk/070912.JSSST07.pptx.

Slides:



Advertisements
Similar presentations
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
Advertisements

Exploring the UNIX File System and File Security
Hypertext Transfer Protocol Kyle Roth Mark Hoover.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File-System Interface.
Chapter 10: File-System Interface
NFS. The Sun Network File System (NFS) An implementation and a specification of a software system for accessing remote files across LANs. The implementation.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
Windows Server 2008 Chapter 8 Last Update
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Lecture 23 The Andrew File System. NFS Architecture client File Server Local FS RPC.
A crash course in njit’s Afs
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Distributed File Systems Steve Ko Computer Sciences and Engineering University at Buffalo.
Design and implementation  Main features  Socket API  No need to modify existing applications/middleware  Overlay network  FW/NAT traversal.
Week 2 File Systems & Unix Commands. File System Hierarchy.
Almaden Rice University Nache: Design and Implementation of a Caching Proxy for NFSv4 Ajay Gulati, Rice University Manoj Naik, IBM Almaden Renu Tewari,
GridFS Targeting Data Sharing in Grid Environments Marcelo Nery dos Santos / Renato Cerqueira PUC-Rio, Brazil Presented by: Francisco Silva.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Distributed Systems. Interprocess Communication (IPC) Processes are either independent or cooperating – Threads provide a gray area – Cooperating processes.
Advanced Operating Systems - Spring 2009 Lecture 21 – Monday April 6 st, 2009 Dan C. Marinescu Office: HEC 439 B. Office.
Distributed File Systems Case Studies: Sprite Coda.
AE6382 Secure Shell Usually referred to as ssh, the name refers to both a program and a protocol. The program ssh is one of the most useful networking.
File and Object Replication in Data Grids Chin-Yi Tsai.
Chapter Two Exploring the UNIX File System and File Security.
Chapter 10: File-System Interface Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 10: File-System.
Page 110/19/2015 CSE 30341: Operating Systems Principles Chapter 10: File-System Interface  Objectives:  To explain the function of file systems  To.
UNIX Commands. Why UNIX Commands Are Noninteractive Command may take input from the output of another command (filters). May be scheduled to run at specific.
10.1 Silberschatz, Galvin and Gagne ©2005 Operating System Principles 10.4 File System Mounting A file system must be mounted before it can be accessed.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Hands-On Microsoft Windows Server Implementing Microsoft Internet Information Services Microsoft Internet Information Services (IIS) –Software included.
From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Edition 5, © Addison-Wesley 2012 Exercises for Chapter 12: Distributed.
 CASTORFS web page - CASTOR web site - FUSE web site -
FTP Server API Implementing the FTP Server Registering FTP Command Callbacks Data and Control Port Close Callbacks Other Server Calls.
Sun Network File System Presentation 3 Group A4 Sean Hudson, Syeda Taib, Manasi Kapadia.
Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.
Linux+ Guide to Linux Certification, Third Edition
Linux+ Guide to Linux Certification, Third Edition
Virtual Private Grid (VPG) : A Command Shell for Utilizing Remote Machines Efficiently Kenji Kaneda, Kenjiro Taura, Akinori Yonezawa Department of Computer.
GLOBAL EDGE SOFTWERE LTD1 R EMOTE F ILE S HARING - Ardhanareesh Aradhyamath.
REST By: Vishwanath Vineet.
Lecture 02 File and File system. Topics Describe the layout of a Linux file system Display and set paths Describe the most important files, including.
Fall 2013 SILICON VALLEY UNIVERSITY CONFIDENTIAL 1 Introduction to Embedded Systems Dr. Jerry Shiao, Silicon Valley University.
FTP Client API FTP in embedded devices Implementing an FTP Client FTP Command APIs Other FTP Client APIs.
Silberschatz, Galvin and Gagne ©2011 Operating System Concepts Essentials – 8 th Edition Chapter 2: The Linux System Part 2.
4P13 Week 5 Talking Points 1. Security Provided by BSD a self-protecting Trusted Computing Base (TCB) spanning kernel and userspace; kernel isolation.
Naming CSCI 6900/4900. Mounting Mounting – Merging different namespaces transparently File system example –Directory node of one namespace stores identifier.
Chapter Five Distributed file systems. 2 Contents Distributed file system design Distributed file system implementation Trends in distributed file systems.
Design of File System Directory Services Osamu Tatebe Grid Technology Research Center, AIST GFS-WG, GGF10 March 2004, Berlin GGF10 GFS-WG March 2004, Berlin.
File System Directory Services Osamu Tatebe Grid Technology Research Center, AIST GFS-WG, F2F meeting April 2004, San Jose GFS-WG April 2004, San Jose.
An Introduction to GPFS
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
a brief summary for users
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
Chapter 11: File System Implementation
Distributed File Systems
File System Implementation
Key Terms Windows 2008 Network Infrastructure Confiuguration Lesson 6
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Subject Name: Operating Systems Subject Code:10CS53
Operation System Program 4
File Transfer: FTP Objectives Chapter 19
Exploring the UNIX File System and File Security
Chapter 2: The Linux System Part 2
Chapter 15: File System Internals
Directory Structure A collection of nodes containing information about all files Directory Files F 1 F 2 F 3 F 4 F n Both the directory structure and the.
Exercises for Chapter 8: Distributed File Systems
Chapter 15: File System Internals
Today: Distributed File Systems
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Chapter 15: File System Internals
Presentation transcript:

Nan DUN Kenjiro Taura Akinori Yonezawa The University of Tokyo Sep 12, http://

 Background  Related Work ◦ SSHFS ◦ FUSE  HandyFS ◦ Designs and Implementation ◦ Evaluation and Problems  Future Work Sep 12,

 User explicit sharing operation ◦ rcp, scp [OpenSSH] ◦ FTP [J. Postel, et al ], GridFTP [Globus]  Conventional Distributed File System ◦ Inner-Cluster Sharing  NFSv3, NFSv4 ◦ Inter-Cluster Sharing  AFS [OpenAFS], Gfarm [Tatebe et al. ‘04] Sep 12, http://

 Complexity of conventional DFSs ◦ Client-Server Configurations ◦ Administrative Privilege ◦ Typical Firewall Setting of the Grid  Simplicity of SSHFS [ ◦ Only need available SSH connection ◦ Client-side mount only ◦ User-level Operations without privileges Sep 12,

 Create local mount point ◦ mkdir mnt_chiba  Mount remote directory to mount point ◦ sshfs chiba000.intrigger.nii.ac.jp:/sharedir mnt_chiba  Access remote files via local mount point ◦ [ls, rm, cp, mkdir] mnt_chiba Sep 12,

Sep 12, fuse hostA_mountpointhostB_mountpoint SSH client hostA SSHFS User Application General File System Call SSH Commands Translation fuse SSH client hostB

Sep 12, Data Transfer Rate (Mbps) SSHFS-FUSE overhead

 Disadvantages of Merely Using SSHFS ◦ Non-scalable  Users have to create many mount points for hosts ◦ Non-unified namespace  Files under different mount points (hosts) should be manipulated separately  Motivation of HandyFS ◦ Inherit simplicity from SSHFS ◦ Create a scalable mount for many hosts ◦ Ad-hoc create a unified namespace for files on many hosts Sep 12,

Sep 12,  Create ONE local mount point ◦ mkdir mnt_all  Mount MULTIPLE remote directories to mount point ◦ handyfs chiba000.intrigger.nii.ac.jp:/sharedir1 chiba001.intrigger.nii.ac.jp:/sharedir mnt_all  Access remote files via local mount point ◦ [ls, rm, cp] mnt_all

Sep 12, FUSE Kernel Module User Application HandyFS SSHFS Host1 Host2 User Kernel SSHFS The trick is using FUSE twice 1. SSHFS mount for remote host 2. Local mount for directory merge /handyfs_mnt/sshfs_mnt1

Source directory Tree at HostB Source directory Tree at HostA  Directories Merge ◦ Source directories and Target (virtual) directory  Source directory: SSHFS mount points for each host  Target directory: Directory in HandyFS namespace Sep 12, /mnt_hostA /A /B /B/C /mnt_hostB /B /C /B/D / / /A /B /B/C Virtual directory Tree /C /B/D

 Directories Merge ◦ Source directories and Target (virtual) directory ◦ Directory Table  To remember virtual directory is merged from which source directory ◦ On-Demand Merge Operation  Merge operation occurs only when request reach non- merged path Sep 12, /cmndir /srcdirA/cmndir Virtual DirectorySource Directories /srcdirB/cmndir Implemented by using BerkeleyDB to achieve good performance

 File Lookup Sep 12, Lookup a file with full pathname Retrieve source directory entries from directory table Lookup file in each retrieved source directory entries If target pathname has not been merged, then merge and store into directory table Return result to user On-demand Merge

 Directory Split ◦ Remove one or several source directories from virtual directory, we have two alternatives  Remove directory table entries  Heavy Operation: Search entire directory table  Re-insert overhead if source become available again  Blind user view of source directories by filtering  Use a in-memory filter to mark current available sources  File lookup results are filtered before returning to user  Efficient and re-usable Sep 12,

 Directory Split Sep 12, Lookup a file with full pathname Retrieve source directory entries from directory table Lookup file in each retrieved source directory entries If target pathname has not been merged, then merge and store into directory table Return result to user Filtering out results that are from removed source directories

 File Creation ◦ File is created at random source directory ◦ For directory file, an entry is created in directory table  Directory Read ◦ Similar as file lookup  Lookup target directory  Read all entries from source directories that composed of target directory Sep 12,

 Some Practical Problems ◦ Same file from multiple sources  Not compatible with UNIX file semantics  Define priority of source directory  Only files from source directory with high priority are visible to user ◦ Stale of Directory Table  Remote directory tree may be modified by its local operations  Periodically update Sep 12,

 Experimental Environments ◦ InTrigger Platform: tokyo.ac.jp/intrigger/ tokyo.ac.jp/intrigger/ Sep 12, Hardware/SoftwareSpecification CPUCore2 Duo 2.13Hz Memory4GB NetworkGigabit Ethernet OSLinux FUSE Kernel Interface7.8 FUSE Library2.7.0 SSHFS1.8

 Experimental Results ◦ Data Transfer Rate Sep 12, SSHFS-FUSE overhead TWICE-FUSE- MOUNT overhead

 Experimental Results ◦ Source Compilation Sep 12, Wide-Area Latency SSHFS Overhead Wide-Area Latency SSHFS Overhead TWICE-FUSE- MOUNT overhead

 Performance ◦ HandyFS itself  To remove MOUNT-TWICE overhead ◦ SSHFS performance  Need be tuned at source level  Scalability ◦ “One-to-All” mount  “All-to-All” mount Sep 12,