Flexibility, Manageability and Performance in a Grid Storage Appliance John Bent, Venkateshwaran Venkataramani, Nick Leroy, Alain Roy, Joseph Stanley,

Slides:



Advertisements
Similar presentations
A Proposal of Capacity and Performance Assured Storage in The PRAGMA Grid Testbed Yusuke Tanimura 1) Hidetaka Koie 1,2) Tomohiro Kudoh 1) Isao Kojima 1)
Advertisements

30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
Threads, SMP, and Microkernels
High Performance Cluster Computing Architectures and Systems Hai Jin Internet and Cluster Computing Center.
A Computation Management Agent for Multi-Institutional Grids
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
© 2007 Pearson Education Inc., Upper Saddle River, NJ. All rights reserved.1 Computer Networks and Internets with Internet Applications, 4e By Douglas.
1 Last Class: Introduction Operating system = interface between user & architecture Importance of OS OS history: Change is only constant User-level Applications.
PRASHANTHI NARAYAN NETTEM.
© DSRG 2001www.cs.agh.edu.pl Cross Grid Workshop - Kraków Krzysztof Zieliński, Sławomir Zieliński University of Mining and Metallurgy {kz,
File Systems (2). Readings r Silbershatz et al: 11.8.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Chapter 3 Operating Systems Introduction to CS 1 st Semester, 2015 Sanghyun Park.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
GridFS Targeting Data Sharing in Grid Environments Marcelo Nery dos Santos / Renato Cerqueira PUC-Rio, Brazil Presented by: Francisco Silva.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
Networked Storage Technologies Douglas Thain University of Wisconsin GriPhyN NSF Project Review January 2003 Chicago.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Processes and Threads Processes have two characteristics: – Resource ownership - process includes a virtual address space to hold the process image – Scheduling/execution.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Nick LeRoy & Jeff Weber Computer Sciences Department University of Wisconsin-Madison Managing.
Peter F. Couvares (based on material from Tevfik Kosar, Nick LeRoy, and Jeff Weber) Associate Researcher, Condor Team Computer Sciences Department University.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
Large Scale Parallel File System and Cluster Management ICT, CAS.
Processes Introduction to Operating Systems: Module 3.
Tevfik Kosar Computer Sciences Department University of Wisconsin-Madison Managing and Scheduling Data.
Operating Systems Objective n The historic background n What the OS means? n Characteristics and types of OS n General Concept of Computer System.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
CEDPS Data Services Ann Chervenak USC Information Sciences Institute.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Problem-solving on large-scale clusters: theory and applications Lecture 4: GFS & Course Wrap-up.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
Grid Appliance The World of Virtual Resource Sharing Group # 14 Dhairya Gala Priyank Shah.
6.894: Distributed Operating System Engineering Lecturers: Frans Kaashoek Robert Morris
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
VIRTUAL NETWORK COMPUTING SUBMITTED BY:- Ankur Yadav Ashish Solanki Charu Swaroop Harsha Jain.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Introduction to Operating Systems Concepts
Introduction to Distributed Platforms
NeST: Network Storage Flexible Commodity Storage Appliances
dCache “Intro” a layperson perspective Frank Würthwein UCSD
Grid Computing.
Ákos Frohner EGEE'08 September 2008
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Overview Introduction VPS Understanding VPS Architecture
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
NeST: Network Storage Technologies
Wide Area Workload Management Work Package DATAGRID project
Virtual Memory: Working Sets
Lecture Topics: 11/1 Hand back midterms
Presentation transcript:

Flexibility, Manageability and Performance in a Grid Storage Appliance John Bent, Venkateshwaran Venkataramani, Nick Leroy, Alain Roy, Joseph Stanley, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, and Miron Livny University of Wisconsin

Two trends D a t a s e t s P e r f o rm a n c e Storage appliances address both trends.

Storage Appliances: + and - Storage Appliances: Great for basic file service Easy to manage: Plug in and it works Good performance: Specialized just for I/O Reliable and available too Storage Appliances for the Grid: Mismatch? Inflexible: Few, specific protocols (e.g., NFS) Costly: 10x the cost of PC + a few disks Difficult to integrate: Just one piece of the puzzle

A Solution: NeST NeST: A Storage Appliance for the Grid Flexible: Multiple simultaneous protocols Virtual protocol layer Low-cost: Use commodity machines Dynamic adaptation Grid-aware: Integrate w/ higher-level systems Design specifically for the Grid

Outline Introduction General architecture Design goals Flexibility Low-cost Grid-aware features NeST in the Grid example Conclusions

Physical storage layer ChirpHTTPGrid FTPNFS Common protocol layer Dispatcher Transfer Mgr Concurrency Models Storage Mgr Control flow Datal flow NeST: Protocol layer Physical network layer Virtualizes different protocols Mediates access to network

Physical storage layer ChirpHTTPGrid FTPNFS Common protocol layer Dispatcher Transfer Mgr Concurrency Models Storage Mgr Control flow Datal flow NeST: Dispatcher Physical network layer Mediates interaction between other components Gathers information, advertises

Physical storage layer ChirpHTTPGrid FTPNFS Common protocol layer Dispatcher Transfer Mgr Concurrency Models Storage Mgr Control flow Datal flow NeST: Storage manager Physical network layer Space management Access control Virtualizes physical storage

Physical storage layer ChirpHTTPGrid FTPNFS Common protocol layer Dispatcher Transfer Mgr Concurrency Models Storage Mgr Control flow Datal flow NeST: Transfer manager Physical network layer Implementss cheduling policies Chooses concurrency model

Outline Introduction General architecture Design goals Flexibility Low-cost Grid-aware features NeST in the Grid example Conclusions

Flexibility: Multiple protocols Problem: How to support multiple protocols? One approach: Just a Bunch of Servers (JBOS) Problems with JBOS Lack of control (scheduling) Painful administration No shared code Larger memory footprint nfsd wu-ftpd httpd JBOS Server

NeST: Flexibility By Design NeST: Integrate protocols and gain advantage Implementation like VFS Integration introduces new challenges Different protocols allow different auth models More expensive to add a new protocol Less fault isolation

NeST vs JBOS Chirp GridFTP HTTP NFS Server bandwidth (MB/s) Total For each protocol, NeST is comparable to JBOS server. wu-ftpd Apache linux nfsd Linux cluster - Dual PIII - 1 GB Ram - linux Each protocol - 4 clients - 10 MB files

Exerting scheduling control Different scheduling policies FCFS Cache-aware [USENIX ‘02] Proportional share Proportional share scheduling Allows administrators to set protocol proportions e.g. favor NFS Very difficult in JBOS

Proportional share Server bandwidth (MB/s) FCFS1:1:1:11:2:1:11:1:1:4 Scheduling configuration In most cases, achieves Jain’s metric of fairness > 0.98 (1 is “fair”). Linux cluster - Dual PIII - 1 GB Ram - linux Each protocol - 4 clients - 10 MB files

Outline Introduction General architecture Design goals Flexibility Low-cost Grid-aware features NeST in the Grid example Conclusions

Low-Cost: New challenges Desire: Run on arbitrary OS on arbitrary PC Software-only, user-level storage appliance Currently on Linux (release 0.9) and Solaris (beta) Problem: Portable performance Performance under load is platform / workload dependent Threads or processes on some systems, events on others May also be workload dependent (e.g. whether in cache) NeST approach: Dynamic adaptivity Simultaneously support multiple concurrency models Monitor performance using each model Bias towards better model over time

Adaptive Concurrency Solaris: 1K files Ave time per request (sec) Events Threads Linux: 10 MB files Events Threads Ave time per request (ms) Adaptive Dynamic adaptation approaches “ideal” without static information.

Outline Introduction General architecture Design goals Flexibility Low-cost Grid-aware features NeST in the Grid example Conclusions

Grid-Aware Mechanisms Basic functionality Users and groups: Dynamic creation/deletion does not need administrative intervention Access control: Generic AFS-style ACLs Advanced functionality QoS: Preferential scheduling Advertises into global scheduling systems Flexible protocol and authentication mechanisms Self-cleaning storage guarantees: Lots

Storage guarantees: Lots Characteristics of Lots: Capacity: Total amount of data lot can store Duration: Time for which data is guaranteed to exist Set of files: Multiple files may co-exist within lot Self-cleaning Expired lots become “best-effort” lots Lot management Either default set created by administrator, OR use resource management protocol to create before usage Implementation: File system quotas Advantage: Integrates cleanly with local access methods Disadvantage: Performance hit for large writes

Outline Introduction General architecture Design goals Flexibility Low-cost Grid-aware features NeST in the Grid example Conclusions

NeST in the Grid Linux NeST Solaris NeST HomeRemote Global Execution Manager compute node compute node compute node compute node Condor Tribune Advertisement

NeST in the Grid N N Home Remote Global Execution Mgr GridFTP 3) 3) Global coordinates xfer 1) 1) Home submits jobs 6) 6) Global terminates space 5) 5) Global coordinates xfer NFS 4) 4) Global starts jobs 2) 2) Global reserves space

Conclusions NeST: A storage appliance for the Grid Gain manageability Without sacrificing performance Design goals: Flexibility: Virtual protocol architecture Low-cost: Adaptation mechanisms Grid-aware: Space management Current status: release 0.9 available Future work Hot deployable NeSTs, lot management extensions

Questions?

NeST: Architecture Common Protocol Layer GridFTPNFSHTTPChirp Dispatcher Storage Manager FTP The Network Physical Storage Transfer Manager

Example client interaction Common Protocol Layer GFTPNFSHTTPChirp Dispatcher Storage Manager FTP The Network Physical Storage Transfer Manager Client sends mkdir Dispatcher asks protocol handler to parse asks storage mgr to mkdir checks permissions sends ack through handler Client puts file in new dir Dispatcher asks protocol handler to parse asks storage mgr for permission hands off to transfer mgr Transfer manager monitors transfer sends final ack