Www.nestproject.org NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau.

Slides:



Advertisements
Similar presentations
A Proposal of Capacity and Performance Assured Storage in The PRAGMA Grid Testbed Yusuke Tanimura 1) Hidetaka Koie 1,2) Tomohiro Kudoh 1) Isao Kojima 1)
Advertisements

Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
The Condor Data Access Framework GridFTP / NeST Day 31 July 2001 Douglas Thain.
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
PRASHANTHI NARAYAN NETTEM.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
DISTRIBUTED COMPUTING
NeST: Network Storage Technologies Building I/O Appliances on Commodity Systems John Bent, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau and Miron Livny.
Module – 7 network-attached storage (NAS)
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Harnessing the Capacity of Computational.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Networked Storage Technologies Douglas Thain University of Wisconsin GriPhyN NSF Project Review January 2003 Chicago.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
LOGO Service and network administration Storage Virtualization.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Chapter 20 Distributed File Systems Copyright © 2008.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Welcome and Condor Project Overview.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Types of Operating Systems
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
Tevfik Kosar Computer Sciences Department University of Wisconsin-Madison Managing and Scheduling Data.
Flexibility, Manageability and Performance in a Grid Storage Appliance John Bent, Venkateshwaran Venkataramani, Nick Leroy, Alain Roy, Joseph Stanley,
Business and Partnering Opportunities: “Windows Server 8” Continuous Availability Designing Systems for Continuous Availability and Scalability Session.
CEDPS Data Services Ann Chervenak USC Information Sciences Institute.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
Department of Computing, School of Electrical Engineering and Computer Sciences, NUST - Islamabad KTH Applied Information Security Lab Secure Sharding.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
A Fully Automated Fault- tolerant System for Distributed Video Processing and Off­site Replication George Kola, Tevfik Kosar and Miron Livny University.
6.894: Distributed Operating System Engineering Lecturers: Frans Kaashoek Robert Morris
Chapter 2 Operating System Overview Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
The Kangaroo Approach to Data Movement on the Grid Author: D. Thain, J. Basney, S.-C. Son, and M. Livny From: HPDC 2001 Presenter: NClab, KAIST, Hyonik.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
STORAGE ARCHITECTURE/ MASTER): Where IP and FC Storage Fit in Your Enterprise Randy Kerns Senior Partner The Evaluator Group.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
Group # 14 Dhairya Gala Priyank Shah. Introduction to Grid Appliance The Grid appliance is a plug-and-play virtual machine appliance intended for Grid.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
Scott Koranda, UWM & NCSA 20 November 2016www.griphyn.org Lightweight Replication of Heavyweight Data Scott Koranda University of Wisconsin-Milwaukee &
Disk Cache Main memory buffer contains most recently accessed disk sectors Cache is organized by blocks, block size = sector’s A hash table is used to.
Chapter 1 Characterization of Distributed Systems
CASTOR: possible evolution into the LHC era
GridOS: Operating System Services for Grid Architectures
COMPSCI 110 Operating Systems
Give Your Data the Edge A Scalable Data Delivery Platform
NeST: Network Storage Flexible Commodity Storage Appliances
dCache “Intro” a layperson perspective Frank Würthwein UCSD
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Ákos Frohner EGEE'08 September 2008
Module – 7 network-attached storage (NAS)
Replication Middleware for Cloud Based Storage Service
Hadoop Technopoints.
Building continuously available systems with Hyper-V
Operating Systems : Overview
NeST: Network Storage Technologies
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Presentation transcript:

NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau

NeST: Network storage Flexible, commodity based, software-only storage appliances A storage appliance must be – easy to deploy – self-configurable to minimize administrative costs – reliable but also recoverable – secure against intrusion yet transparent to legit users

From commodity to appliance Why build appliance on commodity system? – Want to ride commodity cost curve Avoid high cost of specialized servers (e.g. NetApp) – Leverage high availability of commodity systems – Allow on-demand acquisition and deployment Challenge of building on commodity systems – Must be portable across multiple operating systems Additional challenge to achieve high performance at user level – Must be adaptable across a range of storage devices

NeST structure Protocol layer – Pluggable protocols map diverse protocols into common control flows – Analogous to Linux v-nodes Transfer layer – Different concurrency architectures maximize system throughput across diverse platforms Storage layer – Provides abstract interface to disks and memory

NeST Structure Protocol Layer GFTPNeSTWiNDHTTPNFS Control Logic Concurrency Architecture Pool of processes Nonblocking Pool of threads Storage Layer Raw diskLocal FSRAIDMemory

Many Protocols, Single Server Single administrative interface - Set policies, manage user accounts Maintainable S/W - Shared code base reduces replication, increases maintainability Different protocols for different purposes - Grid FTP for wide-area transfers - Chirp for local-area accesses and reservations Single point of control - Storage quotas/guarantees can be supported - Bandwidth can be controlled & QoS provided

Concurrency architecture Three difficult goals – Low latency – High bandwidth – Multiple simultaneous clients No single portable solution – Multiple models provide solutions on a range of different platforms Multi-threaded Multi-process Single process event-driven – Control logic dynamically selects “best” model

Storage Layer Abstract storage models – Virtual storage model akin to virtual protocol layer – RAID, JBOD, etc. – Memory storage model also a possibility Provide file system interface to remote memory Useful for store and forward buffering systems like Kangaroo Single interface for storage resource management – Reservations – Access control – User and group management

NeST, Condor and the Grid Wide area scheduling requirements: – Buffers for wide area data movements – Staging for local replicas of distributed datasets – Guaranteed availability for informed scheduling – Flexible/dynamic management of user accounts Example: Distributed repository Local NeST 1) Reserve space at NeST 2) Initiate transfer from distributed repository 3) Schedule jobs at local cluster. 4) Jobs access data locally

Future work HTTP administrative interface Define metrics by which to measure – Deployability – Manageability – Reliability Allow reservations to have a “cost” Provide mechanisms by which files can have arbitrary, searchable metadata