HEC FSIO Workshop August 9, 2011 Communications and Protocols Active Networks and Active Object Storage John A. Chandy Department of Electrical and Computer.

Slides:



Advertisements
Similar presentations
An Overview of ABFT in cloud computing
Advertisements

Boxwood: Distributed Data Structures as Storage Infrastructure Lidong Zhou Microsoft Research Silicon Valley Team Members: Chandu Thekkath, Marc Najork,
Copyright © 2007, GemStone Systems Inc. All Rights Reserved. Optimize computations with Grid data caching OGF21 Jags Ramnarayan Chief Architect, GemFire.
Towards Automating the Configuration of a Distributed Storage System Lauro B. Costa Matei Ripeanu {lauroc, NetSysLab University of British.
The Storage Resource Broker and.
Hard Disk Drives Chapter 7.
Mass Storage Systems. Readings r Chapter , 12.7.
© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.
Issues of HPC software From the experience of TH-1A Lu Yutong NUDT.
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
© 2009 EMC Corporation. All rights reserved. Direct Attached Storage and Introduction to SCSI Module 2.1.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
COMP1214 Systems & Platforms: Operating Systems Concepts Dr. Yvonne Howard – Rikki Prince – 1.
Executional Architecture
Copyright © 2004 Pearson Education, Inc.. Chapter 13 Disk Storage, Basic File Structures, and Hashing.
Reliable and Efficient Programming Abstractions for Sensor Networks Nupur Kothari, Ramki Gummadi (USC), Todd Millstein (UCLA) and Ramesh Govindan (USC)
Jaringan Informasi Pengantar Sistem Terdistribusi oleh Ir. Risanuri Hidayat, M.Sc.
Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Spark: Cluster Computing with Working Sets
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
1 Lecture 23: Multiprocessors Today’s topics:  RAID  Multiprocessor taxonomy  Snooping-based cache coherence protocol.
Overview of Lustre ECE, U of MN Changjin Hong (Prof. Tewfik’s group) Monday, Aug. 19, 2002.
Distributed Systems Architecture Presentation II Presenters Rose Kit & Turgut Tezir.
Storage Area Network (SAN)
Storage Networking Technologies and Virtualization Section 2 DAS and Introduction to SCSI1.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Storage Systems in HPC John A. Chandy Department of Electrical and Computer Engineering University of Connecticut.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
Object-based Storage Long Liu Outline Why do we need object based storage? What is object based storage? How to take advantage of it? What's.
Module 9 Review Questions 1. The ability for a system to continue when a hardware failure occurs is A. Failure tolerance B. Hardware tolerance C. Fault.
1 Advanced Storage Technologies for High Performance Computing Sorin, Faibish EMC NAS Senior Technologist IDC HPC User Forum, April 14-16, Norfolk, VA.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
HPCS Lab. High Throughput, Low latency and Reliable Remote File Access Hiroki Ohtsuji and Osamu Tatebe University of Tsukuba, Japan / JST CREST.
Chapter 5 Section 2 : Storage Networking Technologies and Virtualization.
Latest Relevant Techniques and Applications for Distributed File Systems Ela Sharda
CSE 451: Operating Systems Section 10 Project 3 wrap-up, final exam review.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
Embedded Runtime Reconfigurable Nodes for wireless sensor networks applications Chris Morales Kaz Onishi 1.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Embedding Constraint Satisfaction using Parallel Soft-Core Processors on FPGAs Prasad Subramanian, Brandon Eames, Department of Electrical Engineering,
Towards Exascale File I/O Yutaka Ishikawa University of Tokyo, Japan 2009/05/21.
OPERATING SYSTEM SUPPORT DISTRIBUTED SYSTEMS CHAPTER 6 Lawrence Heyman July 8, 2002.
FREERIDE: System Support for High Performance Data Mining Ruoming Jin Leo Glimcher Xuan Zhang Ge Yang Gagan Agrawal Department of Computer and Information.
Alok Choudhary Dept. of Electrical & Computer Engineering And Kellogg School of Management Northwestern University I/O and Storage: Challenges Moving Forward.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
AFS/OSD Project R.Belloni, L.Giammarino, A.Maslennikov, G.Palumbo, H.Reuter, R.Toebbicke.
Storage Networking. Storage Trends Storage grows %/year, gets more complicated It’s necessary to pool storage for flexibility Intelligent storage.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Simulation of O2 offline processing – 02/2015 Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture Eugen Mudnić.
Next Generation of Apache Hadoop MapReduce Owen
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
An Introduction to GPFS
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
Video Security Design Workshop:
Introduction to Distributed Platforms
Direct Attached Storage and Introduction to SCSI
Storage Networking.
Unit OS10: Fault Tolerance
Apache Spark Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing Aditya Waghaye October 3, 2016 CS848 – University.
Direct Attached Storage and Introduction to SCSI
Storage Networking.
NFS and AFS Adapted from slides by Ed Lazowska, Hank Levy, Andrea and Remzi Arpaci-Dussea, Michael Swift.
COT 4600 Operating Systems Fall 2010
CS 295: Modern Systems Organizing Storage Devices
Convergence of Big Data and Extreme Computing
Presentation transcript:

HEC FSIO Workshop August 9, 2011 Communications and Protocols Active Networks and Active Object Storage John A. Chandy Department of Electrical and Computer Engineering Janardhan Singaraju, Ajith Thamarakuzhi, Cengiz Karakoyunlu, Orko Momin, Mike Runde, Paul Wortman

Active Storage Networks Active DisksActive Disks –Intelligence at the disk can distribute computation to parallel disks –Process data in streams –Disks only have local view of data Active Storage NetworkActive Storage Network –Network has a global view of data –Distributed caching of file system metadata and data –Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011

Active storage networks 1 /3 Gbps 1 Gbps An ASN is comprised of a smart switch along with intelligence embedded in the I/O network.An ASN is comprised of a smart switch along with intelligence embedded in the I/O network. Network Switches have global view of the data and can perform in-stream data reduction and transformation operations.Network Switches have global view of the data and can perform in-stream data reduction and transformation operations. ASN can enhance storage node performance as well as improve the computational performance of the parallel I/O systems.ASN can enhance storage node performance as well as improve the computational performance of the parallel I/O systems. Active Storage Networks HEC FSIO Workshop August 9, 2011

Network switch topology 2-dilated flattened butterfly2-dilated flattened butterfly Active Storage Networks HEC FSIO Workshop August 9,

Hardware Implementation NetFPGA board from StanfordNetFPGA board from Stanford 4 GigE connects4 GigE connects 2 SATA connectors for node to node communication2 SATA connectors for node to node communication PCI bus for node to node communicationPCI bus for node to node communication Active Storage Networks HEC FSIO Workshop August 9, 2011

Active Storage Networks Application operationsApplication operations –Reduction operations - min/max, k-means clustering, search –Transformational operations – streaming, sort, File System OperationsFile System Operations –Locking –Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011

Parallelization techniques Functional units are re-used on reaching the reconfigurable hardware area limits.Functional units are re-used on reaching the reconfigurable hardware area limits. Data level parallelism by distributing the data to several functional units in several switch elements.Data level parallelism by distributing the data to several functional units in several switch elements. Functional level parallelism by distributing functions to several elements.Functional level parallelism by distributing functions to several elements. Active Storage Networks HEC FSIO Workshop August 9, 2011

K-means clustering Active Storage Networks HEC FSIO Workshop August 9, 2011

Runtime per iteration Active Storage Networks HEC FSIO Workshop August 9, 2011

Data search Active Storage Networks HEC FSIO Workshop August 9, 2011

Kmin/Kmax Active Storage Networks HEC FSIO Workshop August 9, 2011

Data sort Active Storage Networks HEC FSIO Workshop August 9, 2011

Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011 RAIDed files, parity calculated in switchRAIDed files, parity calculated in switch

File locking Lock table in switchLock table in switch Active Storage Networks HEC FSIO Workshop August 9, 2011

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Storage Networks Lessons LearnedLessons Learned –Hardware design is hard –HW Libraries can help –ASNs make most sense for reductions –Storage systems optimizations show promise What needs to get doneWhat needs to get done –Better HW design –Application and FS hooks –When to do ASN and when to do SW?

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Object Storage Active DisksActive Disks –Intelligence at the disk can distribute computation to parallel disks Active Object Storage for Parallel File SystemsActive Object Storage for Parallel File Systems –Active Disks for OSDs –Use Active Storage to improve parallel file system performance –Use Active Storage to improve parallel file system reliability –Application aware storage and autonomic storage using active OSDs.

Active Disks Can we use OSDs to make Active Disks a reality?Can we use OSDs to make Active Disks a reality? –Application-aware storage Object attributes can give hints to the diskObject attributes can give hints to the disk Application specificApplication specific –Parallel File Systems Felix et al. added a filtering layer to Lustre to provide active processingFelix et al. added a filtering layer to Lustre to provide active processing –T10 OSD? Active Storage Networks HEC FSIO Workshop August 9, 2011

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Previous ImplementationPrevious Implementation –Based on disc-osd –Object-oriented (Java) Attach object types to storage objectsAttach object types to storage objects Define methods for object typesDefine methods for object types New ImplementationNew Implementation –Based on osc-osd (supported by Panasas) –RPC - Call functions on OSD remotely –Execute Engines – C, Java, Python, etc.

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD How do you move code from client to target within OSD framework?How do you move code from client to target within OSD framework? –Create an object with the code –Each function object has a special attribute that defines the type of associated execute engine –OSD can support multiple execute engines

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD How do you execute the method remotely within the OSD framework?How do you execute the method remotely within the OSD framework? –New EXECUTE FUNCTION command so that we can invoke a function –We use the CDB continuation to specify the parameters –Results (if any) returned directly or written to a new object From T10/08-185r5 changes to OSD-2

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Client OSD iSCSI OSD Target C Engine Java Engine Function Object.so,.class

Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Status:Status: –C and Java engines complete –Python engine soon –OrangeFS support for OSDs

Summary Active storage networksActive storage networks –Improves performance of computation kernels –Useful in parallel file system optimizations Active storage for improved file system performanceActive storage for improved file system performance Acknowledgements: NSF CCF , CCF Acknowledgements: NSF CCF , CCF Active Storage Networks HEC FSIO Workshop August 9, 2011

Communication and Protocols Active Storage Networks HEC FSIO Workshop August 9, 2011 Coherence schemesCoherence schemes Scalable abstractions for scientific dataScalable abstractions for scientific data Scalable replication, relocation, failure detection, and fault toleranceScalable replication, relocation, failure detection, and fault tolerance Topology aware storage layoutTopology aware storage layout Wide area storage access protocolsWide area storage access protocols Cloud storage?Cloud storage? Inter-stack communication?Inter-stack communication? Memory hierarchy?Memory hierarchy?