Testing the Zambeel Aztera Chris Brew FermilabCD/CSS/SCS Caveat: This is very much a work in progress. The results presented are from jobs run in the last.

Slides:



Advertisements
Similar presentations
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Advertisements

1 Principles of Reliable Distributed Systems Tutorial 12: Frangipani Spring 2009 Alex Shraer.
Teraserver Darrel Sharpe Matt Todd Rob Neff Mentor: Dr. Palaniappan.
High Performance Computing Course Notes High Performance Storage.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Ronen Gabbay Microsoft Regional Director Yside / Hi-Tech College
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
Experiences Deploying Xrootd at RAL Chris Brew (RAL)
1 The Google File System Reporter: You-Wei Zhang.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
© 2009 IBM Corporation IBM Systems & Technology Group System x and BladeCenter® Why BladeCenter S SAN is the Right Choice Lowest cost, lowest complexity,
Performance Testing of DDN WOS Boxes Shaun de Witt, Roger Downing Future of Big Data Workshop June 27 th 2013.
File System Benchmarking
Module 12: Designing High Availability in Windows Server ® 2008.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Buffalo Data Storage Expansion June As organizations grow the amount of data storage capacity required to support it grows as well Increased data.
Module 9: Configuring Storage
All the components of network are connected to the central device called “hub” which may be a hub, a router or a switch. There is no direct traffic between.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Comparing Disk Benchmark Tools Chris Brew FNAL. Why? We are about to start testing a number of Storage Solutions We are about to start testing a number.
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
CENTRALISED AND CLIENT / SERVER DBMS. Topics To Be Discussed………………………. (A) Centralized DBMS (i) IntroductionIntroduction (ii) AdvantagesAdvantages (ii)
Ceph: A Scalable, High-Performance Distributed File System
Replication Store it in multiple places.... Literature Colouris, Dollimore, Kindberg, 2000 –Gets deep into the details of reliable communication, byzantine.
Install, configure and test ICT Networks
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
Parallel IO for Cluster Computing Tran, Van Hoai.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Seminar On Rain Technology
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
G. Russo, D. Del Prete, S. Pardi Frascati, 2011 april 4th-7th The Naples' testbed for the SuperB computing model: first tests G. Russo, D. Del Prete, S.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
DIT314 ~ Client Operating System & Administration CHAPTER 7 MANAGING DISKS AND FILE SYSTEM Prepared By : Suraya Alias.
SEMINAR TOPIC ON “RAIN TECHNOLOGY”
Dave Newbold, University of BristolGridPP Middleware Meeting ‘Real World’ issues from DC04 DC04: Trying to operate the CMS computing system at 25Hz for.
Advanced Network Administration Computer Clusters.
DIT314 ~ Client Operating System & Administration
Video Security Design Workshop:
RHEV Platform at LHCb Red Hat at CERN 17-18/1/17
Policies in Distributed Data Storage
Experience of Lustre at QMUL
Backing Up Workstations: How to Protect Yourself on the Cheap
Managing Multi-User Databases
High Availability Linux (HA Linux)
Distributed File Systems
Vladimir Stojanovic & Nicholas Weaver
Experience of Lustre at a Tier-2 site
Choosing the best storage method
Unit OS10: Fault Tolerance
Introduction to Networks
Large Scale Test of a storage solution based on an Industry Standard
Storage Virtualization
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
The Basics of Apache Hadoop
Network Attached Storage NAS100
Web Server Administration
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2010 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
For the MVHS Cyber Defense CLub
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Presentation transcript:

Testing the Zambeel Aztera Chris Brew FermilabCD/CSS/SCS Caveat: This is very much a work in progress. The results presented are from jobs run in the last two days and have not been fully checked for validity.

Aim To test the overall performance of the Zambeel Aztera Network Attached Storage System To test the overall performance of the Zambeel Aztera Network Attached Storage System To develop suite of tests that can be used for comparisons between different NAS devices To develop suite of tests that can be used for comparisons between different NAS devices

What is it? From a user point of view it’s just a large “black box” network attached NFS server serving out a 2.8TB filesystem From a user point of view it’s just a large “black box” network attached NFS server serving out a 2.8TB filesystem Behind the scenes it consists of multiple “compute” nodes and “storage” nodes. Compute nodes interface between the storage and the clients making multiple disks across multiple nodes appear as one (or more) filesystems. Load balancers share the load amongst the compute nodes. Very redundant - every component is duplicated, data is mirrored and striped across strorage nodes. Behind the scenes it consists of multiple “compute” nodes and “storage” nodes. Compute nodes interface between the storage and the clients making multiple disks across multiple nodes appear as one (or more) filesystems. Load balancers share the load amongst the compute nodes. Very redundant - every component is duplicated, data is mirrored and striped across strorage nodes.

Tests Manageability Tests: Manageability Tests: –Ease of setup –Ease to Reconfigure Performance Tests: Performance Tests: –Max Throughput: Read and Write –Max Throughput Reading a Single File –Simultaneous Reads and Writes –Creation, Listing and Deletion of Large Numbers of Small Files –Data Integrity Failure Tests: Failure Tests: –Fail various parts of the system and see what happens

Tools Reader/Writer : Reader/Writer : –Fermi Built tools that write and read a file IOZone: IOZone: –Standard Benchmark tool with many options. Used here for the data integrity tests. Bonnie++: Bonnie++: –Another standard benchmark tool. Used for it’s File Creation/Deletion test.

Why Three Tools? All have different Strengths/Weaknesses All have different Strengths/Weaknesses –Reader/Writer: Very Simple and Easy to script around –IOZone Many Many modes –Bonnie++ Only one with file creation/deletion test

Performance Tests So Far All tests use the new hotdog nodes (Dual 1.6 GHz, 1GB Memory, 100Mb/s Network) All tests use the new hotdog nodes (Dual 1.6 GHz, 1GB Memory, 100Mb/s Network) Completed Tests: Completed Tests: –Max Throughput (n clients with 1 or 2 processes writing/reading 1GB files. Rate for all files completed before the first node to complete 5 files are averaged on each node and summed across the nodes) –Single File Throughput (same as previous read test but all nodes/processes read the same file) –Client comparison (Measure the single client read/write speeds for different clients)

Maximum Throughput

Tuning Changes Throughput

Different Clients/Different Performance

Reading a Single File

Failure Testing Whilst the system was up and clients were reading and writing data, removed and replaced one by one: Whilst the system was up and clients were reading and writing data, removed and replaced one by one: –Compute Module –Storage Module –Load Balancer Performance was obviously degraded but the system stayed up and serving it’s clients Performance was obviously degraded but the system stayed up and serving it’s clients

Conclusions For 50 Clients the system gives a total throughput of ~120Mb/s for writes and ~220Mb/s for reads For 50 Clients the system gives a total throughput of ~120Mb/s for writes and ~220Mb/s for reads Reliability seems to be high. It can survive any single component failure Reliability seems to be high. It can survive any single component failure With these tools we have the basis of a test suite and procedure for comparing the different storage technologies that are becoming available With these tools we have the basis of a test suite and procedure for comparing the different storage technologies that are becoming available