A Case for Redundant Arrays of Inexpensive Disks (RAID) -1988

Slides:



Advertisements
Similar presentations
A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
Advertisements

Disk Arrays COEN 180. Large Storage Systems Collection of disks to store large amount of data. Performance advantage: Each drive can satisfy only so many.
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
A CASE FOR REDUNDANT ARRAYS OF INEXPENSIVE DISKS (RAID) D. A. Patterson, G. A. Gibson, R. H. Katz University of California, Berkeley.
I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
CS224 Spring 2011 Computer Organization CS224 Chapter 6A: Disk Systems With thanks to M.J. Irwin, D. Patterson, and J. Hennessy for some lecture slide.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
CSCE430/830 Computer Architecture
CA 714CA Midterm Review. C5 Cache Optimization Reduce miss penalty –Hardware and software Reduce miss rate –Hardware and software Reduce hit time –Hardware.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
RAID CS147 Dr. Sin-Min Lee By Kenneth Chen. History Norman Ken Ouchi at IBM was awarded U.S. Patent 4,092,732 titled "System for recovering data stored.
1 A Case for Redundant Arrays of Inexpensive Disks Patterson, Gibson and Katz (Seminal paper) Chen, Lee, Gibson, Katz, Patterson (Survey) Circa late 80s..
Sean Traber CS-147 Fall  7.9 RAID  RAID Level 0  RAID Level 1  RAID Level 2  RAID Level 3  RAID Level 4 
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
CPSC-608 Database Systems Fall 2008 Instructor: Jianer Chen Office: HRBB 309B Phone: Notes #6.
1 Storage (cont’d) Disk scheduling Reducing seek time (cont’d) Reducing rotational latency RAIDs.
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
S.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output Cache Main Memory Secondary Memory (Disk)
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Storage & Peripherals Disks, Networks, and Other Devices.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
Disks Chapter 5 Thursday, April 5, Today’s Schedule Input/Output – Disks (Chapter 5.4)  Magnetic vs. Optical Disks  RAID levels and functions.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
The concept of RAID in Databases By Junaid Ali Siddiqui.
CS 6290 I/O and Storage Milos Prvulovic. Storage Systems I/O performance (bandwidth, latency) –Bandwidth improving, but not as fast as CPU –Latency improving.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
1 Lecture 23: Storage Systems Topics: disk access, bus design, evaluation metrics, RAID (Sections )
Abstract Increases in CPU and memory will be wasted if not matched by similar performance in I/O SLED vs. RAID 5 levels of RAID and respective cost/performance.
1 Lecture: Storage, GPUs Topics: disks, RAID, reliability, GPUs (Appendix D, Ch 4)
Cloud Computing Vs RAID Group 21 Fangfei Li John Soh Course: CSCI4707.
W4118 Operating Systems Instructor: Junfeng Yang.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
CS Introduction to Operating Systems
RAID.
CMPE Database Systems Workshop June 16 Class Meeting
A Case for Redundant Arrays of Inexpensive Disks (RAID)
Multiple Platters.
External Memory.
RAID, Programmed I/O, Interrupt Driven I/O, DMA, Operating System
Module: Storage Systems
Steve Ko Computer Sciences and Engineering University at Buffalo
Disks and RAID.
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
I/O System Chapter 5 Designed by .VAS.
Dave Eckhardt Disk Arrays Dave Eckhardt
Vladimir Stojanovic & Nicholas Weaver
IT 251 Computer Organization and Architecture
CS 554: Advanced Database System Notes 02: Hardware
Chapter 10 Mass-Storage Structure
Lecture 13 I/O.
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
Lecture 28: Reliability Today’s topics: GPU wrap-up Disk basics RAID
TECHNICAL SEMINAR PRESENTATION
RAID Redundant Array of Inexpensive (Independent) Disks
RAID RAID Mukesh N Tekwani April 23, 2019
Presentation transcript:

A Case for Redundant Arrays of Inexpensive Disks (RAID) -1988 -David A. Patterson, Garth Gibson, and Randy H. Katz Presenter: Craig Bergstrom

Outline Motivation: The Pending I/O Crisis A Potential Solution, Caveats A Better Solution, Raid Levels 1-5 Conclusion and Evaluation

Assumptions The only failures of concern are failures of the disk-head assembly during normal operation. Times-to-failure are independent, and exponentially distributed. Minimum user request of data is of a sector, and sector is small relative to a track.

Problem: The Pending I/O Crisis CPU speeds are increasing very quickly. (Joy’s law) Primary memory speeds are increasing very quickly. Although secondary storage size is increasing sufficiently fast, its speed is not keeping pace with the rest of the system. (Seek time)

Hard Disk Architecture

Secondary I/O Access Time Seek Time: Time it takes to position the head over the track to be read/written (measured in ms) Rotational Latency: Time from when the read/write head is over the right track until data begins being transferred from the magnetic disk. (For 7200rpm, worst case ~ 8.33ms) Also internal and external data transfer times. (very fast)

Latency vs. Bandwidth Taken from: “Latency Lags Bandwidth” by David A. Patterson. Communications of the ACM October/2004

Amdahl’s Law S = initial time / improved time = Relative Speedup F = Portion of time spent on improved components K= Speedup achieved in improved components If current systems spend 10% of their time in I/O and everything else increases by factor of 10x, effective speedup is only 5x. If everything else increases by 100x, effective speedup is only 10x.

Solution: Arrays of Disks Motivation: Consumer disks offer a high performance/price ratio.

Problem: Reliability MTTFarray = MTTFdisk/N Where N is the number of disks in the array This means that the MTTF of an array of 75 inexpensive Conners CP3100’s will be 30,000 hours / 75 = 400 hours.

Solution: Redundant Arrays We add redundancy to recover in the event of failure of a single component. Break disks into groups, each with a pool of disks for redundancy. When one disk in the group fails, we use the redundant information to replicate that which has been lost. Periodic replacement of failed disks MTTR = Mean Time To Repair

RAID Level 1: Mirroring Raid level 1 mirrors data across two disks, creating a redundant copy. Tandem: doubles the number of controllers for fault tolerance. Every write to a data disk is also a write to a check disk. (increased overhead) A read costs essentially the same as with a single disk.

RAID Level 2: Hamming Code for EEC Computer architect designers added redundant chips to correct single errors and detect double errors in groups of chips. These same principals can be applied to secondary storage by bit-interleaving data across disks and adding enough check bits to correct errors. Poor performance on small reads/writes. 10 data disks require 4 check disks, 25 data disks require 5 check disks.

RAID Level 3 Single Check Disk Per Group Most check disks in level 2 are used to determine which disk failed. Often, disk controllers provide the same functionality. If we know which disk failed, then we only need one check disk per group. Poor performance on small reads/writes.

RAID Level 4 Parallel Reads Interleave sectors instead of bits. This sector by sector interleaving allows us to read multiple sectors at a time. However, writing still requires exclusive access to the check disk which results in poor write performance.

RAID Level 5 No Single Check Disk To avoid exclusive writes, we spread check data across all of the data disks.

Raid Levels 2-5 Performance All figures for a group size of 10 disks.

Performance Relative To SLED

Conclusions RAID is a cost effective option to achieve improved size and performance for secondary storage I/O.

Evaluation Overall, This was an excellent argument supporting the use of RAID instead of Single Large Expensive Disks. The credibility of the argument is slightly tainted by the fact that faults having to do with cabling and packaging are ignored. Some of the particulars may be outdated due to the fact that this paper was published in 1988.

Citations Patterson, David A., “Garth Gibson, and Randy H. Katz. “A Case for Redundant Arrays of Inexpensive Disks (RAID)” Proceedings of the 1998 ACM SIGMOD international conference on Management of data. (1993): 109-116. Patterson, David A., “Latency Lags Bandwidth”, Communications of the ACM. Volume 47. (2004): 71-75. Chen, Peter M., et. al. “RAID: High-Performance, Reliable Secondary Storage” ACM Computing Surveys. Oct 29, 1993. <http://citeseer.ist.psu.edu/chen94raid.html>