Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is.

Slides:



Advertisements
Similar presentations
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
Advertisements

A CASE FOR REDUNDANT ARRAYS OF INEXPENSIVE DISKS (RAID) D. A. Patterson, G. A. Gibson, R. H. Katz University of California, Berkeley.
Faculty of Information Technology Department of Computer Science Computer Organization Chapter 7 External Memory Mohammad Sharaf.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID Redundant Array of Independent Disks
 RAID stands for Redundant Array of Independent Disks  A system of arranging multiple disks for redundancy (or performance)  Term first coined in 1987.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
Lecture 16 Raid. Device Protocol Variants Status checks: polling vs. interrupts Data: PIO vs. DMA Control: special instructions vs. memory-mapped I/O.
RAID CS5493/7493. RAID : What is it? Redundant Array of Independent Disks configured into a single logical storage unit.
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
Project Announcement Grading: Part A: 3/4 of total grade Part B: 1/4 of total grade Actual grade formula: 3/4*(Max(A1, 0.7*A2)) + 1/4*(Max(B_self, 0.9B_with_staff_code)))
Reliability of Disk Systems. Reliability So far, we looked at ways to improve the performance of disk systems. Next, we will look at ways to improve the.
1 Storage (cont’d) Disk scheduling Reducing seek time (cont’d) Reducing rotational latency RAIDs.
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
High Performance Computing Course Notes High Performance Storage.
CS 333 Introduction to Operating Systems Class 18 - File System Performance Jonathan Walpole Computer Science Portland State University.
Cse Feb-001 CSE 451 Section February 24, 2000 Project 3 – VM.
I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.
RAID Systems CS Introduction to Operating Systems.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
COMP25212 ARRAY OF DISKS Sergio Davies Feb/Mar 2014COMP25212 – Storage 2.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
CSE 321b Computer Organization (2) تنظيم الحاسب (2) 3 rd year, Computer Engineering Winter 2015 Lecture #4 Dr. Hazem Ibrahim Shehata Dept. of Computer.
Redundant Array of Independent Disks
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
CS 153 Design of Operating Systems Spring 2015 Lecture 22: File system optimizations.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Introduction to RAID Rogério Perino de Oliveira Neves Patrick De Causmaecker
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
RAID Presentation Raid is an acronym for “Redundant array of independent Drives”, or Redundant array of inexpensive drives”. The main concept of RAID is.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
W4118 Operating Systems Instructor: Junfeng Yang.
Reliability of Disk Systems. Reliability So far, we looked at ways to improve the performance of disk systems. Next, we will look at ways to improve the.
Lecture 17 Raid. Device Protocol Variants Status checks: polling vs. interrupts Data: PIO vs. DMA Control: special instructions vs. memory-mapped I/O.
Lecture 14 – RAID Thanks to Greg Ganger and Remzi Arapaci-Dusseau for slides Distributed Systems.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
Magnetic Disks Have cylinders, sectors platters, tracks, heads virtual and real disk blocks (x cylinders, y heads, z sectors per track) Relatively slow,
CS Introduction to Operating Systems
Storage HDD, SSD and RAID.
RAID Overview.
RAID Thanks to Greg Ganger and Remzi Arapaci-Dusseau for slides
Jonathan Walpole Computer Science Portland State University
Multiple Platters.
Disks and RAID.
Vladimir Stojanovic & Nicholas Weaver
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani
RAID Disk Arrays Hank Levy 1.
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
TECHNICAL SEMINAR PRESENTATION
RAID Redundant Array of Inexpensive (Independent) Disks
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Presentation transcript:

Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is RAID-0, RAID-1, RAID-4, and RAID-5? UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department CS 537 Introduction to Operating Systems Andrea C. Arpaci-Dusseau Remzi H. Arpaci-Dusseau

Motivation: Why use multiple disks? Capacity More disks allows us to store more data Performance Access multiple disks in parallel Each disk can be working on independent read or write Overlap seek and rotational positioning time for all Reliability Recover from disk (or single sector) failures Will need to store multiple copies of data to recover RAID: Redundant Array of Inexpensive/Independent Disks

Hardware vs. Software RAID Hardware RAID Storage box you attach to computer Same interface as single disk, but internally much more –Multiple disks –More complex controller –NVRAM (holding parity blocks) Software RAID OS (device driver layer) treats multiple disks like a single disk Software does all extra work Interface for both Linear array of bytes, just like a single disk (but larger)

RAID-0: Striping Stripe blocks across disks in a “chunk” size How to pick a reasonable chunk size? How to calculate where chunk # lives? Disk: Offset within disk:

RAID-0: Striping Evaluate for D disks Capacity: How much space is wasted? Performance: How much faster than 1 disk? Reliability: More or less reliable than 1 disk?

RAID-1: Mirroring Motivation: Handle disk failures Put copy (mirror or replica) of each chunk on another disk Capacity: Reliability: Performance:

RAID-4: Parity Motivation: Improve capacity Idea: Allocate parity block to encode info about blocks Parity checks all other blocks in stripe across other disks Parity block = XOR over others (gives “even” parity) Example: > Parity value? How do you recover from a failed disk? Example: x 0 0 and parity of 1 What is the failed value? P0P0 P1P1 P2P2 P3P3

RAID-4: Parity Capacity: Reliability: Performance: Reads Writes: How to update parity block? –Two different approaches Small number of disks (or large write): Large number of disks (or small write): –Parity disk is the bottleneck P0P0 P1P1 P2P2 P3P3

RAID-5: Rotated Parity Capacity: Reliability: Performance: Reads: Writes: Still requires 4 I/Os per write, but not always to same parity disk 03 6 P3P3 14 P2P2 9 2 P1P1 710 P0P Rotate location of parity across all disks

Advanced Issues What happens if more than one fault? Example: One disk fails plus “latent sector error” on another RAID-5 cannot handle two faults Solution: RAID-6 (e.g., RDP) Add multiple parity blocks Why is NVRAM useful? Example: What if update 2, don’t update P0 before power failure (or crash), and then disk 1 fails? NVRAM solution: Use to store blocks updated in same stripe –If power failure, can replay all writes in NVRAM Software RAID solution: Perform parity scrub over entire disk ’5 811 P0P0 P1P1 P2P2 P3P3

Conclusions RAID turns multiple disks into a larger, faster, more reliable disk RAID-0: Striping Good when performance and capacity really matter, but reliability doesn’t RAID-1: Mirroring Good when reliability and write performance matter, but capacity (cost) doesn’t RAID-5: Rotating Parity Good when capacity and cost matter or workload is read- mostly Good compromise choice