Αρχιτεκτονική Υπολογιστών Ενότητα # 6: RAID Διδάσκων: Γεώργιος Κ. Πολύζος Τμήμα: Πληροφορικής.

Slides:



Advertisements
Similar presentations
By Rakshith Venkatesh Outline What is RAID? RAID configurations used. Performance of each configuration. Implementations. Way.
Advertisements

A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
RAID Redundant Arrays of Independent Disks Courtesy of Satya, Fall 99.
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
A CASE FOR REDUNDANT ARRAYS OF INEXPENSIVE DISKS (RAID) D. A. Patterson, G. A. Gibson, R. H. Katz University of California, Berkeley.
What is RAID Redundant Array of Independent Disks.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
1 Jason Drown Mark Rodden (Redundant Array of Inexpensive Disks) RAID.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID A RRAYS Redundant Array of Inexpensive Discs.
Triple-Parity RAID and Beyond Hai Lu. RAID RAID, an acronym for redundant array of independent disks or also known as redundant array of inexpensive disks,
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
RAID Redundant Array of Independent Disks
CSCE430/830 Computer Architecture
 RAID stands for Redundant Array of Independent Disks  A system of arranging multiple disks for redundancy (or performance)  Term first coined in 1987.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
Chapter 3 Presented by: Anupam Mittal.  Data protection: Concept of RAID and its Components Data Protection: RAID - 2.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
Sean Traber CS-147 Fall  7.9 RAID  RAID Level 0  RAID Level 1  RAID Level 2  RAID Level 3  RAID Level 4 
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
RAID Systems CS Introduction to Operating Systems.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
By : Nabeel Ahmed Superior University Grw Campus.
Redundant Array of Inexpensive Disks (RAID). Redundant Arrays of Disks Files are "striped" across multiple spindles Redundancy yields high data availability.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
RAID Shuli Han COSC 573 Presentation.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
Redundant Array of Independent Disks
RAID: High-Performance, Reliable Secondary Storage Mei Qing & Chaoxia Liao Nov. 20, 2003.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
CSI-09 COMMUNICATION TECHNOLOGY FAULT TOLERANCE AUTHOR: V.V. SUBRAHMANYAM.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
 Stands for Redundant Array of Independent Disks.  It’s a technology that enables greater levels of performance, reliability and/or large volumes when.
Seminar on RAID TECHNOLOGY Redundant Array of Independent Disk By CHANDAN.R 8 TH ISE, 1ap05is013 Under the guidance of Mr.Mithun.B.N, Lecturer,Dept.ISE.
Cloud Computing Vs RAID Group 21 Fangfei Li John Soh Course: CSCI4707.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID TECHNOLOGY RASHMI ACHARYA CSE(A) RG NO
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
CS Introduction to Operating Systems
CMPE Database Systems Workshop June 16 Class Meeting
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
Vladimir Stojanovic & Nicholas Weaver
HP AutoRAID (Lecture 5, cs262a)
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
TECHNICAL SEMINAR PRESENTATION
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
Mass-Storage Systems.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
RAID RAID Mukesh N Tekwani April 23, 2019
Presentation transcript:

Αρχιτεκτονική Υπολογιστών Ενότητα # 6: RAID Διδάσκων: Γεώργιος Κ. Πολύζος Τμήμα: Πληροφορικής

RAID Redundant Array of Independent Disks David A. Patterson, Garth Gibson, and Randy H. Katz, A Case for Redundant Arrays of Inexpensive Disks (RAID) Proc ACM SIGMOD international conference on management of data, Chicago, IL, USA, pp , June DOI= / Based on slides by… Rakshith Venkatesh, John Kubiatowicz,and Anthony D. Joseph

What is RAID? Stands for Redundant Array of Independent Disks enables greater levels of Performance reliability By concurrent use of two or more ‘hard disk drives’ Through: Mirroring Stripping (of data) and Error correction techniques

Array Reliability Reliability of N disks = Reliability of 1 Disk ÷ N 50,000 Hours ÷ 70 disks = 700 hours Disk system MTTF: Drops from 6 years to 1 month! Arrays (without redundancy) too unreliable to be useful! Hot spares support reconstruction in parallel with access: very high media availability can be achieved

RAID Basics (1) Levels of RAID (those in RED are actually used): –RAID 0: striping with no parity (just bandwidth) –RAID 1: Mirroring (simple, fast, but requires 2x storage) Reads faster, writes slower (why?) –RAID 2: bit interleaving with error-correcting codes (ECC) –Dedicated parity disk (RAID level 3), byte-level striping Dedicated parity disk is write bottleneck, since every write also writes parity

RAID Basics (2) RAID 4:dedicated parity disk, block-level striping RAID 5: Rotating parity disk, block-level striping –most popular; rotating disk spreads out parity load RAID 6: RAID 5 with two parity blocks (tolerates two failures) RAID 10: Combines RAID 1 and RAID 0 RAID 6 for today’s drive sizes…

RAID 0 (1) Splits data among two or more disks Provides good performance Lack of data redundancy o there is no fail over support o with this configuration

RAID 0 (2) In the diagram to the right o the odd blocks are written to disk 0 o the even blocks to disk 1 such that A1, A2, A3, A4, … would be the order of blocks read if read sequentially from the beginning Used in read only NFS systems and gaming systems

RAID 0 analysis (1) Failure Rate: MTBF of RAID0 is roughly proportional to the number of disks in the array Pr{disk fail} = 5%, then Pr{(at least) one fails} = 1 – Pr{none fails} = 1 – (1-0.05) 2 = 9.75% [independent!]

RAID 0 analysis (2) Performance: The fragments are written to their respective disks simultaneously on the same sector This allows smaller sections of the entire chunk of data to be read off the drive in parallel, hence good performance

RAID 1 RAID1 is ‘data mirroring’ Two copies of the data are held on two physical disks  the data is always identical Twice as many disks are required to store the same data when compared to RAID 0 Array continues to operate so long as at least one drive is functioning

RAID 1: Disk Mirroring/Shadowing Each disk is fully duplicated onto its "shadow“ Very high availability can be achieved Bandwidth sacrifice on write: –Logical write = two physical writes Reads may be optimized Most expensive solution: 100% capacity overhead recovery group

RAID 1 analysis Failure Rate: If Pr{disk fail} = 5%, then the probability of both the drives failing in a 2 disk array is P{both fail} = (0.05) 2 = 0.25%. Performance: If we use independent disk controllers for each disk, then we can increase the read or write speeds by doing operations in parallel

RAID 5 RAID 5 is an ideal combination of good performance, good fault tolerance and high capacity and storage efficiency An arrangement of parity and CRC to help rebuilding drive data in case of disk failures “Distributed Parity” is the key word here

RAID 5: High I/O Rate Parity Increasing Logical Disk Addresses 1 logical write becomes 4 physical I/Os Independent writes possible because of interleaved parity Reed-Solomon Codes ("Q") for protection during reconstruction D0D1D2 D3 P D4D5D6 P D7 D8D9P D10 D11 D12PD13 D14 D15 PD16D17 D18 D19 D20D21D22 D23 P Disk Columns Stripe Unit Targeted for mixed applications

Problems of Disk Arrays: Small Writes D0D1D2 D3 P D0' + + D1D2 D3 P' new data old data old parity XOR (1. Read) (2. Read) (3. Write) (4. Write) RAID-5: Small Write Algorithm 1 Logical Write = 2 Physical Reads + 2 Physical Writes

RAID 5 analysis (1) MTBF is slightly better than RAID 0 –This is because failure of one disk is not quite a harm –We need more time if 2 or more disks fail Performance is also as good as RAID 0, if not better. –We can read and write parallel blocks of data

RAID 5 analysis (2) One of the drawbacks is that the write involves heavy parity calculations by the RAID controller –Write operations are slower compared to RAID 0 Pretty useful for general purpose uses where ‘read’s’ are more frequent the ‘write’s’

RAID 10 Combines RAID 1 and RAID 0 both - good performance and good failover handling called ‘Nested RAID’

RAID 6+ RAID 6: seen as the best way to guarantee data integrity  uses double parity lesser MTBF compared to RAID5 drawback: longer write time

Implementations (1) Software based RAID: Software implementations are provided by many Operating Systems A software layer sits above the disk device drivers and provides an abstraction layer between the logical drives (RAIDs) and physical drives Server's processor is used to run the RAID software Used for simpler configurations like RAID0 and RAID1

Implementations (2) Hardware based RAID: hardware implementation of RAID requires (at least) a special-purpose RAID controller on a desktop system this may be built into the motherboard the processor is not used for RAID calculations A PCI-bus-based, IDE/ATA hard disk RAID controller, supporting levels 0, 1, and 01.

Road ahead… (1) The expanded use of RAID-6 and other dual-parity schemes is a virtual certainty RAID vendors to support "fast rebuild" features that can restore hundreds of gigabytes in just an hour or so!! Striping (of data) would extend across RAID groups -- not just across drives within a group Hot Spares!!

Road ahead… (2) Improved disk diagnostic features should offer more reliable predictions of impending drive failures, allowing the rebuild process to begin before an actual fault occurs

Τέλος Ενότητας # 6 Μάθημα: Αρχιτεκτονική Υπολογιστών Ενότητα # 6: RAID Διδάσκων: Γεώργιος Κ. Πολύζος Τμήμα: Πληροφορικής