1 Storage Refinement. Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID.

Slides:



Advertisements
Similar presentations
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
Advertisements

CS 346 – April 4 Mass storage –Disk formatting –Managing swap space –RAID Commitment –Please finish chapter 12.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID A RRAYS Redundant Array of Inexpensive Discs.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
 RAID stands for Redundant Array of Independent Disks  A system of arranging multiple disks for redundancy (or performance)  Term first coined in 1987.
“Redundant Array of Inexpensive Disks”. CONTENTS Storage devices. Optical drives. Floppy disk. Hard disk. Components of Hard disks. RAID technology. Levels.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
CS5226 Hardware Tuning. 2 Application Programmer (e.g., business analyst, Data architect) Sophisticated Application Programmer (e.g., SAP admin) DBA,
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
CSE 486/586 CSE 486/586 Distributed Systems Case Study: Facebook f4 Steve Ko Computer Sciences and Engineering University at Buffalo.
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
Sean Traber CS-147 Fall  7.9 RAID  RAID Level 0  RAID Level 1  RAID Level 2  RAID Level 3  RAID Level 4 
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
RAID Technology CS350 Computer Organization Section 2 Larkin Young Rob Deaderick Amos Painter Josh Ellis.
Performance/Reliability of Disk Systems So far, we looked at ways to improve the performance of disk systems. Next, we will look at ways to improve the.
CPSC-608 Database Systems Fall 2008 Instructor: Jianer Chen Office: HRBB 309B Phone: Notes #6.
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
OS and Hardware Tuning. Tuning Considerations Hardware  Storage subsystem Configuring the disk array Using the controller cache  Components upgrades.
I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.
RAID Systems CS Introduction to Operating Systems.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Lecture 39: Review Session #1 Reminders –Final exam, Thursday 3:10pm Sloan 150 –Course evaluation (Blue Course Evaluation) Access through.
By : Nabeel Ahmed Superior University Grw Campus.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
DISKS IS421. DISK  A disk consists of Read/write head, and arm  A platter is divided into Tracks and sector  The R/W heads can R/W at the same time.
RAID Shuli Han COSC 573 Presentation.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Redundant Array of Independent Disks
RAID: High-Performance, Reliable Secondary Storage Mei Qing & Chaoxia Liao Nov. 20, 2003.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
Managing Multi-User Databases AIMS 3710 R. Nakatsu.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
CSI-09 COMMUNICATION TECHNOLOGY FAULT TOLERANCE AUTHOR: V.V. SUBRAHMANYAM.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
CS Hardware Tuning Xiaofang Zhou School of Computing, NUS Office: S URL:
CS Operating System & Database Performance Tuning Xiaofang Zhou School of Computing, NUS Office: S URL:
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
- Disk failure ways and their mitigation - Priya Gangaraju(Class Id-203)
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Raid Techniques. Redundant Array of Independent Disks RAID is a great system for increasing speed and availability of data. More data protection than.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Disk Failures Skip. Index 13.4 Disk Failures Intermittent Failures Organizing Data by Cylinders Stable Storage Error- Handling.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
RAID TECHNOLOGY RASHMI ACHARYA CSE(A) RG NO
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
CS Introduction to Operating Systems
Managing Multi-User Databases
Vladimir Stojanovic & Nicholas Weaver
CS 554: Advanced Database System Notes 02: Hardware
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
TECHNICAL SEMINAR PRESENTATION
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID RAID Mukesh N Tekwani April 23, 2019
Presentation transcript:

1 Storage Refinement

Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID

3 Disk Failures Partial  Total Intermittent  Permanent

Disk failures Intermittent failure –Repeated tries will be successful Media decay –A bit or bits are permanently corrupted Write failure –Neither can we write successfully, nor can we retrieve the previous written sector Disk crash –The entire disk becomes unreadable, suddenly and permanently

5 Coping with Disk Failures Detection –e.g. Checksum Correction  Redundancy

Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID

Intermittent failures Reading function returns (w,s) –W: data –S: status If s is bad, repeat reading enough times

Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum –Stable storage To attack Disk crash –RAID

Checksums Parity check –Even 1’s –For example: > parity bit is 1 –Any one bit error in reading or writing can be detected –Even bits errors can not be detected

Stable storage Basic idea –Sectors are paired and represent the same sector- contents Error-handling capabilities –Media failure Except that both copy failed, which is rare –Write failure Logical BlockCopy A Copy B

CS 245Notes 211 At what level do we cope? Single Disk –e.g., Error Correcting Codes Disk Array Logical Physical

Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID

Disk crashes Failure model –Mean time to failure RAID Storage System –Providing fault-tolerance by redundancy –Improving the performance

RAID Storage System Redundant Array of Inexpensive Disks –Combine multiple small, inexpensive disk drives into a group to yield performance exceeding that of one large, more expensive drive –Appear to the computer as a single virtual drive –Support fault-tolerance by redundantly storing information in various ways

RAID Types Five types of array architectures, RAID 1 ~ 5 –Different disk fault-tolerance –Different trade-offs in features and performance A non-redundant array of disk drives is often referred to as RAID 0 Only RAID 0, 1, 3 and 5 are commonly used –RAID 2 and 4 do not offer any significant advantages over these other types Certain combination is possible (10, 35 etc) –RAID 10 = RAID 1 + RAID 0

factors Redundancy(cost, usage) Read and write performance 16

17 RAID 0 - Striping No redundancy –No fault tolerance High I/O performance –Parallel I/O RAID 0 implements a striped disk array, the data is broken down into blocks and each block is written to a separate disk drive I/O performance is greatly improved by spreading the I/O load across many channels and drives

RAID 1 – Mirroring Provide good fault tolerance –Works ok if one disk in a pair is down One write = a physical write on each disk One read = either read both or read the less busy one –Could double the read rate Low usage rate, only ½ disks are used

RAID 3 - Parallel Array with Parity One disk for parity, data disks are organized as stripes –N times disk i/o on Parity disks Usage rate: (N-1)/N disks are used N times I/O workloads on parity disk

example Disk 1,2,3 are data disks and disk 4 is for parity checking –If disk 1: disk 2: disk 3: –Then disk 4: if disk1 crashed, according to –disk 2: , –disk 3: , –disk 4: – we can recover disk 1:

Read/Write in parity-based RAID Read: parallel stripes read from multiple disks –Good performance Write: 2 reads + 2 writes –Read old data stripe; read parity stripe (2 reads) –Pnew=(Sold xor Snew) xor Pold XOR old data stripe with new data stripe. XOR result into parity stripe. –Write new data stripe and new parity stripe (2 writes). Bad for write-intensive apps

Example for Write on Pairty- based RAID After writing new data, we need to ensuer the correctness of parity in redundant disk Naïve approach: read another n-1 data disk, recalculate the parity, consuming extra n-1 I/O Good strategy: 4 I/O Example: –Change disk 2 to , parity is –Sold xor Snew= xor = –Pnew= xor =

RAID 5 – Parity Checking Each stripe unit has an extra parity stripe –Parity stripes are distributed N-1 disks are used, the same as RAID3

RAID 10 – Striped Mirroring RAID 10 = Striping + mirroring –A striped array of RAID 1 arrays High performance of RAID 0, and high tolerance of RAID 1 (at the cots of doubling disks).. More information about RAID disks at

Hardware vs. Software RAID Software RAID (volume manager) –Software RAID: run on the server’s CPU –Directly dependent on server CPU performance and load –Occupies host system memory and CPU operation, degrading server performance Hardware RAID (array controller) –Hardware RAID: run on the RAID controller’s CPU –Does not occupy any host system memory. Is not operating system dependent –Host CPU can execute applications while the array adapter's processor simultaneously executes array functions: true hardware multi-tasking

26 Which RAID Level to Use? Data and Index Files read intensive –RAID 5 is best suited for read intensive apps or if the RAID controller cache is effective enough. write intensive –RAID 10 is best suited for write intensive apps. Log File –RAID 1 is appropriate Fault tolerance with high write throughput. Writes are synchronous and sequential. No benefits in striping. Temporary Files –RAID 0 is appropriate. No fault tolerance. High throughput.

Comparing RAID Levels RAID 0RAID 1RAID 5RAID 10 ReadHigh2XHigh WriteHigh1XMediumHigh Fault tolerance NoYes Disk utilization HighLowHighLow Key problems Data lost when any disk fails Use double the disk space Lower throughput with disk failure Very expensive, not scalable Key advantages High I/O performance Very high I/O performance A good overall balance High reliability with good performance

RAID Levels - Data Settings: accounts( number, branchnum, balance); create clustered index c on accounts(number); – rows –Cold Buffer –Dual Xeon (550MHz,512Kb), 1Gb RAM, Internal RAID controller from Adaptec (80Mb), 4x18Gb drives (10000RPM), Windows 2000.

RAID Levels - Transactions No Concurrent Transactions: –Read Intensive: select avg(balance) from accounts; –Write Intensive, e.g. typical insert: insert into accounts values (690466,6840, ); Writes are uniformly distributed.

RAID Levels SQL Server7 on Windows 2000 (SoftRAID means striping/parity at host) Read-Intensive: –Using striping (RAID0, RAID 10, RAID5) increases throughput significantly. Write-Intensive: –Without cache, RAID 5 suffers. With cache, it is ok.

31 Controller Pre-fetching No, Write-back Yes Read-ahead: –Prefetching at the disk controller level. –No information on access pattern. –Better to let database management system do it. Write-back vs. write through: –Write back: transfer terminated as soon as data is written to cache. Batteries to guarantee write back in case of power failure Fast cache flushing is a priority –Write through: transfer terminated as soon as data is written to disk.

Summary: What RAID Provides Fault tolerance –It does not prevent disk drive failures –It enables real-time data recovery High I/O performance Mass data capacity Configuration flexibility Lower protected storage costs Easy maintenance

SCSI Controller Cache - Data Settings: employees(ssnum, name, lat, long, hundreds1, hundreds2); create clustered index c on employees(hundreds2); –Employees table partitioned over two disks; Log on a separate disk; same controller (same channel). – rows per table –Database buffer size limited to 400 Mb. –Dual Xeon (550MHz,512Kb), 1Gb RAM, Internal RAID controller from Adaptec (80Mb), 4x18Gb drives (10000RPM), Windows 2000.

SCSI (not disk) Controller Cache - Transactions No Concurrent Transactions: update employees set lat = long, long = lat where hundreds2 = ?; –cache friendly: update of 20,000 rows (~90Mb) –cache unfriendly: update of 200,000 rows (~900Mb)

35 SCSI Controller Cache SQL Server 7 on Windows Adaptec ServerRaid controller: –80 Mb RAM –Write-back mode Updates Controller cache increases throughput whether operation is cache friendly or not. –Efficient replacement policy!