1 Lecture 23: Storage Systems Disk insides, characteristics, performance, reliability, technology trends, RAID systems Adapted from UCB CS252 S01, Revised.

Slides:



Advertisements
Similar presentations
CPS216: Data-Intensive Computing Systems Data Access from Disks Shivnath Babu.
Advertisements

1 Lecture 22: I/O, Disk Systems Todays topics: I/O overview Disk basics RAID Reminder: Assignment 8 due Tue 11/21.
I/O InterfaceCS510 Computer ArchitecturesLecture Lecture 17 I/O Interfaces and I/O Busses.
IT253: Computer Organization
CS61C L16 Disks © UC Regents 1 CS161 Lecture 25 - Disks Laxmi Bhuyan
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Secondary Storage Devices: Magnetic Disks
Overview of Mass Storage Structure
RAID Redundant Arrays of Independent Disks Courtesy of Satya, Fall 99.
I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
CPSC 614:Graduate Computer Architecture Prof
1 CDA3101 Fall 2013 Computer Storage: Practical Aspects 6,13 November 2013 Copyright © 2011 Prabhat Mishra.
Mass StorageCS510 Computer ArchitecturesLecture Lecture 18 Mass Storage Devices: RAID, Data Library, MSS.
CS 136, Advanced Architecture Storage. CS136 2 Case for Storage Shift in focus from computation to communication and storage of information –E.g., Cray.
88 CHAPTER SECONDARY STORAGE. © 2005 The McGraw-Hill Companies, Inc. All Rights Reserved. 8-2 Competencies Distinguish between primary & secondary storage.
1 CSCI 504 Computer Organization Lecture 5: I/O Introduction: Storage Devices & RAID Dr. Yuan-Shun Dai Computer Science 504 Fall 2004 Adapted from D.A.
CS224 Spring 2011 Computer Organization CS224 Chapter 6A: Disk Systems With thanks to M.J. Irwin, D. Patterson, and J. Hennessy for some lecture slide.
I/O Management and Disk Scheduling
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
CPE 731 Advanced Computer Architecture Storage
1 IKI10230 Pengantar Organisasi Komputer Bab 5.3: On-line Storage 23 April 2003 Bobby Nazief Qonita Shahab bahan.
Secondary Storage Memory
CMSC 611: Advanced Computer Architecture
Storage and Disks Fusheng Wang Department of Biomedical Informatics
CSE431 Chapter 6A.1Irwin, PSU, 2008 Chapter 6A: Disk Systems Mary Jane Irwin ( ) [Adapted from Computer Organization.
1 Disks Introduction ***-. 2 Disks: summary / overview / abstract The following gives an introduction to external memory for computers, focusing mainly.
Princess Sumaya Univ. Computer Engineering Dept. Chapter 6:
Computing ESSENTIALS CHAPTER Copyright 2003 The McGraw-Hill Companies, Inc.Copyright 2003 The McGraw-Hill Companies, Inc Secondary Storage computing.
GRAP 3175 Computer Applications for Drafting Unit II Computer Hardware.
Storage and Disks.
EE898 Lec /22/04 EE Architecture of Digital Systems Lecture 3: I/O Introduction and A Little Queueing Theory Prof. Seok-Bum Ko.
CS61C L13 I/O © UC Regents 1 CS 161 Chapter 8 - I/O Lecture 17.
CPE 442 io.1 Introduction To Computer Architecture CpE 442 I/O Systems.
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
CS 430 – Computer Architecture Disks
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
CS61C L16 Disks © UC Regents 1 CS61C - Machine Structures Lecture 16 - Disks October 20, 2000 David Patterson
CS61C L40 I/O: Disks (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures.
Disk Technologies. Magnetic Disks Purpose: – Long-term, nonvolatile, inexpensive storage for files – Large, inexpensive, slow level in the memory hierarchy.
Cs 61C L15 Disks.1 Patterson Spring 99 ©UCB CS61C Anatomy of I/O Devices: Magnetic Disks Lecture 15 March 10, 1999 Dave Patterson (http.cs.berkeley.edu/~patterson)
12/3/2004EE 42 fall 2004 lecture 391 Lecture #39: Magnetic memory storage Last lecture: –Dynamic Ram –E 2 memory This lecture: –Future memory technologies.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
CS 61C L41 I/O Disks (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Storage & Peripherals Disks, Networks, and Other Devices.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
1 IKI20210 Pengantar Organisasi Komputer Kuliah No. 21: Peripheral 29 Nopember 2002 Bobby Nazief Johny Moningka
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
Chapter 111 Chapter 11: Hardware (Slides by Hector Garcia-Molina,
Lecture 3: 1 Introduction to Queuing Theory More interested in long term, steady state than in startup => Arrivals = Departures Little’s Law: Mean number.
I/O Computer Organization II 1 Introduction I/O devices can be characterized by – Behavior: input, output, storage – Partner: human or machine – Data rate:
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Csci 136 Computer Architecture II – IO and Storage Systems Xiuzhen Cheng
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Magnetic Disk Rotational latency Example Find the average rotational latency if the disk rotates at 20,000 rpm.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
Mohamed Younis CMCS 411, Computer Architecture 1 CMCS Computer Architecture Lecture 25 I/O Systems May 2,
1 Components of the Virtual Memory System  Arrows indicate what happens on a lw virtual address data physical address TLB page table memory cache disk.
CMSC 611: Advanced Computer Architecture I/O & Storage Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Module: Storage Systems
IT 251 Computer Organization and Architecture
Backing Store.
Part V Memory System Design
Disks and Files DBMS stores information on (“hard”) disks.
Introduction I/O devices can be characterized by I/O bus connections
Lecture 21: Storage Systems
Presentation transcript:

1 Lecture 23: Storage Systems Disk insides, characteristics, performance, reliability, technology trends, RAID systems Adapted from UCB CS252 S01, Revised by Zhao Zhang

2 I/O Systems Processor Cache Memory - I/O Bus Main Memory I/O Controller Disk I/O Controller I/O Controller Graphics Network interrupts

3 Storage Technology Drivers Driven by the prevailing computing paradigm 1950s: migration from batch to on-line processing 1990s: migration to ubiquitous computing computers in phones, books, cars, video cameras, … nationwide fiber optical network with wireless tails Today: digital media everywhere Digital forms of voice, picture, and video Data from scientific computing such as earthquake simulation, high energy physical experiments, bioinformatics In forms of personal storages, web server, peer-to-peer storage, grid storage Effects on storage industry: Embedded storage smaller, cheaper, more reliable, lower power Data utilities high capacity, hierarchically managed storage

4 Magnetic Disks Purpose: Long-term, nonvolatile storage Large, inexpensive, slow level in the storage hierarchy Characteristics: Seek Time (~8 ms avg) positional latency rotational latency Transfer rate MByte/sec Blocks Capacity Gigabytes Quadruples every 2 years Sector Track Cylinder Head Platter 7200 RPM = 120 RPS => 8 ms per rev ave rot. latency = 4 ms 128 sectors per track => 0.25 ms per sector 1 KB per sector => 16 MB / s Response time = Queue + Controller + Seek + Rot + Xfer Service time

5 Photo of Disk Head, Arm, Actuator Actuator Arm Head Platters (12) { Spindle

6 Seagate Barracuda GB, 3.5 inch disk 12 platters, 24 surfaces 24,247 cylinders 7,200 RPM; (4.2 ms avg. latency) 7.4/8.2 ms avg. seek (r/w) 64 to 35 MB/s (internal) 0.1 ms controller time 10.3 watts (idle) source: Latency = Queuing Time + Controller time + Seek Time + Rotation Time + Size / Bandwidth per access per byte { + Sector Track Cylinder Head Platter Arm Track Buffer

7 Disk Performance Factors Actual disk seek and rotation time depends on the current head position Seek time: how far is the head to the track? Disk industry standard: assume random position of the head, e.g., average 8ms seek time In practice: disk accesses have locality Rotation time: how far is the head to sector? Can safely assume ½ of rotation time (disk keeps rotating) Revolutions Per Minute Rev/sec 1 revolution = 1/ sec 6.00 ms 1/2 rotation (revolution) 3.00 ms Data Transfer time: What are the rotation speed, disk density, and sectors per transfer? RPM a track of data per 6.00 ms Outer tracks are longer and may support higher bandwidth

8 Disk Performance Example Rule of Thumb: Observed average seek time is typically about 1/4 to 1/3 of quoted seek time (i.e., 3X-4X faster) Rule of Thumb: disks deliver about 3/4 of internal media rate (1.3X slower) for data Calculate time to read 64 KB for UltraStar 72, using 1/3 quoted 7.4ms seek time, 3/4 of 64MB/s internal outer track bandwidth Disk latency = average seek time + average rotational delay + transfer time + controller overhead = (0.33 * 7.4 ms) * 1/(7200 RPM/(60000ms/M)) + 64 KB / (0.75 * 65 MB/s) ms = 2.5 ms /(7200 RPM/(60000ms/M)) + 64 KB / (47 KB/ms) ms = ms = 8.2 ms (64% of 12.7)

9 Disk Characteristics in 2000 $447$435$828

10 Disk Performance/Cost Trends Capacity + 100%/year (2X / 1.0 yrs) Transfer rate (BW) + 40%/year (2X / 2.0 yrs) Rotation + Seek time – 8%/ year (1/2 in 10 yrs) MB/$ > 100%/year (2X / 1.0 yrs) Fewer chips + areal density Seagate 120GB Internal Hard Drive ST A, $150 at staple (list price) Maxtor 120GB 8MB Cache Hard Drive $59.84 after rebate at OfficeDepot IBM Microdrive

11 Disk System Performance Response time = Queue + Device Service time 100% Response Time (ms) Throughput (% total BW) % Proc Queue IOCDevice System-level Metrics: Response Time Throughput Response time = Queue + Controller + service time ()

12 How About Queuing Time? Queuing time can be the most significant one in disk response time More interested in long term, steady state than in startup => Arrivals = Departures Littles Law: Mean number tasks in system = arrival rate x mean reponse time Applies to any system in equilibrium, as long as nothing in black box is creating or destroying tasks ArrivalsDepartures

13 A Little Queuing Theory: Notation Queuing models assume state of equilibrium: input rate = output rate Notation: r average number of arriving customers/second T ser average time to service a customer (tradtionally µ = 1/ T ser ) userver utilization (0..1): u = r x T ser (or u = r / µ ) T q average time/customer in queue = T s er x u / (1 –u) T sys average time/customer in system: T sys = T q + T ser L q average length of queue: L q = r x T q L sys average length of system: L sys = r x T sys Littles Law: Length server = rate x Time server (Mean number customers = arrival rate x mean service time) ProcIOCDevice Queue server System

14 A Little Queuing Theory: Example Processor sends 50 x 8KB disk I/Os per sec, requests & service exponentially distrib., avg. disk service = 12 ms On average, how is the disk utilized? What is the number of requests in the queue? What is the average time a spent in the queue? What is the average response time for a disk request? Notation: raverage number of arriving customers/second= 50 T ser average time to service a customer= 12 ms userver utilization (0..1): u = r x T ser = 50/s x.012s = 0.60 T q average time/customer in queue = T s er x u / (1 – u) = 12x 0.60/(1-0.60) = 12x1.5 = 18 ms T sys average time/customer in system: T sys =T q +T ser = 30 ms L q average length of queue:L q = r x T q = 50/s x 0.018s = 0.9 requests in queue L sys average # tasks in system : L sys = r x T sys = 50/s x 0.030s = 1.5 Look into textbook when you need to work on I/O

15 How to build Large Storage: Disk Array Array Controller String Controller String Controller String Controller String Controller String Controller String Controller... Not practical to build large disks

16 Array Reliability Reliability of N disks = Reliability of 1 Disk ÷ N 50,000 Hours ÷ 70 disks = 700 hours Disk system MTTF: Drops from 6 years to 1 month! (MTTF: Mean Time to Failure) Arrays (without redundancy) too unreliable to be useful! Solution: RAID -- Redundant Arrays of Inexpensive Disks

17 RAID: The Idea P logical record P contains sum of other disks per stripe mod 2 (parity) Striped physical records If disk fails, subtract P from sum of other disks to find missing information RAID-3 shown

18 RAID 4: High I/O Rate Parity D0D1D2 D3 P D4D5D6 PD7 D8D9 PD10 D11 D12 PD13 D14 D15 P D16D17 D18 D19 D20D21D22 D23 P Disk Columns Increasing Logical Disk Address Stripe Insides of 5 disks Example: small read D0 & D5, large write D12-D15 Example: small read D0 & D5, large write D12-D15

19 RAID 5: High I/O Rate Interleaved Parity Independent writes possible because of interleaved parity Independent writes possible because of interleaved parity D0D1D2 D3 P D4D5D6 P D7 D8D9P D10 D11 D12PD13 D14 D15 PD16D17 D18 D19 D20D21D22 D23 P Disk Columns Increasing Logical Disk Addresses Example: write to D0, D5 uses disks 0, 1, 3, 4 No disk hot spot!

20 Future Storage Trends Disks: Extraodinary advance in capacity/drive, $/GB Currently 17 Gbit/sq. inch; can continue past 100 Gbit/sq. inch? Bandwidth, seek time not keeping up: 3.5 inch form factor makes sense? 2.5 inch form factor in near future? 1.0 inch form factor in long term? Tapes Old technique, no investment in innovation Are they already dead? What is a tapeless backup system? Other Storage CD/DVD Compact Flash, USB key storage, MRAM