Lecture 21: Storage Systems

Slides:



Advertisements
Similar presentations
IT253: Computer Organization
Advertisements

I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Lecture 23: Storage Systems Disk insides, characteristics, performance, reliability, technology trends, RAID systems Adapted from UCB CS252 S01, Revised.
CPE 442 io.1 Introduction To Computer Architecture CpE 442 I/O Systems.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
CS 430 – Computer Architecture Disks
I/O Systems Processor Cache Memory - I/O Bus Main Memory I/O Controller Disk I/O Controller I/O Controller Graphics Network interrupts.
CS4432: Database Systems II Data Storage - Lecture 2 (Sections 13.1 – 13.3) Elke A. Rundensteiner.
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
CS61C L16 Disks © UC Regents 1 CS61C - Machine Structures Lecture 16 - Disks October 20, 2000 David Patterson
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
CS61C L40 I/O: Disks (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures.
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
CS4432: Database Systems II Lecture 2 Timothy Sutherland.
Disk Technologies. Magnetic Disks Purpose: – Long-term, nonvolatile, inexpensive storage for files – Large, inexpensive, slow level in the memory hierarchy.
Cs 61C L15 Disks.1 Patterson Spring 99 ©UCB CS61C Anatomy of I/O Devices: Magnetic Disks Lecture 15 March 10, 1999 Dave Patterson (http.cs.berkeley.edu/~patterson)
S.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output Cache Main Memory Secondary Memory (Disk)
12/3/2004EE 42 fall 2004 lecture 391 Lecture #39: Magnetic memory storage Last lecture: –Dynamic Ram –E 2 memory This lecture: –Future memory technologies.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
CS 61C L41 I/O Disks (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
CS252/Culler Lec 6.1 2/7/02 CS252 Graduate Computer Architecture I/O Introduction: Storage Devices & RAID Jason Hill.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
CS4432: Database Systems II Data Storage (Better Block Organization) 1.
Storage & Peripherals Disks, Networks, and Other Devices.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
CPE 631: Storage Electrical and Computer Engineering University of Alabama in Huntsville Aleksandar Milenkovic
1 IKI20210 Pengantar Organisasi Komputer Kuliah No. 21: Peripheral 29 Nopember 2002 Bobby Nazief Johny Moningka
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
Lecture 3: 1 Introduction to Queuing Theory More interested in long term, steady state than in startup => Arrivals = Departures Little’s Law: Mean number.
I/O Computer Organization II 1 Introduction I/O devices can be characterized by – Behavior: input, output, storage – Partner: human or machine – Data rate:
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Csci 136 Computer Architecture II – IO and Storage Systems Xiuzhen Cheng
Disk Basics CS Introduction to Operating Systems.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Magnetic Disk Rotational latency Example Find the average rotational latency if the disk rotates at 20,000 rpm.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
Mohamed Younis CMCS 411, Computer Architecture 1 CMCS Computer Architecture Lecture 25 I/O Systems May 2,
1 Lecture 16: Data Storage Wednesday, November 6, 2006.
1 Components of the Virtual Memory System  Arrows indicate what happens on a lw virtual address data physical address TLB page table memory cache disk.
CMSC 611: Advanced Computer Architecture I/O & Storage Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Computer Architecture Principles Dr. Mike Frank
Hardware Technology Trends and Database Opportunities
Module: Storage Systems
Lecture 16: Data Storage Wednesday, November 6, 2006.
IT 251 Computer Organization and Architecture
Backing Store.
IT 344: Operating Systems Winter 2008 Module 13 Secondary Storage
Part V Memory System Design
Disks and Files DBMS stores information on (“hard”) disks.
Introduction to I/O Where does the data for our CPU and memory come from or go to? Computers communicate with the outside world via I/O devices. Input.
Introduction I/O devices can be characterized by I/O bus connections
Lecture 13 I/O.
CSE 451: Operating Systems Winter 2006 Module 13 Secondary Storage
Lecture 28: Reliability Today’s topics: GPU wrap-up Disk basics RAID
CSE 451: Operating Systems Autumn 2003 Lecture 12 Secondary Storage
CSE 451: Operating Systems Winter 2007 Module 13 Secondary Storage
Persistence: hard disk drive
Mass-Storage Systems.
CSE 451: Operating Systems Secondary Storage
CSE 451: Operating Systems Winter 2003 Lecture 12 Secondary Storage
CSE 451: Operating Systems Spring 2005 Module 13 Secondary Storage
CS252 Graduate Computer Architecture I/O Introduction: Storage Devices & RAID Jason Hill.
CSE 451: Operating Systems Autumn 2004 Secondary Storage
CSE 451: Operating Systems Winter 2004 Module 13 Secondary Storage
CS 245: Database System Principles Notes 02: Hardware
Presentation transcript:

Lecture 21: Storage Systems Disk insides, characteristics, performance, reliability, technology trends, RAID systems This lecture will cover sections 1.1-1.4: introduction, application, technology trends, cost and price. Adapted from UCB CS252 S01

I/O Systems Processor Cache Memory - I/O Bus Main Memory I/O interrupts Processor Cache Memory - I/O Bus Main Memory I/O Controller I/O Controller I/O Controller Graphics Disk Disk Network

Storage Technology Drivers Driven by the prevailing computing paradigm 1950s: migration from batch to on-line processing 1990s: migration to ubiquitous computing computers in phones, books, cars, video cameras, … nationwide fiber optical network with wireless tails Today: digital media everywhere Digital forms of voice, picture, and video Data from scientific computing such as earthquake simulation, high energy physical experiments, bioinformatics In forms of personal storages, web server, peer-to-peer storage, grid storage Effects on storage industry: Embedded storage smaller, cheaper, more reliable, lower power Data utilities high capacity, hierarchically managed storage

Magnetic Disks Purpose: Characteristics: Capacity Track Purpose: Long-term, nonvolatile storage Large, inexpensive, slow level in the storage hierarchy Characteristics: Seek Time (~8 ms avg) positional latency rotational latency Transfer rate 10-40 MByte/sec Blocks Capacity Gigabytes Quadruples every 2 years (aerodynamics) Sector Cylinder Platter Head 7200 RPM = 120 RPS => 8 ms per rev ave rot. latency = 4 ms 128 sectors per track => 0.25 ms per sector 1 KB per sector => 16 MB / s Response time = Queue + Controller + Seek + Rot + Xfer Service time

Photo of Disk Head, Arm, Actuator Spindle Arm Head Actuator Platters (12) {

{ Seagate Barracuda 180 + 181.6 GB, 3.5 inch disk 12 platters, 24 surfaces 24,247 cylinders 7,200 RPM; (4.2 ms avg. latency) 7.4/8.2 ms avg. seek (r/w) 64 to 35 MB/s (internal) 0.1 ms controller time 10.3 watts (idle) Track Sector Cylinder Track Buffer Arm Platter Head Latency = Queuing Time + Controller time + Seek Time + Rotation Time + Size / Bandwidth { 47 to 26 MB/s (external) per access + per byte source: www.seagate.com

Disk Performance Factors Actual disk seek and rotation time depends on the current head position Seek time: how far is the head to the track? Disk industry standard: assume random position of the head, e.g., average 8ms seek time In practice: disk accesses have locality Rotation time: how far is the head to sector? Can safely assume ½ of rotation time (disk keeps rotating) 10000 Revolutions Per Minute  166.67 Rev/sec 1 revolution = 1/ 166.67 sec  6.00 ms 1/2 rotation (revolution)  3.00 ms Data Transfer time: What are the rotation speed, disk density, and sectors per transfer? 10000 RPM  a track of data per 6.00 ms Outer tracks are longer and may support higher bandwidth

Disk Performance Example Rule of Thumb: Observed average seek time is typically about 1/4 to 1/3 of quoted seek time (i.e., 3X-4X faster) Rule of Thumb: disks deliver about 3/4 of internal media rate (1.3X slower) for data Calculate time to read 64 KB for UltraStar 72 again, using 1/3 quoted 7.4ms seek time, 3/4 of 64MB/s internal outer track bandwidth Disk latency = average seek time + average rotational delay + transfer time + controller overhead = (0.33 * 7.4 ms) + 0.5 * 1/(7200 RPM/(60000ms/M)) + 64 KB / (0.75 * 65 MB/s) + 0.1 ms = 2.5 ms + 0.5 /(7200 RPM/(60000ms/M)) + 64 KB / (47 KB/ms) + 0.1 ms = 2.5 + 4.2 + 1.4 + 0.1 ms = 8.2 ms (64% of 12.7)

Disk Characteristics in 2000 $828 $447 $435

Disk Performance/Cost Trends Capacity + 100%/year (2X / 1.0 yrs) Transfer rate (BW) + 40%/year (2X / 2.0 yrs) Rotation + Seek time – 8%/ year (1/2 in 10 yrs) MB/$ > 100%/year (2X / 1.0 yrs) Fewer chips + areal density Seagate 120.0GB Internal Hard Drive ST3120026A, $150 at staple (list price) Maxtor 120GB 8MB Cache Hard Drive $59.84 after rebate at OfficeDepot IBM Microdrive

Disk System Performance Response Time (ms) Throughput (% total BW) 100 200 300 0% System-level Metrics: Response Time Throughput Response time = Queue + Controller + service time (√) 100% Proc Queue IOC Device Response time = Queue + Device Service time

How About Queuing Time? Queuing time can be the most significant one in disk response time More interested in long term, steady state than in startup => Arrivals = Departures Little’s Law: Mean number tasks in system = arrival rate x mean reponse time Applies to any system in equilibrium, as long as nothing in black box is creating or destroying tasks Arrivals Departures

A Little Queuing Theory: Notation Proc IOC Device Queue server System Queuing models assume state of equilibrium: input rate = output rate Notation: r average number of arriving customers/second Tser average time to service a customer (tradtionally µ = 1/ Tser ) u server utilization (0..1): u = r x Tser (or u = r / µ ) Tq average time/customer in queue Tsys average time/customer in system: Tsys = Tq + Tser Lq average length of queue: Lq = r x Tq Lsys average length of system: Lsys = r x Tsys Little’s Law: Lengthserver = rate x Timeserver (Mean number customers = arrival rate x mean service time) Sometimes Queue called waiting line, System called Queue

A Little Queuing Theory: Example processor sends 50 x 8KB disk I/Os per sec, requests & service exponentially distrib., avg. disk service = 12 ms On average, how utilized is the disk? What is the number of requests in the queue? What is the average time a spent in the queue? What is the average response time for a disk request? Notation: r average number of arriving customers/second= 50 Tser average time to service a customer= 12 ms u server utilization (0..1): u = r x Tser= 50/s x .012s = 0.60 Tq average time/customer in queue = Ts er x u / (1 – u) = 12x 0.60/(1-0.60) = 12x1.5 = 18 ms Tsys average time/customer in system: Tsys =Tq +Tser= 30 ms Lq average length of queue:Lq= r x Tq = 50/s x 0.018s = 0.9 requests in queue Lsys average # tasks in system : Lsys = r x Tsys = 50/s x 0.030s = 1.5 Look into textbook when you need to work on I/O

How to build Large Storage: Disk Array Controller String Controller . . . String Controller . . . String Controller . . . String Controller . . . String Controller . . . String Controller . . . Not practical to build large disks

Array Reliability Reliability of N disks = Reliability of 1 Disk ÷ N 50,000 Hours ÷ 70 disks = 700 hours Disk system MTTF: Drops from 6 years to 1 month! (MTTF: Mean Time to Failure) • Arrays (without redundancy) too unreliable to be useful! Solution: RAID -- Redundant Arrays of Inexpensive Disks

RAID: The Idea 10010011 11001101 . . . P 1 logical record If disk fails, subtract P from sum of other disks to find missing information 10010011 11001101 . . . logical record P 1 Striped physical records P contains sum of other disks per stripe mod 2 (“parity”) RAID-3 shown

RAID 4: High I/O Rate Parity Increasing Logical Disk Address D0 D1 D2 D3 P Insides of 5 disks D4 D5 D6 D7 P D8 D9 D10 D11 P Example: small read D0 & D5, large write D12-D15 Stripe D12 D13 D14 D15 P D16 D17 D18 D19 P D20 D21 D22 D23 P . . . . . Disk Columns

RAID 5: High I/O Rate Interleaved Parity Increasing Logical Disk Addresses P Independent writes possible because of interleaved parity D4 D5 D6 P D7 D8 D9 P D10 D11 D12 P D13 D14 D15 Example: write to D0, D5 uses disks 0, 1, 3, 4 P D16 D17 D18 D19 No disk hot spot! D20 D21 D22 D23 P . . . . . Disk Columns

Future Storage Trends Disks: Tapes Other Storage Extraodinary advance in capacity/drive, $/GB Currently 17 Gbit/sq. inch; can continue past 100 Gbit/sq. inch? Bandwidth, seek time not keeping up: 3.5 inch form factor makes sense? 2.5 inch form factor in near future? 1.0 inch form factor in long term? Tapes Old technique, no investment in innovation Are they already dead? What is a tapeless backup system? Other Storage CD/DVD Compact Flash, USB key storage, MRAM