Abstract Increases in CPU and memory will be wasted if not matched by similar performance in I/O SLED vs. RAID 5 levels of RAID and respective cost/performance.

Slides:



Advertisements
Similar presentations
A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
Advertisements

Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
CA 714CA Midterm Review. C5 Cache Optimization Reduce miss penalty –Hardware and software Reduce miss rate –Hardware and software Reduce hit time –Hardware.
10 REASONS Why it makes a good option for your DB IN-MEMORY DATABASES Presenter #10: Robert Vitolo.
Chapter 6 External Memory Disk and RAID (Redundant Arrays of Independent Disks) CS-147 Fall 2010 Jonathan Wang.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
1 Storage Hierarchy Cache Main Memory Virtual Memory File System Tertiary Storage Programs DBMS Capacity & Cost Secondary Storage.
Secondary Storage CSCI 444/544 Operating Systems Fall 2008.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Nov. 20, 2002 Topic: Storage Systems (Disk Technology)
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
Data Storage Technology
Solid-State Drive Ding Ruogu Kong Liang. A solid-state drive (SSD) is a data storage device that uses solid-state memory to store persistent data.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Secondary Storage Unit 013: Systems Architecture Workbook: Secondary Storage 1G.
Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 5 – Storage Organization.
Module 3 - Storage MIS5122: Enterprise Architecture for IT Auditors.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Computer Hardware. “ Copyright 2005 John Wiley & Sons Inc.” TG 12 Chapter Outline Introduction The Central Processing Unit Computer Memory Evolution of.
COMP25212 ARRAY OF DISKS Sergio Davies Feb/Mar 2014COMP25212 – Storage 2.
Eng. Mohammed Timraz Electronics & Communication Engineer University of Palestine Faculty of Engineering and Urban planning Software Engineering Department.
Memory/Storage Architecture Lab Computer Architecture Lecture Storage and Other I/O Topics.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
1 CHAPTER 2 COMPUTER HARDWARE. 2 The Significance of Hardware  Pace of hardware development is extremely fast. Keeping up requires a basic understanding.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Parity Logging O vercoming the Small Write Problem in Redundant Disk Arrays Daniel Stodolsky Garth Gibson Mark Holland.
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
I/O 1 Computer Organization II © McQuain Introduction I/O devices can be characterized by – Behavior: input, output, storage – Partner:
Chapter 111 Chapter 11: Hardware (Slides by Hector Garcia-Molina,
Disks Chapter 5 Thursday, April 5, Today’s Schedule Input/Output – Disks (Chapter 5.4)  Magnetic vs. Optical Disks  RAID levels and functions.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
OSes: 11. FS Impl. 1 Operating Systems v Objectives –discuss file storage and access on secondary storage (a hard disk) Certificate Program in Software.
Chapter 2 part 2. Computer Processing Speeds Milliseconds - thousands of a second Microseconds - millionths of a second Nanoseconds - billionths of a.
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
2.1 Operating System Concepts Chapter 2: Computer-System Structures Computer System Operation Storage Structure Storage Hierarchy Hardware Protection General.
I/O Computer Organization II 1 Introduction I/O devices can be characterized by – Behavior: input, output, storage – Partner: human or machine – Data rate:
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
Price Performance Metrics CS3353. CPU Price Performance Ratio Given – Average of 6 clock cycles per instruction – Clock rating for the cpu – Number of.
Computer Organization & Programming
By : Reem Hasayen. A storage device is a hardware device capable of storing information. There are two types of storage devices used in computers 1. Primary.
CS 6290 I/O and Storage Milos Prvulovic. Storage Systems I/O performance (bandwidth, latency) –Bandwidth improving, but not as fast as CPU –Latency improving.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
1 Lecture 23: Storage Systems Topics: disk access, bus design, evaluation metrics, RAID (Sections )
Storage Systems CSE 598d, Spring 2007 Lecture ?: Rules of thumb in data engineering Paper by Jim Gray and Prashant Shenoy Feb 15, 2007.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
Lecture Topics: 11/22 HW 7 File systems –block allocation Unix and NT –disk scheduling –file caches –RAID.
W4118 Operating Systems Instructor: Junfeng Yang.
CSCE 385: Computer Architecture Spring 2014 Dr. Mike Turi I/O.
File organization Secondary Storage Devices Lec#7 Presenter: Dr Emad Nabil.
TYPES OF MEMORY.
A Case for Redundant Arrays of Inexpensive Disks (RAID) -1988
A Case for Redundant Arrays of Inexpensive Disks (RAID)
Chapter 2: Computer-System Structures
Module: Storage Systems
Database Management Systems (CS 564)
Vladimir Stojanovic & Nicholas Weaver
Part V Memory System Design
Introduction I/O devices can be characterized by I/O bus connections
Lecture 13 I/O.
CSC3050 – Computer Architecture
Presentation transcript:

Abstract Increases in CPU and memory will be wasted if not matched by similar performance in I/O SLED vs. RAID 5 levels of RAID and respective cost/performance analysis

Background – CPU Performance Unprecedented CPU growth –Gordon Bell: 40% per year between ’74 – ’84 –Bill Joy: Millions of Instructions Per Second = MIPS = 2 Year – 1984 –Mainframes and supercomputers don’t share same growth rate Multiprocessors used to cope with performance

Background – Memory Gene Amdahl: each CPU instruction per second requires one byte of main memory –If system costs are not dominated by the cost of memory the memory chip capacity should grow at same rate Gordon Moore: transistors/chip = 2 Year – 1964 –RAM has quadrupled every 2-3 years Ratio of MB RAM to MIPS has increased in recent years due to dropping memory prices

Background – I/O Primary measure of magnetic disk technology is max number of bits that can be stored per square inch –MAD: maximal areal density –MAD = 10 (Year-1971)/10 Doubled in capacity & halved in price every 3 years

Background – I/O con’t Capacity not the only measure –Performance Main memory increases have kept pace with CPU for 2 reasons –Caches –Speed increases SLED has only increased marginally –Held back because they are mechanical –Seek and rotation delays –Greater density solved only some of problem

Background – I/O con’t Larger main memories and solid state disks as buffers –Problem: volatility High rate of random request for small chunks of data (transaction processing) Low rate of requests for large chunks of data (large simulations on supercomputers)

…the Problem What is the impact of improving the performance of some parts while leaving others the same? Amdahl’s Law: s = 1 _ (1-f) + f/k where: s = effective setup f = fraction of work in faster mode k = speedup while in faster mode

…the Problem con’t Example: –Current applications spend 10% of time in I/O –Using Amdahl’s Law When computers are 10x faster total system speedup will only be 5x When computers are 100x faster, total system speedup will be a mere 10x faster –90% wasted total speedup due to I/O While software & buffering will help in the near term, a solution is needed to avoid crisis

…the Problem con’t Questions: –How can we increase performance of secondary storage and/or disk I/O? –Can a more cost effective solution be found?

…the Solution Arrays of inexpensive disks –PC disks: lower cost and performance –Versus SLED I/Os per second of an inexpensive disk is within a factor of 2 Cheaper per MB, less power consumption –Contains full track buffers and most functions of traditional mainframe controller –Small Computer System Interface (SCSI)

Reliability Unreliable nature of disks forces constant backup MTTF of disk array = MTTF of a single disk Number of disks in array Example: MTTF for 100 CP-3100 disks is 30,000/100 or 300 hours which is less than 2 weeks! Compare to the IBM 3380 which has a MTTF > 30,000 hours Without fault tolerance large arrays of disks are too unreliable to be a viable solution

Better Solution Redundant Arrays of Independent Disks (RAID) Things to consider –Reliability –Overhead cost –Usable storage percentage –Performance

Reliability Regarding reliability –Check disks with redundant information –Replacement of disk occurs in short time (MTTR) Obsolescence –Extremely high MTTF is ‘overkill’ –Are you still using a 20 year old disk?

Overhead Cost Number of extra check disks

Usable Storage Percentage Percentage of total disk space allocated to actual data Another way of viewing costs of overhead

Performance Various applications have various uses for the I/O –Supercomputers: number of reads/writes per second on large chunks of data –Transaction processing: number of individual read/writes per second Also use read-modify-write access

Advantages Full data redundancy (reliability) Much lower cost compared to SLEDs Data rate I/O rate Modular growth potential (scalability) Lower power consumption Availability (after failure)

Disadvantages Capacity Quantity of independent disks to achieve size/performance needs Large number of connections/cables

My Opinion Viable solution to problem assuming other issues are resolved –Actual MTTF rates for arrays –Connectivity to 100 or even 1000 disks Low cost allows entry into consumer market Similar solution to dual-core CPUs –Can’t go any faster so just use 2 of them –Growing trend in computer industry

Problem Solved? Increased performance over current secondary storage solutions? –YES More Cost effective than current standard secondary storage solutions? –YES

The End!!! Questions? Comments? Discussion…