CMPE Database Systems Workshop June 16 Class Meeting

Slides:



Advertisements
Similar presentations
A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
Advertisements

Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
1 Jason Drown Mark Rodden (Redundant Array of Inexpensive Disks) RAID.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
RAID Redundant Array of Independent Disks
CSCE430/830 Computer Architecture
 RAID stands for Redundant Array of Independent Disks  A system of arranging multiple disks for redundancy (or performance)  Term first coined in 1987.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
2P13 Week 11. A+ Guide to Managing and Maintaining your PC, 6e2 RAID Controllers Redundant Array of Independent (or Inexpensive) Disks Level 0 -- Striped.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
CS 149: Operating Systems February 24 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
1 Storage (cont’d) Disk scheduling Reducing seek time (cont’d) Reducing rotational latency RAIDs.
©Brooks/Cole, 2003 Chapter 5 Computer Organization.
High Performance Computing Course Notes High Performance Storage.
I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Comp 1001: IT & Architecture - Joe Carthy 1 Information Representation: Summary All Information is stored and transmitted in digital form in a computer.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
RAID Shuli Han COSC 573 Presentation.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
CS 149: Operating Systems February 26 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
CS 146: Data Structures and Algorithms June 23 Class Meeting Department of Computer Science San Jose State University Summer 2015 Instructor: Ron Mak
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
CS 149: Operating Systems March 3 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
The concept of RAID in Databases By Junaid Ali Siddiqui.
Introduction: Memory Management 2 Ideally programmers want memory that is large fast non volatile Memory hierarchy small amount of fast, expensive memory.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 4 Computer Systems Review.
Memory The term memory is referred to computer’s main memory, or RAM (Random Access Memory). RAM is the location where data and programs are stored (temporarily),
Cloud Computing Vs RAID Group 21 Fangfei Li John Soh Course: CSCI4707.
W4118 Operating Systems Instructor: Junfeng Yang.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
CSE 451: Operating Systems Spring 2010 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center 534.
CS Introduction to Operating Systems
RAID.
A Case for Redundant Arrays of Inexpensive Disks (RAID) -1988
A Case for Redundant Arrays of Inexpensive Disks (RAID)
Chapter 2: Computer-System Structures
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
Dave Eckhardt Disk Arrays Dave Eckhardt
Vladimir Stojanovic & Nicholas Weaver
Database Management System
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
RAID Disk Arrays Hank Levy 1.
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
TECHNICAL SEMINAR PRESENTATION
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Presentation transcript:

CMPE 180-38 Database Systems Workshop June 16 Class Meeting Department of Computer Engineering San Jose State University Summer 2017 Instructor: Ron Mak www.cs.sjsu.edu/~mak

Computer System Architecture A major disparity exists between CPU speed and memory speed. CPU technology is advancing faster than memory technology. The CPU can process data faster than it can be fetched from compatibly priced main memory units. Main memory is the bottleneck in terms of system speed and cost. Systems need to include very fast and expensive cache memory.

Memory Hierarchy Small amount of very fast, expensive, volatile cache memory. volatile: contents disappear when the power goes off Gigabytes of medium-speed, medium-price, volatile main memory. RAM: random access memory Gigabytes or terabytes of slow, cheap, non-volatile disk storage.

Memory/Storage Speed Comparisons Suppose your computer ran at human speeds. 1 CPU cycle: 1 second Then the time to retrieve one byte from: SRAM 5 seconds DRAM 2 minutes Flash 1 day Hard drive 2 months Tape 1,000 years

B-Trees A B-tree is a tree data structure suitable for disk drives. It may take up to 11 ms to access data on disk. Today’s modern CPUs can execute billions of instructions per second. Therefore, it’s worth spending a few CPU cycles to reduce the number of disk accesses. B-trees are often used to implement databases.

B-Trees A B-tree is an m-ary tree. Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

B-Trees A B-tree of order 5 for a disk drive: Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

B-Tree Insertion Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

B-Tree Insertion Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

B-Tree Insertion Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

B-Tree Deletion Mark Allen Weiss Data Structures and Algorithms in Java (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-257627-9

Price per Megabyte of DRAM Operating Systems Concepts, 9th edition Silberschatz, Galvin, and Gagne (c) 2013 John Wiley & Sons. All rights reserved. 978-1-118-06333-0

Price per Megabyte of Magnetic Hard Disk Operating Systems Concepts, 9th edition Silberschatz, Galvin, and Gagne (c) 2013 John Wiley & Sons. All rights reserved. 978-1-118-06333-0

RAID II Prototype (FrankenRAID) Computer History Museum Redundant Array of Independent Disks Initially, Redundant Array of Inexpensive Disks Term was coined in 1987 by David Patterson, Randy Katz and Garth A. Gibson. RAID II Prototype (FrankenRAID) Computer History Museum

RAID Redundancy RAID increases reliability by adding redundancy. Mirroring Maintain multiple copies of data. Striping Spread data blocks of a file across multiple disks. Error correcting codes (ECC) Use parity bits and checksums. Recover lost data.

RAID Levels Different levels of RAID have different levels of reliability and performance costs. RAID levels 0 through 6 RAID level 1+0 or 10 Example: 4 disks of data C: copy of the data P: error-correcting bits Operating Systems Concepts, 9th edition Silberschatz, Galvin, and Gagne (c) 2013 John Wiley & Sons. All rights reserved. 978-1-118-06333-0

RAID 0 Striped data blocks No mirroring Best performance No fault tolerance http://searchstorage.techtarget.com/definition/RAID

RAID 1 Disk mirroring No striping Improved read performance http://searchstorage.techtarget.com/definition/RAID

RAID 2 Striped data blocks Parity information Superseded by RAID 3 http://searchstorage.techtarget.com/definition/RAID

RAID 3 Cannot overlap I/O. An I/O operation addresses all the drives at the same time. Best for single-user systems with long record applications. Striped data blocks One drive stores parity information. http://searchstorage.techtarget.com/definition/RAID

RAID 4 All writes have to update the parity drive. Large stripes No overlapped writes.  Large stripes Read entire records from any single drive. Overlapped reads. http://searchstorage.techtarget.com/definition/RAID

RAID 5 Poor write performance. Block-level striping Striped parity Array can function even if one drive fails. http://searchstorage.techtarget.com/definition/RAID

RAID 6 Continues to function even if two disks fail simultaneously. Higher cost per gigabyte and often has slower write performance than RAID 5. Similar to RAID 5. Includes a second parity scheme that is distributed across the drives in the array. http://searchstorage.techtarget.com/definition/RAID

RAID 10 (1+0) The data is mirrored and the mirrors are striped. Higher performance than RAID 1, but at a much higher cost. Combines RAID 1 and RAID 0. http://searchstorage.techtarget.com/definition/RAID

RAID Tutorials http://www.thegeekstuff.com/2010/08/raid-levels-tutorial/ http://searchstorage.techtarget.com/definition/RAID

Solid-State Drives (SSDs) No moving parts. Fail less often than hard drives. Lessens the need for RAID. Use wear leveling instead for data protection. Each block can tolerate a finite number of program/erase cycles before becoming unreliable. Distribute reads and writes evenly among all the blocks of the device.