Data Management for Decision Support Session-5 Prof. Bharat Bhasker.

Slides:



Advertisements
Similar presentations
Disk Storage, Basic File Structures, and Hashing
Advertisements

RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
Chapter 6 External Memory Disk and RAID (Redundant Arrays of Independent Disks) CS-147 Fall 2010 Jonathan Wang.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
Database Implementation Issues CPSC 315 – Programming Studio Spring 2008 Project 1, Lecture 5 Slides adapted from those used by Jennifer Welch.
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
File Organizations and Indexes ISYS 464. Disk Devices Disk drive: Read/write head and access arm. Single-sided, double-sided, disk pack Track, sector,
METU Department of Computer Eng Ceng 302 Introduction to DBMS Disk Storage, Basic File Structures, and Hashing by Pinar Senkul resources: mostly froom.
Efficient Storage and Retrieval of Data
Physical Database Design File Organizations and Indexes ISYS 464.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Chapter 13 Disk Storage, Basic File Structures, and Hashing.
DISK STORAGE INDEX STRUCTURES FOR FILES Lecture 12.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
12.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts with Java – 8 th Edition Chapter 12: Mass-Storage Systems.
By : Nabeel Ahmed Superior University Grw Campus.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
RAID Shuli Han COSC 573 Presentation.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Redundant Array of Independent Disks
L/O/G/O External Memory Chapter 3 (C) CS.216 Computer Architecture and Organization.
File Organizations and Indexes ISYS 464. Disk Devices Disk drive: Read/write head and access arm. Single-sided, double-sided, disk pack Track, sector,
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 17 Disk Storage, Basic File Structures, and Hashing.
Disk Storage, Basic File Structures, and Hashing
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
Basic File Structures and Hashing Lectured by, Jesmin Akhter, Assistant professor, IIT, JU.
Physical Database Design File Organizations and Indexes ISYS 464.
I/O Management and Disk Structure Introduction to Operating Systems: Module 14.
1 Overview of Database Design Process. Data Storage, Indexing Structures for Files 2.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
Chapter- 14- Index structures for files
Chapter Ten. Storage Categories Storage medium is required to store information/data Primary memory can be accessed by the CPU directly Fast, expensive.
File Structures. 2 Chapter - Objectives Disk Storage Devices Files of Records Operations on Files Unordered Files Ordered Files Hashed Files Dynamic and.
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Chapter 13 Disk Storage, Basic File Structures, and Hashing.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Storage and File structure COP 4720 Lecture 20 Lecture Notes.
Lec 5 part2 Disk Storage, Basic File Structures, and Hashing.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Chapter 5 Record Storage and Primary File Organizations
Announcements Today –RAID –Begin Indexes Program 1 due Friday –Office Hours today 2-3 pm –I’ll have limited contact over the weekend –later today.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
CS Introduction to Operating Systems
Managing Multi-User Databases
Oracle SQL*Loader
9/12/2018.
RAID RAID Mukesh N Tekwani
Chapters 17 & 18 6e, 13 & 14 5e: Design/Storage/Index
Database Implementation Issues
Disk storage Index structures for files
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Database Implementation Issues
Presentation transcript:

Data Management for Decision Support Session-5 Prof. Bharat Bhasker

Server Hardware Architecture Disk Technology -- RAID The DISK (i.e. I/O ) speed has not kept pace with the CPU speed I/O throughput is the weakest link in the chain Greatest Possibility of Failure => loss of data What is required? A Robust reliable, possibly failsafe storage mechanism Devices with better I/O throughput

Server Hardware Architecture Disk Technology -- RAID Redundant Array of Independent Disks –Cheap (Small) disks can be combined to offer large storage –Plug and Play –Hot Swappable –Reliability and Availability –Disk Block Access = Seek Time + Block Transfer Time

Server Hardware Architecture Disk Technology -- RAID RAID- I Disk Mirroring/ Shadowing Based on VMS shadowing - uses two disks in place of one Both disk contain exact same copy of the data It’s a constant backup/shadow/mirror require twice the disk drive VMS model has common failure point RAID-1 has independent drive/controller/power

Server Hardware Architecture Disk Technology -- RAID RAID- 3 Data Stripping for fault tolerance Doesn’t require twice the disk for backup/mirroring Based on Parity drive I.e. one extra drive for recreating the data Assume five drives for data then RAID-3 needs 6 drives Stripping done at byte/bit level ? 42

Server Hardware Architecture Disk Technology -- RAID RAID- 4 Data Stripping for fault tolerance Stripping done at Block level Better performance Assume five drives for data then RAID-3 needs 6 drives Parallel Reads from Multiple heads ? 42

Server Hardware Architecture Disk Technology -- RAID RAID- 5 Data Stripping for fault tolerance Stripping done at Block/record segments level but parity is rotated In RAID 3/4 all drives used for reading/writing RAID 5 ability to read as many drives as it needs at the same time for different individual read/write requests

Data Organizations Operations on organized data –Find (Locate) –Read (Get) –FindNext –Delete –Insert –Modify –Findall –Find Ordered

Data Organizations Unordered File Organization Find - Average b/2 O(b) Read - O(1) Insert (1) Modify O(b) Delete O(b) A v b x c d w e

Data Organizations Ordered File Organization Find - O(log b) Read - O(1) Insert O(b) Modify O(log b) Delete O(log b) A b c d f t u v

Data Organizations Primary Index an ordered file with fixed record length and two fields- key field and block pointer field. Primary index is built on ordering key field. A b c d t A f j t f j

Data Organizations Assume 30,000 records Blocksize =1024 bytes and R =100 bytes Each block can store 1024/ records. Total block b = 3000 In Ordered files log (3000) = 12 block accesses Ordering Key =9 bytes and Block pointer 6 bytes Primary Index R = 15 bytes records per block 1024/15 = 68 Blocks required to hold 3000 entries 3000/68 = 45 blocks log2 (45) = 6 block accesses + 1 for data block

Data Organizations Clustering Index an ordered file with fixed record length and two fields- key field and block pointer field. Primary index is built on file ordered on a non-key field. A A A d t A d j t d d j j

Data Organizations Secondary Index an ordered file with fixed record length and two fields- non ordering field and block pointer field. Secondary index is built on non ordering field (dense). A b c d t A b c d x e e t

Data Organizations Assume 30,000 records Blocksize =1024 bytes and R =100 bytes Each block can store 1024/ records. Total block b = 3000 In Unordered files 3000/2 = 1500 block accesses Ordering Key =9 bytes and Block pointer 6 bytes Primary Index R = 15 bytes records per block 1024/15 = 68 Each record requires an entry Blocks required to hold entries 30000/68 = 442 blocks log2 (442) = 9 block accesses + 1 for data block = 10 Block accesses

Data Organizations Multi Data Pointers for Duplicate handling Multi Level By creating a primary index on top of the base level secondary index 442 blocks of ordered data can be addressed by primary key mechanism of 68 entries per block 442/68 = 7 log (7) = 3 for locating the block in the secondary level + 1 for secondary level + 1 for data = 5 block accesses