File Processing : Storage Media 2015, Spring Pusan National University Ki-Joune Li.

Slides:



Advertisements
Similar presentations
Professor Michael J. Losacco CIS 1110 – Using Computers Storage Chapter 6.
Advertisements

CpSc 3220 File and Database Processing Lecture 1 Course Overview File Storage Basics.
CS4432: Database Systems II Data Storage - Lecture 2 (Sections 13.1 – 13.3) Elke A. Rundensteiner.
Data Storage Lecture 3 CSCI 1405, CSCI 1301 Introduction to Computer Science Fall 2009.
McGraw-Hill/Irwin Copyright © 2008 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Secondary Storage.
1 Classification of Physical storage Media Speed with which data can be accessed Cost per unit of data Reliability  data loss on power failure or system.
CS4432: Database Systems II Lecture 2 Timothy Sutherland.
Secondary Storage Chapter Distinguish between primary and secondary storage Compare internal and external hard disks Describe ways to improve hard-disk.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
©Silberschatz, Korth and Sudarshan11.1Database System Concepts Chapter 11: Storage and File Structure Overview of Physical Storage Media Magnetic Disks.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
Data Storage Technology
Chapter 8 Secondary Storage.
3.1 Storage. Overview Compare the following storage devices in terms of storage capacity, cost, speed of data retrieval and suitability for specific purposes:
What is Data Storage? ‘Storing’ data, we mean putting the data in a known place. ‘ Writing ’ data or ‘ saving ’ data are other ways of saying ‘storing’
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Storing Data: Disks & Files
STORAGE DEVICES AND MEDIA BY: T.MINH. Backing Up Data Refers to the copying of files and data to different medium in case of problem with the main storage.
Understanding Computers
McGraw-Hill/Irwin Copyright © 2008 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 7 Secondary Storage.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
1 Database Systems Storage Media Asma Ahmad 21 st Apr, 11.
Lecture 11: DMBS Internals
Chapter 4: Business Hardware Oz (5 th edition). Storage Media Data an programs must be stored on nonvolatile medium –Data is retained even when not powered.
Physical Storage and File Organization COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
Lecture 8 of Advanced Databases Storage and File Structure Instructor: Mr.Ahmed Al Astal.
Chapter 10 Storage & File Structure. n Overview of Physical Storage Media n Magnetic Disks n Tertiary Storage n Storage Access n File Organization n Organization.
SCUHolliday13–1 Schedule Today: u Data Storage and Indexing u Read Sections Next u Query Evaluation.
INFO1 – Practical problem solving in the digital world
Computer Organisation 1 Secondary Storage Sébastien Piccand
1 Secondary Storage Management Submitted by: Sathya Anandan(ID:123)
Storing Data: Electronic Filing Cabinets What You Will Learn Difference between memory and storage How storage media are categorized Measuring a storage.
Secondary Storage Chapter 8 Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. 8-1.
AS Computing Storage devices. Primary storage  A computer’s main memory (RAM) is known as primary memory.  Primary memory is volatile; when the computer.
1 Storage and File Structure. 2 Classification of Physical Storage Media Speed with which data can be accessed Cost per unit of data Reliability  data.
TheTeacher Computing Data Storage Computing. TheTeacher Computing Primary Storage One of the fundamental properties of a computer is that it can store.
Overview of Physical Storage Media
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
SCUHolliday13–1 Schedule Today: u Data Storage and Indexing u Read Sections Next u Query Evaluation.
Storing data – Storage devices and media. What is a storage device?  A storage device is any device used in a computer to store information.  A storage.
SECONDARY STORAGE DEVICES. Agenda of Today’s Lecture  Introduction to Hardware  Types of hardware devices  Storage Devices  Secondary Storage devices.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Lecture 9: Storage and File Structure Overview of Physical Storage Media Magnetic Disks.
11.1Database System Concepts. 11.2Database System Concepts Now Something Different 1st part of the course: Application Oriented 2nd part of the course:
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
CS4432: Database Systems II Data Storage 1. Storage in DBMSs DBMSs manage large amounts of data How does a DBMS store and manage large amounts of data?
©Silberschatz, Korth and Sudarshan11.1Database System Concepts Chapter 11: Storage and File Structure Overview of Physical Storage Media Magnetic Disks.
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Storage Media 2004, Spring Pusan National University Ki-Joune Li.
CS 101 – Sept. 28 Main vs. secondary memory Examples of secondary storage –Disk (direct access) Various types Disk geometry –Flash memory (random access)
Storage devices 1. Storage Storage device : stores data and programs permanently its retained after the power is turned off. The most common type of storage.
Section 13.1 – Secondary storage management (Former Student’s Note)
Memory The term memory is referred to computer’s main memory, or RAM (Random Access Memory). RAM is the location where data and programs are stored (temporarily),
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
DMBS Architecture May 15 th, Generic Architecture Query compiler/optimizer Execution engine Index/record mgr. Buffer manager Storage manager storage.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
Main Memory Main memory – –a collection of storage locations, –each with a unique identifier called the address. Word- –Data are transferred to and from.
1 Lecture 16: Data Storage Wednesday, November 6, 2006.
Copyright © 2003 Prentice Hall, Inc 1 Chapter 4 Storing Data: Electronic Filing Cabinets NEXT SLIDE.
Data Storage and Querying in Various Storage Devices.
Storage Overview of Physical Storage Media Magnetic Disks RAID
Storage and Disks.
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
Introduction to Database
Disks and Files DBMS stores information on (“hard”) disks.
File Processing : Storage Media
Lecture 11: DMBS Internals
File Processing : Storage Media
Storage and File Structure
Section 13.1 – Secondary storage management (Former Student’s Note)
Presentation transcript:

File Processing : Storage Media 2015, Spring Pusan National University Ki-Joune Li

STEMPNU Major Functions of Computer Computation Storage Communication Presentation

STEMPNU Storage of Data Major Challenges  How to store and manage a large amount of data Example : more than 100 peta bytes for EOS Project  How to represent sophisticated data

STEMPNU Modeling and Representation of Real World Example  Building DB about Korean History  Very complicated and Depending on viewpoint Database Course : 2010 Fall semester Real World Computer World

STEMPNU Managing Large Volume of Data Large Volume of Data  Cost for Storage Media Not very important and negligible  Processing Time Comparison between main memory and disk access time  RAM : several nanoseconds (10 -9 sec)  Disk : several milliseconds (10 -3 sec) Time is the most valuable resource Example  Retrieving a piece of data from 100 peta bytes DB

STEMPNU Managing Large Volume of Data Management of Data  Secure Management From hacking From any kinds of disasters  Consistency of Data Example  Failure during a flight reservation transaction  Concurrent transaction

STEMPNU Goals of File Systems To provide with 1. efficient Data Structures for storing large and complex data 2. Access Methods for rapid search 3. Query Processing Methods 4. Robust Management of Transactions

STEMPNU Memory Hierarchy Large Data Volume  Not be stored in main memory  But in secondary memory Memory Hierarchy Cache Memory 256 K bytes Main Memory 1G bytes Secondary Memory 100 G bytes Tertiary Memory 100 Tera bytes Faster Cheaper

STEMPNU Flash Memory Non-Volatile  Data survives power failure, but  Data can be written at a location only once, but location can be erased and written to again Can support only a limited number of write/erase cycles. Erasing of memory has to be done to an entire bank of memory Speed  Reads are roughly as fast as main memory  But writes are slow (few microseconds), erase is slower Cost per unit of storage roughly similar to main memory Widely used in embedded devices such as digital cameras

STEMPNU Optical Storage Non-volatile :  data is read optically from a spinning disk using a laser  CD-ROM (800 MB), DVD (4.7 to 17 GB), CD-R, DVD-R  CD-RW, DVD-RW, and DVD-RAM Speed  Reads and writes are slower than with magnetic disk Juke-box systems  Large numbers of removable disks,  Few drives, and  Mechanism for automatic loading/unloading of disks  For storing large volumes of data

STEMPNU Tape Non-volatile  Primarily Used for backup Speed  Sequential access : much slower than disk Cost  Very high capacity (40 to 300 GB tapes available)  Tape can be removed from drive  Drives are expensive Tape jukeboxes  hundreds of terabytes to even a petabyte

STEMPNU Data Access with Secondary Memory Main Memory Access Request Get Data If in main memory Disk If not in main memory Access to Disk Load on main memory Get Data Hit Ratio r h = n h / n a How to increase hit ratio ?

STEMPNU Why Hit Ratio is so important ? Example for(int i=0;i<1000;i++) Nbytes=read(fd,buf,100); 1000 disk accesses ? 1000 * sec = 10 sec1000 * sec = sec when r h = 0 when r h = 1

STEMPNU Physical Structure of Disk 512 bytes 200~400 sectors 2 * n DF

STEMPNU Disk Access Time t = t S + t R + t T, where  t S : Seek Time Time to reposition the head over the correct track Average seek time is 1/2 the worst case seek time 4 to 10 milliseconds on typical disks  t R : Rotational Latency Time to reposition the head over the correct sector Average rotational latency : ½ r (to find index point) + ½ r = r In case of rpm : r =1*60sec/15000 = 4 msec  t T : Transfer Time Time to transfer data from disk to main memory via channel Proportional to the number of sectors to read Real transfer time is negligible

STEMPNU Block-Oriented Disk Access Example for(int i=0;i<1000;i++) Nbytes=read(fd,buf,10); 1000 times 10 bytes Buffer in main memory 1024 bytes 10 times 100 times 1 block (e.g bytes) Number of Disk Accesses

STEMPNU Disk Block Unit of Disk Access Block Size  Normally multiple of sectors  1K, 4K, 16K or 64K bytes depending on configuration Why not large block ?  Limited by the size of available main memory  Too large : unnecessary accesses of sectors e.g. only 100 bytes, when block size is given as 64K  1 block : 128 sectors (about ½ track, ½ rotation, 2 msec)  Too wasteful