IT 21003 Database Administration Section 07. Space Management Managing Space: An Introduction  Organizing database storage is a major responsibility.

Slides:



Advertisements
Similar presentations
Introduction to Database Systems1 Records and Files Storage Technology: Topic 3.
Advertisements

©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Quick Review of Apr 10 material B+-Tree File Organization –similar to B+-tree index –leaf nodes store records, not pointers to records stored in an original.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree.
Oracle Architecture. Instances and Databases (1/2)
School of Computing and Management Sciences © Sheffield Hallam University The slowest task in any record retrieval is reading from disk (It can be 500.
Primer on Structure& Storage Analysis Primer on Structure & Storage Analysis This presentation is supposed to give a simple and brief overview for storage.
Basic Storage Concepts and Settings
1 - Oracle Server Architecture Overview
METU Department of Computer Eng Ceng 302 Introduction to DBMS Disk Storage, Basic File Structures, and Hashing by Pinar Senkul resources: mostly froom.
Harvard University Oracle Database Administration Session 2 System Level.
Harvard University Oracle Database Administration Session 5 Data Storage.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Chapter 13 Disk Storage, Basic File Structures, and Hashing.
Computer Organization and Architecture
1.1 CAS CS 460/660 Introduction to Database Systems File Organization Slides from UC Berkeley.
Oracle Database Administration Database files Logical database structures.
IT Database Administration Section 03. Tablespaces and the DBA  Important for DBAs – logical units of database storage Made up of physical operating-system.
CHP - 9 File Structures. INTRODUCTION In some of the previous chapters, we have discussed representations of and operations on data structures. These.
9/11/2015ISYS366 - Week051 ISYS366 – Week 5-6 Database Tuning - User and Rollback Data Spaces, Recovery, Backup.
Database Administration TableSpace & Data File Management
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 17 Disk Storage, Basic File Structures, and Hashing.
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 9 Index Management.
7202ICT Database Administration Lecture 7 Managing Database Storage Part 2 Orale Concept Manuel Chapter 3 & 4.
Chapter 5 Operating System Support. Outline Operating system - Objective and function - types of OS Scheduling - Long term scheduling - Medium term scheduling.
Extents, segments and blocks in detail. Database structure Database Table spaces Segment Extent Oracle block O/S block Data file logical physical.
9 Storage Structure and Relationships. 9-2 Objectives Listing the different segment types and their uses Controlling the use of extents by segments Stating.
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 7 Basic Table Management.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Cleansing Ola Ekdahl IT Mentors 9/12/08.
Sizing Basics  Why Size?  When to size  Sizing issues:  Bits and Bytes  Blocks (aka pages) of Data  Different Data types  Row Size  Table Sizing.
Database structure and space Management. Database Structure An ORACLE database has both a physical and logical structure. By separating physical and logical.
Oracle 10g Database Administrator: Implementation and Administration Chapter 7 Basic Table Management.
Database structure and space Management. Segments The level of logical database storage above an extent is called a segment. A segment is a set of extents.
Storage Structures. Memory Hierarchies Primary Storage –Registers –Cache memory –RAM Secondary Storage –Magnetic disks –Magnetic tape –CDROM (read-only.
File Structures. 2 Chapter - Objectives Disk Storage Devices Files of Records Operations on Files Unordered Files Ordered Files Hashed Files Dynamic and.
Computer Systems Week 14: Memory Management Amanda Oddie.
Week 3 Lecture 2 Basic Storage Concepts and Settings.
14 Copyright © 2006, Oracle. All rights reserved. Tuning Block Space Usage.
Week 4 Lecture 2 Advanced Table Management. Learning Objectives  Create tables with large object (LOB) columns and tables that are index-organized 
Harvard University Oracle Database Administration Session 6 Object Storage.
Oracle 10g Database Administrator: Implementation and Administration Chapter 5 Basic Storage Concepts and Settings.
Managing Tablespaces and Data Files
© Janice Regan, CMPT 300, May CMPT 300 Introduction to Operating Systems File systems.
File Systems cs550 Operating Systems David Monismith.
10 Managing Rollback Segments Objectives Planning the number and size of rollback segments Creating rollback segments using appropriate storage.
Informationsteknologi Wednesday, October 3, 2007Computer Systems/Operating Systems - Class 121 Today’s class Memory management Virtual memory.
11.1 Silberschatz, Galvin and Gagne ©2005 Operating System Principles 11.5 Free-Space Management Bit vector (n blocks) … 012n-1 bit[i] =  1  block[i]
Database structure and space Management. Database Structure An ORACLE database has both a physical and logical structure. By separating physical and logical.
Virtual Memory Pranav Shah CS147 - Sin Min Lee. Concept of Virtual Memory Purpose of Virtual Memory - to use hard disk as an extension of RAM. Personal.
Chapter 5 Record Storage and Primary File Organizations
COMP091 – Operating Systems 1 Memory Management. Memory Management Terms Physical address –Actual address as seen by memory unit Logical address –Address.
IT Database Administration Section 05. Rollback Segments: An Introduction  What are they?  Space within a database used to store “before” images.
Select Operation Strategies And Indexing (Chapter 8)
Memory management The main purpose of a computer system is to execute programs. These programs, together with the data they access, must be in main memory.
Chapter 2 Memory and process management
Module 11: File Structure
CHP - 9 File Structures.
Database structure and space Management
Chapter 11: File System Implementation
Database Tuning - User and Rollback Data Spaces, Recovery, Backup
Storage Structure and Relationships
Oracle 10g Database Administrator: Implementation and Administration
Lecture 10: Buffer Manager and File Organization
Disk Storage, Basic File Structures, and Buffer Management
Chapter 11: File System Implementation
Physical Database Structure .
Introduction To Oracle 10g
Storage Structure and Relationships
Managing Tables.
Chapter 11: File System Implementation
Presentation transcript:

IT Database Administration Section 07

Space Management Managing Space: An Introduction  Organizing database storage is a major responsibility for the DBA  Need to ensure tablespaces are set up with appropriate default storage definitions  Objects should be created for optimal space usage and performance  Situation must be monitored continually  Garbage collection on Oracle requires DBA assistance or intervention  DBAs need to understand how extents and blocks are used  The use of extents and blocks can be controlled by the DBA using a variety of techniques

Space Management Database Storage Relationships – E-R Diagram Tablespace Segment Datafile Extent Block LogicalPhysical Logical – Kinda Physical

Space Management Structure of Data Segments  The space occupied by a database object is called a segment  Segment types can be:  Data  Index  Rollback  Temporary  Each segment is contained within a tablespace and consists of one or more extents of contiguous data blocks

Space Management Structure of Data Segments  Tablespace: tbspc1 Segment 1 Extent1 of Seg1Extent2 of Seg1Extent3 of Seg1 Segment 2 Extent1 of Seg2Extent2 of Seg2Extent3 of Seg2 Segment 3 Extent1 of Seg3Extent2 of Seg3 Oracle Data Blocks

Space Management Specifying Storage Parameters for Data Segments  INITIAL Size of first extent Size of first extent Default is five Oracle blocks Default is five Oracle blocks  NEXT Size of the second extent to be allocated Size of the second extent to be allocated Default is five Oracle blocks Default is five Oracle blocks  PCTINCREASE  Factor by which the next extent is larger than the most recent extent to be allocated  Default is 50%

Space Management Specifying Storage Parameters for Data Segments  MINEXTENTS Minimum number of extents to be used by the segment Minimum number of extents to be used by the segment Default is 1 Default is 1  MAXEXTENTS Maximum number of extents that can be allocated Maximum number of extents that can be allocated Default depends on block size Default depends on block size Maximum value is 2,147,483,645 or approximately 2GB Maximum value is 2,147,483,645 or approximately 2GB  INITRANS  Minimum number of transactions that have space allocated within each block  Space reserved for the Interested Transaction List (ITL)  Default is 1 for Tables  Default is 2 for Indexes  Maximum is 255

Space Management Specifying Storage Parameters for Data Segments  MAXTRANS Maximum number of simultaneous transactions that can consume space in a block at any one time Maximum number of simultaneous transactions that can consume space in a block at any one time Default value is 255 Default value is 255 Maximum is 255 Maximum is 255  FREELISTS  Number of segment freelists  Default value is 1

Space Management Sizing Extents  Tablespaces are made up of datafiles The storage within a datafile is regarded as contiguous The storage within a datafile is regarded as contiguous Datafiles need not be contiguous with each other Datafiles need not be contiguous with each other  A possible scenario  Tablespace size is 40MB  Made up of three datafiles, 20MB, 10MB, 10MB  Size of largest possible extent is 20MB, not 40MB, because extents cannot span datafiles

Space Management Sizing Extents  Storage structures are allowed unlimited numbers of extents  Not allowed for objects within the SYSTEM tablespace  Limited number of extents based on DB_BLOCK_SIZE

Space Management Sizing Extents  To size extents we need the following information: How will the table be used? How will the table be used? How will it grow (insert activity)? How will it grow (insert activity)? Volatility? Volatility? Types of queries? Types of queries?  Reasons for allowing tables to occupy multiple extents  Access using an index  Involves a direct lookup on a table, no need for blocks to be contiguous  Full-Table scans in a mult-iuser environment  No real impact because disk arm will be servicing requests across all users  Full-Table scans in a single-user environment  Performance impact, but not if extents are reasonably large

Space Management Sizing Extents  Extents should be sized in multiples of  DB_BLOCK_SIZE * DB_FILE_MULTIBLOCK_READ_COUNT (DBFMRC)  Oracle brings DBFMRC blocks into memory in one fetch  Example:  DB_BLOCK_SIZE is 4KB  DBFMRC is set at the default of 8  4KB * 8 = 32KB  Extents should therefore be in multiples of 32KB

Space Management Sizing Extents  Beware of setting DBFMRC too high  32-Bit OS’s will bring in a maximum of 64KB of data in one fetch  Setting DBFMRC such that DBFMRC * DBS is greater than 64KB has no effect on performance  64-Bit OS’s can cope with higher values  Many systems run with DB_BLOCK_SIZE = 8KB or 4KB and DBFMRC = 8  4KB * 8 = 32KB  8KB * 8 = 64KB

Space Management Large Extents  Can be more difficult to manage  If a table extends, there may well be enough total free space within the tablespace for the amount of storage required  But not enough contiguous storage for the new large extent  An error is returned  Tablespace needs to be reorganized or extended  Reduce the frequency of dynamic extension Possible performance improvement because dynamic allocation of space is an expensive activity, generating a lot of recursive SQL Possible performance improvement because dynamic allocation of space is an expensive activity, generating a lot of recursive SQL  A tablespace can be dedicated (designed) for a single objects  If the object has equal-sized extents of 10MB, a sensible size for the tablespace could be 100MB plus the size of one Oracle data block per datafile  Accounts for the one-block overhead for each file header

Space Management Initial Extents  The first extent for each object has a one-block overhead, containing no data, known as the segment header block  Segment header is the first block and contains extent information for all other extents belonging to the segment (extent map)  Finite number of entries can fit in a header block  Actual number depends on block size  Tables defined with UNLIMITED EXTENTS have chained header blocks to hold the extent information  On highly concurrent systems, transactions may queue for latches on the header block

Space Management Inserting New Rows  Extents are typically allocated when an INSERT operation cannot find enough space in the existing blocks  A transaction maintains a list of free blocks it has accessed  These are blocks that have been used by the transaction and still have space available for inserts  If the new insert cannot fit in any of these blocks, the segment freelist of blocks is searched  If no blocks are available on the freelist, Oracle uses a new, unused block from the current extent; if no blocks are available, it resorts to dynamic extentsion  This means allocating a new extent

Space Management Inserting New Rows: Allocation of New Extents  The size of the new extent is determined by the current values of NEXT and PCTINCREASE  Formula is:  (1 + PCTINCREASE/100) * NEXT  Oracle searches the free-space map for an extent of free space the required size rounded up to the nearest five blocks  If an extent of the exact required size is not found, Oracle searches the free-space map for a piece of free space larger than that required  If it finds a large piece, it splits the free space as follows:  If the piece is within five blocks of the required size, the whole amount is allocated  If it is larger, the free space is split and the required amount is allocated, leaving behind a smaller piece of free space

Space Management Free-Space Allocation  If a new extent of 28MB is required, and a piece of free space of size 40MB is found Used Extent Free-Space Extent of 40MB Used Extent New Extent 28MB Free Used Extent Free Space is split in appropriate fashion

Space Management Free-Space Allocation  If no piece of free space large enough can be found  Free-space map is searched and adjacent free extents coalesced in an attempt to produce a large enough piece of contiguous space  SMON coalesces free-space fragments periodically  Oracle then searches the free-space map looking for  An exact matching size extent  A larger piece of free space and acts as described before  If no space of the required size is available, Oracle returns an error if AUTOEXTEND is off for all datafiles in the tablespace

Space Management Free-Space Allocation  If a new extent is allocated, Oracle updates the segment header, free-space map, and Data Dictionary accordingly  Oracle locally managed tablespaces are managed by bitmaps  No management of free-space map because all extents are equally sized

Space Management Coalescing Free Space Used (Table A) Free Used (Table B) Free Used (Table A) Table B gets dropped Used (Table A) FreeFreeFree 10MB 20MB 8MB Free-space extents coalesced during a search for free space Used (Table A) Free 38MB

Space Management Controlling Space Usage within Data Blocks  Each segment has a list of free blocks maintained automatically by server processes affecting the segment A segment freelist is a set of blocks available for new rows to be inserted A segment freelist is a set of blocks available for new rows to be inserted  Two parameters are used to control which blocks appear on the freelist  PCTFREE specifies the amount of space in each blocks that is unavailable for inserts  Reserved for updates of existing rows already within the block  PCTUSED determines how much data should be removed from a block before the block is put back on the freelist

Space Management Setting PCTFREE and PCTUSED  PCTFREE + PCTUSED <= 100%  PCTFREE large (~40), PCTUSED small (~20)  Blocks remain off the freelist for long periods  Can lead to waste of space  Saves processing costs because blocks do not thrash on and off the freelist  Allows space for updates  Fewer rows per block that can impact full-table scans but reduce possible block contention PCTFREE PCTUSED

Space Management Setting PCTFREE and PCTUSED  PCTFREE small (~5), PCTUSED very large (~90)  Gives efficient use of space due to high block occupancy  Leads to high processing costs because blocks are frequently moved on and off the freelist  Not much space for updates of existing rows PCTFREE PCTUSED

Space Management Setting PCTFREE and PCTUSED  PCTFREE large (~40), PCTUSED large (~50)  Gives poor use of space due to low block occupancy  Allows space for updates  Processing costs could be high because blocks are frequently moved on and off the freelist PCTFREE PCTUSED

Space Management Setting PCTFREE and PCTUSED  PCTFREE small (~10), PCTUSED fairly small (~40)  These are the default values  Gives relatively efficient use of space due to high block occupancy  Not much space for large-scale updates  Processing costs are low because blocks are not frequently moved on and off the freelist PCTFREE PCTUSED

Space Management Setting PCTFREE and PCTUSED  PCTFREE small (~10), PCTUSED fairly small (~40)  The Block accepts both updates and inserts until it reaches 10 percent free; it then allows only updates  Once the block has dropped back down to 40% used, the block allows inserts again

Space Management Setting PCTFREE and PCTUSED  PCTFREE can determine the free space in an index when it is created  Can be set high to prevent excessive block splitting and reorganization  Useful for volatile indexes  Cannot be reset and has no effect as the index is updated  Use purely as an initial setting or starting point  Oracle manages the occupancy of index blocks such that they remain, on average, 70 percent full, thus overriding the PCTFREE setting  PCTUSED cannot be specified for indexes

Space Management Regaining Free Space: Deallocating Extents  Oracle maintains a High Water Mark (HWM) for each segment in every table The point to which actual data has grown within the segment The point to which actual data has grown within the segment Cannot return space in blocks below the HWM to free space without a reorganization Cannot return space in blocks below the HWM to free space without a reorganization  Possible to reclaim space allocated to a segment but never used  This is space above the HWM  ALTER INDEX sales_ind DEALLOCATE UNUSED;  The minimum size of the remaining space must still satisfy the size determined by INITIAL and MINEXTENTS  ALTER TABLE orders DEALLOCATE UNUSED KEEP 20M;  The KEEP clause can override MINEXTENTS and INITIAL, causing the values to be reset accordingly

Space Management Regaining Free Space Below the HWM  The dbms_space_unused_space procedure can be used only to return information about unused blocks and the position of the HWM  Deleting rows does not regain space – HWM remains unchanged Must use Export/Import to reorganize the table storage Must use Export/Import to reorganize the table storage  Create a new table as follows:  Create new_orders As Select * From orders;  Drop Table orders;  Rename new_orders To orders;  Causes a loss of GRANTS, etc on the original table  Needs space for the new copy

Space Management ROWID  ROWIDs are pseudo data types used by Oracle to uniquely address rows in the database Have a number of components Have a number of components  Shown as 18-Character strings using base-64 encoding  Extended ROWID Format FFFBBBBBBSSS Data Object Number Relative file Number Block Number Slot NumberAAAASRAAFAAAAAACAAD Example ROWID Number

Space Management ROWIDs  Datafiles have absolute and relative addresses Observe file_id and relative_fno in the dba_files table Observe file_id and relative_fno in the dba_files table  A ROWID does not contain the absolute file address  Datafiles are identified only within a particular tablespace  The object number determines the tablespace, so that ROWID can uniquely define the file  Object Number + Relative File Number  Relative addressing allows many more files to be used by an Oracle database  Can have 64 million operating-files

Space Management Data Block Format Fixed Header Transaction Entry Row Directory PCTFREE PCTUSED

Space Management Block Organization  Header  Average size is about bytes  Made up of a fixed header and variable header (transaction entries)  Row Directory  Two per row  Contains address of start of each row within the block

Space Management INITRANS and MAXTRANS  Setting INITRANS reserves space for transaction entries in the block This space is not available for data storage This space is not available for data storage If INITRANS is too low, transactions may be placed in the header If INITRANS is too low, transactions may be placed in the header  Setting MAXTRANS allows the specified number of concurrent transactions to place their entries in the block only if there is enough free space

Space Management Row Storage LastNameFirstNameMiddleNameAddressState SmithJosephAdam 123 State Street Ohio Row Overhead Number of Columns (5) Column 1 Length (15) Column 1 Data (SMITH) Column 2 Length (15) Column 2 Data (JOSEPH)… Table Columns Row

Space Management Row Storage  Row Overhead is made up of:  Row Header  Offset in transaction list  Number of Columns in the row that contain data  Rows with > 255 columns will have additional row overhead information

Space Management Storage Patterns for Data Types  VARCHAR and VARCHAR2 1 byte per alphanumeric character 1 byte per alphanumeric character Maximum is 4,000 bytes Maximum is 4,000 bytes  CHAR Fixed Length Fixed Length Maximum is 2,000 bytes Maximum is 2,000 bytes  DATE 7 bytes for all date formats 7 bytes for all date formats  NUMBER  Up to 38 digits of precision  Stored as a single base-100 digit per byte  Therefore no rounding errors in financial data

Space Management Storage Patterns for Data Types  ROWID 10 bytes 10 bytes  LOB Locaters about 20 bytes Locaters about 20 bytes  VARRAY User defined User defined  User-Defined Types

Space Management Updating Rows  When an update causes a change in the length of a data item  Oracle builds a whole new row in the block  The space occupied by the original row is labeled as free space  This is known as block fragmentation  Oracle delays reorganization of storage within the block until no more space is available within the block  If an update does not change the length of a data item, the row is updated in situ (in place)  No block re-organization is necessary

Space Management Row Chaining and Row Migration  Ordinarily, Oracle does one fetch for the retrieval of a single row  A row that occupies multiple blocks impacts performance because more than one I/O may be required to fetch the row  Typically happens with rows containing LOB columns  Known as Row Chaining  Not much can be done about this

Space Management Row Chaining and Row Migration  Updates to rows may not fit within the free space in the block  In this case the whole row is migrated into another block, leaving behind a pointer to the chained block  This Row Migration causes extra disk I/O per row because the ROWID of the migrated row is not changed  Indexes still point to the original block  PCTFREE can be used to control row migration by reserving space for updates to existing rows in blocks

Space Management Detecting and Handling Row Migration  Row Migration can and should be avoided  ANALYZE can be used to detect chained or migrated rows  The script utlchain.sql needs to be run in the account used to perform the analysis  Builds the chained_rows table  Use Analyze Table student List Chained Rows;  This puts data into the chained_rows table  To see the details of the chained_rows table  Select *  From chained_rows  Where table_name = ‘STUDENT’;

Space Management Eliminating Migrated Rows  This is a hard way to do it  Create a temporary table and copy into it all the rows found in the chained_rows table  Create Table student_mig As  Select * From student  Where Rowid In (Select head_rowid From chained_rows);  Delete the migrated rows from the original table  Delete From student Where Rowid In  (Select head_rowid From chained_rows);  Now insert the rows into the original table from the temporary table  Insert Into student Select * From student_mig;

Space Management Row Chaining and Row Migration  Can also be observed in user_tables  Need to run this statement first  Analyze Table table_name Compute | Estimate Statistics;  This populates columns in the user_tables view  Tables and Indexes can be analyzed this way  Analyzing tables with the COMPUTE option takes longer than a full table scan It performs a sort and requires temporary storage It performs a sort and requires temporary storage  Statistics are found in:  user_tables  user_indexes  user_tab_columns

Space Management Analyzing Objects  To Analyze you need to own the object or have the privilege Analyze Any system privilege  Analyzing will invalidate any currently parsed SQL statements on the table  Oracle9i has dbms_stats that can be used to analyze and monitor tables

Space Management Estimating Space Requirements  Tables created with badly designed storage parameters cause performance problems like: Frequent Dynamic Extension Frequent Dynamic Extension Large numbers of extents causing problems for full- table scans Large numbers of extents causing problems for full- table scans Extents too large to fit into any free-space pattern Extents too large to fit into any free-space pattern Row Chaining and Migration Row Chaining and Migration Over-allocation of space leading to the purchase of unnecessary hardware Over-allocation of space leading to the purchase of unnecessary hardware  A practical approach is to preload the table with some sample data, analyze the table, and discover the number of rows per block  Calculate a 5% overhead for safety

Space Management Questions?