Chapter 8 Physical Database Design

Slides:



Advertisements
Similar presentations
Technology Guide 3 Data and Database T3-1. IT for Management Prof. Efraim Turban T3-2 File Management Hierarchy of data for a computer-based file Record.
Advertisements

Chapter 15 Creating Database Forms and Reports Introduction Forms Reports.
Chapter 5: Database Forms and Reports
Topic Denormalisation S McKeever Advanced Databases 1.
© Copyright 2011 John Wiley & Sons, Inc.
© Copyright 2011 John Wiley & Sons, Inc.
ACCOUNTING INFORMATION SYSTEMS
Chapter Chapter 13-2 Chapter 13 Data Modeling Introduction An Overview of Databases Steps in Creating a Database Using Rea Creating Database Tables.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
IS 4420 Database Fundamentals Chapter 6: Physical Database Design and Performance Leon Chen.
Chapter 14 Organizing and Manipulating the Data in Databases
Organizing Data & Information
Chapter 101 Information Technology For Management 6 th Edition Turban, Leidner, McLean, Wetherbe Lecture Slides by L. Beaubien, Providence College John.
Chapter 5 The Relational Database Model: Introduction
Chapter 3: Data Modeling
Chapter 3 Data Modeling Fundamentals of Database Management Systems by
Introduction to Databases
Chapter 7 Logical Database Design
Chapter 4 The Database Management System Concept
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design Copyright 2000 © John Wiley & Sons, Inc. All rights reserved. Slide 1 Systems.
Chapter 6 The Relational Database Model: Additional Concepts
Introduction to Databases Chapter 1: Introducing Data and Data Management.
Chapter 6 Physical Database Design. Introduction The purpose of physical database design is to translate the logical description of data into the technical.
4-1 Coding Complete COBOL Programs: The PROCEDURE DIVISION Chapter 4.
4-1 COBOL for the 21 st Century Nancy Stern Hofstra University Robert A. Stern Nassau Community College James P. Ley University of Wisconsin-Stout (Emeritus)
Chapter 2 Simple File Storage and Retrieval
Chapter 3 The Database Management System Concept
CSC271 Database Systems Lecture # 30.
4-1 COBOL for the 21 st Century Nancy Stern Hofstra University Robert A. Stern Nassau Community College James P. Ley University of Wisconsin-Stout (Emeritus)
Chapter 10 Object-Oriented Database Management
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Introduction to Databases Chapter 7: Data Access and Manipulation.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
Chapter 8 Physical Database Design
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design, 2 nd Edition Copyright 2003 © John Wiley & Sons, Inc. All rights reserved.
Introduction to Information Technology, 2 nd Edition Turban, Rainer & Potter © 2003 John Wiley & Sons, Inc. 5-1 Introduction to Information Technology.
Fundamentals of Database Management Systems, 2nd ed
TM 7-1 Copyright © 1999 Addison Wesley Longman, Inc. Physical Database Design.
Chapter 4: Organizing and Manipulating the Data in Databases
Physical Database Design Chapter 6. Physical Design and implementation 1.Translate global logical data model for target DBMS  1.1Design base relations.
Today’s Agenda  Any questions about the assignment (due Mon)?  Quiz  Quiz review  Homework for Friday:  Watch the two videos on the Coursera db website.
Data and its manifestations. Storage and Retrieval techniques.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
7 - 1 Chapter 7: Data Analysis for Modeling PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with.
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design Copyright 2000 © John Wiley & Sons, Inc. All rights reserved. Slide 1 Systems.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
Chapter Chapter 13-2 Accounting Information Systems, 1 st Edition Data and Databases.
Core Concepts of ACCOUNTING INFORMATION SYSTEMS Moscove, Simkin & Bagranoff John Wiley & Sons, Inc. Developed by: Marianne Bradford, Ph.D. Bryant College.
Copyright © 2000 John Wiley & Sons, Inc. All rights reserved
Slide 1-1 Chapter 1 Terms Information Systems Overview Introduction to Information Systems Judith C. Simon.
Database Management COP4540, SCS, FIU Physical Database Design (ch. 16 & ch. 3)
Core Concepts of ACCOUNTING INFORMATION SYSTEMS Moscove, Simkin & Bagranoff John Wiley & Sons, Inc. Developed by: S. Bhattacharya, Ph.D. Florida Atlantic.
Chapter 4-1. Chapter 4-2 Chapter 4: Data Modeling Introduction An Overview of Databases Steps in Creating a Database Using REA Creating Database Tables.
Programming Logic and Design Fourth Edition, Comprehensive Chapter 16 Using Relational Databases.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Lesson 18: Configuring Security for Mobile Devices MOAC : Configuring Windows 8.1.
Slide 6-1 Chapter 6 System Software Considerations Introduction to Information Systems Judith C. Simon.
Chapter 13 Client/Server Database and Distributed Database Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis.
Chapter 5-1. Chapter 5-2 Chapter 5: Organizing and Manipulating the Data in Databases Introduction Normalization Validating the Data in Databases Extracting.
Slide 11-1 Chapter 11 Terms Information Resource Management Strategies Introduction to Information Systems Judith C. Simon.
Introduction to Business Information Systems by Mark Huber, Craig Piercy, Patrick McKeown, and James Norrie Tech Guide D: The Details of SQL, Data Modelling,
Slide 6-1 Chapter 6 Terms System Software Considerations Introduction to Information Systems Judith C. Simon.
Systems Analysis and Design
Physical Changes That Don’t Change the Logical Design
Physical Database Design and Performance
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Systems Analysis and Design 5th Edition Chapter 8. Architecture Design
Physical Database Design
Systems Analysis and Design
ACCOUNTING INFORMATION SYSTEMS
Presentation transcript:

Chapter 8 Physical Database Design Fundamentals of Database Management Systems by Mark L. Gillenson, Ph.D. University of Memphis Presentation by: Amita Goyal Chin, Ph.D. Virginia Commonwealth University John Wiley & Sons, Inc.

Chapter Objectives Describe the concept of physical database design. List and describe the inputs to the physical database design process. Perform physical database design and improve database performance using a variety of techniques ranging from adding indexes to denormalization.

Database Performance Factors Affecting Application and Database Performance Application Factors Need for Joins Need to Calculate Totals Data Factors Large Data Volumes Database Structure Factors Lack of Direct Access Clumsy Primary Keys Data Storage Factors Related Data Dispersed on Disk Business Environment Factors Too Many Data Access Operations Overly Liberal Data Access

Physical Database Design The process of modifying a database structure to improve the performance of the run-time environment. We are going to modify the third normal form tables produced by the logical database design techniques to make the applications that will use them run faster.

Inputs to Physical Database Design Physical database design starts where logical database design ends. The well structured relational tables produced by the conversion from ERDs or by the data normalization process form the starting point for physical database design.

More Inputs to Physical Database Design Inputs Into the Physical Database Design Process The Tables Produced by the Logical Database Design Process Business Environment Requirements Response Time Requirements Throughput Requirements Data Characteristics Data Volume Assessment Data Volatility Application Characteristics Application Data Requirements Application Priorities Operational Requirements Data Security Concerns Backup and Recovery Concerns Hardware and Software Characteristics DBMS Characteristics Hardware Characteristics

The Tables Produced by the Logical Database Design Process Form the starting point of the physical database design process. Reflect all of the data in the business environment. Are likely to be unacceptable from a performance point of view and must be modified in physical database design.

Business Environment Requirements Response Time Requirements Throughput Requirements

Business Environment Requirements: Response Time Requirements Response time is the delay from the time that the Enter Key is pressed to execute a query until the result appears on the screen. What are the response time requirements?

Business Environment Requirements: Throughput Requirements Throughput is the measure of how many queries from simultaneous users must be satisfied in a given period of time by the application set and the database that supports it.

Data Characteristics Data Volume Assessment Data Volatility How much data will be in the database? Roughly how many records is each table expected to have? Data Volatility Refers to how often stored data is updated.

Application Characteristics What is the nature of the applications that will use the data? Which applications are the most important to the company? Which data will be accessed by each application?

Application Characteristics Application Data Requirements Application Priorities

Application Characteristics: Data Requirements Which database tables does each application require for its processing? Do the applications require that tables be joined? How many applications and which specific applications will share particular database tables? Are the applications that use a particular table run frequently or infrequently?

Application Characteristics: Priorities When a modification to a table proposed during physical design that’s designed to help the performance of one application hinders the performance of another application, which of the two applications is the more critical to the company?

Operational Requirements: Data Security, Backup and Recovery Protecting data from theft or malicious destruction and making sure that sensitive data is accessible only to those employees of the company who have a “need to know.” Backup and Recovery Being able to recover a table or a database that has been corrupted or lost due to hardware or software failure to the recovery of an entire information system after a natural disaster.

Hardware and Software Characteristics DBMS Characteristics For example, exact nature of indexes, attribute data type options, and SQL query features, which must be known and taken into account during physical database design. Hardware Characteristics Processor speeds and disk data transfer rates.

Physical Database Design Techniques Physical Design Categories and Techniques That DO NOT Change the Logical Design Adding External Features Adding Indexes Adding Views Reorganizing Stored Data Clustering Files Splitting a Table into Multiple Tables Horizontal Partitioning Vertical Partitioning Splitting-Off Large Text Attributes

Physical Database Design Techniques Physical Design Categories and Techniques That DO Change the Logical Design Changing Attributes in a Table Substituting Foreign Keys Adding Attributes to a Table Creating New Primary Keys Storing Derived Data Combining Tables Combine Tables in One-to-One relationships Alternative for Repeating Groups Denormalization Adding New Tables Duplicating Tables Adding Subset Tables

Adding External Features Doesn’t change the logical design at all. There is no introduction of data redundancy.

Adding External Features Adding Indexes Adding Views

Adding External Features: Adding Indexes Which attributes or combinations of attributes should you consider indexing in order to have the greatest positive impact on the application environment? Attributes that are likely to be prominent in direct searches Primary keys Search attributes Attributes that are likely to be major players in operations, such as joins, SQL SELECT ORDER BY clauses and SQL SELECT GROUP BY clauses.

Adding External Features: Adding Indexes What potential problems can be caused by building too many indexes? Indexes are wonderful for direct searches. But when the data in a table is updated, the system must take the time to update the table’s indexes, too.

General Hardware Company With Some Indexes

Adding External Features: Adding Views Doesn’t change the logical design. No data is physically duplicated. An important device in protecting the security and privacy of data.

Reorganizing Stored Data Doesn’t change the logical design. No data is physically duplicated. Clustering Files Houses related records together on a disk.

Reorganizing Stored Data: Clustering Files The salesperson record for salesperson 137, Baker, is followed on the disk by the customer records for customers 0121, 0933, 1047, and 1826.

Splitting a Table Into Multiple Tables Horizontal Partitioning Vertical Partitioning Splitting-Off Large Text Attributes

Splitting a Table Into Multiple Tables: Horizontal Partitioning The rows of a table are divided into groups, and the groups are stored separately on different areas of a disk or on different disks. Useful in managing the different groups of records separately for security or backup and recovery purposes. Improve data retrieval performance. Disadvantage: retrieval of records from more than one partition can be more complex and slower.

Splitting a Table Into Multiple Tables: Horizontal Partitioning

Splitting a Table Into Multiple Tables: Vertical Partitioning The separate groups, each made up of different columns of a table, are created because different users or applications require different columns. Each partition must have a copy of the primary key.

Splitting a Table Into Multiple Tables: Vertical Partitioning The Salesperson table

Splitting a Table Into Multiple Tables: Splitting Off Large Text Attributes A variation on vertical partitioning involves splitting off large text attributes into separate partitions. Each partition must have a copy of the primary key.

Changing Attributes in a Table Changes the logical design. Substituting a Foreign Key Substitute an alternate key (Salesperson Name, assuming it is a unique attribute) as a foreign key. Saves on the number of performance-slowing joins.

Adding Attributes to a Table Creating New Primary Keys Storing Derived Data

Adding Attributes to a Table: Creating New Primary Keys Changes the logical design. In a table with no single attribute primary key, indexing a multi-attribute key would likely be clumsy and slow. Create a new serial number attribute primary key for the table.

Adding Attributes to a Table: Creating New Primary Keys The current two-attribute primary key of the CUSTOMER EMPLOYEE table can be replaced by one, new attribute.

Adding Attributes to a Table: Storing Derived Data Calculate answers to certain queries once and store them in the database.

Combining Tables If two tables are combined into one, then there must surely be situations in which the presence of the new single table allows us to avoid joins that would have been necessary when there were two tables. Combination of Tables in One-to-One Relationships Alternatives for Repeating Groups Denormalization

Combining Tables: Combination of Tables in One-to-One Relationships Advantage: if we ever have to retrieve detailed data about a salesperson and his office in one query, it can now be done without a join.

Combining Tables: Combination of Tables in One-to-One Relationships Disadvantages: the tables are no longer logically as well as physically independent. retrievals of salesperson data alone or of office data alone could be slower than before. storage of data about unoccupied offices is problematic and may require a reevaluation of which field should be the primary key.

Combining Tables: Alternatives for Repeating Groups If repeating groups are well controlled, they can be folded into one table.

Combining Tables: Denormalization It may be necessary to take pairs of related, third normal form tables and to combine them, introducing possibly massive data redundancy. Unsatisfactory response times and throughput may mandate eliminating run-time joins.

Combining Tables: Denormalization Since a salesperson can have several customers, a particular salesperson’s data will be repeated for each customer he has.

Combining Tables: Denormalization

Adding New Tables Duplicating Tables Adding Subset Tables Duplicate tables and have different applications access the duplicates. Adding Subset Tables Duplicate only those portions of a table that are most heavily accessed. Assign subsets to different applications to ease the performance crunch.

Good Reading Bookstores: Problem Assume that Good Reading’s headquarters frequently needs to quickly find the details of a book, based on either its book number or its title, together with details about its publisher. If a join takes too long, resulting in unacceptable response times, throughput, or both, what are the possibilities in terms of physical design that can improve the situation?

Good Reading Bookstores: Solutions The Book Number attribute and the Book Title attributes in the PUBLISHER table can each have an index built on them to provide direct access, since the problem says that books are going to be searched for based on one of these two attributes. The two join attributes—the Publisher Name attribute of the PUBLISHER table and the Publisher Name attribute of the BOOK table—can each have an index built on them to help speed up the join operation.

Good Reading Bookstores: Solutions If the DBMS permits it, the two tables can be clustered, with the book records associated with a particular publisher stored near that publisher’s record on the disk. The two tables can be denormalized, with the appropriate publisher data being appended to each book record (and the PUBLISHER table being eliminated).

“Copyright 2004 John Wiley & Sons, Inc. All rights reserved “Copyright 2004 John Wiley & Sons, Inc. All rights reserved. Reproduction or translation of this work beyond that permitted in Section 117 of the 1976 United States Copyright Act without express permission of the copyright owner is unlawful. Request for further information should be addressed to the Permissions Department, John Wiley & Sons, Inc. The purchaser may make back-up copies for his/her own use only and not for distribution or resale. The Publisher assumes no responsibility for errors, omissions, or damages caused by the use of these programs or from the use of the information contained herein.”