Physical Design CS 543 – Data Warehousing. CS 543 - Data Warehousing (Sp 2007-2008) - Asim LUMS2 Physical Design Steps 1. Develop standards 2.

Slides:



Advertisements
Similar presentations
CS 346 – April 4 Mass storage –Disk formatting –Managing swap space –RAID Commitment –Please finish chapter 12.
Advertisements

Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Database Performance Tuning and Query Optimization
SQL Server Accelerator for Business Intelligence (SSABI)
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
Physical Database Design Chapter 5 G. Green 1. Agenda Purpose Activities Fields Records Files 2.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
© Copyright 2011 John Wiley & Sons, Inc.
Database Systems: A Practical Approach to Design, Implementation and Management International Computer Science S. Carolyn Begg, Thomas Connolly Lecture.
Chapter Physical Database Design Methodology Software & Hardware Mapping Logical Design to DBMS Physical Implementation Security Implementation Monitoring.
Chapter 6 Methodology Conceptual Databases Design Transparencies © Pearson Education Limited 1995, 2005.
IS 4420 Database Fundamentals Chapter 6: Physical Database Design and Performance Leon Chen.
Dimensional Modeling – Part 2
Components and Architecture CS 543 – Data Warehousing.
Physical Database Monitoring and Tuning the Operational System.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Chapter 9 Database Design
Telecommunication Case Study CS 543 – Data Warehousing.
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Chapter 17 Methodology – Physical Database Design for Relational Databases Transparencies © Pearson Education Limited 1995, 2005.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Team Dosen UMN Physical DB Design Connolly Book Chapter 18.
Redundant Array of Independent Disks
Chapter 6: Physical Database Design and Performance
Chapter 6 Physical Database Design. Introduction The purpose of physical database design is to translate the logical description of data into the technical.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Chapters 17 & 18 Physical Database Design Methodology.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
Lecture 9 Methodology – Physical Database Design for Relational Databases.
Methodology - Conceptual Database Design Transparencies
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
Methodology Conceptual Databases Design
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
TM 7-1 Copyright © 1999 Addison Wesley Longman, Inc. Physical Database Design.
Chapter 16 Methodology – Physical Database Design for Relational Databases.
1 © Prentice Hall, 2002 Chapter 6: Physical Database Design and Performance Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott,
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Methodology: Conceptual Databases Design
DATABASE MGMT SYSTEM (BCS 1423) Chapter 5: Methodology – Conceptual Database Design.
10/10/2012ISC239 Isabelle Bichindaritz1 Physical Database Design.
Methodology - Conceptual Database Design
Database Management COP4540, SCS, FIU Physical Database Design (ch. 16 & ch. 3)
Planning Warehouse Storage Chapter 9. Data Partitioning zBreaking up a data into separate physical units that can be handled independently zEase of: -
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Methodology – Physical Database Design for Relational Databases.
Building Dashboards SharePoint and Business Intelligence.
SYS364 Database Design Continued. Database Design Definitions Initial ERD’s Normalization of data Final ERD’s Database Management Database Models File.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
7 Strategies for Extracting, Transforming, and Loading.
Chapter 4 Logical & Physical Database Design
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall Chapter 9 Designing Databases 9.1.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Database Systems, 8 th Edition SQL Performance Tuning Evaluated from client perspective –Most current relational DBMSs perform automatic query optimization.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
Methodology – Physical Database Design for Relational Databases
Database Performance Tuning and Query Optimization
Introduction of Week 3 Assignment Discussion
國立臺北科技大學 課程:資料庫系統 fall Chapter 18
Physical Database Design
Chapter 11 Database Performance Tuning and Query Optimization
Presentation transcript:

Physical Design CS 543 – Data Warehousing

CS Data Warehousing (Sp ) - Asim LUMS2 Physical Design Steps 1. Develop standards 2. Create aggregates plan 3. Determine data partitioning 4. Establish clustering options 5. Prepare indexing strategy 6. Assign storage structures 7. Complete physical model

CS Data Warehousing (Sp ) - Asim LUMS3 Develop Standards IT standards include  Naming conventions for database and software  Procedures for documentation, information gathering, project organization, methodology, and process Standards are of greater significance in data warehousing projects because they are large and complex projects with non-technical end-users

CS Data Warehousing (Sp ) - Asim LUMS4 Create Aggregates Plan Requirements guide creation of aggregates or summary tables A comprehensive plan would  Identify key dimensions and their hierarchical levels that can be aggregated  Provide guidelines on when to include an aggregate table (e.g. based on some performance metric)  Establish monitoring of usage (types of queries and their performances)

CS Data Warehousing (Sp ) - Asim LUMS5 Determine the Data Partitioning Scheme Fact tables can become very large. It is essential that they are properly partitioned among different physical platforms to improve performance. The partitioning scheme would include  The fact tables and the dimension tables selected for partitioning  The type of partitioning for each table – horizontal or vertical  The number of partitions for each table  The criteria for dividing each table (for example, by product groups)  Descriptions of how to make queries aware of partitions

CS Data Warehousing (Sp ) - Asim LUMS6 Establish Clustering Options Establish physical location of data elements for quick access If data elements are read sequential most of the time, then they should be placed in adjacent locations on the disk

CS Data Warehousing (Sp ) - Asim LUMS7 Prepare an Indexing Strategy Adequate indexing can improve query performance significantly An indexing strategy would include  Indexes for each table  The sequence in which indexes will be created for each table  Create some indexes initially  Monitor performance and plan to add more indexes as need is felt

CS Data Warehousing (Sp ) - Asim LUMS8 Assign Storage Structures Determine how and where data is to be stored on the physical medium Storage structures include  File structures  Location of files on disk (e.g. blocking)  Planning for size and growth  Planning for data warehouse storage as well as other storage such as staging area and client desktops

CS Data Warehousing (Sp ) - Asim LUMS9 Key Physical Design Objectives Improve performance Ensure scalability Manage storage Provide ease of administration Design for flexibility

CS Data Warehousing (Sp ) - Asim LUMS10 From Logical Model to Physical Model

CS Data Warehousing (Sp ) - Asim LUMS11 Physical Model Components

CS Data Warehousing (Sp ) - Asim LUMS12 Logical Model and Physical Model

CS Data Warehousing (Sp ) - Asim LUMS13 Standards Naming of database objects  Components of object names  Word separators  Names in logical and physical model Naming of files and tables in the staging area  Indicate the process  Express the purpose Standards for physical files  Files holding source codes and scripts  Database files  Application documents

CS Data Warehousing (Sp ) - Asim LUMS14 Physical Storage Data Structures

CS Data Warehousing (Sp ) - Asim LUMS15 Optimizing Storage Set the correct block size Set the appropriate block usage parameters  Block percent free; block percent used Manage data migration Resolve dynamic extensions Employ file striping techniques

CS Data Warehousing (Sp ) - Asim LUMS16 Using RAID Technology Redundant array of inexpensive disks  Data mirroring  Data duplexing  Parity checking  Data striping Six levels of RAID implementations (RAID 0 to RAID 5)

CS Data Warehousing (Sp ) - Asim LUMS17 Estimating Storage Sizes For each database table, determine  Initial estimate of the number of rows  Average length of the row  Anticipated monthly increase in the number of rows  Initial size of the table in megabytes (MB)  Calculated table sizes in 6 months and in 12 months For all tables, determine  The total number of indexes  Space needed for indexes initially, in six months, and in 12 months Estimate  Temporary work space for sorting and merging  Temporary and permanent files in the staging area

CS Data Warehousing (Sp ) - Asim LUMS18 Performance Enhancement Techniques Data partitioning Data clustering Parallel processing Summary levels Referential integrity checks Initialization parameters Data arrays