Data Warehousing Design DT211/4. Designing Data Warehouses To begin a data warehouse project, we need to find answers for questions such as: – Which user.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Data Warehousing Design Transparencies
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
De-normalize if… Performance is unsatisfactory Table has a low update rate –(sacrifice flexibility) Table has a high query rate –(speed up retrieval)
Manajemen Basis Data Pertemuan 8 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
Data Warehousing Design Transparencies
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Data Warehousing Dale-Marie Wilson, Ph.D..
Telecommunication Case Study CS 543 – Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Principles of Dimensional Modeling
Building a Data Warehouse with SQL Server Presented by John Sterrett.
Business Intelligence Process Grain of the Fact Table Dr. Chang Liu
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehouse design models in higher education courses Patrizia Poščić, Associate Professor Danijela Subotić, Teaching Assistant.
1 Data Warehouses BUAD/American University Data Warehouses.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Data Warehousing.
MIS2502: Data Analytics Dimensional Data Modeling
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
UNIT-II Principles of dimensional modeling
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Lecture 14: Data Warehousing Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Data Warehouse/Data Mart It’s all about the data.
Methodology Conceptual Databases Design
Conceptual Design & ERD Modelling
Operation Data Analysis Hints and Guidelines
Advanced Applied IT for Business 2
Methodology Conceptual Database Design
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse—Subject‐Oriented
Methodology – Monitoring and Tuning the Operational System
Summarized from various resources Modern Database Management
MIS2502: Data Analytics Dimensional Data Modeling
The DreamHome Case Study
Data Warehouse.
Star Schema.
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Competing on Analytics II
Dimensional Model January 14, 2003
MIS2502: Data Analytics Dimensional Data Modeling
CMPE 226 Database Systems April 11 Class Meeting
Data Warehouse and OLAP
Entity-Relationship Modeling
An Introduction to Data Warehousing
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Retail Sales is used to illustrate a first dimensional model
DWH – Dimesional Modeling
Chapter 13 The Data Warehouse
CSD305 Data Warehouse Design
Data Warehouse and OLAP
Presentation transcript:

Data Warehousing Design DT211/4

Designing Data Warehouses To begin a data warehouse project, we need to find answers for questions such as: – Which user requirements are most important and which data should be considered first? – Which data should be considered first? – Should the project be scaled down into something more manageable? – Should the infrastructure for a scaled down project be capable of ultimately delivering a full-scale enterprise- wide data warehouse? 2

Designing Data Warehouses Interviews provide the necessary information for the top-down view (user requirements) and the bottom-up view (which data sources are available and will be supported over the next few years) of the data warehouse. The database component of a data warehouse is described using a technique called dimensionality modelling. 3

Dimensionality modeling A logical design technique that aims to present the data in a standard, intuitive form that allows for high-performance access Every dimensional model (DM) is composed of one table with a composite primary key, called the fact table, and a set of smaller tables called dimension tables. 4

Fact and dimension tables for each business process of DreamHome 5

ER model of property sales business process of DreamHome 6

Star schema for property sales of DreamHome 7

Dimensionality modeling Star schema is a logical structure that has a fact table containing factual data in the centre, surrounded by dimension tables containing reference data, which can be denormalised. Facts are generated by events that occurred in the past, and are unlikely to change, regardless of how they are analyzed. Important to treat fact data as read-only reference data that will not change over time. Most useful fact tables contain one or more numerical measures, or ‘facts’ that occur for each record and are numeric and additive. 8

Dimensionality modeling Star schemas can be used to speed up query performance by denormalizing reference information into a single dimension table. For example: dimension tables (propertyfor sale, client, branch and staff) all have city region and county repeated. This avoids having to certain join tables 9

Database Design Methodology for Data Warehouses ‘Methodology’ includes following steps: – Choosing the process – Choosing the grain – Identifying and conforming the dimensions – Choosing the facts – Storing pre-calculations in the fact table – Choosing the duration of the database 10

Step 1: Choosing the process The process (function) refers to the subject matter of a particular data warehouse: to answer the most commercially important business questions. Identify the discrete business processes. For example: property sales; 11

ER model of an extended version of DreamHome 12 © Pearson Education Limited 1995, 2005

ER model of property sales business process of DreamHome 13

Step 2: Choosing the grain Decide what a record of the fact table is to represents: e.g. Property sales. Identify dimensions of the fact table. The grain decision for the fact table also determines the grain of each dimension table. Also include time as a core dimension, which is always present in star schemas. 14

Star schema for property sales of DreamHome 15

Step 3: Identifying and conforming the dimensions Dimensions set the context for asking questions about the facts in the fact table. Clientbuyer: clientno., client name, city, region, county. If any dimension occurs in two data marts (subset of the data warehouse) 16

Star schemas for property sales and property advertising 17

Step 4: Choosing the facts The grain of the fact table determines which facts can be used in the data mart. Facts should be numeric and additive. Unusable facts include: – non-numeric facts – non-additive facts – fact at different granularity from other facts in table 18

Star schema for property sales of DreamHome 19

Property rentals with fact table corrected 20

Step 5: Storing pre-calculations in the fact table Once the facts have been selected each should be re- examined to determine whether there are opportunities to use pre-calculations. For example in the lease fact table: total revenue = total rent - (client allowance + staff commission) Important to decide on duration of fact table: mostly 1 to 2 years. If it is too large could mean that access to older data is difficult; problems in reading data 21

Fact and dimension tables for each business process of DreamHome 22

Dimensional model (fact constellation) for the DreamHome data warehouse 23