Dimensional model. What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg,

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Lecture 3 Themes in this session Basics of the multidimensional data model and star- join schemata The process of, and specific design issues in, multidimensional.
Tips and Tricks for Dimensional Modeling
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Chapter 3 The Relational Model Transparencies © Pearson Education Limited 1995, 2005.
Data Warehousing Design Transparencies
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Telecommunication Case Study CS 543 – Data Warehousing.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 17 Methodology – Physical Database Design for Relational Databases Transparencies © Pearson Education Limited 1995, 2005.
Team Dosen UMN Physical DB Design Connolly Book Chapter 18.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
CS 345: Topics in Data Warehousing Thursday, October 7, 2004.
Business Intelligence
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model Pearson Education © 2014.
Relational Model Session 6 Course Name: Database System Year : 2012.
Chapter 4 The Relational Model.
Chapter 3 The Relational Model Transparencies Last Updated: Pebruari 2011 By M. Arief
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
RAJIKA TANDON DATABASES CSE 781 – Database Management Systems Instructor: Dr. A. Goel.
Lecture 9 Methodology – Physical Database Design for Relational Databases.
Dimensional Modeling Chapter 2. The Dimensional Data Model An alternative to the normalized data model Present information as simply as possible (easier.
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Bus Architecture. Value Chain Identifies the natural logical flow of an organization’s primary activities Operational source systems produce snapshots.
INVENTORY CASE STUDY. Introduction Optimized inventory levels in stores can have a major impact on chain profitability: minimize out-of-stocks reduce.
Normalized model vs dimensional model
Basic Model: Retail Grocery Store
Methodology – Physical Database Design for Relational Databases.
1 Data Warehousing Lecture-15 Issues of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
ISQS 3358, Business Intelligence Supplemental Notes on the Term Project Zhangxi Lin Texas Tech University 1.
UNIT-II Principles of dimensional modeling
Creating the Dimensional Model
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Data Warehousing (Kimball, Ch.5-12) Dr. Vairam Arunachalam School of Accountancy, MU.
The Relational Model. 2 Relational Model Terminology u A relation is a table with columns and rows. –Only applies to logical structure of the database,
Chapter 16. Insurance 서울시립대학교 인공지능 연구실 G 조찬연 The Data Warehouse Toolkit 1 /35.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Data modeling. Presentation by – Anupama Vudaru, Phani Kondapalli Content by – Prathibha Madineni, Subrahmanyam Kolluri October 2010.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
Dimensional Modeling Primer Chapter 1 Kimball & Ross.
Chapter 4 The Relational Model Pearson Education © 2009.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Data Warehouse/Data Mart It’s all about the data.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
Operation Data Analysis Hints and Guidelines
Lecture 2 The Relational Model
Applying Data Warehouse Techniques
Overview and Fundamentals
Dimensional Model January 14, 2003
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Chapter 4 Summary Query.
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Data warehouse architecture CIF, DM Bus Matrix Star schema
Retail Sales is used to illustrate a first dimensional model
Dimensional Model January 16, 2003
Applying Data Warehouse Techniques
Examines blended and separate transaction schemas
Review of Major Points Star schema Slowly changing dimensions Keys
Applying Data Warehouse Techniques
Dmytro Polishchuk BI Developer DB Best Technologies
Presentation transcript:

Dimensional model

What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg, count..)  Textual (rarely) Derived facts Fact tables  90% of database (many rows, few columns)  contain FKs to dimensions PKs  Many to many between dimensions Fact tables types:  Transaction fact tables  tbc

What do we know so far about … DIMENSIONS? “How do business people describe the data resulting from the business process measurement events?” Dimension tables:  10% of database (many columns, few rows) Flags and Indicators as Textual Attributes Attributes with Embedded Meaning Numeric Values as Attributes or Facts

More about FACTS… NO null FKs in fact tables  WHY? Referential integrity violated No join on null keys It’s ok to have nulls as metrics in fact tables  they’re properly handled in aggregate functions such as SUM, MIN, MAX, COUNT, and AVG which do the “right thing” with nulls.  Substituting a zero instead would improperly skew these aggregated calculations

More about DIMENSIONS… NO null values for attributes (use unknown or not applicable instead)  WHY? Null values disappear in pull-down menus of possible attribute values special syntax is required to identify them If users sum up facts by grouping on a fully populated dimension attribute, and then alternatively, sum by grouping on a dimension attribute with null values, they’ll get different query results.

More about DIMENSIONS… Degenerate Dimensions (DD)  Operational transaction control numbers such as order numbers, invoice numbers, and bill-of-lading numbers usually give rise to empty dimensions and are represented as degenerate dimensions in transaction fact tables. The degenerate dimension is a dimension key without a corresponding dimension table.

Retail Schema in Action

Retail Schema Extensibility frequent shopper program New dimension attributes New dimensions New measured facts

More about FACTS… Factless Fact Tables What products were on promotion but did not sell?

Dimension and Fact Table Keys Dimension Table Surrogate Keys  Every join between dimension and fact tables in the data warehouse should be based on meaningless integer surrogate keys. You should avoid using a natural key as the dimension table’s primary key. Fact Table Surrogate Keys  PK of a fact table typically consists of a subset of the table’s FKs and/or degenerate dimension.

Inventory Business Process Inventory Periodic Snapshot

Inventory Business Process Inventory Transactions

Inventory Business Process Inventory Accumulating Snapshot

Fact Table Types

Data Warehouse Bus Architecture By defining a standard bus interface for the DW/BI environment, separate dimensional models can be implemented by different groups at different times. The separate business process subject areas plug together and usefully coexist if they adhere to the standard.

Data Warehouse Bus Matrix

Slowly Changing Dimension (SCD) Type 0: Retain Original Type 1: Overwrite  easy to implement, but it does not maintain any history of prior attribute values.

Slowly Changing Dimension (SCD) Type 2: Add New Row  the primary workhorse technique for accurately tracking slowly changing dimension attributes.

Slowly Changing Dimension (SCD) Type 3: Add New Attribute  The type 3 slowly changing dimension technique enables you to see new and historical fact data by either the new or prior attribute values, sometimes called alternate realities.

Dimensional model Goals: user understandability, query performance, resilience to change Atomic data Adherence to bus architecture

Case study – Babes-Bolyai University 3-5 persons teams create a dimensional model of data available at UBB consider one business process identify different types of facts and dimensions