Tips and Tricks for Dimensional Modeling

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Data Warehousing Design Transparencies
Dimensional Modeling.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
C6 Databases.
Navigator Management Partners LLC Business Analysis Professional Development Day – Sep 2014 How to understand and deliver requirements to your Business.
Technical BI Project Lifecycle
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
An Introduction to Dimensional Data Warehouse Design Presented by Joseph J. Sarna Jr. JJS Systems, LLC.
Dimensional Modeling – Part 2
Data Warehousing Design Transparencies
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Lecture 5 CS.456 DATABASE DESIGN.
ETL By Dr. Gabriel.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Business Intelligence
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Dimensional model. What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg,
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Data Warehouse design models in higher education courses Patrizia Poščić, Associate Professor Danijela Subotić, Teaching Assistant.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
Building the cube – Chapter 9 & 10 Let’s be over with it.
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Dimensional Modeling Primer Chapter 1 Kimball & Ross.
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
Dimensional Modelling
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
UNIT-II Principles of dimensional modeling
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Creating the Dimensional Model
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Data Warehousing (Kimball, Ch.5-12) Dr. Vairam Arunachalam School of Accountancy, MU.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
Data warehousing theory and modelling techniques Graduate course on dimensional modelling.
Chapter 16. Insurance 서울시립대학교 인공지능 연구실 G 조찬연 The Data Warehouse Toolkit 1 /35.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Event Title Event Date. Module 02—Introduction to Dimensional Modeling Techniques Name Title Microsoft Corporation.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Data warehouse and OLAP
Components of A Successful Data Warehouse
Data Warehouse.
Applying Data Warehouse Techniques
Overview and Fundamentals
Dimensional Model January 14, 2003
CMPE 226 Database Systems April 11 Class Meeting
Applying Data Warehouse Techniques
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Data warehouse architecture CIF, DM Bus Matrix Star schema
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Applying Data Warehouse Techniques
Dmytro Polishchuk BI Developer DB Best Technologies
Presentation transcript:

Tips and Tricks for Dimensional Modeling By Shawn Jackson

Overview Set of techniques and concepts used in data warehouse design Intended to support end-user queries and is oriented around understandability and performance Uses the concepts of facts (measures) and dimensions (context) Facts are typically (but not always) numerical values that can be aggregated Dimensions are groups of hierarchies and descriptors that define the facts

Star Schema

Snowflake Schema

Kimball University: 10 Essential Rules of Dimensional Modeling (#1-5) Load detailed atomic data into dimensional structures Store data at the lowest grain Use summary tables/views to improve performance as necessary Structure dimensional models around business processes Fact tables should be based on a business event Complement single process fact tables with consolidated fact tables that combine metrics from multiple processes at the same level of detail Ensure that every fact table has an associated date dimension table Ensure that all facts in a single fact table are at the same grain or level of detail Resolve many-to-many relationships in fact tables

Kimball University: 10 Essential Rules of Dimensional Modeling (#6-10) Resolve many-to-one relationships in dimension tables Store report labels and filter domain values in dimension tables Don’t store codes and descriptions in the fact table Make sure the full description of the code is in the dimension table Make certain that dimension tables use a surrogate key Create conformed dimensions to integrate data across the enterprise Date dimension is a common example Single version of the truth Continuously balance requirements and realities to deliver a DW/BI solution that's accepted by business users and that supports their decision-making

Slowly Changing Dimensions Type 0 Type 1 Type 2 Type 3 Type 4 Type 6

SCD Type 0 Rows are added but never changed Missing true business / natural key Typically are only used in derived dimensions Type 0 attributes are more common Supplier Key Name 123 Acme Supply Co 124 Acme Supply Company

SCD Type 1 Rows can be updated or added based upon business key Historical information is not tracked Supplier_Key Supplier_Code Supplier_Name Supplier_State 123 ABC Acme Suply Co CA Supplier_Key Supplier_Code Supplier_Name Supplier_State 123 ABC Acme Supply Co CA Supplier_Key Supplier_Code Supplier_Name Supplier_State 123 ABC Acme Supply Co IL

SCD Type 2 Rows are only added A version number or effective dates are used to keep track of history Supplier Key Code Name State Start Date End 123 ABC Acme Supply Co CA 01-Jan-2000 21-Dec-2004 124 IL 22-Dec-2004

SCD Type 3 Rows are updated but not added Historical information is preserved through extra columns Supplier Key Code Name Original / Prior Supplier State Effective Date Current State 123 ABC Acme Supply Co CA 22-Dec-2004 IL

SCD Type 4 Combination of type 1 and type 2 dimensions Rows are updated in the type 1 table and added in the type 2 table Supplier Supplier_key Supplier_Code Supplier_Name Supplier_State 123 ABC Acme Supply Co IL Supplier History Supplier HistKey Supplier Key Code Name State Start Date End 1001 123 ABC Acme Supply Co CA 01-Jan-2000 21-Dec-2004 1002 IL 22-Dec-2004

SCD Type 6 / hybrid Combines type 1, 2 and 3 in one table Supplier Key Code Name Current State Prior Start Date End Flag 123 ABC Acme Supply Co NY CA 01-Jan-2000 21-Dec-2004 N 124 IL 22-Dec-2004 03-Feb-2008 125 04-Feb-2008 Y

Roleplaying Dimensions Recycled for multiple applications within the same database Date dimension is commonly used (sale date, delivery date) Can be used to get different views of data

Roleplaying Example

Factless Fact Tables Tracking events Many to many joins