June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)

Slides:



Advertisements
Similar presentations
Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
Advertisements

Dimensional Modeling.
Tips and Tricks for Dimensional Modeling
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Cognos 8 Training Session
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
Business Information Warehouse Business Information Warehouse.
James Serra – Data Warehouse/BI/MDM Architect
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
An Introduction to Dimensional Data Warehouse Design Presented by Joseph J. Sarna Jr. JJS Systems, LLC.
Dimensional Modeling – Part 2
How to build your own… Super Model Dimensional Modelling for Analysis Services Darren Gosbell Principal Consultant - James & Monroe
Data Warehousing Design Transparencies
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Chapter 13 The Data Warehouse
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Tanvi Madgavkar CSE 7330 FALL Ralph Kimball states that : A data warehouse is a copy of transaction data specifically structured for query and analysis.
Principles of Dimensional Modeling
Lecture 5 CS.456 DATABASE DESIGN.
ETL Design and Development Michael A. Fudge, Jr.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Business Intelligence
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Dimensional model. What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg,
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Best Practices in Higher Education Student Data Warehousing Forum Northwestern University October 21-22, 2003 FIRST QUESTIONS Emily Thomas Stony Brook.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
Physical Design Michael A. Fudge, Jr.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Data Warehousing (Kimball, Ch.5-12) Dr. Vairam Arunachalam School of Accountancy, MU.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Data Warehouse/Data Mart It’s all about the data.
DATA WAREHOUSING TECHNIQUES ROUNDTABLE Kathy Bronson Trevyn Bowden Clackamas Communtiy College 7/2016 Information Technology Forest Grove, Oregon NWEUG.
Advanced Applied IT for Business 2
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Applying Data Warehouse Techniques
Overview and Fundamentals
Inventory is used to illustrate:
CMPE 226 Database Systems April 11 Class Meeting
Data Warehouse and OLAP
Applying Data Warehouse Techniques
An Introduction to Data Warehousing
Data Warehousing: Data Models and OLAP operations
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Applying Data Warehouse Techniques
Data warehouse architecture CIF, DM Bus Matrix Star schema
Retail Sales is used to illustrate a first dimensional model
Data Warehouse.
Applying Data Warehouse Techniques
Review of Major Points Star schema Slowly changing dimensions Keys
Applying Data Warehouse Techniques
Data Warehouse and OLAP
Review of Major Points Star schema Slowly changing dimensions Keys
Presentation transcript:

June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

What is DW? Be a relational database organized to hold information in a structure that best supports reporting and analysis Subject oriented Time variant Non-volatile: stable Integrated: from multi source

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

DW architecture

DW architecture (cont)

Data marts

DW architecture (cont) I.Slowly changing dimensions (SCD) II.Dimension (table) types III.Fact (table) types IV.Scheme types

DW architecture (cont) I.Slowly changing dimensions (SCD) – There are 6 types of SCD, they are type 0,1,2,3,4,6 – The most common SCDs are Types 1, 2, and 3

DW architecture (cont) I.1. SCD Type 0 – Do not manage a slowly changing dimension. – Values remain as they were at the time the dimension record was first entered. – Surrogate key is not required. – Be easy to maintain.

DW architecture (cont) I.2. SCD Type 1 – Do not track historical data at all. – Overwrite old data with new data. – Surrogate key is not required. – Be easy to maintain.

DW architecture (cont) I.3. SCD Type 2 – Create multiple records for a given natural key with separate surrogate key – New record is inserted each time a change is made – Should not be used if dimensional model is subject to change

DW architecture (cont) I.4. SCD Type 3 – It tracks changes using separate columns – Type 3 has limited history preservation depending on the number of columns designated for storing historical data

DW architecture (cont) I.5. SCD Type 4 – There are two tables storing data of a dimension One table keeps the current data Additional table is used to keep a record of changes

DW architecture (cont) I.6. SCD Type 6 (Hybrid) – Type 6 combines the approaches of types 1, 2 and 3 (1+2+3=6)

DW architecture (cont) II. Dimensions Types – Conformed dimension – Junk dimension – Degenerate dimension – Role-playing dimension

DW architecture (cont) II.1. Conformed dimension – Be a dimension table which can be shared by multiple fact tables – Have a single meaning or content through data warehouse

DW architecture (cont) II.2. Junk dimension – Be a collection of random/miscellaneous attributes that are unrelated to any particular dimension and don't fit into tight star scheme to: Reduce number of dimension tables Decrease number of columns in fact tables

DW architecture (cont) II.3 Degenerate dimension – Any values in the fact table that don’t join to dimensions are either considered degenerate dimensions or measures – Degenerate dimension can be rolled up from Facts within SSAS

DW architecture (cont) II.4. Role-playing dimension – Is a dimension which can play different roles in a fact table For example Date dimension which can be used for Order date, Shipping date, Due date...

DW architecture (cont) III. Fact Types – Consist of the measurements, metrics or facts of business process (the one contains no measures or facts is factless fact table) – Measure types: Additive type: can be summed up through all of the dimensions Semi Additive type: can be summed up for some of the dimensions Non Additive type: can not be summed up for any of the dimensions – Fact data can be at detail level or aggregated – Fact types: Transactional: the most detailed data held in the table Periodic snapshots: it takes a 'picture of the moment – period of time' of data from the transactional table Accumulating snapshots: store the activity of a process from beginning to end

DW architecture (cont) IV. Scheme Types – Start Scheme – SnowFlake Scheme

DW architecture (cont) IV.1. Star schema Design

DW architecture (cont) IV.1. Advantages of Star schema – be very easy to understand, even for non technical business managers – provide better performance and smaller query times – be easily extensible and will handle future changes easily

DW architecture (cont) IV.2. Snowflake schema Design

DW architecture (cont) IV.2. Advantages of Snowflake schema – Save storage space – Reduce processing time in some cases

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

How to design a DW? Analyze requirements*** Identify the Grain Identify the Dimensions Identify the Facts

How to design a DW ? (cont) Recommendation – Use surrogate key instead of natural key, PK is required in SSAS – Should not create foreign key with check option – Avoid NULL value of surrogate key on fact table Should assign a special surrogate key – Pay much consider on many-to-many relationship

How to design a DW ? (cont) Demo Star Scheme Demo SnowFlake Scheme

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

Advantage of DW Enhances end-user access to a wide variety of data. Increases data consistency. Increases productivity and decreases computing costs.

Advantage of DW (cont) Be able to combine data from different sources, in one place. Can record historical information for data source tables that are not set up to save an update history.

Agenda What is DW? DW architecture How to design a DW? Advantages of DW Disadvantages of DW Questions and Answers

Disadvantage of DW Extracting, cleaning and loading data could be time consuming. Data warehousing project scope might increase. Problems with compatibility with systems already in place e.g. transaction processing system.

Disadvantage of DW (cont) Providing training to end-users, who end up not using the data warehouse. Security could develop into a serious issue, especially if the data warehouse is web accessible. A data warehouse is a HIGH maintenance system.

Agenda What is DW? DW architecture How to design a DW? Advantage of DW Disadvantage of DW Questions and Answers

QA Thank you for your attendant

Reference The Data Warehouse Toolkit book by Ralph Kimball Building the Data Warehouse book by W. H. Inmon Data Warehouse Design Solutions by Christopher Adamson and Michael Venerable OLAP-and-data-warehouse-design-concepts/a32p1