1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.

Slides:



Advertisements
Similar presentations
Information Systems Today: Managing in the Digital World
Advertisements

Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Cognos 8 Training Session
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
MIS 451 Building Business Intelligence Systems
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Dimensional Modeling – Part 2
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Data Staging Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling I Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Physical Data Warehouse Design Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling VI Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of Business Dimensional.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Principles of Dimensional Modeling
Lecture 5 CS.456 DATABASE DESIGN.
Data warehousing theory and modelling techniques Building Dimensional Models.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
Dimensional model. What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg,
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
1 Data Warehouses BUAD/American University Data Warehouses.
Bus Architecture. Value Chain Identifies the natural logical flow of an organization’s primary activities Operational source systems produce snapshots.
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Normalized model vs dimensional model
Basic Model: Retail Grocery Store
The University of Akron Dept of Business Technology Computer Information Systems The Relational Model: Concepts 2440: 180 Database Concepts Instructor:
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
UNIT-II Principles of dimensional modeling
Creating the Dimensional Model
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Fact Table The fact table stores business events. The attributes explain the conditions of the entity at the time the business event happened.
MIS 451 Building Business Intelligence Systems Logical Design (1)
Data Warehousing (Kimball, Ch.5-12) Dr. Vairam Arunachalam School of Accountancy, MU.
Advanced Data Modeling. Heterogeneous Mapping Heterogeneous Mapping is the ability of MSTR7 tools to join on unlike column names. Heterogeneous Mapping.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Data Warehousing.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Decision Support System ISYS 363. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
Information Systems Today: Managing in the Digital World
Lecture-34 DWH Implementation: Goal Driven Approach (2)
Data warehouse and OLAP
Data Warehouse.
Star Schema.
Inventory is used to illustrate:
Retail Sales is used to illustrate a first dimensional model
Retail Sales is used to illustrate a first dimensional model
Dimensional Model January 16, 2003
DWH – Dimesional Modeling
Presentation transcript:

1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of Business

2 Technical Architecture Design Product Selection & Installation End-User Application Specification End-User Application Development The Business Dimensional Lifecycle Project Planning Business Requirement Definition Business Requirement Definition Deployment Maintenance and Growth Project Management Dimensional Modeling Physical Design Data Staging Design & Development

3 Outline Table structure, types, characteristics and terminology Design steps Dimensional models with varying types of fact and dimension tables

4 Types of Facts Transactional facts (transactions or line items in transactions) Snapshots Factless facts

5 Types of Dimensions Role playing dimensions Heterogeneous dimensions Slowly changing dimensions Large dimensions Many-to-many dimensions

6 Keys and Attributes Primary key - a column whose value uniquely identifies each row (record) in the table. Attributes – columns in a table that are not designated as the primary key. Foreign key – a non-primary-key attribute for a table that corresponds to a primary key of another table.

7 Attributes in DW tables Dimension Table –One Primary Key –Dimension Attributes Fact table –Primary key --- A collection of primary keys from all its associated dimension tables All warehouse keys in fact table are foreign keys referring to its associated dimension tables All/part of warehouse keys in fact table form the primary key of fact table –Fact Attributes

8 Attributes in DW tables Data warehouse keys generated by the system

9 Keys and Grain Keys –Primary or natural keys (from source systems) –Warehouse or synthetic keys (generated by a data warehouse tool) Grain –The level of detail of fact measures described in the DW, e.g., sales transactions from order line items by order date, product and customer

10 Single-Fact-Table Data Warehouse Design Decisions 1.The business questions in focus and source information systems* 2.The grain of the fact table 3.The dimensions tables and keys 4.The fact attributes and dimension attributes *All DW attributes must be mapped to or derived from source attributes

11 Single-Fact-Table Data Warehouse Design Decisions 1.The business questions in focus and source information systems 2.The grain of the fact table 3.The dimensions tables and keys 4.The fact attributes and dimension attributes

12 Sample Business Questions Report Sales in terms of – (total) amt, (total) qty and (avg.) price Report Sales by PRODUCT name and/or category name Report Sales by CUSTOMER name, city and/or or state Report Sales by ORDER date, month, year, holiday, special event or other time constraints Report using a combination of the measures and constraints

13 Relational Schema of B.com B2B System Orders ( Order_No, SID, BID, CID, Order_date) OrderLine (Order_No, Line_ID, PID, Actual_Del_Date, Target_Del_Date, Arrival_Date, Shipping_Fee, Tax, Quantity, Unit_Price,Defect_on_arrival) Delivery ( SID, CID, Unit_shipping_fee, UNIT_DEL_TIME) Contract ( CID, Contract_Name, Payment_term, Payment_num) Payment ( PaymentID, OrderNO, Pay_Amount, Date)

14 Relational Schema of B.com B2B System Category ( CAT_ID, CAT_Name) Product ( PID, CAT_ID, P_Weight, P_Life, P_Name) Supplier ( SID, S_Name, S_City, S_State, S_Country) Product_Supply ( PID, SID, Unit_Price, Quantity_in_Stock, Production_in_Week) Buyer ( BID, B_Name, CityID, B_Type) Buyer_City ( CityID, C_Name, C_State, C_Country, C_Tax)

15 Single-Fact-Table Data Warehouse Design Decisions 1.The business questions in focus and source information systems 2.The grain of the fact table 3.The dimensions tables and keys 4.The fact attributes and dimension attributes

16 Grain of the Fact Table  Type of fact table: transactional facts  Potential grains: order or orderline  Constraints: order date, product, customer  Grain: sales from orderline (by order date, product, and customer)

17 Single-Fact-Table Data Warehouse Design Decisions 1.The business questions in focus and source information systems 2.The grain of the fact table 3.The dimensions tables and keys 4.The fact attributes and dimension attributes

18 Dimension Tables and Keys Key dimension tables jointly make up the primary key for a fact table

19 Single-Fact-Table Data Warehouse Design Decisions 1.The business questions in focus and source information systems 2.The grain of the fact table 3.The dimensions tables and keys 4.The fact attributes and dimension attributes

20 Determine Fact Attributes

21 Types of Fact Attributes Additive fact attributes can be added along any dimension.

22 Types of Fact Attributes Non-additive fact attributes cannot be added along any dimension.

23 Types of Fact Attributes Semi-additive fact attributes can be added along some dimensions.

24 Time Dimension Data warehouse needs an explicit time dimension table instead of just a time attribute (e.g, ORDERDATE). Save computation effort and improve query performance Complex queries regarding calendar calculation are hidden from end users of data warehouse.

25 Time Dimension Besides the time attribute, time dimension table includes the following additional attributes: –Day_of_week (1-7); Day_number_in_month (1- 31); –Day_number_in_year (1-365) –Week_number (1-52); month (1-12), Quarter (1- 4) –Holiday_flag (y/n) –Fiscal_quarter, Fiscal_year

26 Determine Dimension Attributes

27 Avoid Snowflake Designs

28 Avoid Snowflake Design Snowflake structure

29 Avoid Snowflake Schemas Tradeoff of avoiding snowflake –Advantage: improve query performance and easy of understanding –Disadvantage: require more storage space