Review of Major Points Star schema Slowly changing dimensions Keys

Slides:



Advertisements
Similar presentations
Dimensional Modeling By Dr. Gabriel.
Advertisements

Dimensional Modeling.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
The Operating Cycle and Merchandising Operations 6.
Accounting for Merchandising Businesses
4.03 Solve Related Mathematical Problems. Opening Cash Fund The opening cash drawer contains the coins and currency for the day’s business The till is.
Dimensional Modeling Business Intelligence Solutions.
1 9 Ch3, Hachim Haddouti Adv. DBS and Data Warehouse CSC5301 Ch3 Hachim Haddouti Hachim Haddouti.
© Ron McFadyen1 Many-to-one-to-many We need information that can only be obtained by accessing two fact tables through a common dimension … drilling across.
12.2 Cash and Trade Discounts
Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
CS 345: Topics in Data Warehousing Thursday, October 7, 2004.
Business Intelligence
Hachim Haddouti, adv. DBMS & DW CSC5301, Ch4 Adv. DBMS & DW CH 4 Hachim Haddouti.
Dimensional model. What do we know so far about … FACTS? “What is the process measuring?” Fact types:  Numeric Additive Semi-additive Non-additive (avg,
FINANCIAL ACCOUNTING Tools for Business Decision-Making KIMMEL  WEYGANDT  KIESO  TRENHOLM  IRVINE CHAPTER 5: Merchandising Operations.
Sept Ron McFadyen1 Section 10.1 Domain Models Domain Model: a visual representation of conceptual classes or real-world objects in a domain.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
INVENTORY CASE STUDY. Introduction Optimized inventory levels in stores can have a major impact on chain profitability: minimize out-of-stocks reduce.
Reports. Report Summary Warehouse Reports Returned Material Serial Numbers Not Found This report list the serial numbers of material returned which were.
Basic Model: Retail Grocery Store
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
UNIT-II Principles of dimensional modeling
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Chapter 24 Stock Handling and Inventory Control Section 24.1 The Stock Handling Process Section 24.2 Inventory Control Section 24.1 The Stock Handling.
4.03 Solve Related Mathematical Problems. Opening Cash Fund The opening cash ________ contains the coins and currency for the day’s business The _____.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 12 Merchandise Purchases and Accounts Payable.
Chapter 5 Inventories and Cost of Goods Sold
4.03 Solve Related Mathematical Problems
View Integration and Implementation Compromises
Merchandising Activities
Data warehouse and OLAP
5 Accounting for Merchandising Operations
Star Schema.
Applying Data Warehouse Techniques
Inventories and Cost of Goods Sold.
Assignment 2 Due Thursday Feb 9, 2006
Overview and Fundamentals
Dimensional Model January 14, 2003
Inventory is used to illustrate:
Retail Sales is used to illustrate a first dimensional model
Ch. 15: Accounting for Purchases and Cash Payments
Applying Data Warehouse Techniques
Typically data is extracted from multiple sources
Welcome Back Glencoe Accounting.
Assignment 2 Due Thursday Feb 9, 2006
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Warehouse Architecture
Data warehouse architecture CIF, DM Bus Matrix Star schema
Dimensional Modeling.
Factless Facts: Occurrences of Relationships or Events
Retail Sales is used to illustrate a first dimensional model
Role Playing Dimensions (p )
Dimensional Model January 16, 2003
The Purchasing Process
Applying Data Warehouse Techniques
Aggregate improvement Lost, shrunken, and collapsed Ralph Kimball
Examines blended and separate transaction schemas
Transaction fact table (figure 7.2)
Assignment 1 Due Thursday Jan 19, 2006
Many aggregates can be defined for one base star schema
Applying Data Warehouse Techniques
Review of Major Points Star schema Slowly changing dimensions Keys
Page 37 Figure 2.3, with attributes excluded
Presentation transcript:

Review of Major Points Star schema Slowly changing dimensions Keys Fact & Factless Additive, semi-additive, non-additive Degenerate dimensions Snowflaking Outriggers Extensibility Design process Slowly changing dimensions Type 1, 2, 3, hybrids Data warehouse architecture Bus Matrix Conformed dimensions Conformed facts Fact table types periodic snapshot accumulating snapshot transaction February 2004 91.4904 Ron McFadyen

Fact table normalization Role playing Junk dimension Ch 5 Ideas Fact table normalization Role playing Junk dimension Principles for handling: Units of measure, Currency Header fact allocations to line items Fact table types Lag calculations Additive 1s and 0s February 2004 91.4904 Ron McFadyen

Fact table normalization Consider …(FKs) Order qty Gross dollar amount Order deal discount dollar amount Net order dollar amount See figure 5.2 These facts will be densely populated in the fact table (if not 100%) February 2004 91.4904 Ron McFadyen

Fact table normalization Consider …(FKs) Fact type (FK) measure The concept of normalizing the fact table means that we reduce the fact table to one measure and add a new dimension for fact type (qty, gross, discount, net) No motivation for normalizing this schema (because the facts are densely populated February 2004 91.4904 Ron McFadyen

There is one underlying or base table Each role is seen as a view Role playing A schema exhibits role-playing if a single dimension appears more than once in the schema There is one underlying or base table Each role is seen as a view Date dimension found in every schema Consider a Northwind Dimensional Model with 3 date dimensions: February 2004 91.4904 Ron McFadyen

Role playing Shipper Customer Employee Product Supplier Ordered Date Order facts Quantity Discount amount Gross amount Net amount Employee Product Supplier Ordered Date Requested Date Shipped Date February 2004 91.4904 Ron McFadyen

Junk dimension If, after assigning attributes to fact and dimension tables, there are a number of miscellaneous attributes left over, then if these attributes are lumped into one dimension the dimension is called a junk dimension e.g. (similar to figure 5.5 Key Payment type Payment type group Commission indicator … 1 cash cash commissionable … 2 cash cash Non-commissionable … 3 Discover card credit commissionable … 4 Discover card credit Non-commissionable … 5 Mastercard credit commissionable … … … … … … February 2004 91.4904 Ron McFadyen

A fact references the appropriate combination Junk dimension A fact references the appropriate combination Key Payment type Payment type group Commission indicator … 1 cash cash commissionable … 2 cash cash Non-commissionable … 3 Discover card credit commissionable … 4 Discover card credit Non-commissionable … 5 Mastercard credit commissionable … … … … … … February 2004 91.4904 Ron McFadyen

A global company may handle several currencies Recommendation: Store all facts in a local and standardized currency – this simplifies any analysis that end-users perform Order facts Quantity Discount amount local Discount amount Cdn Gross amount local Gross amount Cdn Net amount local Net amount Cdn Local currency Order facts Quantity Discount amount Gross amount Net amount February 2004 91.4904 Ron McFadyen

Currencies To allow conversions (easily) to any currency, utilize a currency conversion fact table Order facts Quantity Discount amount local Discount amount Cdn Gross amount local Gross amount Cdn Net amount local Net amount Cdn Local currency key Currency conversion facts Date key Source currency key Destination currency key Source/Destination exch rate Destination/Source exch rate February 2004 91.4904 Ron McFadyen

What SQL would report the total gross amount in US dollars? Currencies What SQL would report the total gross amount in US dollars? Order facts Quantity Discount amount local Discount amount Cdn Gross amount local Gross amount Cdn Net amount local Net amount Cdn Local currency key Currency conversion facts Date key Source currency key Destination currency key Source/Destination exch rate Destination/Source exch rate February 2004 91.4904 Ron McFadyen

Allocating facts to a lower granularity fact table e.g. suppose the Order has a shipping charge. Where should that be stored? If we are creating an order line fact table, we could try to allocate the shipping charge to each line item. The allocation formula may not be easily determined. Without an allocation formula, analysts cannot explore the relationship between products and shipping charges Order facts Order shipping charges captured/known at the level of the order Quantity Discount amount Gross amount Net amount Shipping amount formula February 2004 91.4904 Ron McFadyen

Assignment 2 Due Friday Feb 13, 2004 Use DTS to perform an initial load of a Star Schema for a Northwind Dimensional database Some details Create your own copy of Northwind to use as source data Use surrogate keys for each dimension PK of fact table is catenation of all FKs and DDs Sample stored procedure for populating the Date dimension Initial load of fact table may assume only one record in a dimension per natural key Use a view to facilitate loading the fact table with its FKs and facts February 2004 91.4904 Ron McFadyen

Assignment 2 Order (DD) Shipper Customer Employee Product Supplier Order facts Quantity Discount amount Gross amount Net amount Employee Product Supplier Ordered Date Requested Date Shipped Date February 2004 91.4904 Ron McFadyen

Fact tables: Transaction, Periodic Snapshot, Accumulating Snapshot Figure 5.8 Shipment Invoice Line Item Transaction fact table Figure 5.10 Order Fulfillment Accumulating fact table Figure 5.8 One row per invoice line item Figure 5.10 One row per item ordered – I.e. per order line item A row is updated by ETL as the line item progresses through manufacturing, shipping, invoicing, payment Fundamental difference ? February 2004 91.4904 Ron McFadyen

Figure 5.8 Shipment Invoice Line Item Transaction fact table Facts table Invoice date key Requested ship date key Actual ship date key Product key … Quantity shipped Shipment Line Item On-Time Count Shipment Line Item Complete Count Ship Line Item Damage Free Count Satisfaction metrics (see pages 127-128) Additive 1s and 0s What % of shipments to customers are on-time? February 2004 91.4904 Ron McFadyen

Figure 5.10 Order Fulfillment Accumulating fact table Facts table Order day key Backlog date key … Product key Order quantity Shipment quantity Order to manufacturing release lag Many dates Lags could be presented in a view (see page 130) What is this view? February 2004 91.4904 Ron McFadyen