3 Copyright © 2006, Oracle. All rights reserved. Business, Logical, and Dimensional Modeling.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Chapter 10: Designing Databases
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Designing the data warehouse / data marts Methodologies and Techniques.
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
Modeling the Data Warehouse Chapter 7. Data Warehouse Database Design Phases zDefining the business model (conceptual model) zCreating the dimensional.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Principles of Dimensional Modeling
Data warehousing theory and modelling techniques Building Dimensional Models.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
Datawarehousing Concepts | 7.0 9/7/2015 Datawarehousing Concepts.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Data Warehouse Chapter 11. Multiple Files Problem Added complexity of multiple source files Start simple Multiple Source files Extracted data Logic to.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
OnLine Analytical Processing (OLAP)
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
ISQS 3358, Business Intelligence Dimensional Modeling Zhangxi Lin Texas Tech University 1 1.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
Basic Model: Retail Grocery Store
3 Defining the Business and Logical Models. Designing the Conceptual Model Phase I: Functional and Nonfunctional Requirements.
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
UNIT-II Principles of dimensional modeling
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
Creating the Dimensional Model
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
ISAM 5931: Data Warehousing & Data Mining Group Project submitted by : Mudassar Hakim & Gaurav Wadhwani.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
ITEC 3220M Using and Designing Database Systems Instructor: Prof. Z.Yang Course Website: c3220m.htm Office: TEL.
Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
4 Copyright © Oracle Corporation, All rights reserved. Modeling the Data Warehouse.
1 Copyright © 2006, Oracle. All rights reserved. Defining OLAP Concepts.
Event Title Event Date. Module 02—Introduction to Dimensional Modeling Techniques Name Title Microsoft Corporation.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
3 Copyright © 2006, Oracle. All rights reserved. Building an Analytic Workspace.
7 Copyright © 2006, Oracle. All rights reserved. Defining a Relational Dimensional Model.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
9 Copyright © 2006, Oracle. All rights reserved. Summary Management.
ISQS 3358, Business Intelligence Dimensional Modeling
Data warehouse and OLAP
Dimensional Model January 14, 2003
Retail Sales is used to illustrate a first dimensional model
ISQS 3358, Business Intelligence Dimensional Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Retail Sales is used to illustrate a first dimensional model
Dimensional Modeling.
MIS2502: Data Analytics Dimensional Data Modeling
Retail Sales is used to illustrate a first dimensional model
Dimensional Model January 16, 2003
DWH – Dimesional Modeling
Data Warehousing.
Presentation transcript:

3 Copyright © 2006, Oracle. All rights reserved. Business, Logical, and Dimensional Modeling

Copyright © 2006, Oracle. All rights reserved Objectives After completing this lesson, you should be able to do the following: Discuss data warehouse environment data structures Discuss data warehouse database design phases: –Defining the business model –Defining the logical model –Defining the dimensional model

Copyright © 2006, Oracle. All rights reserved Data Warehouse Modeling Issues Among the main issues that data warehouse data modelers face are: Different data types Many ways to use warehouse data Many ways to structure the data Multiple modeling techniques Planned replication Large volumes of data

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Data Warehouse Environment Data Structures The data modeling structures that are commonly found in a data warehouse environment are: Third normal form (3NF) Star schema Snowflake schema

Copyright © 2006, Oracle. All rights reserved Star Schema Model Product Table Product_id Product_disc,... Time Table Day_id Month_id Year_id,... Sales Fact Table Product_id Store_id Item_id Day_id Sales_amount Sales_units,... Item Table Item_id Item_desc,... Store Table Store_id District_id,... Central fact table Denormalized dimensions

Copyright © 2006, Oracle. All rights reserved Snowflake Schema Model Time Table Week_id Period_id Year_id Dept Table Dept_id Dept_desc Mgr_id Mgr Table Dept_id Mgr_id Mgr_name Product Table Product_id Product_desc Item Table Item_id Item_desc Dept_id Sales Fact Table Item_id Store_id Product_id Week_id Sales_amount Sales_units Store Table Store_id Store_desc District_id District Table District_id District_desc

Copyright © 2006, Oracle. All rights reserved Snowflake Schema Model Can be used directly by some tools Is more flexible to change Provides for speedier data loading Can become large and unmanageable Degrades query performance Has more complex metadata CountryStateCountyCity

Copyright © 2006, Oracle. All rights reserved Data Warehouse Design Phases Phase 1: –Defining the business models Phase 2: –Defining the logical model Phase 3: –Defining the dimensional model Phase 4: –Defining the physical model

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Phase 1: Defining the Business Model Performing strategic analysis Creating the business model Documenting metadata

Copyright © 2006, Oracle. All rights reserved Performing Strategic Analysis Identify crucial business processes. Understand business processes. Prioritize and select the business processes to implement. Business benefit LowHigh Low High Feasibility

Copyright © 2006, Oracle. All rights reserved Creating the Business Model Defining business requirements Determining granularity Documenting metadata

Copyright © 2006, Oracle. All rights reserved Existing metadataProduction ERD model Interviews to collect business requirements Research Business Requirements Drive the Design Process Primary input Secondary input

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Using a Business Process Matrix Sample of business process matrix Promotions Channels Products Times (Date) Inventory Customers ReturnsSales Business Processes Business Dimensions

Copyright © 2006, Oracle. All rights reserved Identifying Business Measures and Dimensions The attribute is perceived as constant or discrete: Products Promotions Customers Countries Channels Times The attribute varies continuously: Sales Quantity sold Units sold Cost Measures Dimensions

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Determining Granularity YEAR? QUARTER? MONTH? WEEK? DAY? TIMES Product category? Product subcategory? Product name? Product desc? Product item? PRODUCTS

Copyright © 2006, Oracle. All rights reserved Identifying Business Definitions and Rules Credit Rating Meaning A+ 0 bad checks or bank credit failures A 1 bad check or bank credit failures B 2 bad checks or bank credit failures C3 or more bad checks or bank credit failures Customer Rule 1 A customer with a credit rating of A or above will receive a 10% discount on any order totaling $500 (U.S.) or more. Rule 2 A customer with a credit rating of A or above will receive a 5% discount on any order totaling $250 (U.S.) but less than $500. … Rule 5 A customer with a credit rating of C will not receive any discounts on purchases. Order …

Copyright © 2006, Oracle. All rights reserved Documenting Metadata Documenting metadata should include: Documenting the design process Documenting the development process Providing a record of changes Recording enhancements over time

Copyright © 2006, Oracle. All rights reserved Business Metadata Elements Name of the measure Business dimensions –Dimension attributes Sample data Business definition and rules

Copyright © 2006, Oracle. All rights reserved Metadata Documentation Approaches Automated –Data modeling tools –ETL tools –End-user tools Manual

Copyright © 2006, Oracle. All rights reserved Phase 2: Designing the Logical Model Entity Relationship Modeling (ERM) uses entity relationship diagram (ERD): Each CUSTOMER belongs to one COUNTRY. Each COUNTRY can have many CUSTOMERS. Customers Cust_name Country_Id Cust_Addr … Countries Country_id Name Region ISO_Code … Belongs to have Entity Attributes Relationship

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Phase 3: Defining the Dimensional Model Identify fact tables: –Translate business measures into fact tables. –Analyze source system information for additional measures. Identify dimension tables. Link fact tables to the dimension tables. Model the time dimension.

Copyright © 2006, Oracle. All rights reserved Star Dimensional Modeling

Copyright © 2006, Oracle. All rights reserved Advantages of Using a Star Dimensional Model Supports multidimensional analysis Creates a design that improves performance Enables optimizers to yield better execution plans Parallels end-user perceptions Provides an extensible design Broadens the choices for data access tools

Copyright © 2006, Oracle. All rights reserved Fact Table Characteristics Fact tables: Contain numerical metrics of the business Hold large volumes of data Grow quickly Can contain base, derived, and summarized data Are typically additive Are joined to dimension tables through foreign keys that reference primary keys in the dimension tables What are factless fact tables? Sales (Fact Table) PROD_ID CUST_ID TIME_ID CHANNEL_ID PROMO_ID QUANTITY_SOLD AMOUNT_SOLD...

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved More on Factless Fact Tables Emp_FK Sal_FK Age_FK Ed_FK Grade_FK Grade dimension Grade_PK Education dimension Ed_PK Employee dimension Emp_PK Salary dimension Sal_PK Age dimension Age_PK

Copyright © 2006, Oracle. All rights reserved Identifying Base and Derived Measures Business Measures Quantity sold Amount sold Profit Sales Fact Table Base Derived Business measures Facts (Base, Derived) Quantity sold Amount sold Profit

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Fact Table Measures Fact table measures can be: Nonadditive: Cannot be added along any dimension Semiadditive: Added along some dimensions Additive: Added across all dimensions

Copyright © 2006, Oracle. All rights reserved Dimension Table Characteristics Dimension tables: Contain textual information that represents the attributes of the business Contain relatively static data Are joined to a fact table through a foreign key reference

Copyright © 2006, Oracle. All rights reserved Translating Business Dimensions into Dimension Tables Business dimensions Dimension tables Customers Products Channels Countries Promotions Product_status Prod_List_price Products Dimension Table Prod_Id Prod_desc Prod_Subcategory Prod_name Prod_Category Prod_category_id Prod_Pack_Size … Prod_Status Prod_Weight _class Prod_Category_Desc

Copyright © 2006, Oracle. All rights reserved Slowly Changing Dimensions

Copyright © 2006, Oracle. All rights reserved Slowly Changing Dimension (SCD): An Example Where Product_key is a calculated number stored within the database Business Dimension Product Products: SCD Product weight, Product package size varying over time Product_status Prod_List_price Products Dimension Table Prod_Id (Natural) Prod_desc Prod_Subcategory Prod_name Prod_Category Prod_category_id Prod_Pack_Size … Prod_Status Prod_Weight_class Prod_Category_Desc Id (Surrogate) Prod_Eff_From Prod_Eff_To

Copyright © 2006, Oracle. All rights reserved Types of Database Keys Primary keys (PKs) Foreign keys (FKs) Composite keys Surrogate keys

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Using Time in the Data Warehouse Defining standards for time is critical. Aggregation based on time is complex.

Copyright © 2006, Oracle. All rights reserved Time Dimension Time dimension is critical to the data warehouse. Choose the right granularity for the Time dimension. Where should the element of time be stored? Time dimension Sales fact Current dimension grain Future dimension grain Fiscal month Fiscal year Fiscal quarter Fiscal week Day

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Identify Hierarchies for Dimensions State/Province Region Country City Geography hierarchy Fiscal year Fiscal quarter Fiscal month Fiscal time Fiscal date Calendar year Calendar quarter Calendar month Calendar time Calendar date Multiple time hierarchies

Copyright © 2006, Oracle. All rights reserved State 5State1State 2 Region 2 Country2Country 4 Data Drilling State 4 Group Market Hierarchy Region 1 Country1 State 6State 3 Country3 City1City2

Copyright © 2006, Oracle. All rights reserved Using Data Modeling Tools Tools with a GUI enable definition, modeling, and reporting. Avoid a mix of modeling techniques caused by the following: –Development pressures –Developers with lack of knowledge –No strategy Determine a strategy for data modeling. Write and publish data models formally. Make the data models available electronically.

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved Summary In this lesson, you should have learned about: Data warehouse environment data structures Data warehouse database design phases: –Defining the business model –Defining the logical model –Defining the dimensional model

Copyright © 2006, Oracle. All rights reserved Practice 3-1: Overview This practice covers the following topics: Identifying the facts, measures, hierarchies, and slowly changing dimensions based on the RISD scenario given Exploring viewlet-based demonstrations provided for modeling concepts, and answering the questions in these interactive viewlets

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved

Copyright © 2006, Oracle. All rights reserved