Lecture 5 CS.456 DATABASE DESIGN.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Dimensional Modeling.
Tips and Tricks for Dimensional Modeling
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Alternative Database topology: The star schema
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Telecommunication Case Study CS 543 – Data Warehousing.
Database – Part 2a Dr. V.T. Raja Oregon State University.
Principles of Dimensional Modeling
Data warehousing theory and modelling techniques Building Dimensional Models.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Business Intelligence Process Grain of the Fact Table Dr. Chang Liu
Hachim Haddouti, adv. DBMS & DW CSC5301, Ch4 Adv. DBMS & DW CH 4 Hachim Haddouti.
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Relational Databases Database Driven Applications Retrieving Data Changing Data Analysing Data What is a DBMS An application that holds the data manages.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Bus Architecture. Value Chain Identifies the natural logical flow of an organization’s primary activities Operational source systems produce snapshots.
Customer Order Order Number Date Cust ID Last Name First Name State Amount Tax Rate Product 1 ID Product 1 Description Product 1 Quantity Product 2 ID.
BI Terminologies.
System Design System Design - Mr. Ahmad Al-Ghoul System Analysis and Design.
Slide 1 Chapter 03 The Relational Model and Normalization.
1 Database & DBMS The data that goes into transaction processing systems (TPS), also goes to a database to be stored and processed later by decision support.
Normalized model vs dimensional model
MIS2502: Data Analytics Dimensional Data Modeling
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
UNIT-II Principles of dimensional modeling
Quiz questions. 1 A data structure that is made up of fields and records? Table.
Creating the Dimensional Model
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Fact Table The fact table stores business events. The attributes explain the conditions of the entity at the time the business event happened.
Advanced Data Modeling. Heterogeneous Mapping Heterogeneous Mapping is the ability of MSTR7 tools to join on unlike column names. Heterogeneous Mapping.
ISAM 5931: Data Warehousing & Data Mining Group Project submitted by : Mudassar Hakim & Gaurav Wadhwani.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
DO NOT COPY --CONFIDENTIAL Homework 6b Partial Key Generating a Fact table for Henry Books BCIS 4660 Dr. Nick Evangelopoulos Spring 2012.
* Database is a group of related objects * Objects can be Tables, Forms, Queries or Reports * All data reside in Tables * A Row in a Table is a record.
Howard Paul. Sequential Access Index Files and Data File Random Access.
Houston Petroleum Valve Company Data-Mining Project Data Modeling Phase Fouad Alibrahim Mohammad H. Monakes University of Houston Clear Lake University.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
Order Database – ER Diagram
PRINCIPLES OF DIMENSIONAL MODELING
Star Schema.
Assignment 2 Due Thursday Feb 9, 2006
Dimensional Model January 14, 2003
Retail Sales is used to illustrate a first dimensional model
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Relational Data Modeling
Applying Data Warehouse Techniques
Assignment 2 Due Thursday Feb 9, 2006
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Dimensional Modeling.
Retail Sales is used to illustrate a first dimensional model
Dimensional Model January 16, 2003
DWH – Dimesional Modeling
Applying Data Warehouse Techniques
Examines blended and separate transaction schemas
Applying Data Warehouse Techniques
Review of Major Points Star schema Slowly changing dimensions Keys
Page 37 Figure 2.3, with attributes excluded
Presentation transcript:

Lecture 5 CS.456 DATABASE DESIGN

Characteristics related to Fact tables Concatenated key Data Grain Fully Additive Measures Semi additive Measures Table Deep, Not Wide Sparse Data Degenerate Dimensions.

Concatenated Keys in Fact table In fact table we keep measurements. A row in fact table relates to combination of rows from all multiple dimension tables. A row in fact table must be identified using primary keys of all dimension tables. Primary key of fact table must be concatenation of primary keys of all dimension tables.

Concatenated Keys in Fact table Example: Record of “Sales Analysis” fact table should contain: Primary key value from Product table (Product_ID). Primary key value from Time table (Date). Primary key value from Customer table (Customer_ID). Primary key value from Sales Representative table (Sales_Rep_ID). Sales_Analysis_ID is key attribute of fact table which should store a concatenated value of all primary key attributes from dimension tables.

Data Grain Level of detail for measurements/metrics in a fact table. Quantity Ordered for a Particular product Certain date Specific customer Sales representative Data to be stored may be at different detail levels.

Table Deep not Wide Fact table contains fewer attributes than dimension table. Number of records in a fact table is very large than dimension table. 3 products 5 customers 30 days 10 sales representatives Number of rows in fact table will be 4500 Much larger than dimension table

Sparse Data Some rows of fact table will not contain values for some dimension values. Value of order totals for calendar dates on holiday will not contain any value. There can be such data in which there will be more null values in rows of fact table.

Degenerate Dimensions Some attributes of operational database are not related to either fact measures or dimension values. But they are required to be stored in data warehouse. Such attributes are called degenerate dimensions. Example: order_no, order_line_no, invoice_no Average number of products per order Products are required to be related to particular order numbers.

Star Schema Keys Primary Key Don’t use primary keys of operational database tables as keys of dimension tables. Product_code in operational systems is in following format: Store warehouse code||product id Data warehouse contains historic data If product’s store warehouse is changed after some time Using product_code for dimension table will cause problem.

Star Schema Keys Avoid built-in meanings in primary keys of dimension table. Customer_code representing a customer associated with a company Use Surrogate Keys Surrogate keys are system generated keys/sequence numbers. Keep keys of operational system as additional attributes in dimension tables.

Star Schema Keys Foreign keys Primary Keys of Fact table: Each dimension table has a one to many relationship with central fact table. Primary key of a dimension table must be a foreign key in fact table. Primary Keys of Fact table: A single compound key whose length is total length of all dimension tables. A key should be concatenation of all primary keys of dimension tables. A generated primary key independent of keys of dimension tables.

DW Design Steps 1. Choose the business process(es) to model Sales 2. Choose the granularity of the business process Items by Store by Promotion by Day Low granularity is needed 3. Choose the dimensions Time, Store, … 4. Choose the measures Dollar_sales, unit_sales, dollar_cost, customer_count

Select Lower(‘SQL Course’) ,Upper(‘sql course’) from dual;