Data warehousing theory and modelling techniques Building Dimensional Models.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
C6 Databases.
Technical BI Project Lifecycle
Copyright © 2015 Pearson Education, Inc. Database Design Chapters 17 and
3/5/2009Computer systems1 Analyzing System Using Data Dictionaries Computer System: 1. Data Dictionary 2. Data Dictionary Categories 3. Creating Data Dictionary.
Dimensional Modeling Business Intelligence Solutions.
Chapter 3 Database Management
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Dimensional Modeling II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Telecommunication Case Study CS 543 – Data Warehousing.
Database – Part 2a Dr. V.T. Raja Oregon State University.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Get to know Access 2007 Access does data Access does data. All kinds of data. From customer contacts, billing hours and inventory to diet and exercise.
APPENDIX C DESIGNING DATABASES
Leaving a Metadata Trail Chapter 14. Defining Warehouse Metadata Data about warehouse data and processing Vital to the warehouse Used by everyone Metadata.
ETL Design and Development Michael A. Fudge, Jr.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
ISQS 3358, Business Intelligence Creating Data Marts Zhangxi Lin Texas Tech University 1.
Business Intelligence
Data Governance Data & Metadata Standards Antonio Amorin © 2011.
ระบบฐานข้อมูลขั้นสูง (Advanced Database Systems) Lecturer AJ. Suwan Janin Phone:
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Information Systems Today (©2006 Prentice Hall) 3-1 CS3754 Class Note 12 Summery of Relational Database.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
1 Data Warehouses BUAD/American University Data Warehouses.
Bus Architecture. Value Chain Identifies the natural logical flow of an organization’s primary activities Operational source systems produce snapshots.
ISQS 6339, Data Management and Business Intelligence Cubism – Bells and Whistles Zhangxi Lin Texas Tech University 1.
Data Management Console Synonym Editor
Information Systems & Databases 2.2) Organisation methods.
ETL Extract. Design Logical before Physical Have a plan Identify Data source candidates Analyze source systems with data- profiling tools Receive walk-through.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
BI Terminologies.
CIS 210 Systems Analysis and Development Week 6 Part II Designing Databases,
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
1 CSE 2337 Introduction to Data Management Textbook: Chapter 1.
Normalized model vs dimensional model
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
ISQS 3358, Business Intelligence Supplemental Notes on the Term Project Zhangxi Lin Texas Tech University 1.
Organizing Data and Information
Creating the Dimensional Model
Fact Table The fact table stores business events. The attributes explain the conditions of the entity at the time the business event happened.
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin APPENDIX C DESIGNING DATABASES APPENDIX C DESIGNING DATABASES.
3/6: Data Management, pt. 2 Refresh your memory Relational Data Model
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Entity Relationship Diagram (ERD). Objectives Define terms related to entity relationship modeling, including entity, entity instance, attribute, relationship.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Lecture 5 Data Model Design Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
Intro to MIS – MGS351 Databases and Data Warehouses
ARIS Extension Pack TOGAF April 2016
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
© The McGraw-Hill Companies, All Rights Reserved APPENDIX C DESIGNING DATABASES APPENDIX C DESIGNING DATABASES.
Data Warehouse.
Database Management  .
Databases and Data Warehouses Chapter 3
Dimensional Model January 14, 2003
An Introduction to Data Warehousing
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Dimensional Model January 16, 2003
eSeries Entities By Julie Ladner
Presentation transcript:

Data warehousing theory and modelling techniques Building Dimensional Models

1. Matrix Method for getting started Build the Matrix Use the four-step method 1.2

2. Managing the dimensional Modelling Project. 2.1 Data Warehouse Bus Architecture Matrix Fact Table Diagram Fact Table Detail Dimension Table Detail Steps for the Modelling Team Identifying the sources using a Data Modelling Tool 2.7 Summary SummarySummary

Build the Matrix 1.1 n Table Rows : Data Marts Table Rows Table Rows n Table Columns : Dimensions Table Columns Table Columns ExampleExample Data Warehouse Bus Architecture Matrix Example

1.1 Build the Matrix List the Data Marts (rows) n Example: data marts for a telephone co. –customer billing statements –scheduled service and installation orders –trouble reports –marketing promotions –call detail for a billing perspective –customer inventory –labor and payroll –...

1.1 Build the Matrix List the Dimensions (columns) n Example: Dimensions for the telephone co. in data mart: Customer billing statement –Time (date of billing) –Customer –Service –Rate category –Local service provider –...

Data Warehouse Bus Architecture Matrix

Use the four-step Method to design Each Fact Table 1.2 n Step 1. Choose the Data Mart n Step 2. Declare the Grain n Step 3. Choose the Dimensions n Step 4. Choose the Facts Example: Telephone co. Example: Telephone co.Example:

Four-steps e.g.Telephone co. n 1. Data Mart: Customer billing n 2. Grain: the individual line item on each monthly customer bill n 3. Dimensions: Time, Customer, Service, Promotion n 4. Facts: Line item amount, Line item quantity

Fact Table Diagram 2.2 n The fact table Diagram: –Names the fact table –Clearly states its grain –Shows dimensions to which it is connected –Shows all the other dimensions without connections n Example : fact table diagram for the telephone billing line item Example n Example : Supporting information Example : Example :

The telephone billing Fact Table Diagram

Dimension information Table n Supporting information for the Fact Table Diagram includes the Name and Description of each dimension

Fact Table Detail 2.3 n Complete list of all the facts available through the fact table n List includes: –actual facts in the physical table –derived facts presented through views –facts calculated from first two groups n Example: Customer billing Line item fact table detail for the telephone co Example:

Customer billing Line item Fact table detail

Dimension Table Detail 2.4 n Shows attributes in a single dimension n Shows explicit grain of the dimension n Shows the approximate cardinality of each dimension attribute n Shows hierarchies and relationships between the attributes n Example: Time dimension Table detail Example: n Example: Dimension attribute detail descriptions Example:

Time Dimension Table detail diagram

Dimension attribute detail descriptions n Documentation: Full descriptive information to support the diagram –Attribute Name, description, cardinality –Slowly Changing Policy, Sample Values n Example: Time Dimension attribute detail descriptions Example: n Example: Many-to-many relationships and slowly changing dimension attributes Example: n Example: Correlated attributes Example:

Time Dimension attribute detail descriptions

Many-to-many relationships and slowly changing dimension attributes

Correlated attributes

Steps for the Dimensional Modelling Team 2.5 n Create the Initial draft: data marts, dimensions, data matrix and diagrams n Track Base Facts n Track derived Facts (Example) Example n Present initial design to rest of IS team n Select some key users to work on project n Present to Business users

Derived Fact worksheet

Identifying the Sources for Each Fact Table and Dimension Table 2.6 n Source. Name of the source system. n Business owner. Name of the primary contact within the business who is responsible for this data. n IS owner. Name of the person who is responsible. n Platform. Operating environment where system runs. n Location. The actual location of the system. The city and the specific machine where this system runs. n Description. A brief description of what system does n Example: Data Source Definitions Example: n Example: Mapping data from source to Target Example:

Data Source definitions

Mapping data from source to Target Source-to-target data map: n Foundation for the development of the data staging process n To document specifically where the data can be located n Example: Sample Source-to-target data map Example:

Sample Source-to-target data map (1/2)

Sample Source-to-target data map (2/2)  Table name. The name of the logical table in the data warehouse.  Column name. The name of the logical column in the warehouse.  Data type. The data type of the logical column (char, number, date).  Length. The length of the field of the logical column.  Target column description. A description of the logical column.  Source system. The name of the source system where data feeds the target logical column.  Source table/file. The name of the specific table or file where data feeds the target logical column.  Source column/field. The name of the specific column or field where data feeds the target logical column.  Data transform. Notes about any transformations that are required to translate the source information into the format required by the target column.

Using a Data Modelling Tool 2.7 n Used to develop the physical data model n Preferably one that stores your model’s structure in a relational database

Summary (1/2) n Process used to apply dimensional modelling techniques n Bus Architecture Matrix to lay out the data marts and dimensions n Four-step method to design a single data mart n Diagramming techniques n Data sourcing and mapping

Summary (2/2) n In this section we focused on the DATA n Next: different path of lifecycle Technical architecture

Supporting Templates collected on CD-Rom n Template 7.1 Data Mart Matrix n Template 7.2 Dimensional Model Document n Template 7.3 Derived Fact Worksheet n Template 7.4 Logical table design n Template 7.5 Data Source Definition Document n Template 7.6 Source to Target Data Map