Data warehousing theory and modelling techniques Graduate course on dimensional modelling.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Dimensional Modeling.
Tips and Tricks for Dimensional Modeling
BY LECTURER/ AISHA DAWOOD DW Lab # 4 Overview of Extraction, Transformation, and Loading.
Copyright © C. J. Date 2005page 97 S#Y S1DURINGS3DURING [d04:d10][d08:d10] S2DURINGS4DURING [d02:d04][d04:d10] [d08:d10] WITH ( EXTEND T2 ADD ( COLLAPSE.
MIS 451 Building Business Intelligence Systems
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Chapter 5 - Tuesday.
Master Data for SCM (1) Master Data for Demand Planning & Fulfillment Processes EGN 5623 Enterprise Systems Optimization (Professional MSEM) Fall, 2012.
Prof. Navneet Goyal Computer Science Department BITS, Pilani
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
© Ron McFadyen1 Many-to-one-to-many We need information that can only be obtained by accessing two fact tables through a common dimension … drilling across.
MIS 451 Building Business Intelligence Systems Logical Design (5) – Aggregate.
Dimensional Modeling – Part 2
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Designing a Data Warehouse
Data warehousing theory and modelling techniques Building Dimensional Models.
Copyright © 2007 Pearson Education Canada 1 Chapter 12: Audit Sampling Concepts.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Business Intelligence
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Data Warehousing Seminar Chapter 5. Data Warehouse Design Methodology Data Warehousing Lab. HyeYoung Cho.
May-04Accounting1 Compiere Training Accounting, Tax Performance Analysis.
Master Data for SCM (1) Master Data in Demand Planning & Fulfillment Processes EGN 5346 Logistics Engineering (MSEM, Professional) Fall, 2013.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
© 2007 by Prentice Hall 1 Introduction to databases.
Information Systems Today (©2006 Prentice Hall) 3-1 CS3754 Class Note 12 Summery of Relational Database.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
THE STUDY & EVALUATION OF INTERNAL CONTROL. Definition Professional Standards Data-Oriented  Small, simple systems  Weaker controls System-Oriented.
1 Data Warehousing Lecture-14 Process of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
1 Relational Databases and SQL. Learning Objectives Understand techniques to model complex accounting phenomena in an E-R diagram Develop E-R diagrams.
BI Terminologies.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Hachim Haddouti, adv. DBMS & DW CSC5301, Ch5 Chapter 5: The Value Chain Adv. DBMS & DW Hachim Haddouti.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
Master Data for SCM (1) Master Data in Demand Planning & Fulfillment Processes EGN 5623 Enterprise Systems Optimization (Professional MSEM) Fall, 2011.
Modeling Issues for Data Warehouses CMPT 455/826 - Week 7, Day 1 (based on Trujollo) Sept-Dec 2009 – w7d11.
1 Data Modeling : ER Model…. Lecture Summary so for …. ER Model Entity, Attribute, Key Relationships Instance Diagram.
UNIT-II Principles of dimensional modeling
1 On-Line Analytic Processing Warehousing Data Cubes.
Building a Data Warehouse for Business Reporting Presented by – Arpit Desai Faculty Advisor – Dr. Meiliu Lu CSC Department – Spring 2006 California State.
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 1 Introduction Business Statistics and Registers 1.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
S T A T I S T I K A U S T R I A Quality Assessment of register-based Statistics A Quality Framework Manuela LENK Directorate.
Data modeling. Presentation by – Anupama Vudaru, Phani Kondapalli Content by – Prathibha Madineni, Subrahmanyam Kolluri October 2010.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
INCREMENTAL AGGREGATION After you create a session that includes an Aggregator transformation, you can enable the session option, Incremental Aggregation.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
The Data Large Number of Workbooks Each Workbook has multiple worksheets Transaction worksheets have large (LARGE) number of lines (millions of records.
On-Line Analytic Processing
Star Schema.
Overview and Fundamentals
Competing on Analytics II
MIS2502: Data Analytics The Information Architecture of an Organization Acknowledgement: David Schuff.
Retail Sales is used to illustrate a first dimensional model
Data warehouse architecture CIF, DM Bus Matrix Star schema
Retail Sales is used to illustrate a first dimensional model
Dimensional Model January 16, 2003
Data Warehousing Concepts
Review of Major Points Star schema Slowly changing dimensions Keys
Presentation transcript:

Data warehousing theory and modelling techniques Graduate course on dimensional modelling

1. EXTENDED DIMENSION TABLE DESIGNS Many-to-many dimensions Many-to-many traps Role-playing dimensions Organisation and parts hierarchies Time stamping the changes Building an audit dimension conclusion: too few or too many dimensions

2. EXTENDED FACT TABLE DESIGNS Facts of differing granularity and allocating Time of day Multiple units of measurement Value band reporting 2.4.

1.1. Many-to-many dimensions: if one of the dimensions has many values

n Solution: –A bridge table between the fact table and the many-to-many dimension –we must generalise the original diagnoses key in the fact table to be a special diagnoses key and we use a group key and a weighting factor which sums up to 1.00

Many-to-many dimensions: if one of the dimensions has many values 1.1.

Many-to-many traps: be aware of different cardinality when a dimension is attached to several fact tables 1.2.

Role-playing dimensions 1.3. A role in a data warehouse is a situation where a single dimension appears several times in the same fact table. Then the underlying dimension may exist as a single physical table, but each of the roles must be presented in a separately labelled view. A role in a data warehouse is a situation where a single dimension appears several times in the same fact table. Then the underlying dimension may exist as a single physical table, but each of the roles must be presented in a separately labelled view. E.g. the telecommunications industry of a single call we might register:  source system provider  local switch provider  long distance provider  added value service provider E.g. the telecommunications industry of a single call we might register:  source system provider  local switch provider  long distance provider  added value service provider

1.4. Organisation and parts hierarchies

Organisation and parts hierarchies 1.4.

1.5. Time stamping n E.g. human resources environment of a large enterprise with employees n 3 kinds of queries against the HR data: 1. summary statistics of the entire employee data base (monthly) 2.to profile the employee population at any moment in time (month end or not) 3.we demand that every employee transaction can be represented distinctly: we thus want to see every transaction on a given employee, with the correct transaction sequence and the correct timing of each transaction 3 = detailed transaction theory or fundamental truth

Time stamping e.g. an employee transaction dimension time stamped with current and next transaction dates and times 1.5.

Building an audit dimension during the extract process in the data staging phase 1.6.

2.1. Facts of differing granularity and allocating E.g. shipment invoice shipment invoiceshipment invoice n Individual fact records should be on the lowest atomic level n When faced with facts of differing granularity, try to force all the facts to the lowest level n When allocating facts to the lowest level is impossible, the higher level facts have to be presented in separate tables n High-level plans or forecasts: make sure that the actual aggregates exist at the same level of the plans –1. Aggregate table and plan table share exactly the same dimensions 1. => combine them in 1 single physical table –2. Aggregate table and plan table have different dimensions => combination in 1 single physical table is impossible

2.1. Facts of differing granularity and allocating

2.1. Facts of differing granularity and allocating: high level plans or forecasts

Facts of differing granularity and allocating: high level plans or forecasts 2.1. The dimension plan version makes it impossible to combine with an aggregate table

Time of day 2.2.

2.3. Multiple units of measure The wrong design when fact table quantities need to be expressed in several units of measure

Multiple units of measure 2.3. The recommended design for multiple units of measure

Value band reporting 2.4. n Report of form: balance number of total of range accounts balances $ $ $