Copyright © Starsoft Inc, 2000 1 Data Warehouse Architecture By Slavko Stemberger.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Cognos 8 Training Session
Supervisor : Prof . Abbdolahzadeh
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
C6 Databases.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
Organizing Data & Information
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Principles of Dimensional Modeling
ETL Design and Development Michael A. Fudge, Jr.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Datawarehousing Concepts | 7.0 9/7/2015 Datawarehousing Concepts.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
7.1 Managing Data Resources Chapter 7 Essentials of Management Information Systems, 6e Chapter 7 Managing Data Resources © 2005 by Prentice Hall.
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
MIS2502: Data Analytics The Information Architecture of an Organization.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Data resource management
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
Ayyat IT Group Murad Faridi Roll NO#2492 Muhammad Waqas Roll NO#2803 Salman Raza Roll NO#2473 Junaid Pervaiz Roll NO#2468 Instructor :- “ Madam Sana Saeed”
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Foundations of Business Intelligence: Databases and Information Management.
Data Warehousing Multidimensional Analysis
3/6: Data Management, pt. 2 Refresh your memory Relational Data Model
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
7 Copyright © 2006, Oracle. All rights reserved. Defining a Relational Dimensional Model.
What is a database? (a supplement, not a substitute for Chapter 1…) some slides copied/modified from text Collection of Data? Data vs. information Example:
Intro to MIS – MGS351 Databases and Data Warehouses
Data warehouse.
Decision Support System by Simulation Model (Ajarn Chat Chuchuen)
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Databases and Data Warehouses Chapter 3
Overview and Fundamentals
MANAGING DATA RESOURCES
Data Warehouse and OLAP
Unidad II Data Warehousing Interview Questions
An Introduction to Data Warehousing
Data Warehouse Architecture
MIS2502: Data Analytics The Information Architecture of an Organization Acknowledgement: David Schuff.
Data Warehousing: Data Models and OLAP operations
Retail Sales is used to illustrate a first dimensional model
Data Warehousing Concepts
Data Warehouse and OLAP
Presentation transcript:

Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger

Copyright © Starsoft Inc, Some Acronyms/Terms OLAP –On-line Analytical Processing ROLAP –Relational OLAP OLTP –On-Line Transaction Processing (operational system)

Copyright © Starsoft Inc, Some Acronyms/Terms Metadata –Data about data (data dictionary) Source System –An operational system that provides data for the data warehouse MOLAP –Multidimensional OLAP

Copyright © Starsoft Inc, Some Acronyms/Terms Data Warehouse –A queryable source of data Data Mart –A logical subset of a data warehouse Data Staging Area –An intermediate storage location used for ETL ETL –Extract, Transform and Load

Copyright © Starsoft Inc, Data Structures/Databases Hierarchical DB Network DB Relational DB O-O DB Dimensional DB Flat Files

Copyright © Starsoft Inc, Modeling Methods Dimensional Object Oriented (O-O) Entity-Relationship (E-R)

Copyright © Starsoft Inc, Entity-Relationship Modeling Instantaneous snapshot of the business Removed data redundancy (eliminates update anomalies) Shows detail relationships Complex network of entities can be difficult for end-users to understand Used for operational system

Copyright © Starsoft Inc, Dimensional Modeling Data duplication is allowed (in the dimensions) Query based Easier for users to understand –Not as much detail shows as in E-R Used in data warehouses

Copyright © Starsoft Inc, Dimensional Models Star Schema Snowflake Schema The “Cube”

Copyright © Starsoft Inc, The “Cube” Logical structure of ALL data warehouses Can be implemented physically in an RDB like Oracle Some view this as limited to data marts

Copyright © Starsoft Inc, Star Schema Easy to understand Flexible in type of questions that can be asked Supports very large data warehouses There is data redundancy (in the dimensions)

Copyright © Starsoft Inc, Snowflake Schema “Normalized” star schema More complex than the star schema - harder to understand and work with Solves some problems that cannot be done with star schema

Copyright © Starsoft Inc, Dimension Tables Each variable has a set of known, relatively small, set of values dimensions per data warehouse/data mart is the norm A set of independent variables that affect an observation

Copyright © Starsoft Inc, Dimension Tables (cont…) Some numeric values are descriptive –Numeric descriptive values should be suspect of being facts e.g. standard product price may be a fact because it can change and one can ask “what was the average standard price of the product over the last 12 months” Columns are descriptive and usually textual

Copyright © Starsoft Inc, Dimension Tables (cont…) Time dimension keys may be/should be assigned in the order of the dates in the fact table - this allows physical partitioning In general avoid “smart” keys - they should be meaningless Avoid production keys Dimension keys should be meaningless surrogate keys

Copyright © Starsoft Inc, Dimension Tables - Granularity Keep the grain of the data as small as possible (as detail as possible) –This makes the warehouse more resistant to change –It is easier to add attributes to existing dimensions –superior results in data mining operations Definition: The level of detail of the data

Copyright © Starsoft Inc, Dimension Tables - “Types” Degenerate “Junk” Other Time

Copyright © Starsoft Inc, Dimension Tables - Time Must be consistent across all fact tables Create partial attributes year, month and day and their concatenations (year + month, year + month + day, year + week, …) –Without the concatenations, it is difficult to ask for time ranges All data marts and warehouses have at least one time dimension

Copyright © Starsoft Inc, Dimension Tables - Degenerate Usually a control document id such as order number, invoice number, etc No value in creating a physical table Put the id into the fact table Dimensions with only one attribute

Copyright © Starsoft Inc, Dimension Tables - “Junk” Possible Actions: –Put the these flags into the fact table –Make each one into a dimension –Drop them from the design –Create one dimension with all combinations of these flags Given: Leftover flags and text attributes

Copyright © Starsoft Inc, Fact Tables Degenerate dimension keys (if they exist) Facts –Additive –Semi-additive –Non-additive –None (factless tables) Dimension keys

Copyright © Starsoft Inc, Facts - Additive Can be added across all combination of dimensions Examples: sales in dollars or units These are measures of activity

Copyright © Starsoft Inc, Facts - Semi-additive/non- additive Some may be added across some dimensions but not others –e.g. Bank Balance Some may not be added at all –e.g. Temperature These are measures of intensity

Copyright © Starsoft Inc, Closing Other things to look at –Mutating dimensions –Hierarchical data (e.g. product structures) –Security –Data Loading –Cleansing –etc.