Defining Data Warehouse Structures Data Warehouse Data Access End User Data Access Data Sources Staging Area Data Marts Data Extract, Transform, and Load.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
OCS Infotech Proprietary & Confidential Typical BI solution Architecture.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
2/10/05Salman Azhar: Database Systems1 On-Line Analytical Processing Salman Azhar Warehousing Data Cubes Data Mining These slides use some figures, definitions,
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
Override the title Chris Harrington
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Atlanta Microsoft Database Forum Introduction to Data Warehousing Concepts Brian Thomas Solution Builders, Inc. Presented by March 8, 2004
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Module 1: Introduction to Data Warehousing and OLAP
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
MIS2502: Data Analytics Dimensional Data Modeling
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
DW-2: Designing a Data Warehousing System 용 환승 이화여자대학교
Implementing Calculations Using MDX. Drinks Tea Lemon Earl Grey Coffee Columbian Dimension Family Relationships  Drinks is the Parent of Tea and Coffee.
UNIT-II Principles of dimensional modeling
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
Data Warehousing Multidimensional Analysis
An inside look into Retail Sales & Purchases. Refresh: (About US Census Bureau) Agency of the Federal Statistical System Accumulates and reports on American.
ISAM 5931: Data Warehousing & Data Mining Group Project submitted by : Mudassar Hakim & Gaurav Wadhwani.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Event Title Event Date. Module 02—Introduction to Dimensional Modeling Techniques Name Title Microsoft Corporation.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
Extending and Creating Dynamics AX OLAP Cubes
Dimensional Modelling III
Chapter 13 Business Intelligence and Data Warehouses
Data warehouse and OLAP
Chapter 13 The Data Warehouse
MIS2502: Data Analytics Dimensional Data Modeling
Data Warehouse.
Star Schema.
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Databases & Data Warehouses
Overview and Fundamentals
Dimensional Model January 14, 2003
MIS2502: Data Analytics Dimensional Data Modeling
Implementing Data Models & Reports with Microsoft SQL Server
Data warehouse Design Using Oracle
Data Warehouse and OLAP
Chapter 4 The Relational Model Pearson Education © 2009.
Chapter 4 The Relational Model Pearson Education © 2009.
MIS2502: Data Analytics The Information Architecture of an Organization Acknowledgement: David Schuff.
MIS2502: Data Analytics Dimensional Data Modeling
The Relational Model Transparencies
Dimensional Modeling.
MIS2502: Data Analytics Dimensional Data Modeling
Chapter 4 The Relational Model Pearson Education © 2009.
Retail Sales is used to illustrate a first dimensional model
DWH – Dimesional Modeling
Chapter 4 The Relational Model Pearson Education © 2009.
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

Defining Data Warehouse Structures Data Warehouse Data Access End User Data Access Data Sources Staging Area Data Marts Data Extract, Transform, and Load

Data Warehouse Structures Overview End User Data Access Data Marts OLAP Data Warehouse

The Star SchemaEmployee_DimEmployee_Dim EmployeeKey EmployeeID... EmployeeID... Time_DimTime_Dim TimeKey TheDate... TheDate... Product_DimProduct_Dim ProductKey ProductID... ProductID... Customer_DimCustomer_Dim CustomerKey CustomerID... CustomerID... Shipper_DimShipper_Dim ShipperKey ShipperID... ShipperID... Sales_Fact TimeKey EmployeeKey ProductKey CustomerKey ShipperKey TimeKey EmployeeKey ProductKey CustomerKey ShipperKey Sales Amount Unit Sales... Sales Amount Unit Sales...

Describing Dimensions Describing Business Entities Containing Attributes That Provide Context to Numeric Data Presenting Data Organized into Hierarchies

Identifying Primary Keys Primary Keys Identify Uniqueness Are the Dimension Columns Referenced in the Fact Table Two Candidates Application Keys (app suffix) Are Source Data Values That Identify Uniqueness Surrogate Keys (key suffix) Are System-generated Integers That Identify Uniqueness product_id_app product_dim_key

Defining Hierarchies Understanding Benefits of Hierarchies Allow end users to view data at different levels of summarization Provide drill down / drill up paths of analysis Drill Down Drill Up Implementing Hierarchies Denormalized star schema dimensions Normalized snowflake dimensions

Snowflaked Dimension Tables Defines Hierarchies by Using Multiple Dimension Tables Is More Normalized than a Single Table Dimension

Describing Fact Tables Inventory Data By Inventory Date, Product, and Warehouse Sales Data By Product, Customer, and Order Date

Identifying Fact Table Components Dimension Tables customer_dimcustomer_dim 201 ALFI Alfreds product_dimproduct_dim Chai Sales_fact Table customer_key product_key time_key quantity_sales amount_sales Foreign Keys ,789 The grain of the sales_fact table is defined by the lowest level of detail stored in each dimension associated with the fact table The grain of the sales_fact table is sales data by customer ID, product ID, and order date 134 1/1/2000 time_dimtime_dim Measures

Defining Foreign Keys time_dim_key product_dim_key customer_dim_key product_key customer_key order_date_key FOREIGN KEY Constraint FOREIGN KEY Constraint FOREIGN KEY Constraint Physically implement the relationship between FK columns of the fact table and PKs of dimension tables Enforce referential integrity between the dimension tables and the fact table

Relational Schemas and OLAP End User Data Access Data Marts OLAP Data Warehouse OLAP

OLAP Database Components Numeric Measures Data values or facts that users analyze Dimensions Business categories that provide context to numeric measures Sourced from columns in star schema dimensions Members are organized into hierarchies Cubes Combine dimensions and measures into one conceptual model Logical storage medium for an OLAP database

Relational Dimensions vs. OLAP Dimensions REGION West CA OR East MA NY REGION West East STATEREGION CAWest ORWest MAEast NYEast OLAP Relational

OLAP Dimension Fundamentals Year Quarter Month 1999 Q1 Jan 1999 Q1 Feb … … … Time Dimension Table Month Quarter Year RELATIONAL OLAP

Drinks Tea Lemon Earl Grey Coffee Columbian Dimension Family Relationships  Drinks is the Parent of Tea and Coffee  Tea and Coffee are Children of Drinks  Tea and Coffee are Siblings  Tea and Columbian are Descendants of Drinks  Tea and Drinks are Ancestors of Earl Grey  Lemon and Columbian are Cousins  All are dimension Members Drinks Tea Lemon Earl Grey Coffee Columbian

Cube Measures Are the Numeric Values of Principal Interest Correspond to a Fact Table’s Facts (or Measures) Intersect All Dimensions at All Levels Are Aggregated at All Levels of Detail Form a Dimension

The Cube Measures Dimension Products Dimension Time Dimension Ave Units Sales Dollars Sales Units Net Price Q4 Q3 Q2 Apples Cherries Grapes Q1 Melons 2002

Measures Dimension Products Dimension Q4 Q3 Q2 Time Dimension Apples Cherries Grapes Q1 Melons Querying a Cube Ave Units Sales Dollars Sales Units Net Price 1000

Apples Measures Dimension Products Dimension Q4 Q3 Q2 Q1 Time Dimension Melons Cherries Grapes Defining a Cube Slice Ave Units Sales Dollars Sales Units Net Price