Aggregate Improvement and Lost, shrunken, and collapsed

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Prof. Navneet Goyal Computer Science Department BITS, Pilani
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
© Ron McFadyen1 Many-to-one-to-many We need information that can only be obtained by accessing two fact tables through a common dimension … drilling across.
March 2010ACS-4904 Ron McFadyen1 Aggregate management References: Lawrence Corr Aggregate improvement
March Ron McFadyen1 Using Rational Rose to create a database.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
1 Lecture 10: More OLAP - Dimensional modeling
Lab3 CPIT 440 Data Mining and Warehouse.
By N.Gopinath AP/CSE. Two common multi-dimensional schemas are 1. Star schema: Consists of a fact table with a single table for each dimension 2. Snowflake.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Business Intelligence Instructor: Bajuna Salehe Web:
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
20.5 Data Cubes Instructor : Dr. T.Y. Lin Chandrika Satyavolu 222.
Ahsan Abdullah 1 Data Warehousing Lecture-7De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Winter 2006Winter 2002 Keller, Ullman, CushingJudy Cushing 19–1 Warehousing The most common form of information integration: copy sources into a single.
OLAP On Line Analytic Processing. OLTP On Line Transaction Processing –support for ‘real-time’ processing of orders, bookings, sales –typically access.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
Copyright © Archer Decision Sciences, Inc. Our Model Store DimensionProduct Dimension District Region Total Brand Manufacturer Total StoresProducts.
By A Sai Krishna Geethika Lokanadham Mithun Rajanna KV Kumar Data warehousing for Risk Analysis.
Data warehousing for Profit Analysis By A Sai Krishna Geethika Lokanadham Mithun Rajanna KV Kumar.
To SSAS or not to SSAS, that is the question Ayman Senior PFE - Microsoft.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Building Data ware House
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
Data warehouse and OLAP
Data Warehouse.
Assignment 2 Due Thursday Feb 9, 2006
Dimensional Model January 14, 2003
Inventory is used to illustrate:
Generalization.
On-Line Analytical Processing (OLAP)
Order Database – ER Diagram
INFS 3220 Systems Analysis & Design
Typically data is extracted from multiple sources
Physical Storage Indexes Partitions Materialized views March 2004
Data Warehousing: Data Models and OLAP operations
Physical Storage Indexes Partitions Materialized views March 2006
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Minidimension Example
Assignment 2 Due Thursday Feb 9, 2006
Retail Sales is used to illustrate a first dimensional model
Warehouse Architecture
Physical Storage Indexes Partitions Materialized views March 2005
Data warehouse architecture CIF, DM Bus Matrix Star schema
DATA CUBES E0 261 Jayant Haritsa Computer Science and Automation
Point-in-time balances Physical database Aggregation ETL Architecture
Retail Sales is used to illustrate a first dimensional model
Role Playing Dimensions (p )
Dimensional Model January 16, 2003
Aggregate improvement Lost, shrunken, and collapsed Ralph Kimball
Examines blended and separate transaction schemas
Review of Major Points Star schema Slowly changing dimensions Keys
Assignment 1 Due Thursday Jan 19, 2006
Many aggregates can be defined for one base star schema
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Summary of Transformation Rules
Review of Major Points Star schema Slowly changing dimensions Keys
Page 37 Figure 2.3, with attributes excluded
Recursive Relationship
Data Base.
Presentation transcript:

Aggregate Improvement and Lost, shrunken, and collapsed Aggregate management References: Aggregate Improvement and Lost, shrunken, and collapsed Aggregates exist for performance reasons Consider aggregates as a way to speed queries Indispensable to warehouses, just as indexes are indispensable to OLTP databases. Use aggregate navigators Rules of thumb: Double storage requirement by factor of two to allow for aggregations Aggregates should be 10-20 times smaller than source fact/aggregate table Try to accelerate some queries by 1,000 times Both by Lawrence Corr March 2004 91.4904 Ron McFadyen

Dimensions in a aggregate schema can be characterized as one of: Aggregate management Dimensions in a aggregate schema can be characterized as one of: Lost Shrunken Collapsed March 2004 91.4904 Ron McFadyen

Aggregate management Lost dimension A lost dimension is one that disappears – it is in the base schema, but it does not appear in the aggregate schema e.g. March 2004 91.4904 Ron McFadyen

E.g. a month dimension replaces the date dimension Aggregate management Shrunken dimension A shrunken dimension is one where a rollup occurs, or where a dimension is replaced by a higher dimension in a snowflake design E.g. a month dimension replaces the date dimension How is this type of dimension materialized? What should be the name of the Primary Key? March 2004 91.4904 Ron McFadyen

The collapsed dimension does not exist as a table Aggregate management Collapsed dimension A collapsed dimension does not appear as a dimension in the aggregate, rather one or more of its attributes appear in the aggregate’s fact table The collapsed dimension does not exist as a table e.g. March 2004 91.4904 Ron McFadyen