Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.

Slides:



Advertisements
Similar presentations
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Advertisements

Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
Multidimensional Data. Many applications of databases are "geographic" = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Multidimensional Data Rtrees Bitmap indexes. R-Trees For “regions” (typically rectangles) but can represent points. Supports NN, “where­am­I” queries.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Implementation & Computation of DW and Data Cube.
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
Data Warehousing and OLAP
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 29 Overview of Data Warehousing and OLAP.
Lab3 CPIT 440 Data Mining and Warehouse.
By N.Gopinath AP/CSE. Two common multi-dimensional schemas are 1. Star schema: Consists of a fact table with a single table for each dimension 2. Snowflake.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
CS346: Advanced Databases
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Warehousing.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Data Warehousing and OLAP
On-Line Analytic Processing Chetan Meshram Class Id:221.
CS 345: Topics in Data Warehousing Tuesday, October 19, 2004.
1 Cube Computation and Indexes for Data Warehouses CPS Notes 7.
OnLine Analytical Processing (OLAP)
ITCS 6163 Lecture 5. Indexing datacubes Objective: speed queries up. Traditional databases (OLTP): B-Trees Time and space logarithmic to the amount of.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
Data Warehousing.
BI Terminologies.
Data Warehousing and OLAP. Warehousing ► Growing industry: $8 billion in 1998 ► Range from desktop to huge:  Walmart: 900-CPU, 2,700 disk, 23TB Teradata.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Dr. N. MamoulisAdvanced Database Technologies1 Topic 6: Data Warehousing & OLAP Defined in many different ways, but not rigorously. A decision support.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Winter 2006Winter 2002 Keller, Ullman, CushingJudy Cushing 19–1 Warehousing The most common form of information integration: copy sources into a single.
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
UNIT-II Principles of dimensional modeling
1 On-Line Analytic Processing Warehousing Data Cubes.
Data Warehousing Multidimensional Analysis
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Data Warehousing.
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
Indexing OLAP Data Sunita Sarawagi Monowar Hossain York University.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
SF-Tree and Its Application to OLAP Speaker: Ho Wai Shing.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
An Overview of Data Warehousing and OLAP Technology
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Data Warehousing COMP3017 Advanced Databases Dr Nicholas Gibbins –
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
On-Line Analytic Processing
Chapter 13 The Data Warehouse
Chapter 5: Advanced SQL Database System concepts,6th Ed.
Data Warehouse.
On-Line Analytical Processing (OLAP)
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Introduction of Week 9 Return assignment 5-2
Chapter 13 The Data Warehouse
Presentation transcript:

Advanced Querying OLAP Part 2

Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation Data model: –Multidimensional cube

Context Operations: –Roll-up, drill-down, –Pivot, –Slice and dice. Implementation: –ROLAP –MOLAP

Outline Examples of decision support queries Data Cubes –Conceptual data model –Typical operations SQL:1999 support for OLAP Implementation –ROLAP vs MOLAP –Indexing structures

MOLAP Not on top of relational database –most popular design –specialized data structures Multicubes vs Hypercubes –Not all subcubes are materialized

Multicubes User identifies set of sparse attributes S, and a set of dense attributes D. Index tree is constructed on sparse dimensions. Each leaf points to a multidimensional array indexed by D.

Example product, store are sparse dimensions date and customer-type are dense … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p

Example product, store are sparse dimensions date and customer-type are dense … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p E.g., B-tree, R-tree, …

Example product, store are sparse dimensions date and customer-type are dense … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p E.g., B-tree, R-tree, … 2D array Direct access

Example product, store are sparse dimensions date and customer-type are dense … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p E.g., B-tree, R-tree, … 2D array Direct access Linked list

Queries Efficiency depends on: –does index on sparse dimensions fit into memory? –Type of queries: Restrictions on all dimensions Restrictions only on dense Restrictions only on some sparse and dense

Queries Selection on all attributes: (p,s1,ret,all) … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p

Queries Only on dense attributes: (-,-,ret,”2/1/07”) … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p

Queries Only some sparse and dense attributes: (-,s1,ret,”2/1/07”) … … 1timeretregTotal 1/1/ /1/ … Total timeretregTotal 1/1/ /1/ … Total prod. p store s1 prod. p store s2 prod. p

Specialized Indexing Structures B-trees, (not covered) Bitmapped indices, Join indices, Spatial data structures (covered later)

Index Structures Indexing principle:  mapping key values to records for associative direct access  Most popular indexing techniques in relational database: B+-trees  For multi-dimensional data, a large number of indexing techniques have been developed: R-trees

Bitmap Indexes Bitmap index: indexing technique that has attracted attention in multi-dimensional DB implementation table

Bitmap Indexes The index consists of bitmaps: bitmaps Index on a particular column Index consists of a number of bit vectors - bitmaps Each value in the indexed column has a bit vector (bitmaps) The length of the bit vector is the number of records in the base table The i-th bit is set if the i-th row of the base table has the value for the indexed column

 Index on a particular column  Index consists of a number of bit vectors - bitmaps  Each value in the indexed column has a bit vector (bitmaps)  The length of the bit vector is the number of records in the base table  The i-th bit is set if the i-th row of the base table has the value for the indexed column Bitmap Indexes

Bitmap Index... age index bit maps data records Query: Get people with age = 20 and name = “fred” List for age = 20: List for name = “fred”: Answer is intersection: Suited well for domains with small cardinality

 With efficient hardware support for bitmap operations (AND, OR, XOR, NOT), bitmap index offers better access methods for certain queries  e.g., selection on two attributes  Some commercial products have implemented bitmap index  Works poorly for high cardinality domains since the number of bitmaps increases  Difficult to maintain - need reorganization when relation sizes change (new bitmaps) Bitmap Index – Summary

“Combine” SALE, PRODUCT relations In SQL: SELECT * FROM SALE, PRODUCT Join

Join Indexes join index

Join Indexes  Traditional indexes: value  rids. Join indices: tuples in the join  to rids in the source tables.  Data warehouse:  values of dimensions of star schema  rows in fact table.  Join indexes can span multiple dimensions

OLAP - Summary Data warehouse is a specialized database to support analytical queries = OLAP queries Data cube as conceptual model Implementation of Data Cube –View selection problem –Explosion problem –ROLAP vs. MOLAP –Indexing structures