© 1999 FORWISS General Research Report Implementation and Optimization Issues of the ROLAP Algebra F. Ramsak, M.S. (UIUC) Dr. V. Markl Prof. R. Bayer,

Slides:



Advertisements
Similar presentations
Vorlesung Datawarehousing Table of Contents Prof. Rudolf Bayer, Ph.D. Institut für Informatik, TUM SS 2002.
Advertisements

Dimensional Modeling.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Databases and Information Systems 1 Prof. Dr. Stefan Böttcher Fakultät EIM, Institut für Informatik Universität Paderborn WS 2009 / 2010 Contents: selectivity.
Relational Algebra, Join and QBE Yong Choi School of Business CSUB, Bakersfield.
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
Manajemen Basis Data Pertemuan Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
Foundations of Relational Implementation n Defining Relational Data n Relational Data Manipulation n Relational Algebra.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification.
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 52 Database Systems I Relational Algebra.
Database Systems: Design, Implementation, and Management Tenth Edition
Midterm Review Lecture 14b. 14 Lectures So Far 1.Introduction 2.The Relational Model 3.Disks and Files 4.Relational Algebra 5.File Org, Indexes 6.Relational.
CS263 Lecture 19 Query Optimisation.  Motivation for Query Optimisation  Phases of Query Processing  Query Trees  RA Transformation Rules  Heuristic.
Coloring Away Communication in Parallel Query Optimization Waqar Hasan, Rajeev Motwani Stanford University Παυλάτος Χρήστος
Dimensional Modeling – Part 2
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Chapter 7 Advanced SQL Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems More SQL Database Design -- More SQL1.
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates
Query Processing Presented by Aung S. Win.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
1 CUBE: A Relational Aggregate Operator Generalizing Group By By Ata İsmet Özçelik.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
CSE314 Database Systems More SQL: Complex Queries, Triggers, Views, and Schema Modification Doç. Dr. Mehmet Göktürk src: Elmasri & Navanthe 6E Pearson.
CS 345: Topics in Data Warehousing Tuesday, October 19, 2004.
Network Aware Resource Allocation in Distributed Clouds.
Database Management 9. course. Execution of queries.
Master Thesis Defense Jan Fiedler 04/17/98
FEN  Concepts and terminology  Operations (relational algebra)  Integrity constraints The relational model.
Copyright © 2004 Pearson Education, Inc.. Chapter 15 Algorithms for Query Processing and Optimization.
Query Optimization Chap. 19. Evaluation of SQL Conceptual order of evaluation – Cartesian product of all tables in from clause – Rows not satisfying where.
© 1998 FORWISS FORWISS Oracle Measurement Results Prof. Bayer, PhD Dipl.-Inform. Volker Markl Roland Pieringer.
© 1999 FORWISS FORWISS MISTRAL und DWH 6-2 Processing Relational Queries Using the Multidimensional Access Method UB-Tree Prof. R. Bayer, Ph.D. Dr. Volker.
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
Speccast 1 Leon Poutievski Dr. Ken Calvert Dr. Jim Griffioen {leon, calvert, University of Kentucky Laboratory for Advanced Networking.
© 2000 FORWISS, 1 MISTRAL Processing Relational Queries Using a Multidimensional Access Method.
FEN Introduction to the database field:  The Relational Model Seminar: Introduction to relational databases.
NoDB: Querying Raw Data --Mrutyunjay. Overview ▪ Introduction ▪ Motivation ▪ NoDB Philosophy: PostgreSQL ▪ Results ▪ Opportunities “NoDB in Action: Adaptive.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
© 1999 FORWISS FORWISS MISTRAL Performance of TPC-D Benchmark and Datawarehouses Prof. R. Bayer, Ph.D. Dr. Volker Markl Dept. of Computer Science, Technical.
603 Database Systems Senior Lecturer: Laurie Webster II, M.S.S.E.,M.S.E.E., M.S.BME, Ph.D., P.E. Lecture 17 A First Course in Database Systems.
Fine Grained Authorization Through Predicated Grants Surajit Chaudhuri, Tanmoy Dutta, S. Sudarshan (ICDE 2007) Presented By: Ahmad Abusalah
Prof. Bayer, DWH, Ch.5, SS Chapter 5. Indexing for DWH D1Facts D2.
1 Exercise Sheet 3 Exercise 7.: ROLAP Algebra Assume that a fact table SalesCube has 3 hierarchies with attributes  ear , Month M, Productgroup P and.
Module 4 Database SQL Tuning Section 3 Application Performance.
Chapter 18 Query Processing. 2 Chapter - Objectives u Objectives of query processing and optimization. u Static versus dynamic query optimization. u How.
Indexes and Views Unit 7.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Multiplication Facts Table of Contents 0’s 1’s 2’s 3’s 4’s 5’s 6’s 7’s 8’s 9’s 10’s.
Query Processing – Query Trees. Evaluation of SQL Conceptual order of evaluation – Cartesian product of all tables in from clause – Rows not satisfying.
1 Exercise Sheet 3 Exercise 7.: ROLAP Algebra Assume that a fact table SalesCube has 3 hierarchies with attributes  ear , Month M, Productgroup P and.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 15 – Query Optimization.
03/02/20061 Evaluating Top-k Queries Over Web-Accessible Databases Amelie Marian Nicolas Bruno Luis Gravano Presented By: Archana and Muhammed.
1 Relational Factor Graphs Lin Liao Joint work with Dieter Fox.
22 Copyright © 2009, Oracle. All rights reserved. Filtering Requests in Oracle Business Intelligence Answers.
Aggregator Stage : Definition : Aggregator classifies data rows from a single input link into groups and calculates totals or other aggregate functions.
1 Overview of Query Evaluation Chapter Outline  Query Optimization Overview  Algorithm for Relational Operations.
1 Minggu 6, Pertemuan 12 Query Processing Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
SQL: Interactive Queries (2) Prof. Weining Zhang Cs.utsa.edu.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
More SQL: Complex Queries, Triggers, Views, and Schema Modification
Chapter 13 The Data Warehouse
DATA CUBE Advanced Databases 584.
Database.
Chapter 4: Dimensions, Hierarchies, Operations, Modeling
CLUSTER BY: A NEW SQL EXTENSION FOR SPATIAL DATA AGGREGATION
Database management concepts
Prof. R. Bayer, Ph.D. Dr. Volker Markl
Presentation transcript:

© 1999 FORWISS General Research Report Implementation and Optimization Issues of the ROLAP Algebra F. Ramsak, M.S. (UIUC) Dr. V. Markl Prof. R. Bayer, Ph.D.

© 1999 FORWISS Contents l Motivation l ROLAP Algebra Recap l Optimization Issues – Handling of Restrictions – Aggregation Networks l Future Work & Summary

© 1999 FORWISS Example DW Model

© 1999 FORWISS User‘s View of a Report Grouping combinations used to fill pivot table: (1){Y,Q,R,N} (2){Y,Q,R} (3) {Y,Q} (4) {Y,R,N}(5){Y,R} (6){Y} (7) {R,N}(8) {R} (9){} = ALL

© 1999 FORWISS POT: Pivot Organized Tuples l We may also write for POT(R,G,F). G,FG,F (R)(R)

© 1999 FORWISS POT-Example POT(R,{{A},{A,B}},{sum(D)}) yields the table: AB Sum( D ) a 1 ALL*** … n ALL*** a 1 b 1 …… a n …… a 1 b m …… a n b m a

© 1999 FORWISS POT Extension: Group Filtering l Filtering of generated groups (like with the HAVING clause in SQL) with H containing a predicate H[g] for each grouping g in G

© 1999 FORWISS Group Filtering Example l Report Years, Product-Group sales totals and sales/year for PGs with less than 10 Mio sales

© 1999 FORWISS Straight Forward SQL Generation l POT(R,{{A},{A,B}},{sum(D)}) maps directly to: SELECT A, ‘ALL’, sum(D) FROM R GROUP BY A UNION SELECTA, B, sum(D) FROM R GROUP BY A,B l Disadvantages: – Efficient execution depends on optimizer of underlying DBMS – no UB-Tree support on SQL interface guaranteed

© 1999 FORWISS Handling Restrictions l Semantic of ALL value l Pushing Restrictions Down – Pushing Through POT: Restrictions on all groups – Pushing down inside POT: Restrictions on individual groups may be pushed down (i.e., before grouping) if they do not contain constraints on the aggregation results

© 1999 FORWISS )(Salessum   )(, ALLpad MonthYear  MonthYear,  )(Salessum  FACT   )(, ALLpad MonthYear  ) ( 

© 1999 FORWISS )( Salessum   )(, ALLpad MonthYear  MonthYear,  )( Salessum  FACT   )(, ALLpad MonthYear  ) ( 

© 1999 FORWISS Aggregation Networks l Efficient generation of multiple groups – Usage of previous generated (more finer) groups instead of fact table as starting point – Only one access to the fact table for multiple groups l Problems: Size of aggregation nets – Hierarchy semantic reduces aggregation nets significantly l UB-Tree & Tetris techniques have high potential to optimize aggregation nets – Grouping requires sorting – Sorted writing of large temporary results saves additional processing time

© 1999 FORWISS Example of an Aggregation Network (Year, Month, Productgroup) (Year)(Productgroup) (Month) (Year, Month)(Year, Productgroup)(Month, Productgroup) ( )

© 1999 FORWISS Aggregation Net with Hierarchies (Year, Month, Productgroup) (Year)(Productgroup) (Year, Month)(Year, Productgroup) ( ) Tetris: sort according to Y Sort according to PG (or sorted writing+Scan)

© 1999 FORWISS POT and Aggregation Nets

© 1999 FORWISS Optimization Issues of AggregationNets l Find minimal spanning tree for the specified groupings – Vertices: groupings – Edge weights: cost of computing new group# l Cost factors: – Group size – Required sorting –...

© 1999 FORWISS Optimization Issues of Aggregation Nets (Year, Month, Productgroup) (Year)(Productgroup) (Year, Month)(Year, Productgroup) ( ) C1C1 C2C2 C5C5 C4C4 C3C3 C6C6 C7C7

© 1999 FORWISS Summary and Future Work l Aggregation networks have a very potential to speed up POT operations l Standard grouping/aggregation algorithms may benefit significantly from UB-Tree/Tetris techniques l Upon availability of resources: Implementation of basic ROLAP algebra processing as part of a master thesis