© 1999 FORWISS FORWISS MISTRAL und DWH 6-2 Processing Relational Queries Using the Multidimensional Access Method UB-Tree Prof. R. Bayer, Ph.D. Dr. Volker.

Slides:



Advertisements
Similar presentations
Vorlesung Datawarehousing Table of Contents Prof. Rudolf Bayer, Ph.D. Institut für Informatik, TUM SS 2002.
Advertisements

Tuning: overview Rewrite SQL (Leccotech)Leccotech Create Index Redefine Main memory structures (SGA in Oracle) Change the Block Size Materialized Views,
DBMS 2001Notes 4.2: Hashing1 Principles of Database Management Systems 4.2: Hashing Techniques Pekka Kilpeläinen (after Stanford CS245 slide originals.
Hashing and Indexing John Ortiz.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree.
INDEXING AND HASHING.
Query Execution Professor: Dr T.Y. Lin Prepared by, Mudra Patel Class id: 113.
IS 4420 Database Fundamentals Chapter 6: Physical Database Design and Performance Leon Chen.
Data Indexing Herbert A. Evans. Purposes of Data Indexing What is Data Indexing? Why is it important?
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part A Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Chapter 3: Data Storage and Access Methods
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
1 Lecture 20: Indexes Friday, February 25, Outline Representing data elements (12) Index structures (13.1, 13.2) B-trees (13.3)
Query Execution Professor: Dr T.Y. Lin Prepared by, Mudra Patel Class id: 113.
What is Where? u Getting Started With Geographic Information Systems u Chapter 5.
1 9 Data Warehouse CSC5301 Hachim Haddouti. 2 9 About Me u Hachim Haddouti, born in 1969, married, one baby 9 weeks u Ph.D. in Computer Science (Database.
Chapter 8 Physical Database Design. McGraw-Hill/Irwin © 2004 The McGraw-Hill Companies, Inc. All rights reserved. Outline Overview of Physical Database.
1 CS143: Index. 2 Topics to Learn Important concepts –Dense index vs. sparse index –Primary index vs. secondary index (= clustering index vs. non-clustering.
CS4432: Database Systems II
Chapter 61 Chapter 6 Index Structures for Files. Chapter 62 Indexes Indexes are additional auxiliary access structures with typically provide either faster.
Indexing. Goals: Store large files Support multiple search keys Support efficient insert, delete, and range queries.
Cloud Computing Lecture Column Store – alternative organization for big relational data.
ASP.NET Programming with C# and SQL Server First Edition
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Oracle Data Block Oracle Concepts Manual. Oracle Rows Oracle Concepts Manual.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
1 Chapter 12: Indexing and Hashing Indexing Indexing Basic Concepts Basic Concepts Ordered Indices Ordered Indices B+-Tree Index Files B+-Tree Index Files.
TM 7-1 Copyright © 1999 Addison Wesley Longman, Inc. Physical Database Design.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
1 CPS216: Advanced Database Systems Notes 04: Operators for Data Access Shivnath Babu.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
ICS 321 Fall 2011 Overview of Storage & Indexing (i) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 11/9/20111Lipyeow.
Efficiently Processing Queries on Interval-and-Value Tuples in Relational Databases Jost Enderle, Nicole Schneider, Thomas Seidl RWTH Aachen University,
© 2000 FORWISS, 1 MISTRAL Processing Relational Queries Using a Multidimensional Access Method.
CS 405G: Introduction to Database Systems 21 Storage Chen Qian University of Kentucky.
© 1999 FORWISS FORWISS MISTRAL Performance of TPC-D Benchmark and Datawarehouses Prof. R. Bayer, Ph.D. Dr. Volker Markl Dept. of Computer Science, Technical.
Query Execution Section 15.1 Shweta Athalye CS257: Database Systems ID: 118 Section 1.
Reporter : Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.
C-Store: Data Model and Data Organization Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY May 17, 2010.
Creating and Maintaining Geographic Databases. Outline Definitions Characteristics of DBMS Types of database Relational model SQL Spatial databases.
Prof. Bayer, DWH, Ch.5, SS Chapter 5. Indexing for DWH D1Facts D2.
© 1999 FORWISS General Research Report Implementation and Optimization Issues of the ROLAP Algebra F. Ramsak, M.S. (UIUC) Dr. V. Markl Prof. R. Bayer,
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 7 (Part II) INTRODUCTION TO STRUCTURED QUERY LANGUAGE (SQL) Instructor.
Indexes and Views Unit 7.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Chapter 8 Physical Database Design. Outline Overview of Physical Database Design Inputs of Physical Database Design File Structures Query Optimization.
Lecture 10 Creating and Maintaining Geographic Databases Longley et al., Ch. 10, through section 10.4.
Indexing OLAP Data Sunita Sarawagi Monowar Hossain York University.
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
CS411 Database Systems Kazuhiro Minami 10: Indexing-1.
Prof. Bayer, DWH, Ch.6, SS Chapter 6: UB-tree for Multidimensional Indexing Note: all relational databases are multidimensional: a tuple in a relation.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.
7 1 Database Systems: Design, Implementation, & Management, 7 th Edition, Rob & Coronel 7.6 Advanced Select Queries SQL provides useful functions that.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 Jianping Fan Dept of Computer Science UNC-Charlotte.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
Module 11: File Structure
CPS216: Data-intensive Computing Systems
Indexing Structures for Files and Physical Database Design
Indexing Goals: Store large files Support multiple search keys
COMP 430 Intro. to Database Systems
Chapter 11: Indexing and Hashing
CPSC-310 Database Systems
Database Design and Programming
Indexing 4/11/2019.
Chapter 11: Indexing and Hashing
Prof. R. Bayer, Ph.D. Dr. Volker Markl
Chapter 11: Indexing and Hashing
Index Structures Chapter 13 of GUW September 16, 2019
Presentation transcript:

© 1999 FORWISS FORWISS MISTRAL und DWH 6-2 Processing Relational Queries Using the Multidimensional Access Method UB-Tree Prof. R. Bayer, Ph.D. Dr. Volker Markl Dept. of Computer Science, Technical University Munich and Bavarian Research Center for Knowledgebased Systems (FORWISS)

© 1999 FORWISS FORWISS Agenda 1. Concept of the UB-Tree 2. Range Query Algorithm 3. Tetris Algorithm

© 1999 FORWISS FORWISS Next Generation DB-Applications - huge databases - multidimensional - complex queries Examples: - Datawarehouses - geographic informationsystems - time series

© 1999 FORWISS FORWISS B-Tree and UB-Tree B-Tree invented in 1969, published in 1970/72 enabling technology for all commercial DBMS for 25 years UB-Tree invented in 1996 enabling technology for next generation DB applications

© 1999 FORWISS FORWISS UB-Tree Access method (TransBase HC) enable next generations DB-applications!! buying patterns of consumers geographic data and time series datamining mobile phone calls 10 million users * 10 calls/day = 100 million records caller callee time duration geographic location ~ 100 Bytes/call  10 GB/day = 3.6 TeraByte/year

© 1999 FORWISS FORWISS Design Goals l clustering tuples on disk pages while preserving spatial proximity l efficient incremental organization l logarithmic worst-case guarantees for insertion, deletion and point queries l efficient handling of range queries l good average memory utilization  revolution in DB-applications

© 1999 FORWISS FORWISS Z-regions/UB-Trees A Z-region [  :  ] is the space covered by an interval on the Z-curve and is defined by two Z-addresses  and . UB-Tree partitioning: 0.1] 1.1.1] 2.1] 3] 4] Z-region [0.1 : 1.1.1] point data creating the UB-Tree on the left for a page capacity of 2 points

© 1999 FORWISS FORWISS UB-Tree Insertion 1/2/3/4 Note: partioning is not unique!!

© 1999 FORWISS FORWISS UB-Tree Insertion 18/19

© 1999 FORWISS FORWISS Multidimensional Range Query SELECT * FROM table WHERE (A 1 BETWEEN a 1 AND b 1 ) AND (A 2 BETWEEN a 2 AND b 2 ) AND..... (A n BETWEEN a n AND b n )

© 1999 FORWISS FORWISS Comparison of the Rangequery Performance ideal case s 1 *s 2 *P multidimensional index s 1  *s 2  *P multiple B-Trees, bitmap indexes s 1 *I 1 +s 2 *I 2 +s 1 *s 2 *T compound primary B-Tree s 1 *P

© 1999 FORWISS FORWISS Example of a Range Query rangequery(ql,qh) =  := Z(ql);  := Z(qh) while  <  p := read page(  ) output intersection(p,ql,qh)  := getnext( ,ql,qh) getnext( ,ql,qh) = l := last-intersection-in-region( ,ql,qh) f := first-intersection-in- brother/father(l,ql,qh)) return alpha(f)

© 1999 FORWISS FORWISS Two Visualized Range-Queries

© 1999 FORWISS FORWISS Range Queries in sparsely and densely populated parts of the Universe

© 1999 FORWISS FORWISS Range Queries and Growing Databases 1000 tuples tuples

© 1999 FORWISS FORWISS Summary UB-Trees ü 50%-83% storage utilization, dynamic updates ü Efficient Z-address calculation (bit-interleaving) ü Logarithmic performance for the basic operations ü Efficient range query algorithm (bit-operations) ü Prototype UB/API above RDBMS (Oracle 8, Informix, DB2 UDB, TransBase, soon: MS SQL 7.0) using ESQL/C ? Patent application ? most DBMS applications benefit

© 1999 FORWISS FORWISS Sorting Query Boxes SELECT * FROM table WHERE (A 1 BETWEEN a 1 AND b 1 ) AND (A 2 BETWEEN a 2 AND b 2 ) AND..... (A n BETWEEN a n AND b n ) ORDER BY A i, A j, A k,... (GROUP BY A i, A j, A k,...)

© 1999 FORWISS FORWISS The Tetris Algorithm sort direction

© 1999 FORWISS FORWISS Summary Tetris l Combines sort process and evaluation of multi-attribute restrictions in one processing step l I/O-time linear w.r. to result set size l temporary storage sublinear w.r. to result set size l Sorting no longer a “blocking operation” ? Patent application ?  speedup for all DB operations