Prof. Bayer, DWH, Ch.6, SS 20021 Chapter 6: UB-tree for Multidimensional Indexing Note: all relational databases are multidimensional: a tuple in a relation.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Implementation of Relational Operations (Part 2) R&G - Chapters 12 and 14.
Multidimensional Data
1 Lecture 8: Data structures for databases II Jose M. Peña
2-dimensional indexing structure
Spatial indexing PAMs (II).
BTrees & Bitmap Indexes
Data Indexing Herbert A. Evans. Purposes of Data Indexing What is Data Indexing? Why is it important?
2010/3/81 Lecture 8 on Physical Database DBMS has a view of the database as a collection of stored records, and that view is supported by the file manager.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods.
PROCESS IN DATA SYSTEMS PLANNING DATA INPUT DATA STORAGE DATA ANALYSIS DATA OUTPUT ACTIVITIES USER NEEDS.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Techniques and Data Structures for Efficient Multimedia Similarity Search.
Self-Balancing Search Trees Chapter 11. Chapter 11: Self-Balancing Search Trees2 Chapter Objectives To understand the impact that balance has on the performance.
1 Geometric index structures April 15, 2004 Based on GUW Chapter , [Arge01] Sections 1, 2.1 (persistent B- trees), 3-4 (static versions.
Self-Balancing Search Trees Chapter 11. Chapter Objectives  To understand the impact that balance has on the performance of binary search trees  To.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
O RTHOGONAL R ANGE S EARCHING الهه اسلامی فروردین 92, 1.
1 Database Tuning Rasmus Pagh and S. Srinivasa Rao IT University of Copenhagen Spring 2007 February 8, 2007 Tree Indexes Lecture based on [RG, Chapter.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
E.G.M. PetrakisB-trees1 Multiway Search Tree (MST)  Generalization of BSTs  Suitable for disk  MST of order n:  Each node has n or fewer sub-trees.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Indexing and Hashing (emphasis on B+ trees) By Huy Nguyen Cs157b TR Lee, Sin-Min.
Indexing structures for files D ƯƠ NG ANH KHOA-QLU13082.
Trees for spatial indexing
CHP - 9 File Structures. INTRODUCTION In some of the previous chapters, we have discussed representations of and operations on data structures. These.
Indexing. Goals: Store large files Support multiple search keys Support efficient insert, delete, and range queries.
Index Structures for Files Indexes speed up the retrieval of records under certain search conditions Indexes called secondary access paths do not affect.
Database Management 8. course. Query types Equality query – Each field has to be equal to a constant Range query – Not all the fields have to be equal.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Indexing for Multidimensional Data An Introduction.
1 CPS216: Advanced Database Systems Notes 04: Operators for Data Access Shivnath Babu.
1 B-Trees & (a,b)-Trees CS 6310: Advanced Data Structures Western Michigan University Presented by: Lawrence Kalisz.
© 1999 FORWISS FORWISS MISTRAL und DWH 6-2 Processing Relational Queries Using the Multidimensional Access Method UB-Tree Prof. R. Bayer, Ph.D. Dr. Volker.
1 CMSC 341 Extensible Hashing Chapter 5, Section 6 (pp. 200 – 203)
METU Department of Computer Eng Ceng 302 Introduction to DBMS Indexing Structures for Files by Pinar Senkul resources: mostly froom Elmasri, Navathe and.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
Physical Database Design I, Ch. Eick 1 Physical Database Design I About 25% of Chapter 20 Simple queries:= no joins, no complex aggregate functions Focus.
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
© 2000 FORWISS, 1 MISTRAL Processing Relational Queries Using a Multidimensional Access Method.
COSC 2007 Data Structures II Chapter 15 External Methods.
2IL50 Data Structures Fall 2015 Lecture 9: Range Searching.
1 Tree Indexing (1) Linear index is poor for insertion/deletion. Tree index can efficiently support all desired operations: –Insert/delete –Multiple search.
Prof. Bayer, DWH, Ch.5, SS Chapter 5. Indexing for DWH D1Facts D2.
B-Tree – Delete Delete 3. Delete 8. Delete
1 CPS216: Data-intensive Computing Systems Operators for Data Access (contd.) Shivnath Babu.
Database Systems Laboratory The Pyramid-Technique: Towards Breaking the Curse of Dimensionality Stefan Berchtold, Christian Bohm, and Hans-Peter Kriegal.
1 CPS216: Advanced Database Systems Notes 05: Operators for Data Access (contd.) Shivnath Babu.
Chapter 3.2 Basic Concepts of the MDD-Model
Index in Database Unit 12 Index in Database 大量資料存取方法之研究 Approaches to Access/Store Large Data 楊維邦 博士 國立東華大學 資訊管理系教授.
Indexing and B+-Trees By Kenneth Cheung CS 157B TR 07:30-08:45 Professor Lee.
Prof. Bayer, DWH, Ch.7, SS20021 Chapt. 7 Multidimensional Hierarchical Clustering Fig. 3.1 Hierarchies in the `Juice and More´ schema Year (3) Month (12)
File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.
CS411 Database Systems Kazuhiro Minami 10: Indexing-1.
Multidimensional Access Methods Ho Hoang Nguyen Nguyen Thanh Trong Dao Vu Quoc Trung Ngo Phuoc Huong Thien DATABASE.
ITEC 2620M Introduction to Data Structures Instructor: Prof. Z. Yang Course Website: ec2620m.htm Office: TEL 3049.
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
DS.H.1 Hashing Chapter 5 Overview The General Idea Hash Functions Separate Chaining Open Addressing Rehashing Extendible Hashing Application Example: Geometric.
Spatial Data Management
CHP - 9 File Structures.
Indexing ? Why ? Need to locate the actual records on disk without having to read the entire table into memory.
Spatial Indexing I Point Access Methods.
Orthogonal Range Searching and Kd-Trees
CPSC-310 Database Systems
Multidimensional Indexes
File Processing : Multi-dimensional Index
Chapter 10.1: UB-tree for Multidimensional Indexing
Chapter 6: UB-tree for Multidimensional Indexing
Presentation transcript:

Prof. Bayer, DWH, Ch.6, SS Chapter 6: UB-tree for Multidimensional Indexing Note: all relational databases are multidimensional: a tuple in a relation with m attributes is considered as a point in m- dimensional space. Parts Supplies Suppliers P#P#, S# S# Chapter 6.1 Introduction

Prof. Bayer, DWH, Ch.6, SS Stadium competes Team A Team B S# A# B# A#, S#, B# The relation competes resulting from the relationship is 3-dimensional just considering keys, even more with additional attributes

Prof. Bayer, DWH, Ch.6, SS Geographic Data: weather stations: ( X, Y, Z, time, temp, humidity, wind velocity,...) Fundamental Problem: how to partition multidimensional space for fast search, insertion, deletion? 3 UB-tree relies on basic concepts: area address region

Prof. Bayer, DWH, Ch.6, SS Areas and Addresses Definition: An area A is a special subspace of the hypercube universe constructed as follows: partition the m-dim cube into 2 m subcubes of equal size and number them 1,2,...,2 m 1. at level 1 take the first a 1 subcubes 2. at level 2 take the first a 2 subcubes of subcube a 1 +1 of level 1 3. at level 3 take the first a 3 subcubes of subcube a 2 +1 of level 2 and in general: k. at level k take the first a k subcubes of subcube a k-1 +1 of level k-1 etc.

Prof. Bayer, DWH, Ch.6, SS Definition: The address of an area A is the sequence of subcube numbers a 1.a a k = alpha (A) =  (A) according to the preceding definition. Note: Lexicographic increase of address makes area bigger Theorem:  Area(  )  Area(  ) Concept of region Def. If  < , then region [  :  ] =  [  :  ] := Area (  ) - Area(  )

Prof. Bayer, DWH, Ch.6, SS Note: for an increasing sequence of addresses  1 <  2 <  3 <...<  k we define corresponding regions  1 := (  :  1 ]  2 := (  1 :  2 ]...  k := (  k-1 :  k ] which can also be represented as the increasing sequence:  1 ;  2 ;  3 ;...;  k (which will later be stored in a B-tree index)

Prof. Bayer, DWH, Ch.6, SS Alternative definition: an area is the set of points on an initial interval of the space filling Z-curve Examples: areas with addresses

Prof. Bayer, DWH, Ch.6, SS Idea 1: store data in region  j on a disk page P j Idea 2: store addresses  1 ;  2 ;  3 ;...;  k in a B-tree or B*-tree or Prefix B-tree P 1, …, P k are the leaves of the B-tree Fundamental Question: How to split spatial region, if the leaf page of a B-tree must be split?

Prof. Bayer, DWH, Ch.6, SS Points and Coordinates point p = (x 1, x 2,..., x m ) At a certain resolution, p is a small square (pixel) Def: Address alpha(p) = address ot that area, whose last point is p Def: cart(  ) = Cartesian coordinate of point p with address  Lemma: cart (alpha (p)) = p cart (alpha (x 1, x 2,..., x m )) = (x 1, x 2,..., x m )