Chapter 6: UB-tree for Multidimensional Indexing

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Vorlesung Datawarehousing Table of Contents Prof. Rudolf Bayer, Ph.D. Institut für Informatik, TUM SS 2002.
Temporal Databases S. Srinivasa Rao April 12, 2007
Implementation of Relational Operations (Part 2) R&G - Chapters 12 and 14.
Augmenting Data Structures Advanced Algorithms & Data Structures Lecture Theme 07 – Part I Prof. Dr. Th. Ottmann Summer Semester 2006.
Multidimensional Data
Spatial Indexing I Point Access Methods. PAMs Point Access Methods Multidimensional Hashing: Grid File Exponential growth of the directory Hierarchical.
2-dimensional indexing structure
1 Partition Filter Set for Power- Efficient Packet Classification Authors: Haibin Lu, MianPan Publisher: IEEE GLOBECOM 2006 Present: Chen-Yu Lin Date:
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods.
Self-Balancing Search Trees Chapter 11. Chapter 11: Self-Balancing Search Trees2 Chapter Objectives To understand the impact that balance has on the performance.
1 Geometric index structures April 15, 2004 Based on GUW Chapter , [Arge01] Sections 1, 2.1 (persistent B- trees), 3-4 (static versions.
Self-Balancing Search Trees Chapter 11. Chapter Objectives  To understand the impact that balance has on the performance of binary search trees  To.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
O RTHOGONAL R ANGE S EARCHING الهه اسلامی فروردین 92, 1.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Trees for spatial indexing
CHP - 9 File Structures. INTRODUCTION In some of the previous chapters, we have discussed representations of and operations on data structures. These.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Indexing for Multidimensional Data An Introduction.
R ++ -tree: an efficient spatial access method for highly redundant point data Martin Šumák, Peter Gurský University of P. J. Šafárik in Košice.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
Reporter : Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.
Prof. Bayer, DWH, Ch.5, SS Chapter 5. Indexing for DWH D1Facts D2.
Database Systems Laboratory The Pyramid-Technique: Towards Breaking the Curse of Dimensionality Stefan Berchtold, Christian Bohm, and Hans-Peter Kriegal.
Indexing and B+-Trees By Kenneth Cheung CS 157B TR 07:30-08:45 Professor Lee.
File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.
Prof. Bayer, DWH, Ch.6, SS Chapter 6: UB-tree for Multidimensional Indexing Note: all relational databases are multidimensional: a tuple in a relation.
ITEC 2620M Introduction to Data Structures Instructor: Prof. Z. Yang Course Website: ec2620m.htm Office: TEL 3049.
DS.H.1 Hashing Chapter 5 Overview The General Idea Hash Functions Separate Chaining Open Addressing Rehashing Extendible Hashing Application Example: Geometric.
Presenters : Virag Kothari,Vandana Ayyalasomayajula Date: 04/21/2010.
Fast Subsequence Matching in Time-Series Databases.
Spatial Data Management
Data Indexing Herbert A. Evans.
CPS216: Data-intensive Computing Systems
CHP - 9 File Structures.
Indexing Goals: Store large files Support multiple search keys
Multidimensional Access Structures
Indexing ? Why ? Need to locate the actual records on disk without having to read the entire table into memory.
Multiway range trees: scalable IP lookup with fast updates
Spatial Indexing I Point Access Methods.
The Quad tree The index is represented as a quaternary tree
Orthogonal Range Searching and Kd-Trees
Chapter 11: Indexing and Hashing
Quadtrees 1.
Chapter 4: Dimensions, Hierarchies, Operations, Modeling
G-trees Region identifier: string of bits built as follows
Multidimensional Indexes
B-Tree.
B+Trees The slides for this text are organized into chapters. This lecture covers Chapter 9. Chapter 1: Introduction to Database Systems Chapter 2: The.
CPS216: Advanced Database Systems
File Processing : Multi-dimensional Index
Multiway Search Tree (MST)
File Organization.
Lecture 20: Indexes Monday, February 27, 2006.
Tree-Structured Indexes
Prof. R. Bayer, Ph.D. Dr. Volker Markl
Chapter 11: Indexing and Hashing
Slides based on those originally by : Parminder Jeet Kaur
Chapt. 7 Multidimensional Hierarchical Clustering
Chapter 10.1: UB-tree for Multidimensional Indexing
ICOM 5016 – Introduction to Database Systems
Ch. 16: Sweep-Zones Basic Question: Is it possible to compute nearest neighbors in expected time O(n*log(n)) ??? Basic Idea: Generalize sweep-lines to.
Unit 12 Index in Database 大量資料存取方法之研究 Approaches to Access/Store Large Data 楊維邦 博士 國立東華大學 資訊管理系教授.
Presentation transcript:

Chapter 6: UB-tree for Multidimensional Indexing Chapter 6.1 Introduction Note: all relational databases are multidimensional: a tuple in a relation with m attributes is considered as a point in m-dimensional space. Supplies Parts Suppliers S# P# P#, S# Prof. Bayer, DWH, Ch.6, SS 2000

The relation competes resulting from the relationship is Stadium S# competes Team A Team B A# B# A#, S#, B# The relation competes resulting from the relationship is 3-dimensional just considering keys, even more with additional attributes Prof. Bayer, DWH, Ch.6, SS 2000

Geographic Data: weather stations: ( X, Y, Z, time, temp, humidity, wind velocity, ...) Fundamental Problem: how to partition multidimensional space for fast search, insertion, deletion? 3 UB-tree relies on basic concepts: area address region Prof. Bayer, DWH, Ch.6, SS 2000

Areas and Addresses Definition: An area A is a special subspace of the hypercube universe constructed as follows: partition the m-dim cube into 2m subcubes of equal size and number them 1,2,...,2m 1. at level 1 take the first a1 subcubes 2. at level 2 take the first a2 subcubes of subcube a1+1 of level 1 3. at level 3 take the first a3 subcubes of subcube a2+1 of level 2 and in general: k. at level k take the first ak subcubes of subcube ak-1+1 of level k-1 etc. Prof. Bayer, DWH, Ch.6, SS 2000

according to the preceding definition. Definition: The address of an area A is the sequence of subcube numbers a1.a2. ... .ak = alpha (A) = a (A) according to the preceding definition. Note: Lexicographic increase of address makes area bigger Theorem: a < b <==> Area(a)  Area(b) Concept of region Def. If a < b , then region [a : b ] = r [a : b ] := Area (b) - Area(a) Prof. Bayer, DWH, Ch.6, SS 2000

Note: for an increasing sequence of addresses a1 < a2 < a3 < ... < ak we define corresponding regions r1 := (0 : a1] r2 := (a1 : a2] ... rk := (ak-1 : ak] which can also be represented as the increasing sequence: a1 ; a2 ; a3 ; ... ;ak (which will later be stored in a B-tree index) Prof. Bayer, DWH, Ch.6, SS 2000

??? Definitions for area from Japan paper??? Alternative definition: an area is the set of points on an initial interval of the space filling Z-curve ??? Definitions for area from Japan paper??? Examples: areas with addresses 2 3.2.1 2.1.3.1.2 Prof. Bayer, DWH, Ch.6, SS 2000

Idea 1: store data in region rj on a disk page Pj Idea 2: store addresses a1 ; a2 ; a3 ; ... ;ak in a B-tree or B*-tree or Prefix B-tree P1, …, Pk are the leaves of the B-tree Fundamental Question: How to split spatial region, if the leaf page of a B-tree must be split? Prof. Bayer, DWH, Ch.6, SS 2000

Points and Coordinates point p = (x1, x2, ... , xm) At a certain resolution, p is a small square (pixel) Def: Address alpha(p) = address ot that area, whose last point is p Def: cart() = Cartesian coordinate of point p with address  Lemma: cart (alpha (p)) = p cart (alpha (x1, x2, ... , xm)) = (x1, x2, ... , xm) Prof. Bayer, DWH, Ch.6, SS 2000