File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Multidimensional Indexing
Chapter 11 Indexing and Hashing (2) Yonsei University 2 nd Semester, 2013 Sanghyun Park.
Access Methods for Advanced Database Applications.
File Processing : Hash 2015, Spring Pusan National University Ki-Joune Li.
Multidimensional Data
Preferential top-k search over local data dissertation thesis RNDr. Martin Šumák supervisor: doc. RNDr. Stanislav Krajči, PhD. consultant: RNDr. Peter.
Spatial Indexing I Point Access Methods. PAMs Point Access Methods Multidimensional Hashing: Grid File Exponential growth of the directory Hierarchical.
2-dimensional indexing structure
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
Spatial Indexing SAMs. Spatial Access Methods PAMs Grid File kd-tree based (LSD-, hB- trees) Z-ordering + B+-tree R-tree Variations: R*-tree, Hilbert.
Accessing Spatial Data
Spatial Indexing SAMs.
Multi-dimensional Indexes
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Chapter 3: Data Storage and Access Methods
Spatial Indexing I Point Access Methods.
Techniques and Data Structures for Efficient Multimedia Similarity Search.
1 Geometric index structures April 15, 2004 Based on GUW Chapter , [Arge01] Sections 1, 2.1 (persistent B- trees), 3-4 (static versions.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Spatial Indexing SAMs. Spatial Access Methods PAMs Grid File kd-tree based (LSD-, hB- trees) Z-ordering + B+-tree R-tree Variations: R*-tree, Hilbert.
 Spatial data requires special data structures, similar to B-trees.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
1 CS143: Index. 2 Topics to Learn Important concepts –Dense index vs. sparse index –Primary index vs. secondary index (= clustering index vs. non-clustering.
R-TREES: A Dynamic Index Structure for Spatial Searching by A. Guttman, SIGMOD Shahram Ghandeharizadeh Computer Science Department University of.
Graphics Application Lab Myoung-Ah Kang, Sylvie Servigne,Ki-Joune Li, Robert Laurini CIKM’ 99 Kim Hyong-Jun, Yoon Tai-Jin GA Lab. Indexing Field Values.
Indexing and Hashing.
R-Trees: A Dynamic Index Structure for Spatial Data Antonin Guttman.
Indexing and Hashing (emphasis on B+ trees) By Huy Nguyen Cs157b TR Lee, Sin-Min.
R-Trees Extension of B+-trees.  Collection of d-dimensional rectangles.  A point in d-dimensions is a trivial rectangle.
Trees for spatial indexing
Trees for spatial data representation and searching
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Database management Systems, 3ed, R. Ramakrishnan and J. Gehrke1 Spatial Data Management Chapter 28.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Indexing for Multidimensional Data An Introduction.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
Mehdi Mohammadi March Western Michigan University Department of Computer Science CS Advanced Data Structure.
12.1 Chapter 12: Indexing and Hashing Spring 2009 Sections , , Problems , 12.7, 12.8, 12.13, 12.15,
File Processing : Index and Hash 2015, Spring Pusan National University Ki-Joune Li.
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Index and Hash 2004, Spring Pusan National University Ki-Joune Li.
R-Tree. 2 Spatial Database (Ia) Consider: Given a city map, ‘index’ all university buildings in an efficient structure for quick topological search.
Antonin Guttman In Proceedings of the 1984 ACM SIGMOD international conference on Management of data (SIGMOD '84). ACM, New York, NY, USA.
1 File Processing : File Organization and File Systems 2015, Spring Pusan National University Ki-Joune Li.
Spatial Databases - Indexing
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Grid Files Multi-dimensional Index Structures. Jaruloj Chongstitvatana 2006Grid Files 2 Properties of Grid Files  Support multi-dimensional data, but.
Chapter 11 Indexing And Hashing (1) Yonsei University 1 st Semester, 2016 Sanghyun Park.
ITEC 2620M Introduction to Data Structures Instructor: Prof. Z. Yang Course Website: ec2620m.htm Office: TEL 3049.
Jun-Ki Min. Slide  Such a multi-level index is a form of search tr ee ◦ However, insertion and deletion of new index entrie s is a severe problem.
Jeremy Iverson & Zhang Yun 1.  Chapter 6 Key Concepts ◦ Structures and access methods ◦ R-Tree  R*-Tree  Mobile Object Indexing  Questions 2.
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Hash 2004, Spring Pusan National University Ki-Joune Li.
CS422 Principles of Database Systems Indexes Chengyu Sun California State University, Los Angeles.
Spatial Data Management
Multidimensional Access Structures
Spatial Indexing I Point Access Methods.
Database Management Systems (CS 564)
CPSC-310 Database Systems
Multidimensional Indexes
Spatial Indexing I R-trees
File Processing : Index and Hash
2018, Spring Pusan National University Ki-Joune Li
File Processing : Multi-dimensional Index
Presentation transcript:

File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li

STEMPNU Multi-Dimensional Index Multi-Attributes Query vs. Single Attribute Query  Single Attribute : Only ONE attribute to specify query condition Example : Find Students whose record is in [3.5, 4.5]  Multi-Attributes : Several attributes Example : Find students whose height is greater than 180 cm and weight is less than 70 Kg Each attribute corresponds to a dimension  Multi-Attribute Query : Multi-Dimensional Query

STEMPNU Processing Multi-dimensional Queries Example : Find students whose height > 180 cm and weight < 70 Kg Method 1 : Using a B+-tree  Step 1 : Apply B+-tree to search student taller than 180 cm  Step 2 : Search students lighter than 70 Kg from the result of step 1  Height and Weight or Weight and Height ? 180 < 70 Result

STEMPNU Processing Multi-dimensional Queries Method 2 : Using Two B+-trees  Step 1 : Result 1 ← Students taller than 180 cm by B+-tree  Step 2 : Result 2 ← Students lighter than 70 Kg by B+-tree  Step 3 : Result ← Result 1  Result 2 Comparison of Method 1 and Method  < 70 Result ==

STEMPNU Processing Multi-dimensional Queries Method 3 : Unified Index for Several Attributes  One index for several attributes  Multi-Dimensional Space  Two approaches Extending B+-tree Extending Dynamic Hashing Index for Height and Weight Weight Height

STEMPNU block pointer... Block Pointer Array Extending Hashing : Grid Approach Weight Height Query Fixed Variable Fixed Grid Method Grid File

STEMPNU Extending Hashing : Grid File Directory (x 1, y 1 )(x 2, y 2 )Block Pointer Query

STEMPNU Problem 1: Dead Space Query No objects in this query area 5 block accesses Dead Space  Empty space with no objects How to reduce dead space

STEMPNU Minimum Bounding Rectangle Query MBR (Minimum Bounding Rectangle) Only 1 Disk Access

STEMPNU Problem 2: Non-Point Object Where to store this object

STEMPNU Minimum Bounding Rectangle MBR (Minimum Bounding Box)  Two dimensional geometric simplification of objects  Not the Whole space,  only in the region occupied by objects (X 1min, X 2min ) (X 1max, X 2max )

STEMPNU Extending B+-tree : R-tree B+-tree vs. R-tree  B+-tree : Interval (1-D rectangle)  R-tree : Multi-Dimensional Interval (Rectangle) R-tree : Rectangle B+-tree  Each Node MBR (Minimum Bounding Rectangle) instead of Interval (or Delimiter)  No Linked-List for External Nodes  A certain amount of overlapping is indispensable

STEMPNU Extending B+-tree : R-tree Example Query Root

STEMPNU Upward Split like B-tree Split MBR in the case of overflow  Line sweeping : Compare Cost-X and Cost-Y  Splitting Line New MBR

STEMPNU Splitting Strategy 50:50 Split Instead of 50:50 split, other cost measures  Area,  Perimeter  Overlapping Area Good SplitBad Split 1. Make them as COMPACT as possible 2. Preserve spatial proximity as possible

STEMPNU R*-tree: An Improvement of R-tree Re-Insertion Strategy on Overflow Most Popular Index for Multi-Dimensional Index Newly Inserted Object Delete and Re-Insert this Overflow Re-Inserted Object More Compact