R-Trees and applications Efficient grid queries in the Dynamo reservoir modelling software.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
1 Top-k Spatial Joins
Danzhou Liu Ee-Peng Lim Wee-Keong Ng
Access Methods for Advanced Database Applications.
Course Outline Introduction in algorithms and applications Parallel machines and architectures Overview of parallel machines, trends in top-500 Cluster.
Preferential top-k search over local data dissertation thesis RNDr. Martin Šumák supervisor: doc. RNDr. Stanislav Krajči, PhD. consultant: RNDr. Peter.
1 Lecture 8: Data structures for databases II Jose M. Peña
Spatial Mining.
Indexing Network Voronoi Diagrams*
2-dimensional indexing structure
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Spatial Access Methods Chapter 26 of book Read only 26.1, 26.2, 26.6 Dr Eamonn Keogh Computer Science & Engineering Department University of California.
Accessing Spatial Data
Liang, Introduction to Java Programming, Eighth Edition, (c) 2011 Pearson Education, Inc. All rights reserved Chapter Trees and B-Trees.
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis.
Lars Arge1, Mark de Berg2, Herman Haverkort3 and Ke Yi1
A New Point Access Method based on Wavelet Trees Nieves R. Brisaboa, Miguel R. Luaces, Diego Seco Database Laboratory University of A Coruña A Coruña,
Chapter 3: Data Storage and Access Methods
NPS Introduction to GIS: Lecture 1
1 Lecture 20: Indexes Friday, February 25, Outline Representing data elements (12) Index structures (13.1, 13.2) B-trees (13.3)
1 Database indices Database Systems manage very large amounts of data. –Examples: student database for NWU Social Security database To facilitate queries,
B + -Trees (Part 1). Motivation AVL tree with N nodes is an excellent data structure for searching, indexing, etc. –The Big-Oh analysis shows most operations.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
1 Database Tuning Rasmus Pagh and S. Srinivasa Rao IT University of Copenhagen Spring 2007 February 8, 2007 Tree Indexes Lecture based on [RG, Chapter.
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates
R-Trees: A Dynamic Index Structure for Spatial Data Antonin Guttman.
Indexing. Goals: Store large files Support multiple search keys Support efficient insert, delete, and range queries.
ICS 220 – Data Structures and Algorithms Week 7 Dr. Ken Cosh.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Storage CMSC 461 Michael Wilson. Database storage  At some point, database information must be stored in some format  It’d be impossible to store hundreds.
Trees for spatial data representation and searching
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Database management Systems, 3ed, R. Ramakrishnan and J. Gehrke1 Spatial Data Management Chapter 28.
1 B Trees - Motivation Recall our discussion on AVL-trees –The maximum height of an AVL-tree with n-nodes is log 2 (n) since the branching factor (degree,
1 SD-Rtree: A Scalable Distributed Rtree Witold Litwin & Cédric du Mouza & Philippe Rigaux.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Indexing for Multidimensional Data An Introduction.
CSC 213 – Large Scale Programming. Problems with Search Trees  Great at organizing information for searching  Processing is maintained at consistent.
Indexing for Multidimensional Data An Introduction.
Optimal insert methods of geographical information to Spatio- temporal DB Final Presentation Industrial Project June 17,2012 Students: Michael Tsalenko.
R-Tree. 2 Spatial Database (Ia) Consider: Given a city map, ‘index’ all university buildings in an efficient structure for quick topological search.
Antonin Guttman In Proceedings of the 1984 ACM SIGMOD international conference on Management of data (SIGMOD '84). ACM, New York, NY, USA.
1 Tree Indexing (1) Linear index is poor for insertion/deletion. Tree index can efficiently support all desired operations: –Insert/delete –Multiple search.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of CHAPTER 12: Multi-way Search Trees Java Software Structures: Designing.
CompSci 100E 39.1 Memory Model  For this course: Assume Uniform Access Time  All elements in an array accessible with same time cost  Reality is somewhat.
Spatial and Geographic Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria University (Karachi Campus)
Spatial Database 2/5/2011 Reference – Ramakrishna Gerhke and Silbershatz.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Spatial and Geographic Databases. Spatial databases store information related to spatial locations, and support efficient storage, indexing and querying.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
CompSci Memory Model  For this course: Assume Uniform Access Time  All elements in an array accessible with same time cost  Reality is somewhat.
Maths & Technologies for Games Spatial Partitioning 2
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.
A Spatial Index Structure for High Dimensional Point Data Wei Wang, Jiong Yang, and Richard Muntz Data Mining Lab Department of Computer Science University.
Multidimensional Access Methods Ho Hoang Nguyen Nguyen Thanh Trong Dao Vu Quoc Trung Ngo Phuoc Huong Thien DATABASE.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
1 R-Trees Guttman. 2 Introduction Range queries in multiple dimensions: Computer Aided Design (CAD) Geo-data applications Support special data objects.
Advanced Database Aggregation Query Processing
Spatial Data Management
B-Trees 7/5/2018 4:26 AM Presentation for use with the textbook Data Structures and Algorithms in Java, 6th edition, by M. T. Goodrich, R. Tamassia, and.
Multiway search trees and the (2,4)-tree
Chapter Trees and B-Trees
Chapter Trees and B-Trees
Efficient Distance Computation between Non-Convex Objects
Multidimensional Search Structures
NPS Introduction to GIS: Lecture 1 Based on NIMC and Other Sources.
Presentation transcript:

R-Trees and applications Efficient grid queries in the Dynamo reservoir modelling software

R-Tree Project A.N. Yzelman (UU) Supervisors A.N. Swart (Alten) G.L.G. Sleijpen (UU) Project Owners Eric Haesen (Alten) Hans Molenaar (Shell)

Overview Problem: Querying large spatial databases. Application in Dynamo reservoir simulation software. R-Trees. - Example - Testing - Implementation

Querying spatial data Needed in areas of: 1. Image processing (Ray-tracing / CAD) 2. Geographical information systems (e.g., TomTom) 3. Robotics

Example The San Francisco road network

Typical Queries Containment “What objects are contained in...” Neighbours “What objects are close to...” Intersection “What objects cross...”

The Naive Approach -> Test all objects May take enormous amount of time: nodes for San Francisco Map Conclusion: need a clever data structure

Trees Example: Binary Search tree Each node has max. two children

Trees Searching takes O(log(n)) time (2048 entries take twice the time of 1024 entries!) Insertion/Deletion may be slower Main issue for spatial data: need an ordering

Dynamo Dynamo uses high resolution grids Queries: neighbours, intersections, overlap Currently uses a bisection method, which may be sub-optimal Dynamo may benefit from a custum taylored R-Tree implementation

Example Grid

R-Trees Consider a simple two-dimensional case:

R-Trees Add Bounding Boxes

R-Trees Add Bounding Boxes

R-Trees Add Bounding Boxes

R-Trees Add Bounding Boxes

R-Trees Add Bounding Boxes

R-Trees Add Bounding Boxes

R-Tree Properties Every node contains between m and M data elements unless it is the root. All leaves appear at the same level, and contain all the data elements. The bounding boxes used tightly encloses the objects within.

R-Tree Properties Every node contains between m and M data elements unless it is the root. m and M may be chosen with memory or disk cache size in mind. In the latter case, R-trees reduce to the well known B-trees.

Subtleties By the previous example, an R-tree can be built in many different ways.

R-Trees

Variants By the previous example, we saw an R- tree can be built in many different ways. Leads to many different R-tree variations.

Variants By the previous example, we saw an R- tree can be built in many different ways. Leads to many different R-tree variations. For example: R*-tree, PR-tree, Hilbert-tree, R+-tree, and more...

Testing Use real life geographical data supplied by Shell. Test several R-tree variations. Tune tree parameters to the specifics of Shell data. Quantify results in terms of query time, memory usage and query response quality.

Implementation Object-Oriented C++ Easily extended “Plug & Play” solution for spatial data storage Generic design targeted at n-dimensional polygons in different spaces

Questions?