Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Spatial Query Processing for High Resolutions Hans-Peter Kriegel,

Slides:



Advertisements
Similar presentations
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Advertisements

Phillip Dickens, Department of Computer Science, University of Maine. In collaboration with Jeremy Logan, Postdoctoral Research Associate, ORNL. Improving.
1 Chapter 5 : Query Processing and Optimization Group 4: Nipun Garg, Surabhi Mithal
Min LuTIMBER: A Native XML DB1 TIMBER: A Native XML Database Author: H.V. Jagadish, etc. Presenter: Min Lu Date: Apr 5, 2005.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
DBMS 2001Notes 4.2: Hashing1 Principles of Database Management Systems 4.2: Hashing Techniques Pekka Kilpeläinen (after Stanford CS245 slide originals.
Emre Yenier Rethinking Database System Architecture: Towards a Self-tuning RISC-style Database System.
Efficient Density-Based Clustering of Complex Objects Stefan Brecheisen, Hans-Peter Kriegel, Martin Pfeifle University of Munich Institute for Computer.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
1 Foundations of Software Design Fall 2002 Marti Hearst Lecture 18: Hash Tables.
INSTITUTE FOR INFORMATICS DATABASE GROUP Region of Interest Queries in CT Scans Matthias Schubert 1 Joint work with Alexander Cavallaro 2, Franz Graf 1,
Spatio-Temporal Databases
ACM GIS An Interactive Framework for Raster Data Spatial Joins Wan Bae (Computer Science, University of Denver) Petr Vojtěchovský (Mathematics,
Database management concepts Database Management Systems (DBMS) An example of a database (relational) Database schema (e.g. relational) Data independence.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Chapter 3: Data Storage and Access Methods
Google Bigtable A Distributed Storage System for Structured Data Hadi Salimi, Distributed Systems Laboratory, School of Computer Engineering, Iran University.
1 Lecture 20: Indexes Friday, February 25, Outline Representing data elements (12) Index structures (13.1, 13.2) B-trees (13.3)
Hans-Peter Kriegel, Martin Pfeifle, Marco Pötke, Thomas Seidl A Cost Model for Interval Intersection Queries on RI-Trees Institute for Computer Science.
Alternative: Bitmap Indexing Imagine the following query in huge table Find customers living in London, with 2 cars and 3 children occupying a 4 bed house.
San Diego, 06/12/03 San Diego, 06/12/03 Martin Pfeifle, Database Group, University of Munich Using Sets of Feature Vectors for Similarity Search on Voxelized.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
CPSC-608 Database Systems Fall 2011 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #12.
Fast Set Intersection in Memory Bolin Ding Arnd Christian König UIUC Microsoft Research.
Compressing Multiresolution Triangle Meshes Emanuele Danovaro, Leila De Floriani, Paola Magillo, Enrico Puppo Department of Computer and Information Sciences.
Introduction. 
XML as a Boxwood Data Structure Feng Zhou, John MacCormick, Lidong Zhou, Nick Murphy, Chandu Thekkath 8/20/04.
Dept. of Electrical Engineering and Computer Science, Northwestern University Context-Aware Optimization of Continuous Query Maintenance for Trajectories.
Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps Tan Apaydin – The Ohio State University Guadalupe Canahuate – The Ohio.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 4th Edition Copyright © 2009 John Wiley & Sons, Inc. All rights.
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
Exploiting Type and Space in a Main Memory Query Engine Thomas Schwarz Matthias Grossmann, Daniela Nicklas, Bernhard Mitschang Universität Stuttgart, Institute.
Efficiently Processing Queries on Interval-and-Value Tuples in Relational Databases Jost Enderle, Nicole Schneider, Thomas Seidl RWTH Aachen University,
Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.
Shape-based Similarity Query for Trajectory of Mobile Object NTT Communication Science Laboratories, NTT Corporation, JAPAN. Yutaka Yanagisawa Jun-ichi.
A Study of Balanced Search Trees: Brainstorming a New Balanced Search Tree Anthony Kim, 2005 Computer Systems Research.
Leonardo Guerreiro Azevedo Geraldo Zimbrão Jano Moreira de Souza Approximate Query Processing in Spatial Databases Using Raster Signatures Federal University.
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Full-Text Support in a Database Semantic File System Kristen LeFevre & Kevin Roundy Computer Sciences 736.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 5th Edition Copyright © 2015 John Wiley & Sons, Inc. All rights.
Multidimensional Access Methods Ho Hoang Nguyen Nguyen Thanh Trong Dao Vu Quoc Trung Ngo Phuoc Huong Thien DATABASE.
Completeness Criteria for Object- Relational Database Systems by Won Kim April 2002 Sang Ho Lee School of Computing, Soongsil University
Advanced Database Aggregation Query Processing
A Case Study in Building Layered DHT Applications
Spatio-Temporal Databases
CS522 Advanced database Systems
CS422 Principles of Database Systems Course Overview
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
CPSC-608 Database Systems
TT-Join: Efficient Set Containment Join
SCOPE: Scalable Consistency in Structured P2P Systems
Database management concepts
Spatio-Temporal Databases
Query Processing B.Ramamurthy Chapter 12 11/27/2018 B.Ramamurthy.
Joining Interval Data in Relational Databases
Team Project, Part II NOMO Auto, Part II IST 210 Section 4
Selected Topics: External Sorting, Join Algorithms, …
Database management concepts
Database Design and Programming
CPSC-608 Database Systems
CPSC-608 Database Systems
Prof. R. Bayer, Ph.D. Dr. Volker Markl
Efficient Processing of Top-k Spatial Preference Queries
Course Instructor: Supriya Gupta Asstt. Prof
Presentation transcript:

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Spatial Query Processing for High Resolutions Hans-Peter Kriegel, Martin Pfeifle, Marco Pötke, Thomas Seidl 8 th International DASFAA-Conference March, 2003, Kyoto, Japan Database Group Institute for Computer Science University of Munich, Germany

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Outline of the Talk 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions  Spatial Databases  Voxelized Objects  High Resolutions Outline of the Talk

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Spatial Database Management Systems collision queries System Requirements:  Effectivity  Efficiency  Scalability  Concurrency Control  Recovery complex box queries spatialobjects Spatial Database Management Systems (based on extensible ORDBMS)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Voxelized Spatial Objects triangulated objects voxel sequence 1.) linearization of the data space – grid-approximation – space filling curve interval sequence 2.) interval sequence – bottom-up or top-down – size-bound or error-bound

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Query Processing for High Resolutions Filter - Step Candidate Set for Application Specific Refinement Step Spatial - DB Refinement Step ORDBMSORDBMS CADCAD Result e.g. RI-Tree High Resolution HRI-Approach

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions  Relational Interval Tree [VLDB 00] [VLDB 00]  Extensible Indexing Outline of the Talk

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Relational Interval Tree (RI-Tree) 3a3a 15 a 12 c 5c5c 15 a A B C D 7b7b 1b1b 13 d  Foundation: Interval Tree [ Edelsbrunner 1980 ]

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich D  Foundation: Interval Tree [ Edelsbrunner 1980 ]  1. Idea: Virtualization of the Primary Structure 3a3a 15 a 12 c 5c5c 15 a b7b 1b1b 13 d h – 1 root = 2 h–1 Relational Interval Tree (RI-Tree)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich  Foundation: Interval Tree [ Edelsbrunner 1980 ]  1. Idea: Virtualization of the Primary Structure  2. Idea: Managing of the Secondary Structure by 2 B + -trees nodelower id bacdbacd nodeupper id bcadbcad lowerIndexupperIndex Relational Interval Tree (RI-Tree) root = 2 h– a3a 15 a 12 c 5c5c 15 a 7b7b 1b1b 13 d A B C D

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 1. Procedural Step 2. Declarative Step RI-Tree: Intersection Query

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 1. Procedural Step 16 = root 24 = fork Declarative Step – posting one single SQL-statement – number of I/O-accesses: O(h·log b n + r/b)  arithmetic traversal of the primary structure  collecting the visited nodes in transient tables  number of I/0-accesses: 0 select id from upperIndex i, :leftNodes left where i.node = left.node and i.upper >= :Q.lower union all select id from lowerIndex i, :rightNodes right where i.node = right.node and i.lower <= :Q.upper union all select id from upperIndex i where i.node between :Q.lower and :Q.upper upper = = lower query Q RI-Tree: Intersection Query

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich User-defined Index Structure [VLDB 00] [SSTD 01] Extensible Indexing Framework Object-relational interface for index maintenance and querying functions. Relational Implementation Mapping to built-in indexes (B + -trees); SQL-based query processing Physical Implementation Block-Manager, Caches, Locking, Logging, … User-defined Cost Model [SSDBM 02] Extensible Optimization Framework Object-relational interface for selectivity estimation and cost prediction functions. Relational Implementation Mapping to built-in statistics facilities; SQL-based evaluation of cost model Declarative Embedding Object-relational DML and DDL RI-Tree: Integration into an ORDBMS

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Outline of the Talk 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions  Grey Intervals  Storage of Grey Intervals in an Intervals in an ORDBMS ORDBMS  Intersection Queries based on based on Grey Intervals Grey Intervals

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Grey Intervals A voxelized “real-world” object Black object interval sequence (obtained from encoding voxels via a space filling curve) Grey object interval sequence (obtained from grouping black intervals together) O grey = ( id,   (2, 5), (7, 7) ,  (12, 12), (15, 15),(18, 18) ,  (22, 29) ,  (36, 36), (39, 39),(42, 42) ,  (46, 47)  )  (2, 5), (7, 7)  Grey Interval

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich +- bit-orientedapproachoffset-orientedapproach Storage of a Grey Object Interval Sequence O (L) w1w1 w1w2w3w4w1w2w3w4 O (n * log L) w2w2 w3w3 w4w4 table schema GreyIntervals (id, intervalsequence) BLOB aggregated information H(I grey ), D(I grey ), G(I grey )

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Multi-Step Query Processing for Intersection Queries A 1 Q 1 A 3 Q 2 A 2 Q 1 B 3 Q 1 B 2 Q 1 C 1 Q 1 C 2 Q 1 C 2 Q 2 C 3 Q 2 D 1 Q 1 A 1 Q 1 A 3 Q 2 C 1 Q 1 DB Query Q A 1 Q 1 A 3 Q 2 A 2 Q 1 + B 3 Q 1 C 1 Q 1 C 2 Q 1 B 2 Q 1 ? result set Interval Index 1. filter step FAST GREY TEST 2. filter step BLOB TEST 3. filter step D1 Q1D1 Q1 A C B Q 1 Q 2 A 1 A 2 A 3 A B 1 B 2 B 3 B C 1 C 2 C 3 C 4 C D 1 D 2 D E 1 E... B 3 Q 1 + +

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 2. filter step: FAST-GREY-TEST black interval + grey interval black interval covers the starting or end point of the grey interval grey interval + grey interval share the same starting or end point grey interval + grey interval number of white cells is smaller than the intersecting area... black interval + black interval L grey interval + other interval grey interval has only two black cells and the other interval is completely included in this interval grey interval + grey interval grey intervals have only two black cells and the intervals have no common starting or end point intersection no intersection long black interval + grey interval maximum gap of the grey interval smaller than the length of the black interval  intersection test based on the aggregated information of the grey intervals.

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 3. filter step: BLOB-TEST  intersection test based on the examination of the black interval sequence... I2I2 I1I1 runtime analysis runtime analysis bit-oriented approach  finding the starting point A of the interlacing area  testing the L voxels L oLoL A O (1) O (L)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich 3. filter step: BLOB-TEST... L I2I2 I1I1 w 1 w 2 w 3 w 4 w 5 w 6 w 7 w 8 A w6w6 w4w4 w7w7 n 1 = 8 n L1 = 1 n L2 = 3 runtime analysis runtime analysis offset-oriented approach  finding the starting point of the interlacing area  testing the n L1 resp. n L2 intervals O (log n 1 ) O (n L1 +n L2 )  intersection test based on the examination of the black interval sequence bit-oriented approach  finding the starting point A of the interlacing area  testing the L voxels O (1) O (L)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich SQL-Statement SELECT candidates.id FROM (SELECT db.id AS id, table (AggInfos(db.intervalsequence, q.intervalsequence)) AS ctable FROM GreyIntervals db, :GreyQueryIntervals q WHERE intersects (hull(db.intervalsequence), hull(q.intervalsequence)) GROUP BY db.id )candidates WHERE EXISTS (SELECT 1 FROM GreyIntervals db, :GreyQueryIntervals q, candidates.ctable ctable WHERE db.rowid = ctable.dbrowid AND q.rowid = ctable.qrowid AND blobintersection (db.intervalsequence, q.intervalsequence) ) intersects (hull(db.intervalsequence), hull(q.intervalsequence)) table (AggInfos(db.intervalsequence, q.intervalsequence)) blobintersection (db.intervalsequence, q.intervalsequence)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Outline of the Talk 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Experimental Evaluation  Examination of the HRI approach based on different MAXGAP Parameters: ,000 10, ,000 1,000,000  Comparison between the HRI approach and the spatial variant of the RI-tree [SSTD 01] CAR PLANE approx. 200 parts approx parts approx. 7 million intervals approx. 9 million intervals resolution: 33 bit ( ) resolution: 42 bit ( )

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Experiments CAR  number of interval decreases with increasing MAXGAP - parameter  average interval length increases with increasing MAXGAP - parameter - Interval Distribution

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Experiments – Secondary Storage  With the HRI method we can improve the storage requirement by an order of magnitude. (RI-Tree)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Experiments – Runtime for collision queries PLANE CAR

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Experiments – Runtime for collision queries PLANE CAR RI-tree huge part (PLANE) RI-tree: s HRI: 2.2 s (Maxgap=10,000)

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Outline of the Talk 1.) Introduction 2.) RI-Tree 3.) HRI-Approach 4.) Experimental Evaluation 5.) Conclusions

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich What are the advantages of the HRI approach?  good secondary storage utilization Conclusions What is the HRI approach?  the HRI approach is a multi-step index structure suitable for spatial query processing for high resolutions  small main memory footprint  improved query response time behaviour

Kyoto, 03/26/03 Kyoto, 03/26/03 Martin Pfeifle, Database Group, University of Munich Any questions? ? ? ? ? ? ? ?