Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.

Slides:



Advertisements
Similar presentations
The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Advertisements

Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Nearest Neighbor Queries using R-trees
Indexing and Range Queries in Spatio-Temporal Databases
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
© The McGraw-Hill Companies, Inc., Chapter 2 The Complexity of Algorithms and the Lower Bounds of Problems.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
1 Voronoi Diagrams. 2 Voronoi Diagram Input: A set of points locations (sites) in the plane.Input: A set of points locations (sites) in the plane. Output:
1 NNH: Improving Performance of Nearest- Neighbor Searches Using Histograms Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research) Chen Li (UC Irvine)
Advanced Topics in Algorithms and Data Structures Lecture 7.1, page 1 An overview of lecture 7 An optimal parallel algorithm for the 2D convex hull problem,
Reverse Furthest Neighbors in Spatial Databases Bin Yao, Feifei Li, Piyush Kumar Florida State University, USA.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Query Processing in Databases Dr. M. Gavrilova.  Introduction  I/O algorithms for large databases  Complex geometric operations in graphical querying.
17. Computational Geometry Chapter 7 Voronoi Diagrams.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
2-dimensional indexing structure
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Liang Jin (UC Irvine) Nick Koudas (AT&T) Chen Li (UC Irvine)
Spatial Indexing for NN retrieval
Spatial Indexing SAMs. Spatial Access Methods PAMs Grid File kd-tree based (LSD-, hB- trees) Z-ordering + B+-tree R-tree Variations: R*-tree, Hilbert.
2 -1 Chapter 2 The Complexity of Algorithms and the Lower Bounds of Problems.
Spatial Indexing SAMs.
I/O-Algorithms Lars Arge Spring 2009 April 28, 2009.
Spatial Queries Nearest Neighbor and Join Queries.
Spatial Queries Nearest Neighbor Queries.
I/O-Algorithms Lars Arge Aarhus University March 6, 2007.
I/O-Algorithms Lars Arge University of Aarhus March 7, 2005.
B + -Trees (Part 1) COMP171. Slide 2 Main and secondary memories  Secondary storage device is much, much slower than the main RAM  Pages and blocks.
1 -1 Chapter 1 Introduction Why to study algorithms? Sorting problem: To sort a set of elements into increasing or decreasing order. 11, 7, 14,
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
The Complexity of Algorithms and the Lower Bounds of Problems
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Efficient Distance Computation between Non-Convex Objects By Sean Quinlan Presented by Sean Augenstein and Nicolas Lee.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
B-trees and kd-trees Piotr Indyk (slides partially by Lars Arge from Duke U)
Bin Yao Spring 2014 (Slides were made available by Feifei Li) Advanced Topics in Data Management.
A Quantitative Analysis and Performance Study For Similar- Search Methods In High- Dimensional Space Presented By Umang Shah Koushik.
Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Spatial Query Processing Spatial DBs do not have a set of operators that are considered to be basic elements in a query evaluation. Spatial DBs handle.
Chapter 12 Query Processing (1) Yonsei University 2 nd Semester, 2013 Sanghyun Park.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
1 / 41 Convex Hulls in 3-space Jason C. Yang. 2 / 41 Problem Statement Given P: set of n points in 3-space Return: –Convex hull of P: CH (P) –Smallest.
Lecture 3: External Memory Indexing Structures (Contd) CS6931 Database Seminar.
B+ Trees: An IO-Aware Index Structure Lecture 13.
Internal and External Sorting External Searching
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
Spatial Queries Nearest Neighbor and Join Queries Most slides are based on slides provided By Prof. Christos Faloutsos (CMU) and Prof. Dimitris Papadias.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Heaps, Heap Sort, and Priority Queues. Background: Binary Trees * Has a root at the topmost level * Each node has zero, one or two children * A node that.
Spatial Queries Nearest Neighbor and Join Queries.
Indexing Goals: Store large files Support multiple search keys
A Unified Framework for Efficiently Processing Ranking Related Queries
Query Processing in Databases Dr. M. Gavrilova
KD Tree A binary search tree where every node is a
Nearest Neighbor Queries using R-trees
Advanced Topics in Data Management
R-tree: Indexing Structure for Data in Multi-dimensional Space
Computational Geometry Capter:1-2.1
Chapter 12 Query Processing (1)
Kinetic Collision Detection for Convex Fat Objects
Chapter 1 Introduction.
Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research)
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu

Introduction Related Work Algorithms - PFC (Progressive Furthest Cell) - CHFC (Convex Hull Furthest Cell) Experiment Discussion

 Assume you live at p1 (p2, p3), where would you prefer to build a chemical factory among q1~q3?

 Let P={p1, p2, p3}  Q={q1, q2, q3}  fn(p1, Q)=q3  fn(p2, Q)=q1  fn(p3, Q)=q1  BRFN(q1,Q,P)={p2, p3}  BRFN(q2,Q,P)={}  BRFN(q3,Q,P)={p1} Build the chemical factory here

 Problem: Given query point q, data set P (and Q), Compute MRFN(q, P) and BRFN(q, Q, P).

 MBR  MBR (Minimum Bounding Rectangles) has 3 important distances to a point:  Min Distance  Max Distance  Minmax Distance

 R-tree  R-tree is an index data structure.  In R-trees, points are grouped into MBRs, which are recursively grouped into MBRs in higher levels of the tree.

 Range query  Range query: retrieves all points that locates within the query window.  R-tree based algorithms proves to be efficient to deal with range queries.

 How to compute the MRFN of a given query point?  BFS (Brute-Force Search)  PFC (Progressive Furthest Cell)  Main Idea: 1. Find the cell (region) in which all reverse furthest neighbors of the query point located 2. Perform a range query with the cell How to compute?

 FVC (Furthest Voronoi Cell)

 FVC Example: query point = q1 fvc(q1, P)

 PFC (Progressive Furthest Cell) Algorithm  Points and MBRs are stored in a priority queue L with their minmaxdist sorted in decreasing order.  Two vectors Vc and Vp are also maintained:  Vc: Furthest neighbor candidates  Vp: Disqualifying points

 PFC – mechanism e is a point e is an MBR fvc(q)={} e ∈ fvc(q) e ∩ fvc(q)={} e ∩ fvc(q)≠{} c ∩ fvc(q)≠{} c ∩ fvc(q)={} At last, we update fvc(q) using Vp and then filter points in Vc using fvc(q)

 Example: L={p1, R1} Vc={} Vp={} L={R1} Vc={p1} Vp={} fvc(q)

 Example: L={p3} Vc={p1} Vp={p2} L={} Vc={p1, p3} Vp={p2} fvc(q)

 Example: MRFN(q)={p3} fvc(q) Finally, we use all points in Vp (i.e. p2) to update fvc(q). Then, we perform a range query using the updated fvc(q). The result is {p3} 。

 Efficiency of PFC  PFC makes fvc(q) quickly shrink. If the query point does not have any reverse furthest neighbors, Φ will quickly be reported.  However, it is still not efficient enough.  Improvement: CHFC algorithm.

 Convex Hull  The Convex Hull of a set of points P is the smallest convex polygon that fully contains P.  Denoted as C P.

 Lemma: Given a point set P and its convex hull Cp, for a point q, let p*=fn(q, P), then p* ∈ C P.  fvc(p, P)=fvc(p, C P )

 CHFC (Convex Hull Furthest Cell)  Given a set of points P and a query point p: Compute CP ∪ {p} Compute fvc(p, P) using CP ∪ {p} Perform a range query with fvc(p, P)

 BRFN  BRFN (Bichromatic Reverse Furthest Neighbor) can be found in the same way as MRFN.  The only one difference is, we compute fvc(q, Q, P) will Q, can perform range query in P.

 Efficiency of CHFC:  For most (but not all) cases, |C P | << |P|. That is, the number of points considered are likely to be greatly reduced.  Difficulty: How to compute and update C P when |P| is very large and even |C P | cannot fit into memory.

 Computing Convex Hull  Convex hulls can be found in either a distance-first or a depth-first manner.  Distance-first approach is optimal in the number of page accesses, and the complexity is O(nlogn).  Depth-first algorithms can run in O(n) time for worst case, but not optimal in disk accessing.

 Updating Convex Hull  Inserting new points:  Lemma: P is a point set.  If point q is contained by C P,C P ∪ {q} =C P  Otherwise,  C P ∪ {q} =C Cp ∪ {q}

 Updating Convex Hull  Deleting points:  Points or MBRs with the largest perpendicular distance to p l p r are added into C P first, until there is no points outside the current convex hull.

 External Convex Hull Computing  Existing algorithms can found 2-Dimensional convex hulls with I/Os.  However, when convex hulls are still too large to fit into memory, we use Dudley’s approximate convex hull.

 CPU time & number of IOs

 Thank You!  Questions?