ISEE: Efficient k-Nearest-Neighbor Monitoring over Moving Obejcts [SSDBM 2007] Wei Wu, Kian-Lee Tan National University of Singapore.

Slides:



Advertisements
Similar presentations
The Optimal-Location Query
Advertisements

Lazy Updates: An Efficient Technique to Continuously Monitoring Reverse kNN Presented By: Ying Zhang Joint work with Muhammad Aamir Cheema, Xuemin Lin,
Multi-Guarded Safe Zone: An Effective Technique to Monitor Moving Circular Range Queries Presented By: Muhammad Aamir Cheema 1 Joint work with Ljiljana.
Nearest Neighbor Search
Spatio-temporal Databases
Spatial Join Queries. Spatial Queries Given a collection of geometric objects (points, lines, polygons,...) organize them on disk, to answer point queries.
Efficient access to TIN Regular square grid TIN Efficient access to TIN Let q := (x, y) be a point. We want to estimate an elevation at a point q: 1. should.
1 Top-k Spatial Joins
Nearest Neighbor Queries using R-trees
Searching on Multi-Dimensional Data
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Multidimensional Data
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
CircularTrip: An Effective Algorithm for Continuous kNN Queries Muhammad Aamir Cheema Database Research Group, The School of Computer Science and Engineering,
A (1+  )-Approximation Algorithm for 2-Line-Center P.K. Agarwal, C.M. Procopiuc, K.R. Varadarajan Computational Geometry 2003.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
Spatial Indexing for NN retrieval
On Reducing Communication Cost for Distributed Query Monitoring Systems. Fuyu Liu, Kien A. Hua, Fei Xie MDM 2008 Alex Papadimitriou.
Computer Science Spatio-Temporal Aggregation Using Sketches Yufei Tao, George Kollios, Jeffrey Considine, Feifei Li, Dimitris Papadias Department of Computer.
Spatial Queries Nearest Neighbor and Join Queries.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Hash Tables1 Part E Hash Tables  
Spatial Queries Nearest Neighbor Queries.
Hash Tables1 Part E Hash Tables  
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Outline Who am I? What is research? My Research Higher studies opportunities in Australia Getting jobs in IT industry Presented by: Muhammad Aamir Cheema,
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Kyriakos Mouratidis, Spiridon Bakiras, Dimitris Papadias SIGMOD
Multi-dimensional Search Trees
Location-Based Services & Continuous kNN Query Processing Tai Do Data Systems Group, UCF Fall 2005.
Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,
1 Sorting Algorithms Sections 7.1 to Comparison-Based Sorting Input – 2,3,1,15,11,23,1 Output – 1,1,2,3,11,15,23 Class ‘Animals’ – Sort Objects.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
Monitoring k-NN Queries over Moving Objects Xiaohui Yu University of Toronto Joint work with Ken Pu and Nick Koudas.
CS848 Similarity Search in Multimedia Databases Dr. Gisli Hjaltason Content-based Retrieval Using Local Descriptors: Problems and Issues from Databases.
CS4432: Database Systems II Query Processing- Part 2.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Approximate NN queries on Streams with Guaranteed Error/performance Bounds Nick AT&T labs-research Beng Chin Ooi, Kian-Lee Tan, Rui National.
Continuous Reverse Nearest Neighbor Monitoring
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Spatio-Temporal Databases
1 Reverse Nearest Neighbor Queries for Dynamic Databases SHOU Yu Tao Jan. 10 th, 2003 SIGMOD 2000.
Da Yan, Raymond Chi-Wing Wong, and Wilfred Ng The Hong Kong University of Science and Technology.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Strategies for Spatial Joins
A Unified Algorithm for Continuous Monitoring of Spatial Queries
Spatial Queries Nearest Neighbor and Join Queries.
Influence sets based on Reverse Nearest Neighbor Queries
Spatial Indexing I Point Access Methods.
KD Tree A binary search tree where every node is a
Nearest Neighbor Queries using R-trees
Probabilistic Data Management
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
RUM Conjecture of Database Access Method
Presented by: Mahady Hasan Joint work with
Continuous Density Queries for Moving Objects
The Skyline Query in Databases Which Objects are the Most Important?
3.4: Rectangle Collisions
Presentation transcript:

iSEE: Efficient k-Nearest-Neighbor Monitoring over Moving Obejcts [SSDBM 2007] Wei Wu, Kian-Lee Tan National University of Singapore

Problem Settings Given a query point q, continuously report k nearest objects of q Objects and query points move in an unpredictable fashion Objects and query points can be indexed in main memory

Related Work Divide space into conceptual rectangles Initialize heap with first level rectangles and c q For top entry e in heap unless mindist(e,q) > best_dist; If e is rectangle, enheap all cells in e If e is cell; check objects in it Conceptual Partitioning: An efficient Method for Continuous nearest neighbor monitoring Kyriakos Mouratidis, Marios Hadjieleftheriou, Dimitris Papadias [SIGMOD 2005]

Motivations CPM visits un-necessary cells during update If some nearest neighbor moves away from the query, the new answer cannot be among any cell c for which maxdist(c,q)< best_dist where best_dist is the distance of kth nearest neighbor from q All the shaded cells are visited Ideally, the update should start from the cells that intersect the circle

Motivations Heap size is large because rectangles do not approximate circles very well All the cells in any rectangle Rec are inserted in heap when r becomes greater than mindist(Rec,q) All cells in the four rectangles are enheaped. Ideally, only the cells that intersect the circle should be enheaped How to alleviate these problems? CircularTrip VOB

Visit Order Build (VOB) Each group L i G j has either 4 or 8 cells The cells in any group L i G j have similar min-dist from q The min-dist of a group L i G j from q is smaller than min-dist of L i+1 G j The min-dist of a group L i G j from q is smaller than min-dist of L i G j+1 Min-dist of L i G j is the minimum of all min-distances between the cells in L i G j from q.

Initial Computation Initially en-heap c q and group L 1 G 1 in heap with its min-dist For each de-heaped entry If it is a cell c Look in c for potential NNs store c in visit_list If it is a group L i G j en-heap all cells in L i G i with their min-distance en-heap next level group L i+1 G j with its min-dist if i=j en-heap next group of same level L i G j+1

Data Structure Each cell in grid stores object list and influence list best_NN stores the Nearest neighbors among the visited cells search heap H contains the cells and groups that were en-heaped but not de-heaped (Enables quick updates) visit_list stores min and max-distances of all the cells that were de- heaped

Update Handling If an object x moves inside the circle include x in result and delete current kth-NN (Update influence List) go backward (descending order) in visit_list deleting q from all the cells c for which min-dist(c,q) > new best_dist If a result object x moves outside the circle delete x from the result set start from the beginning of visit_list and skip the cells for which max-dist(c,q)< best_dist

Experiments

Memory Comparison with CPM Data structure of iSEE is same as CPM but CPM has a larger search heap visit_list of both CPM and iSEE contains same number of cells but iSEE also stores max-dist for cells Let r be the distance of kth NN from q CPM memory search heap 2.(4r 2 – πr 2 ) = 1.71r 2 visit_list 2(πr 2 ) Total: 8r 2 iSEE memory: visit_list 3(πr 2 ) Total: 3(πr 2 ) + search heap

iSEE vs CircularTrip CircularTrip doesn’t need any visit_list or search heap When a query moves, iSEE computes from scratch whereas CircularTrip uses previous information VOB is almost as expensive as CircularTrip (if two consecutive circularTrips visit some (at most 27%) cells twice VOB computes max-dist for each cell) Let C be the number of cells that are needed to be visited during computation, CircualrTrip computes distances for at most 1.27C cells whereas VOB computes distances for 2C cells. Update of influence list by iSEE is faster than CircularTrip because they store visit_list (but lazy update approach can be used in both of these algorithms) CircularTrip uses less memory (50%-85% of iSEE) and the running time of both algorithm is estimated to be similar

CircularTrip is more flexible ArcTrip returns all the cells that intersect a circle and lie in a specified angle range ArcTrip can be used to continuously monitor constrained nearest neighbor queries optimally (visiting minimal set of cells) Extension of iSEE for constrained NN queries makes it in-efficient Recall that six constrained NN queries are needed to be continuously monitored for continuous monitoring of RNN queries

CircularTrip is more flexible ArcTrip can be used to monitor constrained NN queries over irregular regions efficiently CircularTrip can also be used to efficiently monitor farthest neighbor queries Farthest neighbors in constrained regions can also be monitored For all algorithms, CircularTrip preserves its property that it needs no book-keeping information and visits minimum number of cells