CircularTrip: An Effective Algorithm for Continuous kNN Queries Muhammad Aamir Cheema Database Research Group, The School of Computer Science and Engineering,

Slides:



Advertisements
Similar presentations
Lazy Updates: An Efficient Technique to Continuously Monitoring Reverse kNN Presented By: Ying Zhang Joint work with Muhammad Aamir Cheema, Xuemin Lin,
Advertisements

Multi-Guarded Safe Zone: An Effective Technique to Monitor Moving Circular Range Queries Presented By: Muhammad Aamir Cheema 1 Joint work with Ljiljana.
1 Query Processing in Spatial Network Databases presented by Hao Hong Dimitris Papadias Jun Zhang Hong Kong University of Science and Technology Nikos.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
Computational Geometry
Nearest Neighbor Search
Spatio-temporal Databases
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
Counting Distinct Objects over Sliding Windows Presented by: Muhammad Aamir Cheema Joint work with Wenjie Zhang, Ying Zhang and Xuemin Lin University of.
Efficient access to TIN Regular square grid TIN Efficient access to TIN Let q := (x, y) be a point. We want to estimate an elevation at a point q: 1. should.
1 Top-k Spatial Joins
        iDistance -- Indexing the Distance An Efficient Approach to KNN Indexing C. Yu, B. C. Ooi, K.-L. Tan, H.V. Jagadish. Indexing the distance:
Nearest Neighbor Queries using R-trees
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Searching on Multi-Dimensional Data
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Click to edit Present’s Name SLICE: Reviving Regions-Based Pruning for Reverse k Nearest Neighbors Queries Shiyu Yang 1, Muhammad Aamir Cheema 2,1, Xuemin.
Multidimensional Data
Introduction to Spatial Database System Presented by Xiaozhi Yu.
1 NNH: Improving Performance of Nearest- Neighbor Searches Using Histograms Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research) Chen Li (UC Irvine)
Mario Rodriguez Revollo School of Computer Science, UCSP SlimSS-tree: A New Tree Combined SS- tree With Slim-down Algorithm Lifang Yang, Xianglin Huang,
Songhua Xing, Cyrus Shahabi and Bei Pan InfoLab University of Southern California Los Angeles, CA Continuous Monitoring.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
Asst. Prof. Yusuf Sahillioğlu
Spatio-temporal Databases Time Parameterized Queries.
Liang Jin (UC Irvine) Nick Koudas (AT&T) Chen Li (UC Irvine)
Spatial Indexing for NN retrieval
Computer Science Spatio-Temporal Aggregation Using Sketches Yufei Tao, George Kollios, Jeffrey Considine, Feifei Li, Dimitris Papadias Department of Computer.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Spatial Queries Nearest Neighbor and Join Queries.
ISEE: Efficient k-Nearest-Neighbor Monitoring over Moving Obejcts [SSDBM 2007] Wei Wu, Kian-Lee Tan National University of Singapore.
Spatial Queries Nearest Neighbor Queries.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
A Unified Approach for Computing Top-k Pairs in Multidimensional Space Presented By: Muhammad Aamir Cheema 1 Joint work with Xuemin Lin 1, Haixun Wang.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Outline Who am I? What is research? My Research Higher studies opportunities in Australia Getting jobs in IT industry Presented by: Muhammad Aamir Cheema,
Research Overview Kyriakos Mouratidis Assistant Professor School of Information Systems Singapore Management University
Computer Science and Engineering Loyalty-based Selection: Retrieving Objects That Persistently Satisfy Criteria Presented By: Zhitao Shen Joint work with.
VLDB '2006 Haibo Hu (Hong Kong Baptist University, Hong Kong) Dik Lun Lee (Hong Kong University of Science and Technology, Hong Kong) Victor.
Kyriakos Mouratidis, Spiridon Bakiras, Dimitris Papadias SIGMOD
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
Multi-dimensional Search Trees
RELAXED REVERSE NEAREST NEIGHBORS QUERIES Arif Hidayat Muhammad Aamir Cheema David Taniar.
Location-Based Services & Continuous kNN Query Processing Tai Do Data Systems Group, UCF Fall 2005.
Computer Science and Engineering Efficiently Monitoring Top-k Pairs over Sliding Windows Presented By: Zhitao Shen 1 Joint work with Muhammad Aamir Cheema.
1 L AZY U PDATES : A N E FFICIENT T ECHNIQUE T O C ONTINUOUSLY M ONITORING R EVERSE K NN (PVLDB’09) Presented By: Jing LI Supervisor: Nikos Mamoulis.
Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Information Technology Influence Computation in Spatial Dabases Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia
Information Technology (Some) Research Trends in Location-based Services Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia.
Multi-dimensional Search Trees CS302 Data Structures Modified from Dr George Bebis.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Approximate NN queries on Streams with Guaranteed Error/performance Bounds Nick AT&T labs-research Beng Chin Ooi, Kian-Lee Tan, Rui National.
Continuous Reverse Nearest Neighbor Monitoring
Location-based Spatial Queries AGM SIGMOD 2003 Jun Zhang §, Manli Zhu §, Dimitris Papadias §, Yufei Tao †, Dik Lun Lee § Department of Computer Science.
Spatial Queries Nearest Neighbor and Join Queries Most slides are based on slides provided By Prof. Christos Faloutsos (CMU) and Prof. Dimitris Papadias.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Strategies for Spatial Joins
A Unified Algorithm for Continuous Monitoring of Spatial Queries
Spatial Queries Nearest Neighbor and Join Queries.
A Unified Framework for Efficiently Processing Ranking Related Queries
KD Tree A binary search tree where every node is a
Nearest Neighbor Queries using R-trees
Spatio-temporal Pattern Queries
Introduction to Spatial Databases
Presented by: Mahady Hasan Joint work with
Presentation transcript:

CircularTrip: An Effective Algorithm for Continuous kNN Queries Muhammad Aamir Cheema Database Research Group, The School of Computer Science and Engineering, University of New South Wales (UNSW), Sydney Joined work with: Yidong Yuan (UNSW), Xuemin Lin (UNSW)

Problem Settings Given a query point q, continuously report k nearest objects of q according to Euclidean distance Objects and query points move in an unpredictable fashion Objects and query points can be indexed in main memory

Related Work Divide space into conceptual rectangles Initialize heap with first level rectangles and c q For top entry e in heap unless mindist(e,q) > dist k ; If e is rectangle, enheap all cells in e If e is cell; check objects in it Conceptual Partitioning: An efficient Method for Continuous nearest neighbor monitoring Kyriakos Mouratidis, Marios Hadjieleftheriou, Dimitris Papadias [SIGMOD 2005]

Motivations Data Point Updates If a NN moves away, the new answer cannot lie inside the circle All the shaded cells are visited Ideally, the update should start from the cells that intersect the circle

Motivations Rectangles do not approximate circles very well All the cells in any rectangle Rec are inserted in heap when r becomes greater than mindist(Rec,q) All cells in the four rectangles are enheaped. Ideally, only the cells that intersect the circle should be enheaped How to find all the cells that intersect a given circle? CircularTrip

CircularTrip Let n be the number of cells that intersect the circle, CircularTrip computes mindist of exactly n cells r

kNN Initial Computation r=0, dist k = ∞ (dist k = distance of kth NN) Repeat unless r ≥ dist k Call CircularTrip with radius Min(r=r+ δ, dist k ) (δ= cell width) Insert returned cells into heap according to ascending order of their mindist from q For each cell c in heap If mindist(c,q) ≥ dist k  RETURN RESULTS Find points in c and update distk and q.kNN accordingly q p1p1 p2p2 r r = δ r = 2.δ r = dist(p 1,q) dist k = ∞ dist k = dist(p 1,q) dist k = dist(p 2,q) q.kNN = p 1 q.kNN = Φ q.kNN = p 2

Continuous Monitoring q p2p2 p1p1 Incoming objects: any non-result object p entering inside the circle Insert p into q.kNN Outgoing objects: any result object leaving the circle delete p from q.kNN Hanlde all the object updates as mentioned above Case 1: q.kNN contains k or more than k objects Keep k closest objects and discard other Case 2: q.kNN contains less than k objects Same as initial computation except the starting radius is dist k dist k

Experiments Objects/queries moving on the road map of German city Oldenburg (Brinkhoff Generator) Number of objects = 100,000 Number of queries = 5000 Object agility= 50% (number of objects reporting update at each timestamp) Query agility= 30% Grid size = 256 X 256 (i.e., δ = 1/256 ) Effect of δ

Experiments Effect of k

Experiments Effect of N and n N = number of objects n = number of queries

Experiments Effect of object agility and speed

Questions

Space Usage CPM stores visit list and heap for each query