Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,

Slides:



Advertisements
Similar presentations
Lazy Updates: An Efficient Technique to Continuously Monitoring Reverse kNN Presented By: Ying Zhang Joint work with Muhammad Aamir Cheema, Xuemin Lin,
Advertisements

Multi-Guarded Safe Zone: An Effective Technique to Monitor Moving Circular Range Queries Presented By: Muhammad Aamir Cheema 1 Joint work with Ljiljana.
Computer Science and Engineering Diversified Spatial Keyword Search On Road Networks Chengyuan Zhang 1,Ying Zhang 2,1,Wenjie Zhang 1, Xuemin Lin 3,1, Muhammad.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
Computer Science and Engineering Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search Chengyuan Zhang 1,Ying Zhang 1,Wenjie Zhang 1, Xuemin.
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
Counting Distinct Objects over Sliding Windows Presented by: Muhammad Aamir Cheema Joint work with Wenjie Zhang, Ying Zhang and Xuemin Lin University of.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Nearest Neighbor Queries using R-trees
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Click to edit Present’s Name SLICE: Reviving Regions-Based Pruning for Reverse k Nearest Neighbors Queries Shiyu Yang 1, Muhammad Aamir Cheema 2,1, Xuemin.
Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.
CircularTrip: An Effective Algorithm for Continuous kNN Queries Muhammad Aamir Cheema Database Research Group, The School of Computer Science and Engineering,
Improving the Performance of M-tree Family by Nearest-Neighbor Graphs Tomáš Skopal, David Hoksza Charles University in Prague Department of Software Engineering.
By Groysman Maxim. Let S be a set of sites in the plane. Each point in the plane is influenced by each point of S. We would like to decompose the plane.
Convex Hulls in 3-space Jason C. Yang.
Reverse Furthest Neighbors in Spatial Databases Bin Yao, Feifei Li, Piyush Kumar Florida State University, USA.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Probabilistic Threshold Range Aggregate Query Processing over Uncertain Data Wenjie Zhang University of New South Wales & NICTA, Australia Joint work:
Query Processing in Databases Dr. M. Gavrilova.  Introduction  I/O algorithms for large databases  Complex geometric operations in graphical querying.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
SASH Spatial Approximation Sample Hierarchy
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
ISEE: Efficient k-Nearest-Neighbor Monitoring over Moving Obejcts [SSDBM 2007] Wei Wu, Kian-Lee Tan National University of Singapore.
Spatial Queries Nearest Neighbor Queries.
A Unified Approach for Computing Top-k Pairs in Multidimensional Space Presented By: Muhammad Aamir Cheema 1 Joint work with Xuemin Lin 1, Haixun Wang.
Presented by: Duong, Huu Kinh Luan March 14 th, 2011.
Outline Who am I? What is research? My Research Higher studies opportunities in Australia Getting jobs in IT industry Presented by: Muhammad Aamir Cheema,
Computer Science and Engineering Loyalty-based Selection: Retrieving Objects That Persistently Satisfy Criteria Presented By: Zhitao Shen Joint work with.
VLDB '2006 Haibo Hu (Hong Kong Baptist University, Hong Kong) Dik Lun Lee (Hong Kong University of Science and Technology, Hong Kong) Victor.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
RELAXED REVERSE NEAREST NEIGHBORS QUERIES Arif Hidayat Muhammad Aamir Cheema David Taniar.
Computer Science and Engineering Efficiently Monitoring Top-k Pairs over Sliding Windows Presented By: Zhitao Shen 1 Joint work with Muhammad Aamir Cheema.
1 L AZY U PDATES : A N E FFICIENT T ECHNIQUE T O C ONTINUOUSLY M ONITORING R EVERSE K NN (PVLDB’09) Presented By: Jing LI Supervisor: Nikos Mamoulis.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Efficient Processing of Top-k Spatial Preference Queries
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
Efficient Computing k-Coverage Paths in Multihop Wireless Sensor Networks XuFei Mao, ShaoJie Tang, and Xiang-Yang Li Dept. of Computer Science, Illinois.
1 / 41 Convex Hulls in 3-space Jason C. Yang. 2 / 41 Problem Statement Given P: set of n points in 3-space Return: –Convex hull of P: CH (P) –Smallest.
Monitoring k-NN Queries over Moving Objects Xiaohui Yu University of Toronto Joint work with Ken Pu and Nick Koudas.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Information Technology Influence Computation in Spatial Dabases Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia
Information Technology (Some) Research Trends in Location-based Services Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia.
On Top-n Reverse Top-k Queries: Variants, Algorithms, and Applications 陳良弼 Arbee L.P. Chen National Chengchi University 9/21/2012 at NCHU.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
STR: A Simple and Efficient Algorithm for R-Tree Packing.
A Spatial Index Structure for High Dimensional Point Data Wei Wang, Jiong Yang, and Richard Muntz Data Mining Lab Department of Computer Science University.
Solid Modeling Dr. Scott Schaefer.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
Click to edit Present’s Name AP-Tree: Efficiently Support Continuous Spatial-Keyword Queries Over Stream Xiang Wang 1*, Ying Zhang 2, Wenjie Zhang 1, Xuemin.
Strategies for Spatial Joins
A Unified Algorithm for Continuous Monitoring of Spatial Queries
A Unified Framework for Efficiently Processing Ranking Related Queries
Stochastic Skyline Operator
Query Processing in Databases Dr. M. Gavrilova
TT-Join: Efficient Set Containment Join
Probabilistic Data Management
Efficient Subgraph Similarity All-Matching
Presented by: Mahady Hasan Joint work with
Jongik Kim1, Dong-Hoon Choi2, and Chen Li3
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang, Ying Zhang University of New South Wales, Australia

Introduction Nearest Neighbor Query –Find the user that is closest to the query facility Reverse Nearest Neighbor Query (RNN) –Find every user for which the query facility is the closest facility u1u1 u2u2 u3u3 f1f1 f2f2 f3f3 q u3 is the nearest neighbor of qu1 and u2 are RNNs of q Nearest Neighbor Query –Find the user that is closest to the query facility Reverse k Nearest Neighbor Query (RkNN) –Find every user for which the query facility is one of the k closest facilities u1, u2 and u3 are R2NNs of q u4u4 Monochromatic RkNN queries are also supported

Preliminaries Half-space Pruning [VLDB04] –dist(u,f 2 ) < dist(u,q) so u cannot be RNN –The half-space that contains f 2 can be pruned Filtering –Repeat until there is no facility in the unpruned area Find a nearby facility in the unpruned space Prune by using the half-space Containment –The users that are contained in the unpruned space are the candidates Verification –For each candidate user u Verify it if no object is within range dist(u,q) q RNN query f1f1 f2f2 u f4f4 f5f5 u2u2 u1u1 u3u3 u4u4 f3f3

Preliminaries Half-space Pruning [VLDB04] –the space that is contained by k half-spaces can be pruned Filtering –Repeat until there is no facility in the unpruned area Find a nearby facility in the unpruned space Prune by using the half-space Containment –The users that are in the unpruned space are the candidate objects Verification –For each candidate user u Verify it if less than k facilities are within the range dist(u,q) f1f1 R2NN query q f2f2 f3f3 f5f5 f4f4 u2u2 u1u1 f6f6 u3u3 u4u4 u

Preliminaries Filtering –Repeat until there is no facility in the unpruned area Find a nearby facility in the unpruned space Prune by using the half-space Containment –The users that are in the unpruned space are the candidate objects Verification –For each candidate user u Verify it if less than k facilities are within the range dist(u,q) f1f1 R2NN query q f2f2 f3f3 f5f5 f4f4 f6f6 FINCH [VLDB08] approximates the unpruned region by a convex polygon

Preliminaries q f2f2 f3f3 f5f5 f4f4 u2u2 u1u1 f6f6 u3u3 u4u4 _ p Influence Zone Z k –An area such that for every p not in Z k, |C p | ≥ k and for every p’ in Z k, |C p’ | < k RkNN query –Return every user u for which |C u | < k OR every user u in Z k Notations For any point p, C p denotes the circle centered at p with radius r = dist(p,q) | C p | denotes the number of facilities inside C p p’ K =2

Advantage Prune the data space Candidates = objects in the unpruned space Verify each candidate object if q is one of its k nearest neighbors Pruning Containment Verification Compute influence zone Result = objects that are inside the influence zone Pruning Containment Existing AlgorithmsOur Algorithm

Naïve Approach q f2f2 f3f3 f5f5 f4f4 f6f6 _ For each facility f Draw the half-space between f and q Z k is the space that is pruned by at most k-1 half-spaces f5f5

Observation 1 A facility f can be ignored if it lies outside C p for every p inside current unpruned polygon q f2f2 f3f3 f5f5 f4f4 f6f6 Intuition If a facility f lies outside C p, the half-space b/w f and q cannot prune p If f lies outside every C p, the half-space of f cannot prune any p f7f7

Observation 2 A facility f can be ignored if it lies outside C p for every p on the boundary of current unpruned polygon q f2f2 f3f3 f5f5 f4f4 f6f6 Intuition For every p’ inside the polygon, there exists a p on the boundary such that C p contains C p’ f7f7 pp’

Observation 3 A facility f can be ignored if it lies outside C v for every vertex of current unpruned polygon q f2f2 f3f3 f5f5 f4f4 f6f6 Intuition C A U C B contains C C f7f7 q A B C p v v’

Observation 4 A facility f can be ignored if it lies outside C v for every convex vertex v of current unpruned polygon q f2f2 f3f3 f5f5 f4f4 f6f6 Intuition Any vertex v’ that is not a convex vertex lies inside the convex polygon and is not required to check for this reason f7f7 v’ The above pruning condition is tight

Algorithm Initialize Z k as the data universe Insert root of R-tree in heap While heap is not empty Deheap an entry e If e cannot be pruned If e is an intermediate node Insert children of e in the heap Else Use the half-space of e to update Z k If e lies outside every C v for every convex vertex of Z k then e can be pruned

Other highlights of our algorithm Observations to efficiently prune certain entries Efficient determination of convex vertices Prove that influence zone is always a star-shaped polygon Efficient containment checks are possible for star- shaped polygons

RkNN Processing Static RkNN queries Pruning Phase (compute Z k ) Containment Phase (return users inside Z k ) Continuous Bichromatic RkNN queries Compute Z k The users that enter Z k become RkNNs and the users that leave it are no more the RkNNs

Theoretical Analysis Area of Influence Zone = k/|F| Number of RkNNs = |U|. k / |F| IO cost of computing Z k = IO cost of RkNN queries = IO cost of computing Z k + r = S = number of facilities (i.e., |F|) f = fanout of the R-tree r = S = number of users (i.e., |U|) f = fanout of the R-tree

Experiments Snapshot RkNN Queries FINCH [1] (page size 4KB and number of buffers is 10) Real dataset containing 175,812 locations in North America Half of the randomly chosen points form the set of facilities The remaining half form the set of users [1] W. Wu, F. Yang, C. Y. Chan, K. L. Tan. FINCH: Evaluating Reverse k Nearest Neighbor Queries on Location Data. VLDB 2008

Experiments Snapshot RkNN Queries The facilities are from the real data set The users follow Normal distribution

Experiments Verification of Theoretical Analysis 100,000 facilities following Uniform distribution 100,000 users following Uniform distribution

Experiments Verification of Theoretical Analysis 100,000 facilities following Uniform distribution 100,000 users following Uniform distribution

Experiments Continuous RkNN Queries Moving objects and queries generated using Brinkhoff Generator [1] on road map of Texas Data space is 1000 Km X 1000 Km Our algorithm (InfZone) is compared with LazyUpdates [2] [1] T. Brinkhoff. A framework for generating network-based moving objects. GeoInformatica, [2] M. A. Cheema, X. Lin, Y. Zhang, W. Wang, W. Zhang. Lazy Updates: An Efficient Technique to Continuously Monitoring Reverse kNN. PVLDB, 2009.

Experiments

Thanks

Experiments Snapshot RkNN Queries

Experiments Snapshot RkNN Queries

Relationship with Voronoi