Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.

Slides:



Advertisements
Similar presentations
The Optimal-Location Query
Advertisements

The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
Computer Science and Engineering Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search Chengyuan Zhang 1,Ying Zhang 1,Wenjie Zhang 1, Xuemin.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Reverse Furthest Neighbors in Spatial Databases Bin Yao, Feifei Li, Piyush Kumar Florida State University, USA.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne.
1 Efficient Subgraph Search over Large Uncertain Graphs Ye Yuan 1, Guoren Wang 1, Haixun Wang 2, Lei Chen 3 1. Northeastern University, China 2. Microsoft.
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
2-dimensional indexing structure
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
Efficient Processing of Top-k Spatial Keyword Queries João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg 1 SSTD 2011.
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
High-Dimensional Similarity Search using Data-Sensitive Space Partitioning ┼ Sachin Kulkarni 1 and Ratko Orlandic 2 1 Illinois Institute of Technology,
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Spatial Queries Nearest Neighbor Queries.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Research Overview Kyriakos Mouratidis Assistant Professor School of Information Systems Singapore Management University
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Michael Vassilakopoulos.
1 SD-Rtree: A Scalable Distributed Rtree Witold Litwin & Cédric du Mouza & Philippe Rigaux.
SUBSKY: Efficient Computation of Skylines in Subspaces Authors: Yufei Tao, Xiaokui Xiao, and Jian Pei Conference: ICDE 2006 Presenter: Kamiru Superviosr:
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
RELAXED REVERSE NEAREST NEIGHBORS QUERIES Arif Hidayat Muhammad Aamir Cheema David Taniar.
Computer Science and Engineering Efficiently Monitoring Top-k Pairs over Sliding Windows Presented By: Zhitao Shen 1 Joint work with Muhammad Aamir Cheema.
Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,
Efficient Processing of Top-k Spatial Preference Queries
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Monitoring k-NN Queries over Moving Objects Xiaohui Yu University of Toronto Joint work with Ken Pu and Nick Koudas.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Approximate NN queries on Streams with Guaranteed Error/performance Bounds Nick AT&T labs-research Beng Chin Ooi, Kian-Lee Tan, Rui National.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
1 Reverse Nearest Neighbor Queries for Dynamic Databases SHOU Yu Tao Jan. 10 th, 2003 SIGMOD 2000.
Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan.
Da Yan, Raymond Chi-Wing Wong, and Wilfred Ng The Hong Kong University of Science and Technology.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Keogh, E. , Chakrabarti, K. , Pazzani, M. & Mehrotra, S. (2001)
Tian Xia and Donghui Zhang Northeastern University
Progressive Computation of The Min-Dist Optimal-Location Query
Supporting Fault-Tolerance in Streaming Grid Applications
Introduction to Spatial Databases
Efficient Evaluation of k-NN Queries Using Spatial Mashups
Finding Fastest Paths on A Road Network with Speed Patterns
Fast Nearest Neighbor Search on Road Networks
Similarity Search: A Matching Based Approach
The Skyline Query in Databases Which Objects are the Most Important?
Efficient Processing of Top-k Spatial Preference Queries
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015

Outline.2.  Backgrounds  Algorithms  Sequential Scan Algorithm  Quasi-Voronoi Cell  Nearest Facility Circle  Maximum NFC Distance  Experiments  Conclusions

Motivation.3.  The min-dist location selection problem  Problem setting: a set of facilities serving a set of clients  If we want to set up a new facility, choose a location from a set of potential locations to minimize the average distance between the facilities and the clients  Motivating applications  Urban planning simulations: deploy public facilities  Multiple player online games: place players

Motivation: urban planning simulation.4. Modeling urban dynamics [1]

Motivation: online computer games.5. An online game example [2]

Problem Definition.6.  A set of clients, C  A set of existing facilities, F  A set of potential locations, P  Select a potential location for a new facility to minimize the average distance between a client and her nearest facility

Related Work.7.  The min-dist optimal location problem [3]  A set of clients C  A set of existing facilities F  A candidate region Q  Compute a location in Q for a new facility to minimize the average distance between a client and her nearest facility Q

Related Work.8. Location Optimization Problems ProblemOptim. Function Solution Space Distance Function Datasets [4]Max-infContinuousL2L2 C, F [5]Max-infDiscreteL2L2 C, F [6]Max-infContinuousL1L1 C, F [7]Max-infDiscreteL2L2 C, P [8]Max-infDiscreteL2L2 C, F, P [3]Min-distContinuousL1L1 C, F [9]Min-distContinuousNetworkC, F, E [10]Min-distDiscreteL2L2 C, P ProposedMin-distDiscreteL2L2 C, F, P

Algorithms: Problem Redefinition.9.  Larger distance reduction  smaller average client-facility distance  The influence Set of p, IS(p)    The distance reduction of p, dr(p)  IS(p 1 ) IS(p 2 )

Algorithms: Sequential Scan.10.  Sequential Scan Algorithm  Sequentially check all the potential locations  For every potential location p  Sequentially check all the clients, compute IS(p) and dr(p)  Report the one with the largest dr value  Drawback – repeated dataset accesses  Key algorithm design considerations  Restrict the search space for IS(p)  Share the computation for determining the influence sets of multiple potential locations

Algorithms: Quasi-Voronoi Cell.11.  A potential location’s surrounding existing facilities constraint its search space for IS The Quasi-Voronoi Cell (QVC) [11]

Algorithms: Nearest Facility Circle.12.  Constraint the search space from clients’ perspective  Nearest facility circle of a client c, NFC(c)  An R-tree on the NFCs  An R-tree on the potential locations  Synchronous traversal 

Algorithms: Maximum NFC Distance.13.  An index reduced version of NFC  NFC requires two R-trees to index the clients  One for the NFCs  The other for the clients  Inefficient to maintain with clients coming and leaving constantly  Key insight  Combine two R-trees together  A single value to describe a region that encloses the NFCs of the clients in an R-tree node N  The Maximum NFC Distance

Algorithms: Maximum NFC Distance.14.  Maximum NFC Distance (MND)  The largest distance between the points on the NFCs and the MBR of a node on the clients

Algorithms: Maximum NFC Distance.15.  Efficient MND Computation  Only requires checking four points per node  The four candidate furthest points (CFP): I v1, I v2, I h1, I h2 

Experiments: settings.16.  Hardware  2.66GHz Intel(R) Core(TM)2 Quad CPU,3GB RAM  Datasets  Synthetic datasets: Uniform, Gaussian, Zipfian  Real datasets: populated places and cultural landmarks in US and North America [13]  US: |C| = 15206, |F| = 3008, |P| = 3009  NA: |C| = 24493, |F| = 4601, |P| = 4602 ParameterValue Disk page size4KB Client set size10K, 50K, 100K, 500K, 1000K Existing facility set size0.1K, 0.5K, 1K, 5K, 10K Potential location set size1K, 5K, 10K, 50K, 100K  ; σ 2 (Gaussian distribution ) 0; 0.125, 0.25, 0,5, 1, 2 N; ∂ (Zipfian distribution)1000; 0.1, 0.3, 0.6, 0.9, 1.2

Experiments: dataset cardinality.17. MND is as good as NFC in running time and I/O. They both outperform SS and QVC by one order of magnitude.

Experiments: dataset cardinality.18. MND reduces 40% in index size compared to NFC

Experiments: data distribution.19.  Gaussian  Real MND shows the best overall performance

Conclusions.20.  A new location optimization problem  Urban simulation  Massively multiplayer online games  Two approaches from commonly used techniques  Quasi-Voronoi Cell  Nearest Facility Circle  A new approach MND  High efficiency  No additional index

Reference.21. [1] [2] [3] D. Zhang, Y. Du, T. Xia, and Y. Tao, “Progressive computation of the min-dist optimal-location query,” in VLDB, [4] S. Cabello, J. M. D´ıaz-B´a˜nez, S. Langerman, C. Seara, and I. Ventura, “Reverse facility location problems.” in CCCG, [5] T. Xia, D. Zhang, E. Kanoulas, and Y. Du, “On computing top-t most influential spatial sites.” in VLDB, [6] Y. Du, D. Zhang, and T. Xia, “The optimal-location query.” in SSTD, [7] Y. Gao, B. Zheng, G. Chen, and Q. Li, “Optimal-location-selection query processing in spatial databases,” TKDE, vol. 21, pp. 1162–1177, [8] J. Huang, Z. Wen, J. Qi, R. Zhang, J. Chen, and Z. He, “Top-k most influential locations selection,” in CIKM, [9] X. Xiao, B. Yao, and F. Li, “Optimal location queries in road network databases,” in ICDE, [10] [11] I. Stanoi, M. Riedewald, D. Agrawal, and A. E. Abbadi, “Discovery of influence sets in frequently updated databases,” in VLDB, [12]

Thank you! Jianzhong Qi