Cheng, Xie, Yiu, Chen, Sun UV-diagram: a Voronoi Diagram for uncertain data 26th IEEE International Conference on Data Engineering Reynold Cheng (University.

Slides:



Advertisements
Similar presentations
Voronoi-based Geospatial Query Processing with MapReduce
Advertisements

Quality Aware Privacy Protection for Location-based Services Zhen Xiao, Xiaofeng Meng Renmin University of China Jianliang Xu Hong Kong Baptist University.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Cleaning Uncertain Data with Quality Guarantees Reynold Cheng, Jinchuan Chen, Xike Xie 2008 VLDB Presented by SHAO Yufeng.
Cleaning Uncertain Data with Quality Guarantees Dr. Reynold Cheng Department of Computer Science The University of Hong Kong
Online Filtering, Smoothing & Probabilistic Modeling of Streaming Data In short, Applying probabilistic models to Streams Bhargav Kanagal & Amol Deshpande.
Data Engineering Research Group 4 faculty members Reynold Cheng David Cheung Ben Kao Nikos Mamoulis 20 research students (10 PhD, 10 MPhil)
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Cheng, Chen, Chen, Xie Evaluating Probability Threshold k- Nearest-Neighbor Queries over Uncertain Data Reynold Cheng (University of Hong Kong) Lei Chen.
Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.
Cleaning Uncertain Data for Top-k Queries Luyi Mo, Reynold Cheng, Xiang Li, David Cheung, Xuan Yang The University of Hong Kong {lymo, ckcheng, xli, dcheung,
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
LUDWIG- MAXIMILIANS- UNIVERSITY MUNICH DATABASE SYSTEMS GROUP DEPARTMENT INSTITUTE FOR INFORMATICS Probabilistic Similarity Queries in Uncertain Databases.
Voronoi-based Nearest Neighbor Search for Multi-Dimensional Uncertain Databases Peiwu Zhang Reynold Cheng Nikos Mamoulis Yu Tang University of Hong Kong.
U of Minnesota Spatial and Spatio-temporal Data Uncertainty: Modeling and Querying Mohamed F. Mokbel Department of Computer Science and Engineering University.
Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.
Indexing Network Voronoi Diagrams*
Data Engineering Research Group 4 faculty members Reynold Cheng David Cheung Ben Kao Nikos Mamoulis 20 research students (10 PhD, 10 MPhil)
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
Location Privacy in Casper: A Tale of two Systems
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Efficient Join Processing over Uncertain Data - By Reynold Cheng, et all. Presented By Lydia & Usha.
1 PSO-based Motion Fuzzy Controller Design for Mobile Robots Master : Juing-Shian Chiou Student : Yu-Chia Hu( 胡育嘉 ) PPT : 100% 製作 International Journal.
VLDB '2006 Haibo Hu (Hong Kong Baptist University, Hong Kong) Dik Lun Lee (Hong Kong University of Science and Technology, Hong Kong) Victor.
Reynold Cheng†, Eric Lo‡, Xuan S
VLDB 2012 Mining Frequent Itemsets over Uncertain Databases Yongxin Tong 1, Lei Chen 1, Yurong Cheng 2, Philip S. Yu 3 1 The Hong Kong University of Science.
Maximal Vector Computation in Large Data Sets The 31st International Conference on Very Large Data Bases VLDB 2005 / VLDB Journal 2006, August Parke Godfrey,
Nearest Neighbor Searching Under Uncertainty
Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection.
A Survey Based Seminar: Data Cleaning & Uncertain Data Management Speaker: Shawn Yang Supervisor: Dr. Reynold Cheng Prof. David Cheung
Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 Probabilistic Continuous Update.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
Wireless Sensor Networks In-Network Relational Databases Jocelyn Botello.
Dave McKenney 1.  Introduction  Algorithms/Approaches  Tiny Aggregation (TAG)  Synopsis Diffusion (SD)  Tributaries and Deltas (TD)  OPAG  Exact.
Clustering Moving Objects in Spatial Networks Jidong Chen, Caifeng Lai, Xiaofeng Meng, Renmin University of China Jianliang Xu, and Haibo Hu Hong Kong.
Efficient Processing of Top-k Spatial Preference Queries
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
A New Spatial Index Structure for Efficient Query Processing in Location Based Services Speaker: Yihao Jhang Adviser: Yuling Hsueh 2010 IEEE International.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Efficient Computation of Combinatorial Skyline Queries Author: Yu-Chi Chung, I-Fang Su, and Chiang Lee Source: Information Systems, 38(2013), pp
Aggregate sum retrieval in sensor network by distributed prefix sum data cube Lok Hang Lee and Man Hon Wong The Chinese University of Hong Kong Department.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
On Top-n Reverse Top-k Queries: Variants, Algorithms, and Applications 陳良弼 Arbee L.P. Chen National Chengchi University 9/21/2012 at NCHU.
Location-based Spatial Queries AGM SIGMOD 2003 Jun Zhang §, Manli Zhu §, Dimitris Papadias §, Yufei Tao †, Dik Lun Lee § Department of Computer Science.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
Anomaly Detection. Network Intrusion Detection Techniques. Ştefan-Iulian Handra Dept. of Computer Science Polytechnic University of Timișoara June 2010.
Shaoxu Song 1, Aoqian Zhang 1, Lei Chen 2, Jianmin Wang 1 1 Tsinghua University, China 2Hong Kong University of Science & Technology, China 1/19 VLDB 2015.
Presented by: Dardan Xhymshiti Fall  Type: Research paper  Authors:  International conference on Very Large Data Bases. Yoonjar Park Seoul National.
Data Engineering Research Group 4 faculty members David Cheung Ben Kao Nikos Mamoulis Reynold Cheng About 15 research students (12 PhD, 3 MPhil)
Density-based Place Clustering in Geo-Social Networks Jieming Shi, Nikos Mamoulis, Dingming Wu, David W. Cheung Department of Computer Science, The University.
2010 IEEE Global Telecommunications Conference (GLOBECOM 2010)
Query in Streaming Environment
Pervasive Data Access (PDA) Research Group
CS & CS Probabilistic Data Management
Spatio-temporal Pattern Queries
Chapter 4: Probabilistic Query Answering (2)
Probabilistic Data Management
IEEE ICDE 2008 Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data Reynold Cheng Hong Kong Polytechnic University.
Probabilistic Data Management
CS & CS ST: Probabilistic Data Management
Uncertain Data Mobile Group 报告人:郝兴.
Data Engineering Research Group
Efficient Processing of Top-k Spatial Preference Queries
Fraction-Score: A New Support Measure for Co-location Pattern Mining
Presentation transcript:

Cheng, Xie, Yiu, Chen, Sun UV-diagram: a Voronoi Diagram for uncertain data 26th IEEE International Conference on Data Engineering Reynold Cheng (University of Hong Kong) Xike Xie (University of Hong Kong) Man Lung Yiu (Hong Kong Polytechnic University) Jinchuan Chen (Renmin University of China) Liwen Sun (University of Hong Kong)

2 Cheng, Xie, Yiu, Chen, Sun Voronoi Diagram

3Cheng, Xie, Yiu, Chen, Sun Voronoi Diagram Aggregate Query in Sensor Network [Shahabi06a] Spatial Skyline Query [Shahabi06b] Reverse Nearest Neighbor Query [Yiu07] Common Influence Join [Yiu08] Uncertain Data Clustering [Kao08]

4 Location Uncertainty [TDRP98,ISSD99,VLDB04]

5Cheng, Xie, Yiu, Chen, Sun UV-diagram (Uncertain Voronoi Diagram) (a)Voronoi Diagram. (b) UV-Diagram.

6 Probabilistic Nearest Neighbor Query [cheng04] INPUT 1.A query point called q 2.A set of n objects O 1,O 2,…, O n with uncertainty regions and pdfs OUTPUT A set of (O i,p i ) tuples –p i is the non-zero probability (qualification probability) that O i is the nearest neighbor of q O2O2 q f O1O1 O3O3 O4O4 O5O5 O6O6

7Cheng, Xie, Yiu, Chen, Sun Agenda Introduction –Basic Concepts Voronoi Diagram in Spatial Database Management Data Uncertainty –Applications of UV-diagram UV-diagram –Basic concepts of UV-diagram UV-edge, UV-cell, possible region, outer region… –Construction Initial region construction, I- and C- level pruning, UV-index construction –Results Conclusion Future work

8Cheng, Xie, Yiu, Chen, Sun UV-Diagram: an example Exponential number of UV-partitions can be generated! UV-cell

9Cheng, Xie, Yiu, Chen, Sun UV-cell We can use 3 UV-cells to represent 7 UV-partitions. The number of UV-cells equals to the number of objects.

10Cheng, Xie, Yiu, Chen, Sun Shape of a UV-cell Bisector Outer Region of O i w.r.t O j Inner Region of Oi w.r.t Oj UV-cell is the intersection of inner regions of O i w.r.t. all other objects

11 Basic Method Example: constructing U 1 Cheng, Xie, Yiu, Chen, Sun

12 Basic Method O1O1 O2O2 O3O3 Example: constructing U 1 Cheng, Xie, Yiu, Chen, Sun n-1 inner region has to be constructed! Pruning techniques Evaluating Ui requires expensive numerical calculations Reference objects Candidate Reference objects

13Cheng, Xie, Yiu, Chen, Sun UV-diagram (Uncertain Voronoi Diagram) (a)Voronoi Diagram. (b) UV-Diagram.

14 Cheng, Xie, Yiu, Chen, Sun Efficient Construction Index Level Pruning Computational Level Pruning Refinement Candidate Reference Objects C i Reference Objects F i UV-index Construction Initial Possible Region Construction Possible Region P i I ndex level Pruning C omputational level Pruning R efinement I ndex level Pruning C omputational level Pruning

15Cheng, Xie, Yiu, Chen, Sun Step 1: Generating a Possible Region

16Cheng, Xie, Yiu, Chen, Sun Step 1: Generating a Possible Region

17Cheng, Xie, Yiu, Chen, Sun Step 2,3: I- and C- Pruning O7O7 O1O1 O2O2 O3O3 O4O4 O5O5 O8O8 O6O6

18Cheng, Xie, Yiu, Chen, Sun Step 4. UV-index Construction Splitting Condition Overlap Checking PNN Query

19Cheng, Xie, Yiu, Chen, Sun Experiment Setup Uncertain DatasetSynthetic: 10k – 80k (30k def) Real dataset: 17k, 30k, 36k Uncertainty pdfGaussian (represented by 20 histogram bars)

20Cheng, Xie, Yiu, Chen, Sun Query Performance (ms)

21Cheng, Xie, Yiu, Chen, Sun Query Time’s Break-down (T q )

22Cheng, Xie, Yiu, Chen, Sun Query Performance (I/O)

23Cheng, Xie, Yiu, Chen, Sun Construction Time

24Cheng, Xie, Yiu, Chen, Sun Pruning Ratio

25Cheng, Xie, Yiu, Chen, Sun Real Dataset

26Cheng, Xie, Yiu, Chen, Sun Conclusion We propose UV-diagram, which is a variant of Voronoi Diagram for uncertain data. We introduce the concepts of UV-cell and reference objects to efficiently construct UV-diagram. We also propose an adaptive index for the UV-diagram.

27 Cheng, Xie, Yiu, Chen, Sun Future Work Use UV-diagram to support various types of queries - Continuous query, imprecise NN query, reverse NN query, etc.

28 THANKS! Q & A More discussions are welcome in the poster session! poster 28 Contact : Xike Xie Department of Computer Science The University of Hong Kong

29Cheng, Xie, Yiu, Chen, Sun Reference [shahabi06a] Mehdi Sharifzadeh, Cyrus Shahabi: The Spatial Skyline Queries. VLDB 2006: [Shahabi06b] Sharifzadeh, Mehdi and Shahabi, Cyrus: Utilizing Voronoi Cells of Location Data Streams for Accurate Computation of Aggregate Functions in Sensor Networks. Geoinformatica [Kao08] Clustering Uncertain Data using Voronoi Diagrams: Ben Kao; Sau Dan Lee; David Cheung; Wai-Shing Ho; K. F. chan. ICDM 2008 [Yiu07] Yiu, Man Lung and Mamoulis, Nikos. Reverse Nearest Neighbors Search in Ad Hoc Subspaces. TKDE 2007 [Yiu08] M. L. Yiu, N. Mamoulis, and P. Karras. Common Influence Join: A Natural Join Operation for Spatial Pointsets. In ICDE [Zheng06] B. Zheng, J. Xu, W.-C. Lee, and L. Lee, “Grid-partition index: a hybrid method for nearest-neighbor queries in wireless location-based services,” VLDB J., vol. 15, no. 1, pp. 21–39, [cheng04] R. Cheng, D. V. Kalashnikov, and S. Prabhakar, “Querying imprecisedata in moving object environments,” TKDE, vol. 16, no. 9, [TDRP98] P. A. Sistla, O. Wolfson, S. Chamberlain, and S. Dao,“Querying the uncertain position of moving objects,” in Temporal Databases: Research and Practice, [ICDCS07] S. Ganguly, M. Garofalakis, R. Rastogi, and K. Sabnani, “Streaming algorithms for robust, real-time detection of ddos attacks,” in ICDCS, [VLDB04a] A. Deshpande, C. Guestrin, S. Madden, J. Hellerstein, and W. Hong, “Model-driven data acquisition in sensor networks,” in Proc. VLDB, 2004 [Jooyandeh09] M. Jooyandeh, A. Mohades, and M. Mirzakhah, “Uncertain voronoi diagram,” Inf. Process. Lett., vol. 109, no. 13, pp. 709–712, [Sember08] J. Sember and W. Evans, “Guaranteed voronoi diagrams of uncertain sites,” in CCCG, 2008.