Uncertain Data Mobile Group 报告人:郝兴.

Slides:



Advertisements
Similar presentations
Ken C. K. Lee, Baihua Zheng, Huajing Li, Wang-Chien Lee VLDB 07 Approaching the Skyline in Z Order 1.
Advertisements

1 Query Processing in Spatial Network Databases presented by Hao Hong Dimitris Papadias Jun Zhang Hong Kong University of Science and Technology Nikos.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
指導教授:陳良弼 老師 報告者:鄧雅文  Introduction  Related Work  Problem Formulation  Future Work.
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
VLDB 2011 Pohang University of Science and Technology (POSTECH) Republic of Korea Jongwuk Lee, Seung-won Hwang VLDB 2011.
Counting Distinct Objects over Sliding Windows Presented by: Muhammad Aamir Cheema Joint work with Wenjie Zhang, Ying Zhang and Xuemin Lin University of.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Cleaning Uncertain Data with Quality Guarantees Reynold Cheng, Jinchuan Chen, Xike Xie 2008 VLDB Presented by SHAO Yufeng.
Cleaning Uncertain Data with Quality Guarantees Dr. Reynold Cheng Department of Computer Science The University of Hong Kong
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Cheng, Chen, Chen, Xie Evaluating Probability Threshold k- Nearest-Neighbor Queries over Uncertain Data Reynold Cheng (University of Hong Kong) Lei Chen.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
VLDB’2007 review Denis Mindolin. VLDB’07 program.
Cleaning Uncertain Data for Top-k Queries Luyi Mo, Reynold Cheng, Xiang Li, David Cheung, Xuan Yang The University of Hong Kong {lymo, ckcheng, xli, dcheung,
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
Voronoi-based Nearest Neighbor Search for Multi-Dimensional Uncertain Databases Peiwu Zhang Reynold Cheng Nikos Mamoulis Yu Tang University of Hong Kong.
U of Minnesota Spatial and Spatio-temporal Data Uncertainty: Modeling and Querying Mohamed F. Mokbel Department of Computer Science and Engineering University.
Ming Hua, Jian Pei Simon Fraser UniversityPresented By: Mahashweta Das Wenjie Zhang, Xuemin LinUniversity of Texas at Arlington The University of New South.
Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.
Probabilistic Threshold Range Aggregate Query Processing over Uncertain Data Wenjie Zhang University of New South Wales & NICTA, Australia Joint work:
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
Cheng, Xie, Yiu, Chen, Sun UV-diagram: a Voronoi Diagram for uncertain data 26th IEEE International Conference on Data Engineering Reynold Cheng (University.
Efficient Join Processing over Uncertain Data - By Reynold Cheng, et all. Presented By Lydia & Usha.
Presented by: Duong, Huu Kinh Luan March 14 th, 2011.
Efficient Computation of the Skyline Cube Yidong Yuan School of Computer Science & Engineering The University of New South Wales & NICTA Sydney, Australia.
Reynold Cheng†, Eric Lo‡, Xuan S
SUBSKY: Efficient Computation of Skylines in Subspaces Authors: Yufei Tao, Xiaokui Xiao, and Jian Pei Conference: ICDE 2006 Presenter: Kamiru Superviosr:
Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach Wenjie Zhang, Xuemin Lin The University of New South Wales & NICTA Ming Hua,
1 Progressive Computation of Constrained Subspace Skyline Queries Evangelos Dellis 1 Akrivi Vlachou 1 Ilya Vladimirskiy 1 Bernhard Seeger 1 Yannis Theodoridis.
Nearest Neighbor Searching Under Uncertainty
Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection.
Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 Probabilistic Continuous Update.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
Efficient Processing of Top-k Spatial Preference Queries
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Information Technology (Some) Research Trends in Location-based Services Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
On Top-n Reverse Top-k Queries: Variants, Algorithms, and Applications 陳良弼 Arbee L.P. Chen National Chengchi University 9/21/2012 at NCHU.
Finding skyline on the fly HKU CS DB Seminar 21 July 2004 Speaker: Eric Lo.
Bin Jiang, Jian Pei ICDE 2009 Online Interval Skyline Queries on Time Series 1.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
Presented by: Dardan Xhymshiti Fall  Type: Research paper  Authors:  International conference on Very Large Data Bases. Yoonjar Park Seoul National.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Probabilistic Skylines on Uncertain Data (VLDB2007) Jian Pei et al Supervisor: Dr Benjamin Kao Presenter: For Date: 22 Feb 2008 ??: the possible world.
Computer Science and Engineering Ranking Complex Objects in a Multi-dimensional Space Wenjie Zhang, Ying Zhang, Xuemin Lin The University of New South.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
Click to edit Present’s Name AP-Tree: Efficiently Support Continuous Spatial-Keyword Queries Over Stream Xiang Wang 1*, Ying Zhang 2, Wenjie Zhang 1, Xuemin.
A Unified Algorithm for Continuous Monitoring of Spatial Queries
SIMILARITY SEARCH The Metric Space Approach
Progressive Computation of The Min-Dist Optimal-Location Query
Stochastic Skyline Operator
Probabilistic Data Management
Query in Streaming Environment
Clustering Uncertain Taxi data
Chapter 4: Probabilistic Query Answering (2)
Probabilistic Data Management
Junqi Zhang+ Xiangdong Zhou+ Wei Wang+ Baile Shi+ Jian Pei*
IEEE ICDE 2008 Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data Reynold Cheng Hong Kong Polytechnic University.
Probabilistic Data Management
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
Probabilistic n-of-N Skyline Computation over Uncertain Data Streams
Presented by: Mahady Hasan Joint work with
Range Queries on Uncertain Data
Data Engineering Research Group
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

Uncertain Data Mobile Group 报告人:郝兴

Paper List Querying Imprecise Data in Moving Object Environments. [TKDE 2004] Reynold Cheng, Dmitri V. Kalashnikov, and Sunil Prabhakar. Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions. [VLDB 2005 ] Yufei Tao, Reynold Cheng, Xiaokui Xiao, Wang Kay Ngai, Ben Kao, Sunil Prabhakar. Efficient Evaluation of Imprecise Location-Dependent Queries. [ICDE 2007] Jinchuan Chen, Reynold Cheng Preserving User Location Privacy in Mobile Data Management Infrastructures. [PET 2006] Reynold Cheng, Yu Zhang, Elisa Bertino, and Sunil Prabhakar. Probabilistic Spatial Queries on Existentially Uncertain Data. [SSTD 2005] Xiangyuan Dai, Man Lung Yiu, Nikos Mamoulis, Yufei Tao, and Michail Vaitis. Probabilistic Skylines on Uncertain Data. [VLDB 2007] Jian Pei, Bin Jiang, Xuemin Lin, Yidong Yuan

Efficient Evaluation of Imprecise Location-Dependent Queries Jinchuan Chen Reynold Cheng Department of Computing The Hong Kong Polytechnic University

Outline A Classification of ILDQ 3 methods: The Minkowski Sum Query-Data Duality Exploiting Probability Threshold

IPQ and IUQ A A q Point object R Uncertainty of Uncertain object Query issuer Uncertain object IPQ: Imprecise Location-Dependent Queries over Point Objects IUQ: Imprecise Location-Dependent Queries over Uncertain Objects

Method 1: The Minkowski Sum A U 利用Minkowski Sum找到最大的基于不确定区域U的最大查询框,然后利用这个最大的查询框来做pruning R B

Method 2: Query-Data Duality Point Point Object R w h R w h R Intuition: The role of query issuer and the point objects can be exchanged Lead to simpler evaluation of IPQ and IUQ Query Point Point Object

Query-Data Duality and IPQ Uncertainty of Query Issuer U Query at the point object! Enables simpler probability evaluation

Method 3: Probability Threshold p-expanded-query U R R Ф U

The p-bound [VLDB04] p p-bound 0.1 0.3 0.5 p p p Uncertainty region The p-bounds shrink with increasing value of p. In practice, a few p-bounds are pre-computed and stored p

Deriving p-expanded-query with p-bound p-bound (top) d d U p-expanded-query R Ф U p-bound (left)

Pruning Uncertain Objects for C-IUQ (1) Strategy 1: Use p-bound A Uncertain object U The concept of p-bound and p-expanded-query can also be used for C-IUQ. We have developed 3 strategies. The first one uses the p-bound, 2nd one uses p-expanded query, and the 3rd one uses both. Their details can be found in our paper. p-bound R Ф U

Pruning Uncertain Objects for C-IUQ (2) Strategy 2: Use p-expanded query A p-expanded-query U R Ф U

Pruning Uncertain Objects for C-IUQ (3) Strategy 3: Use both p-bound and p-expanded query Qp-expanded-query y-expanded-query If x  y < p, then A can be pruned. A U R Ф U x-bound Qp-bound

Probabilistic Spatial Queries on Existentially Uncertain Data Xiangyuan Dai (HKU), Man Lung Yiu (HKU),Nikos Mamoulis (HKU) Yufei Tao (CityU,HK) Michail Vaitis (U Aegean, GR)

Outline Introduction Definitions Evaluation of Probabilistic Queries - range queries - nearest neighbor queries

Introduction

Definitions We refer to Ex as existential probability or confidence of x. We identify two types of probabilistic spatial queries on existentially uncertain objects. - Thresholding query - Ranking query

Evaluation of Probabilistic Queries range queries 􀂾A depth-first search algorithm applied on the R-tree to retrieve the qualified objects 􀂾Let Px = Ex 􀂾Thresholding query: t is used to filter out objects with Px< t 􀂾Ranking query: a priority queue maintains the m results with the highest Px

Evaluation of Probabilistic Queries nearest neighbor queries Pfirst = 1 p7:Px=0.1 [not a result] Pfirst=1-0.1=0.9 p8:Px=0.9*0.2 = 0.18 [not a result] Pfirst = 0.9*(1-0.2)=0.72 p6:Px=0.72 x 0.1 = 0.072 [not a result] Pfirst = 0.72*(1-0.1)=0.648 p4,:Px=0.648 x 0.5 = 0.342 [result !!!] Pfirst = 0.648*(1-0.5)=0.342 p5:Px=0.342 x 0.9 = 0.308 [result !!!] Pfirst = 0.342*(1-0.9)=0.034 Since Pfirst = 0.034 < t = 0.3, the algorithm terminates! Pm = 0 Pm

Probabilistic Skylines on Uncertain Data Jian Pei Simon Fraser University, Canada Bin Jiang, Xuemin Lin, Yidong Yuan The University of New South Wales & NICTA, Australia

Outline Introduction Probabilistic Skyline Computation Bounding-Pruning-Refining Bottom-Up Method Top-Down Method

——Conventional Skylines Introduction ——Conventional Skylines n-dimensional numeric space D = (D1, …, Dn) Large values are preferable Two points, u dominates v (u ≻ v), if " Di (1 ≤ i ≤ n), u.Di ≥ v.Di $ Dj (1 ≤ j ≤ n), u.Dj > v.Dj Given a set of points S, skyline = {u | uÎS and u is not dominated by any other point} Example C ≻ B, C ≻ D skyline = {A, C, E}

——Skylines on Uncertain Data Introduction ——Skylines on Uncertain Data Example A set of object S = {A, B, C} Each instance takes equal probability (0.5) to appear Probabilistic Dominance Pr(A ≻ C) = 3/4 Pr(B ≻ C) = 1/2 Pr((A ≻ C) ∨ (B ≻ C)) = 1 Pr(C is in the skyline) ≠ (1 - Pr(A ≻ C)) × (1 - Pr(B ≻ C)) Probabilistic dominance ≠≻ Probabilistic skyline

Probabilistic Skyline Computation Bottom-Up Method " u at layer-k, $ u′at layer-(k-1), s.t., u′≻ u and Pr(u′) ³ Pr(u) max{Pr(u) | u is at layer-(k-1)} ³ max{Pr(u) | u is at layer-k}

Probabilistic Skyline Computation Top-Down Method Partition Tree Bounding with Partition Trees

Thank you Thank you