Presentation is loading. Please wait.

Presentation is loading. Please wait.

The

Similar presentations


Presentation on theme: "The "β€” Presentation transcript:

1 The 𝜏-Skyline for Uncertain Data
Haitao Wang (Utah State University) Wuzhou Zhang Presented by Matt Gibson (Duke University) (University of Texas at San Antonio)

2 Skyline 𝑝 dominates π‘ž, denoted as π‘β‰½π‘ž, if π‘₯ 𝑝 β‰₯π‘₯(π‘ž) and 𝑦 𝑝 β‰₯𝑦(π‘ž)
Here a point dominates itself for simplicity of discussion Given a set 𝑃 of (exact) points, a point π‘βˆˆπ‘ƒ is a skyline point of 𝑃 if 𝑝 is not dominated by any other point of 𝑃.

3 Uncertain Data 𝒫={ 𝑃 1 , …, 𝑃 𝑛 } : a set of 𝑛 uncertain points
𝑃 𝑖 = 𝑝 𝑖1 , …, 𝑝 π‘–π‘˜ Pr[ 𝑃 𝑖 =𝑝 𝑖𝑗 ]= 𝑀 𝑖𝑗 𝑗=1 π‘˜ 𝑀 𝑖𝑗 =1 For a location π‘βˆˆ 𝑃 𝑖 , we also use 𝑀(𝑝) to denote the probability of 𝑃 𝑖 being at 𝑝. Assume: 𝑃 𝑖 ’s are pairwise independent Set π‘š=π‘›π‘˜. 0.2 0.3 0.4 0.1

4 𝜏-Skyline of 𝑃 𝑖 Given a point π‘ž, the probability that 𝑃 𝑖 dominates π‘ž: 𝛿 𝑖 π‘ž = π‘βˆˆ 𝑃 𝑖 , π‘β‰½π‘ž 𝑀(𝑝) Given 𝜏∈ 0, 1 , the 𝜏-skyline region of 𝑃 𝑖 : 𝑅 𝑖 ={π‘žβˆ£ 𝛿 𝑖 π‘ž β‰€πœ} i.e., the set of points π‘ž such that the probability of 𝑃 𝑖 dominating π‘ž is at most 𝜏. The 𝜏-skyline of 𝑃 𝑖 , denoted by πœ‹ 𝑖 , is the boundary of 𝑅 𝑖 . 0-skyline of 𝑃 𝑖 corresponds to the (conventional) skyline of 𝑃 𝑖 .

5 𝜏-Skyline of 𝒫 Given 𝜏∈ 0, 1 , the 𝜏-skyline region of 𝒫= 𝑃 1 , …, 𝑃 𝑛 : 𝑅={π‘žβˆ£ 𝛿 𝑖 π‘ž β‰€πœ, βˆ€π‘–β‰€π‘›} i.e., the set of points π‘ž such that the probability of any 𝑃 𝑖 dominating π‘ž is at most 𝜏. The 𝜏-skyline of 𝒫, denoted by πœ‹, is the boundary of 𝑅. 0-skyline of 𝒫 corresponds to the (conventional) skyline of 𝑖=1 𝑛 𝑃 𝑖 . The 𝜏-skyline probability of each 𝑃 𝑖 of 𝒫 is defined to be the probability of 𝑃 𝑖 lying inside the 𝜏-skyline region of 𝒫.

6 Related Work 𝜌-skyline
Given π‘ž and 𝒫={ 𝑃 1 , …, 𝑃 𝑛 } , the skyline probability of π‘ž: 𝛼 π‘ž,𝒫 = 𝑖=1 𝑛 (1βˆ’ 𝛿 𝑖 (π‘ž) ) The skyline probability of 𝑃 𝑖 : 𝛼 𝑃 𝑖 ,𝒫 = 𝑗=1 π‘˜ 𝑀 𝑖𝑗 𝛼( 𝑝 𝑖𝑗 , 𝒫 ≠𝑖 ) where 𝒫 ≠𝑖 =π’«βˆ’ 𝑃 𝑖 . Given a parameter 𝜌, the goal is to compute 𝜌-skyline of 𝒫: { 𝑃 𝑖 βˆ£π›Ό 𝑃 𝑖 ,𝒫 β‰₯𝜌} First sub-quadratic: 𝑂( π‘š 5/3 poly(log 𝑛)) [Atallah et al. 2011] 𝑂( π‘š 3/2 ), and 𝑂(π‘šπ‘˜log π‘š) when π‘˜β‰ͺ𝑛 [Afshani et al. 2011] Heuristic algorithms [Pei et al. 2007]

7 Relate Work (cnt.) Other variants of probabilistic skylines 𝐾-skyband
[Lian et al. 2008] [Zhang et al. 2013] 𝐾-skyband Given a set 𝑃 of 𝑛 (exact) points and a parameter 𝐾≀𝑛, the 𝐾-skyband of 𝑃 asks for the set of points in 𝑃 which are dominated by at most 𝐾 points of 𝑃. 0-skyband of 𝑃 corresponds the conventional skyline of 𝑃. Our 𝜏-skyline of 𝑃 𝑖 can be used to answer the weighted skyband of 𝑃 𝑖 . As a byproduct, we obtain the first algorithm for computing the skyband of a set of weighted points!

8 Our Results We first show that
The 𝜏-skyline πœ‹ 𝑖 of 𝑃 𝑖 has complexity 𝑂 π‘˜ , and can be computed in 𝑂(π‘˜log π‘˜) time. Using this as a subroutine: The 𝜏-skyline πœ‹ of 𝒫 has complexity 𝑂 π‘š , and can be computed in 𝑂(π‘šlog π‘š) time, where π‘š=π‘›π‘˜. After which, The 𝜏-skyline probabilities of all 𝑃 𝑖 ’s can be computed in 𝑂(π‘šlog π‘š) time. Our method is very simple and easy to implement!

9 Our Algorithm We first compute, for each 𝑃 𝑖 of 𝒫, the 𝜏-skyline πœ‹ 𝑖 of 𝑃 𝑖 , in 𝑂(π‘˜log π‘˜) time. Sweeping horizontally vertically Two movements: move downwards move rightwards Zig-Zag path We then show that πœ‹ is the upper envelope of πœ‹ 1 , …, πœ‹ 𝑛 , and can be computed in 𝑂(π‘šlog π‘š) time, where π‘š=π‘›π‘˜.

10 Computing the 𝜏-skyline πœ‹ 𝑖 of 𝑃 𝑖
The goal is to find the set of points π‘ž such that the probability of 𝑃 𝑖 dominating π‘ž is at most 𝜏. Sweeping in both horizontal and vertical directions! As a preprocessing step, we sort all the locations of 𝑃 𝑖 into two sorted lists 𝐿 π‘₯ and 𝐿 𝑦 : 𝐿 π‘₯ by increasing π‘₯-coordinate 𝐿 𝑦 by decreasing 𝑦-coordinate Maintain two invariants during the sweeping process

11 Two Invariants During Sweeping
We say that π‘žβ€² strictly dominates π‘ž, denoted as π‘ž β€² β‰»π‘ž, if π‘₯ π‘ž β€² >π‘₯ π‘ž and 𝑦 π‘ž β€² >𝑦(π‘ž) Given a point π‘ž, the probability that 𝑃 𝑖 strictly dominates π‘ž: 𝛿 𝑖 + π‘ž = π‘βˆˆ 𝑃 𝑖 , π‘β‰»π‘ž 𝑀(𝑝) Let π‘ž be any point on the 𝜏-skyline πœ‹ 𝑖 of 𝑃 𝑖 . Two invariants: 𝛿 𝑖 π‘ž >𝜏 𝛿 𝑖 + π‘ž β‰€πœ

12 Let’s Do The Sweeping! Initially, we sweep vertically along the line π‘₯=βˆ’βˆž by scanning the sorted list 𝐿 𝑦 to find π‘ž=(βˆ’βˆž, 𝑦(𝑝)) for some π‘βˆˆ 𝐿 𝑦 such that two invariants hold. (trivial) Then we sweep horizontally. An event happens if Either π‘₯ π‘ž =π‘₯(𝑝) for some π‘βˆˆ 𝐿 π‘₯ Or 𝑦 π‘ž =𝑦(𝑝) for some π‘βˆˆ 𝐿 𝑦 At each event, we decide to move rightwards or downwards. Terminate when either 𝐿 π‘₯ or 𝐿 𝑦 becomes empty.

13 Analysis We have 𝑂 π‘˜ events: 𝐿 π‘₯ + 𝐿 𝑦 =2π‘˜ At each event, 𝑂(1) time:
Two very simple invariant checks We update 𝛿 𝑖 (π‘ž) and 𝛿 𝑖 + (π‘ž) 𝑂 π‘˜ events also implies 𝑂 π‘˜ complexity of πœ‹ 𝑖 Correctness is also easy to verify (see our paper) Theorem: The 𝜏-skyline πœ‹ 𝑖 of 𝑃 𝑖 has complexity 𝑂 π‘˜ , and can be computed in 𝑂(π‘˜log π‘˜) time.

14 Computing the 𝜏-skyline πœ‹ of 𝒫
An easy observation is that the 𝜏-skyline region 𝑅 of 𝒫 is the common intersection of the 𝜏-skyline region 𝑅 1 ,…, 𝑅 𝑛 , i.e., 𝑅= 𝑖=1 𝑛 𝑅 𝑖 And the 𝜏-skyline πœ‹ of 𝒫 is simply the upper envelope of πœ‹ 1 , …, πœ‹ 𝑛 . Equivalently, πœ‹ is the (conventional) skyline of the 𝑂(π‘›π‘˜) turning points of all the πœ‹ 𝑖 ’s. Theorem: The 𝜏-skyline πœ‹ of 𝒫 has complexity 𝑂 π‘š , and can be computed in 𝑂(π‘šlog π‘š) time, where π‘š=π‘›π‘˜.

15


Download ppt "The "

Similar presentations


Ads by Google