The

Slides:



Advertisements
Similar presentations
Two Segments Intersect?
Advertisements

Computational Geometry
On the Density of a Graph and its Blowup Raphael Yuster Joint work with Asaf Shapira.
Brute-Force Triangulation
2/3/15CMPS 3130/6130 Computational Geometry1 CMPS 3130/6130 Computational Geometry Spring 2015 Triangulations and Guarding Art Galleries II Carola Wenk.
Week 5 2. Cauchy’s Integral Theorem (continued)
One of the most important problems is Computational Geometry is to find an efficient way to decide, given a subdivision of E n and a point P, in which.
I. The Problem of Molding Does a given object have a mold from which it can be removed? object not removable mold 1 object removable Assumptions The object.
Intersections. Intersection Problem 3 Intersection Detection: Given two geometric objects, do they intersect? Intersection detection (test) is frequently.
Complexity 18-1 Complexity Andrei Bulatov Probabilistic Algorithms.
Point Location Computational Geometry, WS 2007/08 Lecture 5 Prof. Dr. Thomas Ottmann Algorithmen & Datenstrukturen, Institut für Informatik Fakultät für.
Lecture 6: Point Location Computational Geometry Prof. Dr. Th. Ottmann 1 Point Location 1.Trapezoidal decomposition. 2.A search structure. 3.Randomized,
Voronoi Diagrams.
DAST 2005 Week 4 – Some Helpful Material Randomized Quick Sort & Lower bound & General remarks…
Ch. 8 & 9 – Linear Sorting and Order Statistics What do you trade for speed?
Comment reconstruire le graphe de visibilité d’un polygone? (Reconstructing visibility graphs of polygons) Jérémie Chalopin, Shantanu Das LIF, Aix-Marseille.
UNC Chapel Hill M. C. Lin Point Location Reading: Chapter 6 of the Textbook Driving Applications –Knowing Where You Are in GIS Related Applications –Triangulation.
Triangulating a monotone polygon
Nattee Niparnan. Easy & Hard Problem What is “difficulty” of problem? Difficult for computer scientist to derive algorithm for the problem? Difficult.
Clipping: Clipping is a process of dividing an object into visible and invisible positions and displaying the visible portion and discarding the invisible.
UNC Chapel Hill M. C. Lin Linear Programming Reading: Chapter 4 of the Textbook Driving Applications –Casting/Metal Molding –Collision Detection Randomized.
RA PRESENTATION Sublinear Geometric Algorithms B 張譽馨 B 汪牧君 B 李元翔.
Center for Graphics and Geometric Computing, Technion 1 Computational Geometry Chapter 8 Arrangements and Duality.
Multiple Integration 14 Copyright © Cengage Learning. All rights reserved.
Center for Graphics and Geometric Computing, Technion 1 Computational Geometry Chapter 8 Arrangements and Duality.
4.1 – ONE-TO-ONE FUNCTIONS; INVERSE FUNCTIONS Target Goals: 1.Obtain the graph of the inverse function 2.Determine the inverse of a function.
 2004 SDU 1 Lecture5-Strongly Connected Components.
Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.
Center for Graphics and Geometric Computing, Technion 1 Computational Geometry Chapter 9 Line Arrangements.
The Matrix Equation A x = b (9/16/05) Definition. If A is an m  n matrix with columns a 1, a 2,…, a n and x is a vector in R n, then the product of A.
OBJECTIVES:  Find inverse functions and verify that two functions are inverse functions of each other.  Use graphs of functions to determine whether.
Computer Graphics CC416 Week 14 Filling Algorithms.
Polygon Triangulation
Computational Geometry
Computer Graphics CC416 Lecture 04: Bresenham Line Algorithm & Mid-point circle algorithm Dr. Manal Helal – Fall 2014.
Probabilistic Skylines on Uncertain Data (VLDB2007) Jian Pei et al Supervisor: Dr Benjamin Kao Presenter: For Date: 22 Feb 2008 ??: the possible world.
UNC Chapel Hill M. C. Lin Geometric Data Structures Reading: Chapter 10 of the Textbook Driving Applications –Windowing Queries Related Application –Query.
CMPS 3130/6130 Computational Geometry Spring 2017
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Probabilistic Algorithms
Week 5 2. Cauchy’s Integral Theorem (continued)
Introduction to Polygons
Largest Red-Blue Separating Rectangles
Functions and Their Graphs
Copyright © Cengage Learning. All rights reserved.
Fill Area Algorithms Jan
Haim Kaplan and Uri Zwick
Copyright © Cengage Learning. All rights reserved.
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Orthogonal Range Searching and Kd-Trees
Polygon Triangulation
Depth Estimation via Sampling
I. The Problem of Molding
Efficient Algorithms for the Weighted k-Center Problem on a Real Line
Quickest Visibility Queries in Polygonal Domains
On the Geodesic Centers of Polygonal Domains
Minimizing the Aggregate Movements for Interval Coverage
Haitao Wang Utah State University WADS 2017, St. John’s, Canada
An O(n log n)-Time Algorithm for the k-Center Problem in Trees
Range Queries on Uncertain Data
Joseph S.B. Mitchell, Stony Brook University
Danny Z. Chen1, Yan Gu2, Jian Li2, and Haitao Wang1
Aggregate-Max Nearest Neighbor Searching in the Plane
Haitao Wang Utah State University SoCG 2017, Brisbane, Australia
Visibility and Ray Shooting Queries in Polygonal Domains
Computational Geometry
Weak Visibility Queries of Line Segments in Simple Polygons
Further Equations and Techniques
Clustering.
Chapter 5: Morse functions and function-induced persistence
Presentation transcript:

The 𝜏-Skyline for Uncertain Data Haitao Wang (Utah State University) Wuzhou Zhang Presented by Matt Gibson (Duke University) (University of Texas at San Antonio)

Skyline 𝑝 dominates 𝑞, denoted as 𝑝≽𝑞, if 𝑥 𝑝 ≥𝑥(𝑞) and 𝑦 𝑝 ≥𝑦(𝑞) Here a point dominates itself for simplicity of discussion Given a set 𝑃 of (exact) points, a point 𝑝∈𝑃 is a skyline point of 𝑃 if 𝑝 is not dominated by any other point of 𝑃.

Uncertain Data 𝒫={ 𝑃 1 , …, 𝑃 𝑛 } : a set of 𝑛 uncertain points 𝑃 𝑖 = 𝑝 𝑖1 , …, 𝑝 𝑖𝑘 Pr[ 𝑃 𝑖 =𝑝 𝑖𝑗 ]= 𝑤 𝑖𝑗 𝑗=1 𝑘 𝑤 𝑖𝑗 =1 For a location 𝑝∈ 𝑃 𝑖 , we also use 𝑤(𝑝) to denote the probability of 𝑃 𝑖 being at 𝑝. Assume: 𝑃 𝑖 ’s are pairwise independent Set 𝑚=𝑛𝑘. 0.2 0.3 0.4 0.1

𝜏-Skyline of 𝑃 𝑖 Given a point 𝑞, the probability that 𝑃 𝑖 dominates 𝑞: 𝛿 𝑖 𝑞 = 𝑝∈ 𝑃 𝑖 , 𝑝≽𝑞 𝑤(𝑝) Given 𝜏∈ 0, 1 , the 𝜏-skyline region of 𝑃 𝑖 : 𝑅 𝑖 ={𝑞∣ 𝛿 𝑖 𝑞 ≤𝜏} i.e., the set of points 𝑞 such that the probability of 𝑃 𝑖 dominating 𝑞 is at most 𝜏. The 𝜏-skyline of 𝑃 𝑖 , denoted by 𝜋 𝑖 , is the boundary of 𝑅 𝑖 . 0-skyline of 𝑃 𝑖 corresponds to the (conventional) skyline of 𝑃 𝑖 .

𝜏-Skyline of 𝒫 Given 𝜏∈ 0, 1 , the 𝜏-skyline region of 𝒫= 𝑃 1 , …, 𝑃 𝑛 : 𝑅={𝑞∣ 𝛿 𝑖 𝑞 ≤𝜏, ∀𝑖≤𝑛} i.e., the set of points 𝑞 such that the probability of any 𝑃 𝑖 dominating 𝑞 is at most 𝜏. The 𝜏-skyline of 𝒫, denoted by 𝜋, is the boundary of 𝑅. 0-skyline of 𝒫 corresponds to the (conventional) skyline of 𝑖=1 𝑛 𝑃 𝑖 . The 𝜏-skyline probability of each 𝑃 𝑖 of 𝒫 is defined to be the probability of 𝑃 𝑖 lying inside the 𝜏-skyline region of 𝒫.

Related Work 𝜌-skyline Given 𝑞 and 𝒫={ 𝑃 1 , …, 𝑃 𝑛 } , the skyline probability of 𝑞: 𝛼 𝑞,𝒫 = 𝑖=1 𝑛 (1− 𝛿 𝑖 (𝑞) ) The skyline probability of 𝑃 𝑖 : 𝛼 𝑃 𝑖 ,𝒫 = 𝑗=1 𝑘 𝑤 𝑖𝑗 𝛼( 𝑝 𝑖𝑗 , 𝒫 ≠𝑖 ) where 𝒫 ≠𝑖 =𝒫− 𝑃 𝑖 . Given a parameter 𝜌, the goal is to compute 𝜌-skyline of 𝒫: { 𝑃 𝑖 ∣𝛼 𝑃 𝑖 ,𝒫 ≥𝜌} First sub-quadratic: 𝑂( 𝑚 5/3 poly(log 𝑛)) [Atallah et al. 2011] 𝑂( 𝑚 3/2 ), and 𝑂(𝑚𝑘log 𝑚) when 𝑘≪𝑛 [Afshani et al. 2011] Heuristic algorithms [Pei et al. 2007]

Relate Work (cnt.) Other variants of probabilistic skylines 𝐾-skyband [Lian et al. 2008] [Zhang et al. 2013] 𝐾-skyband Given a set 𝑃 of 𝑛 (exact) points and a parameter 𝐾≤𝑛, the 𝐾-skyband of 𝑃 asks for the set of points in 𝑃 which are dominated by at most 𝐾 points of 𝑃. 0-skyband of 𝑃 corresponds the conventional skyline of 𝑃. Our 𝜏-skyline of 𝑃 𝑖 can be used to answer the weighted skyband of 𝑃 𝑖 . As a byproduct, we obtain the first algorithm for computing the skyband of a set of weighted points!

Our Results We first show that The 𝜏-skyline 𝜋 𝑖 of 𝑃 𝑖 has complexity 𝑂 𝑘 , and can be computed in 𝑂(𝑘log 𝑘) time. Using this as a subroutine: The 𝜏-skyline 𝜋 of 𝒫 has complexity 𝑂 𝑚 , and can be computed in 𝑂(𝑚log 𝑚) time, where 𝑚=𝑛𝑘. After which, The 𝜏-skyline probabilities of all 𝑃 𝑖 ’s can be computed in 𝑂(𝑚log 𝑚) time. Our method is very simple and easy to implement!

Our Algorithm We first compute, for each 𝑃 𝑖 of 𝒫, the 𝜏-skyline 𝜋 𝑖 of 𝑃 𝑖 , in 𝑂(𝑘log 𝑘) time. Sweeping horizontally vertically Two movements: move downwards move rightwards Zig-Zag path We then show that 𝜋 is the upper envelope of 𝜋 1 , …, 𝜋 𝑛 , and can be computed in 𝑂(𝑚log 𝑚) time, where 𝑚=𝑛𝑘.

Computing the 𝜏-skyline 𝜋 𝑖 of 𝑃 𝑖 The goal is to find the set of points 𝑞 such that the probability of 𝑃 𝑖 dominating 𝑞 is at most 𝜏. Sweeping in both horizontal and vertical directions! As a preprocessing step, we sort all the locations of 𝑃 𝑖 into two sorted lists 𝐿 𝑥 and 𝐿 𝑦 : 𝐿 𝑥 by increasing 𝑥-coordinate 𝐿 𝑦 by decreasing 𝑦-coordinate Maintain two invariants during the sweeping process

Two Invariants During Sweeping We say that 𝑞′ strictly dominates 𝑞, denoted as 𝑞 ′ ≻𝑞, if 𝑥 𝑞 ′ >𝑥 𝑞 and 𝑦 𝑞 ′ >𝑦(𝑞) Given a point 𝑞, the probability that 𝑃 𝑖 strictly dominates 𝑞: 𝛿 𝑖 + 𝑞 = 𝑝∈ 𝑃 𝑖 , 𝑝≻𝑞 𝑤(𝑝) Let 𝑞 be any point on the 𝜏-skyline 𝜋 𝑖 of 𝑃 𝑖 . Two invariants: 𝛿 𝑖 𝑞 >𝜏 𝛿 𝑖 + 𝑞 ≤𝜏

Let’s Do The Sweeping! Initially, we sweep vertically along the line 𝑥=−∞ by scanning the sorted list 𝐿 𝑦 to find 𝑞=(−∞, 𝑦(𝑝)) for some 𝑝∈ 𝐿 𝑦 such that two invariants hold. (trivial) Then we sweep horizontally. An event happens if Either 𝑥 𝑞 =𝑥(𝑝) for some 𝑝∈ 𝐿 𝑥 Or 𝑦 𝑞 =𝑦(𝑝) for some 𝑝∈ 𝐿 𝑦 At each event, we decide to move rightwards or downwards. Terminate when either 𝐿 𝑥 or 𝐿 𝑦 becomes empty.

Analysis We have 𝑂 𝑘 events: 𝐿 𝑥 + 𝐿 𝑦 =2𝑘 At each event, 𝑂(1) time: Two very simple invariant checks We update 𝛿 𝑖 (𝑞) and 𝛿 𝑖 + (𝑞) 𝑂 𝑘 events also implies 𝑂 𝑘 complexity of 𝜋 𝑖 Correctness is also easy to verify (see our paper) Theorem: The 𝜏-skyline 𝜋 𝑖 of 𝑃 𝑖 has complexity 𝑂 𝑘 , and can be computed in 𝑂(𝑘log 𝑘) time.

Computing the 𝜏-skyline 𝜋 of 𝒫 An easy observation is that the 𝜏-skyline region 𝑅 of 𝒫 is the common intersection of the 𝜏-skyline region 𝑅 1 ,…, 𝑅 𝑛 , i.e., 𝑅= 𝑖=1 𝑛 𝑅 𝑖 And the 𝜏-skyline 𝜋 of 𝒫 is simply the upper envelope of 𝜋 1 , …, 𝜋 𝑛 . Equivalently, 𝜋 is the (conventional) skyline of the 𝑂(𝑛𝑘) turning points of all the 𝜋 𝑖 ’s. Theorem: The 𝜏-skyline 𝜋 of 𝒫 has complexity 𝑂 𝑚 , and can be computed in 𝑂(𝑚log 𝑚) time, where 𝑚=𝑛𝑘.