Ken C. K. Lee, Baihua Zheng, Huajing Li, Wang-Chien Lee VLDB 07 Approaching the Skyline in Z Order 1.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Adders Used to perform addition, subtraction, multiplication, and division (sometimes) Half-adder adds rightmost (least significant) bit Full-adder.
Using Matrices in Real Life
Advanced Piloting Cruise Plot.
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Chapter 1 The Study of Body Function Image PowerPoint
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 5 Author: Julia Richards and R. Scott Hawley.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
APWeb 2004 Hangzhou, China 1 Labeling and Querying Dynamic XML Trees Jiaheng Lu and Tok Wang Ling School of Computing National University of Singapore.
Reverse Spatial and Textual k Nearest Neighbor Search.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Introduction to Algorithms
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
ZMQS ZMQS
1 Two dimensional thining Let P =(V, m, n, B) and P ' = (V, m, n, B - D) be digital pictures, where D B. Then we say that P' is obtained from P by deleting.
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
CS16: Introduction to Data Structures & Algorithms
Databasteknik Databaser och bioinformatik Data structures and Indexing (II) Fang Wei-Kleiner.
Chapter 10: Applications of Arrays and the class vector
Data Structures: A Pseudocode Approach with C
Data Structures Using C++
ABC Technology Project
Online Algorithm Huaping Wang Apr.21
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
VOORBLAD.
15. Oktober Oktober Oktober 2012.
Name Convolutional codes Tomashevich Victor. Name- 2 - Introduction Convolutional codes map information to code bits sequentially by convolving a sequence.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Constant, Linear and Non-Linear Constant, Linear and Non-Linear
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Squares and Square Root WALK. Solve each problem REVIEW:
Shortest Violation Traces in Model Checking Based on Petri Net Unfoldings and SAT Victor Khomenko University of Newcastle upon Tyne Supported by IST project.
© 2012 National Heart Foundation of Australia. Slide 2.
ITEC200 Week10 Sorting. pdp 2 Learning Objectives – Week10 Sorting (Chapter10) By working through this chapter, students should: Learn.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
25 seconds left…...
Januar MDMDFSSMDMDFSSS
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
A SMALL TRUTH TO MAKE LIFE 100%
1 Unit 1 Kinematics Chapter 1 Day
PSSA Preparation.
Foundations of Data Structures Practical Session #7 AVL Trees 2.
Choosing an Order for Joins
How Cells Obtain Energy from Food
Distributed Computing 9. Sorting - a lower bound on bit complexity Shmuel Zaks ©
Traktor- og motorlære Kapitel 1 1 Kopiering forbudt.
Presentation transcript:

Ken C. K. Lee, Baihua Zheng, Huajing Li, Wang-Chien Lee VLDB 07 Approaching the Skyline in Z Order 1

Outline Introduction Preliminaries Skyline Processing in Z-Order ZSearch for Skyline Query ZUpdate for Skyline Updates Experiment Conclusion 2

Introduction Finding skyline points from very large in high dimensional space is expensive operation. Most of the work in the literature targets is improving performance of skyline query in high dimensional space. 3

Preliminaries Skyline Problems and Properties 4

(Cont.) Skyline Query Processing: 1.Divide and Conquer Algorithm. 2.Sorting-based Algorithm. 3.Hybrid Algorithm. 5

(Cont.) Divide and Conquer Algorithm: D&C divides a dataset into several small partitions and computes every partial skyline. The complete skyline is obtained by merging all partial skylines and removing dominated data points. Sorting-based Algorithm: SFS is devised based on an observation that by getting a dataset presorted according to a certain monotone scoring function such as sum of attributes. SFS sequentially scans the sorted dataset and keeps a set of skyline candidates. Dominance tests in SFS are based on an exhaustive scan on existing skyline candidates. 6

(Cont.) Hybrid Algorithm: Including Index, NN, and BBS. BBS is based on NN search, and adopts R-tree as its underlying index. 7

(Cont.) The nearest neighbor(NN): dominates is a skyline point. 8

(Cont.) The second NN: dominates not dominated by is another skyline. 9

(Cont.) are the skylines 10

(Cont.) BBS deletion an Insertion: Deletion: Need to found EDR(Exclusive Dominance Region). Insertion: Need compared with other skyline points. 11

Skyline Processing in Z-Order Skyline and Z-Order Curve For a d-dimensional space with as the coordinate value domain ranges, the Z-address of a data point contains dv bits, which can be considered as v d-bit groups. The i- th bit of a Z-address is contributed by the (i/d)-th bit of the (i%d)- th coordinate. 12

(Cont.) Instance : Z-address : (011111) 13

(Cont.) P1( ), P2( ), P3( ), P4( ) P5( ), P6( ), P7( ), P8( ) P9( ) 14

(Cont.) 15

(Cont.) Zbtree Index Structure 16

(Cont.) To facilitate data processing along a Z-order address sequence. To preserve data points in regions to enable efficient search space pruning. Assign a Z-address to all points, store them in a B + tree 17

ZSearch for Skyline Query RZ-Region Based Dominance Test 18

(Cont.) ZSearch Algorithm Use RZ-Region Based Dominance Test Input: ZBtree for source data set Local: a stack s Output: Skyline points 19

(Cont.) 20

ZUpdate for Skyline Updates Insertion: Insert Pinsert compared with skyline that Z-adress smaller than Pinsert. If Pinsert is be dominated, the skyline is the same. Else compared with skyline that Z-adress larger than Pinserts. 21

(Cont.) Deletion Find the points are only dominated by Pdel, and Pdel add to skyline set. Just comparing with the points Z-address larger than Pdels. 22

Experiment 23

(Cont.) 24

(Cont.) 25

Conclusion In this paper, we analyze the skyline problems and exploit the orderingand clustering properties of the Z-order curve which match perfectly well with the skyline processing strategies. The ZSearch algorithm scales very well in both dimensionality and cardinality. 26