Faster skyline searching using Hilbert R-tree

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval Introduction to Information Retrieval Lecture 7: Scoring and results assembly.
Advertisements

When Is Nearest Neighbors Indexable? Uri Shaft (Oracle Corp.) Raghu Ramakrishnan (UW-Madison)
Skyline Charuka Silva. Outline Charuka Silva, Skyline2  Motivation  Skyline Definition  Applications  Skyline Query  Similar Interesting Problem.
Using Trees to Depict a Forest
Ken C. K. Lee, Baihua Zheng, Huajing Li, Wang-Chien Lee VLDB 07 Approaching the Skyline in Z Order 1.
1. Find the cost of each of the following using the Nearest Neighbor Algorithm. a)Start at Vertex M.
The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
VLDB 2011 Pohang University of Science and Technology (POSTECH) Republic of Korea Jongwuk Lee, Seung-won Hwang VLDB 2011.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Reference-based Indexing of Sequence Databases Jayendra Venkateswaran, Deepak Lachwani, Tamer Kahveci, Christopher Jermaine University of Florida-Gainesville.
Fast Parallel Similarity Search in Multimedia Databases (Best Paper of ACM SIGMOD '97 international conference)
A Simple and Efficient Algorithm for R-Tree Packing Scott T. Leutenegger, Mario A. Lopez, Jeffrey Edgington STR Sunho Cho Jeonghun Ahn 1.
Modeling and Analysis of Random Walk Search Algorithms in P2P Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE, Rensselaer Polytechnic Institute.
Fast Algorithm for Nearest Neighbor Search Based on a Lower Bound Tree Yong-Sheng Chen Yi-Ping Hung Chiou-Shann Fuh 8 th International Conference on Computer.
Presented by Arshad Jamal, Rajesh Dhania, Vinkal Vishnoi Active hashing and its application to image and text retrieval Yi Zhen, Dit-Yan Yeung, Published.
Design Of New Index Structures For The ISAT Algorithm By Biswanath Panda, Mirek Riedewald, Paul Chew, Johannes Gehrke.
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows COMP9314 Lecture Notes.
1 Shooting Stars in the Sky:An Online Algorithm for Skyline Queries 作者: Donald Kossmann Frank Ramsak Steffen Rost 報告:黃士維.
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
Original Tree:
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
Scalable and Distributed Similarity Search in Metric Spaces Michal Batko Claudio Gennaro Pavel Zezula.
1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur.
RRT-Connect path solving J.J. Kuffner and S.M. LaValle.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
J Cheng et al,. CVPR14 Hyunchul Yang( 양현철 )
Figure 1.1 The observer in the truck sees the ball move in a vertical path when thrown upward. (b) The Earth observer views the path of the ball as a parabola.
Introduction Using time property and location property from lost items’ pictures, we construct the Lost and Found System which combined with image search.
Performance Tuning on Multicore Systems for Feature Matching within Image Collections Xiaoxin Tang*, Steven Mills, David Eyers, Zhiyi Huang, Kai-Cheung.
SUBSKY: Efficient Computation of Skylines in Subspaces Authors: Yufei Tao, Xiaokui Xiao, and Jian Pei Conference: ICDE 2006 Presenter: Kamiru Superviosr:
Garrett Poppe, Liv Nguekap, Adrian Mirabel CSUDH, Computer Science Department.
(Particle Swarm Optimisation)
K-Hit Query: Top-k Query Processing with Probabilistic Utility Function SIGMOD2015 Peng Peng, Raymond C.-W. Wong CSE, HKUST 1.
Efficient Metric Index For Similarity Search Lu Chen, Yunjun Gao, Xinhan Li, Christian S. Jensen, Gang Chen.
Introduction Use machine learning and various classifying techniques to be able to create an algorithm that can decipher between spam and ham s. .
Efficient Processing of Top-k Spatial Preference Queries
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
Universitatea Politehnica Bucureşti - Facultatea de Automatică şi Calculatoare TOWARDS A SECURE DATA SHARING PEER-TO-PEER NETWORK BASED ON GEOMETRIC AND.
The σ-neighborhood skyline queries Chen, Yi-Chung; LEE, Chiang. The σ-neighborhood skyline queries. Information Sciences, 2015, 322: 張天彥 2015/12/05.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
On Top-n Reverse Top-k Queries: Variants, Algorithms, and Applications 陳良弼 Arbee L.P. Chen National Chengchi University 9/21/2012 at NCHU.
Finding skyline on the fly HKU CS DB Seminar 21 July 2004 Speaker: Eric Lo.
Bin Jiang, Jian Pei ICDE 2009 Online Interval Skyline Queries on Time Series 1.
Graph Data Management Lab, School of Computer Science Branch Code: A Labeling Scheme for Efficient Query Answering on Tree
A configuration method for structured P2P overlay network considering delay variations Tomoya KITANI (Shizuoka Univ. 、 Japan) Yoshitaka NAKAMURA (NAIST,
STR: A Simple and Efficient Algorithm for R-Tree Packing.
ETRI Linear-Time Search in Suffix Arrays July 14, 2003 Jeong Seop Sim, Dong Kyue Kim Heejin Park, Kunsoo Park.
Property of Logarithms If x > 0, y > 0, a > 0, and a ≠ 1, then x = y if and only if log a x = log a y.
Presented by: Dardan Xhymshiti Fall  Type: Research paper  Authors:  International conference on Very Large Data Bases. Yoonjar Park Seoul National.
HKU CSIS DB Seminar Skyline Queries HKU CSIS DB Seminar 9 April 2003 Speaker: Eric Lo.
MULTIPLICATION STRATEGIES
Dense-Region Based Compact Data Cube
Fast nearest neighbor searches in high dimensions Sami Sieranoja
Query in Streaming Environment
Aziz Nasridinov and Young-Ho Park*
K Nearest Neighbor Classification
} 2x + 2(x + 2) = 36 2x + 2x + 4 = 36 4x + 4 = x =
Efficient Record Linkage in Large Data Sets
Color Image Retrieval based on Primitives of Color Moments
Solving Multi Step Equations
Warm Up – Monday Calculate the Redundancy of the above network.
Solving Multi Step Equations
Efficient Processing of Top-k Spatial Preference Queries
DATA INFRASTRUCTURE FOR BIG DATA SECURITY ANALYSIS APT attacks:
Internet of Things and Access Control
Provenance for Decision Making in Business
Can Embeddings Detect Heartbleed?
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

Fengyao Yan fyan@email.sc.edu Faster skyline searching using Hilbert R-tree Improvement of the existing shooting star algorithm Fig. 1 An example of Skyline[1] Fig. 2 shoot stars algorithm for skyline searching[1] R-tree was invented in 80s and is extremely use for nearest neighbors searching, but It is not the fastest Data-structure. Security problem and your credentials to solve the problem – layman's terms only Importance of the problem and the impact of the technology you are proposing – layman's terms only Fig. 3 R-tree example [1] Kossmann, D., Ramsak, F. and Rost, S., 2002, August. Shooting stars in the sky: An online algorithm for skyline queries. In Proceedings of the 28th international conference on Very Large Data Bases (pp. 275-286). VLDB Endowment. Fengyao Yan fyan@email.sc.edu

Faster skyline searching using Hilbert R-tree Property of Hilbert Curve: If two points in a 2D space is close to each other, then they are close on a Hilbert curve (they have similar Hilbert value). Fig. 4 An example of Hilbert Curve Adopt Hilbert Curve and construct R-tree according to pre-calculated Hilbert Value The Hilbert R-tree is much faster than Original the R-tree. Fig. 4 An example of Hilbert R-tree R-tree Your Name Email

Conclusion Your Name Email   ED. 10^4 ED. 10^5 ED. 10^6 UEDL. 10^6 UEDM. 10^5 Brute-force 0.0026 0.0717 0.5109 0.0654 0.3712 R-tree 0.0172 0.0239 0.0279 0.0247 10.1514 Hilbert R-tree 0.0099 0.0158 0.0195 0.0205 0.3700 By adopting Hilbert R-tree the original Shooting Stars Algorithm shows 20%-1005% performance improvements Note that we are not modifying the original Shooting Stars algorithm, we improve the performance by using an improved data structure which is heavily used by the original Shooting Stars algorithm. ED: evenly distributed dataset UEDL: Unevenly distributed dataset less skyline points UEDM: Unevenly distributed dataset more skyline points Unit: seconds recap your contribution and emphasize your contributions Your Name Email