Xiao Zhang 1, Wang-Chien Lee 1, Prasenjit Mitra 1, 2, Baihua Zheng 3 1 Department of Computer Science and Engineering 2 College of Information Science.

Slides:



Advertisements
Similar presentations
Searching Trajectories by Locations – An Efficiency Study Zaiben Chen 1, Heng Tao Shen 1, Xiaofang Zhou 1, Yu Zheng 2, Xing Xie 2 1 The University of Queensland.
Advertisements

Mobility Increase the Capacity of Ad-hoc Wireless Network Matthias Gossglauser / David Tse Infocom 2001.
Spatio-temporal Databases
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Indexing and Range Queries in Spatio-Temporal Databases
Danzhou Liu Ee-Peng Lim Wee-Keong Ng
Similarity Search on Bregman Divergence, Towards Non- Metric Indexing Zhenjie Zhang, Beng Chi Ooi, Srinivasan Parthasarathy, Anthony K. H. Tung.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
1 NNH: Improving Performance of Nearest- Neighbor Searches Using Histograms Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research) Chen Li (UC Irvine)
A Crowd-Enabled Approach for Efficient Processing of Nearest Neighbor Queries in Incomplete Databases Samia Kabir, Mehnaz Tabassum Mahin Department of.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
June 3, 2015Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
2-dimensional indexing structure
Spatio-temporal Databases Time Parameterized Queries.
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Liang Jin (UC Irvine) Nick Koudas (AT&T) Chen Li (UC Irvine)
Spatial Indexing for NN retrieval
Spatial Indexing SAMs. Spatial Access Methods PAMs Grid File kd-tree based (LSD-, hB- trees) Z-ordering + B+-tree R-tree Variations: R*-tree, Hilbert.
Spatial Queries Nearest Neighbor and Join Queries.
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Spatial Queries Nearest Neighbor Queries.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
R-Trees: A Dynamic Index Structure for Spatial Data Antonin Guttman.
DSI : A Fully Distributed Spatial Index for Location-based Wireless Broadcast Services Sungwon Jung Dept. of Computer Science and Engineering Sogang University.
Massively Distributed Database Systems Broadcasting - Data on air Spring 2014 Ki-Joune Li Pusan National University.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Michael Vassilakopoulos.
Power Save Mechanisms for Multi-Hop Wireless Networks Matthew J. Miller and Nitin H. Vaidya University of Illinois at Urbana-Champaign BROADNETS October.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
A Quantitative Analysis and Performance Study For Similar- Search Methods In High- Dimensional Space Presented By Umang Shah Koushik.
The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,
A Cyclic-Executive-Based QoS Guarantee over USB Chih-Yuan Huang,Li-Pin Chang, and Tei-Wei Kuo Department of Computer Science and Information Engineering.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Efficient Processing of Top-k Spatial Preference Queries
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
Exact indexing of Dynamic Time Warping
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Location-based Spatial Queries AGM SIGMOD 2003 Jun Zhang §, Manli Zhu §, Dimitris Papadias §, Yufei Tao †, Dik Lun Lee § Department of Computer Science.
Massively Distributed Database Systems Broadcasting - Data on air Spring 2015 Ki-Joune Li Pusan National University.
DASFAA 2005, Beijing 1 Nearest Neighbours Search using the PM-tree Tomáš Skopal 1 Jaroslav Pokorný 1 Václav Snášel 2 1 Charles University in Prague Department.
ICDE-2006 Subramanian Arumugam Christopher Jermaine Department of Computer Science University of Florida 22nd International Conference on Data Engineering.
Continuous Monitoring of Spatial Queries in Wireless Broadcast Environments.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Strategies for Spatial Joins
SIMILARITY SEARCH The Metric Space Approach
Spatial Queries Nearest Neighbor and Join Queries.
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
On the Critical Total Power for k-Connectivity in Wireless Networks
KD Tree A binary search tree where every node is a
Nearest Neighbor Queries using R-trees
Spatio-temporal Pattern Queries
Chapter 4: Probabilistic Query Answering (2)
Efficient Processing of Top-k Spatial Preference Queries
Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research)
Presentation transcript:

Xiao Zhang 1, Wang-Chien Lee 1, Prasenjit Mitra 1, 2, Baihua Zheng 3 1 Department of Computer Science and Engineering 2 College of Information Science and Technology The Pennsylvania State University 3 School of Information Systems, Singapore Management University EDBT, Nantes, France, 03/28/2008

 Background  Problem Analysis  New TNN Algorithms  Optimization  Experiments  Conclusions & Future Work

 What is TNN? ◦ S is a set of banks ◦ R is a set of restaurants ◦ TNN distance = 5+1 = 6

 What is TNN?  Given a query point p and two datasets S and R, TNN returns a pair of objects (s, r) such that ∀(s’, r’)∈S×R, dis(p, s) + dis(s, r) ≤ dis(p, s’) + dis(s’, r’) where dis(p,s) is the Euclidean distance between p and s.  First proposed by Zheng, Lee and Lee [1]. [1] B. Zheng, K.C.Lee and W.-C.Lee. Transitive nearest neighbor search in mobile environments. SUTC 2006

 Server has all the data and broadcasts data in forms of radio signals in channels.  Mobile clients (cell phones and PDAs) tune in to broadcast channels, download necessary data and process queries.  Broadcast VS. on- demand ◦ Support an arbitrary number of mobile devices to have simultaneous access ◦ Efficient use of limited bandwidth ◦ Light workload on the server side

 Assumption: ◦ Zheng, Lee and Lee assumed a single broadcast channel. ◦ Based on existing technology (dual-mode, dual- standby cell phone), we assume multiple channels. ◦ A mobile client can access information in multiple channels simultaneously  Challenges : ◦ How to utilize the parallel processing ability of mobile clients to facilitate query processing? ◦ How to reduce access time? ◦ How to reduce energy consumption?

 1. We developed two new algorithms for TNN query in multi-channel access environment.  2. We proposed two new distance metrics (MinTransDist and MinMaxTransDist) so that our new algorithms efficiently reduce search cost.  3. We proposed an optimization technique to reduce energy consumption.

 1. Two broadcast channels, for S and R  2. 2-dim points  3. Air-indexing: R-tree [2]  4. Broadcast in depth-first order, in order to avoid back-tracking  5. (1, m) interleaving [3]  6. performance metrics (in # of pages): ◦ Access time ◦ Tune-in time [2] A. Guttman. R-trees: a dynamic index structure for spatial searching. inSigmod’84 [3] T.Imielinski, S.Viswanathan, and B.Badrinath. Data on air: organization and access. TKDE 1997

 Randomly choose ANY pair of objects (s’, r’ ), use the trans. dist. as a search range  Guarantee to enclose the answer pair (s, r)

 Theorem [1] : ◦ the transitive distance determined by any pair of objects (s, r) is an upper bound.  General ideas of answering TNN queries: ◦ Estimate: find a search range from the query point p by searching the index ◦ Filter:filter unqualified data objects in the search range determined earlier to find the pair of objects with minimum transitive distance.

 Deficiencies of existing algorithms: ◦ Approximate-TNN-Search:  Uses an equation to estimate the search range in the first step  Search range may be too large or too small ◦ Window-Based-TNN-Search:  Two sequential NN searches in estimation step  Search range estimation is done in sequential order  Large access time

 Algo 1: Double-NN-Search ◦ Issue two NN queries in estimation step ◦ p’s NN in S, and p’s NN in R ◦ (s 1, r 2 )

 Hybrid-NN-Search ◦ Increases interaction between two channels ◦ Uses result of the finished NN to guide the unfinished NN in order to reduce search range ◦ Uses new distance metrics to perform branch-and- bound ◦ Treat TNN distance as a whole

 NN in Channel 1 finishes first  Already found s=p.NN(S)  Looking for r 2, instead of r 1

 NN in channel 2 finishes first  Already found r=p.NN(R)  Looking for s 2 instead of s 1  Use new criteria when searching the index  Need new distance metrics for branch&bound

 MinTransDist: ◦ Lower bound for trans. dist. from p to an MBR to r.  MinMaxTransDist: ◦ Upper bound for trans. dist. from p to an MBR to r.  Details given in the paper.

 Algorithm description: ◦ If the two NN searches in both channels are not finished, follow the Double-NN algorithm ◦ If the NN search in Channel 1 (Dataset S) finishes first, let s=p.NN(S), use s as the new query point and perform NN on the remaining portion of R-tree for dataset R. ◦ If the NN search in Channel 2 (Dataset R) finishes first, change distance metrics, use MinTransDist and MinMaxTransDist to perform branch-and-bound. Find an s which can minimize the transitive distance.

 Updating and pruning strategy ◦ Use queue to keep potential MBRs, sorted based on their arrival time ◦ Case 2 (s=p.NN(S) finishes first):  Switch NN query point to the s  Initial upper bound update  If there is an intermediate result r’, update the upper bound with dis(p, s)+dis(s, r’ )  Scan the queue of MBRs and use dist. metr. in traditional NN queries.

 Updating and pruning strategy (cont.) ◦ Case 3 (r=p.NN(R) finishes first):  If there is an intermediate result s’, use dis(p, s’)+dis(s’, r) as the new upper bound  Then scan all the MBRs in the queue, use z=min Mi∈MBR_queue {MinMaxTransDist(p, M i, r)} to update the upper bound.  In traversal, use MinMaxTransDist to update the upper bound; use MinTransDist for pruning

 Example for pruning:

 Goal: reduce energy consumption  Analysis: ◦ Previous algorithms minimize the search range in the Estimate Step by issuing “exact” search ◦ Energy consumption in Filter Step is low ◦ Energy consumption in Estimate Step is high  Approach: ◦ use “approximate” search in Estimate Step to save energy in this step

 Approximate Search: ◦ Relax the pruning condition ◦ Use ratio of overlapping area to estimate the probability ◦ Compare the ratio with a threshold α

 How to determine α ?  factors: ◦ R-tree height and node depth  Use small α on the root and large α on leaves ◦ Difference in densities of the two datasets involved  Small α or 0 on the dataset with smaller density α01 exact searchapproximate search

 Dataset 1: ◦ 39,000 * 39,000 square region ◦ Densities: , , , , , , , ◦ # of points: 152, 382, 960, 2411, 6055, 15210, 38206,  Dataset 2: ◦ 39,000 * 39,000 square region ◦ # of points: 2,000 – 30,000 with 2,000 increment

 R-tree as air index  Broadcast in depth-first order  STR packing algorithm [3]  (1, m) interleaving [2]  1,000 query points generated for each of the experiments ParameterSize Index pointer2 bytes Coordinate4 bytes Data content1k bytes Page capacity64 – 512 bytes [3] S.Leutenegger, M.Lopez and J.Edginton. Str: a simple and efficient algorithm for r-tree packing. ICDE 1997 [2] T.Imielinski, S.Viswanathan, and B.Badrinath. Data on air: organization and access. TKDE 1997

 Algorithms with exact search: ◦ Access time: Double-NN and Hybrid-NN have the same access time, which is smaller than Window- Based ◦ 1.8 ≥ size(S) / size(R) ≥ 1 / 40

 Algorithms with exact search: ◦ Tune-in time: when 0.01 ≤ size(S)/size(R) ≤ 0.4 Hybrid-NN gives the best tune-in time

 ANN vs. eNN ◦ Improvement in tune-in time ranges from 11%-20%

 Hybrid algorithm with ANN:

 Double-NN and Hybrid-NN effectively reduce access time  Cases in which our algorithms reduces tune- in time are stated and discussed  Optimization technique effectively reduces tune-in time of all three algorithms

 Generalized TNN queries in broadcast environment: ◦ More than 2 datasets are involved ◦ Visiting order not specified ◦ Complete route query  Using new distance metrics in disk based environment

 Any questions?

 Def 1: (MinTransDist) ◦ Given two points p and r, and an MBR M S, MinTransDist(p, M S,r) finds a point s on M S such that MinTransDist(p, M S,r)=dis(p, s)+dis(s, r) and for any point s’≠ s, s’ ∈M S dis(p, s’)+dis(s’, r) ≥ MinTransDist(p, M S,r)

 Def 2: (MaxDist) ◦ Given two points p and r, and a line segment ℓ, MaxDist(p, ℓ, r) = max i=I,2 {dis(p, v i )+dis(v i, r), where v i, (i=1, 2) are the two end points of ℓ ◦ MaxDist(p, ℓ, r) gives a tight upper bound for all the transitive distances from p to any points on ℓ, to r. p r ℓ

 Def 3: (MinMaxTransDist) ◦ Given two points p and r, and an MBR M S, MinMaxTransDist(p, M S, r) = min 1≤i≤4 { MaxDist(p,ℓ i, r ) } where ℓ i (1≤i≤4) are the four sides of MBR M S  Lemma: ◦ Given a starting point p, an ending point r, and an MBR M S enclosing a point dataset S, ∃s ∈ S, such that dis(p, s)+dis(s, r) ≤ MinMaxTransDist(p, M S, r)