AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.

Slides:



Advertisements
Similar presentations
The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Advertisements

Fast Algorithms For Hierarchical Range Histogram Constructions
Indexing and Range Queries in Spatio-Temporal Databases
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Pivoting M-tree: A Metric Access Method for Efficient Similarity Search Tomáš Skopal Department of Computer Science, VŠB-Technical.
Indexing Network Voronoi Diagrams*
2-dimensional indexing structure
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Deterministic Wavelet Thresholding for Maximum-Error Metrics Minos Garofalakis Bell Laboratories Lucent Technologies 600 Mountain Avenue Murray Hill, NJ.
Accessing Spatial Data
Spatio-Temporal Databases
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis.
1 R-Trees for Spatial Indexing Yanlei Diao UMass Amherst Feb 27, 2007 Some Slide Content Courtesy of J.M. Hellerstein.
Chapter 3: Data Storage and Access Methods
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Spatial Queries Nearest Neighbor Queries.
R-tree Analysis. R-trees - performance analysis How many disk (=node) accesses we’ll need for range nn spatial joins why does it matter?
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
1 Indexing Large Trajectory Data Sets With SETI V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel University of Michigan Presented by Guangyue Jia.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
R-Trees: A Dynamic Index Structure for Spatial Data Antonin Guttman.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Michael Vassilakopoulos.
KNR-tree: A novel R-tree-based index for facilitating Spatial Window Queries on any k relations among N spatial relations in Mobile environments ANIRBAN.
The BIRCH Algorithm Davitkov Miroslav, 2011/3116
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
AAU Novel Approaches to the Indexing of Moving Object Trajectories Presented by YuQing Zhang  Dieter Pfoser Christian S. Jensen Yannis Theodoridis.
A Query Adaptive Data Structure for Efficient Indexing of Time Series Databases Presented by Stavros Papadopoulos.
CSIS7101 – Advanced Database Technologies Spatio-Temporal Data (Part 1) On Indexing Mobile Objects Kwong Chi Ho Leo Wong Chi Kwong Simon Lui, Tak Sing.
Fast Subsequence Matching in Time-Series Databases Author: Christos Faloutsos etc. Speaker: Weijun He.
Leonardo Guerreiro Azevedo Geraldo Zimbrão Jano Moreira de Souza Approximate Query Processing in Spatial Databases Using Raster Signatures Federal University.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Reporter : Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Performance Comparison of xBR-trees and R*-trees for Single Dataset Spatial Queries Performance Comparison of xBR-trees and R*-trees for Single Dataset.
R-trees: An Average Case Analysis. R-trees - performance analysis How many disk (=node) accesses we ’ ll need for range nn spatial joins why does it matter?
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Spatio-Temporal Databases
ICDE-2006 Subramanian Arumugam Christopher Jermaine Department of Computer Science University of Florida 22nd International Conference on Data Engineering.
Spatio-Temporal Databases. Term Project Groups of 2 students You can take a look on some project ideas from here:
Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan.
Jeremy Iverson & Zhang Yun 1.  Chapter 6 Key Concepts ◦ Structures and access methods ◦ R-Tree  R*-Tree  Mobile Object Indexing  Questions 2.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Mehdi Kargar Department of Computer Science and Engineering
Spatio-Temporal Databases
Spatial Indexing I Point Access Methods.
Database Management Systems (CS 564)
Spatio-temporal Pattern Queries
Spatial Online Sampling and Aggregation
James B. Orlin Presented by Tal Kaminker
Spatio-Temporal Databases
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
The BIRCH Algorithm Davitkov Miroslav, 2011/3116
Database Systems (資料庫系統)
Efficient Cost Models for Spatial Queries Using R-Trees
R-trees: An Average Case Analysis
Efficient Processing of Top-k Spatial Preference Queries
Donghui Zhang, Tian Xia Northeastern University
Efficient Aggregation over Objects with Extent
Presentation transcript:

AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A. Nascimento

AAU 2 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 3 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 4 Introduction  Problem  Splitting trajectories optimally with the goal of minimizing the expected number of I/Os  Focus on  Spatio-temporal queries over historical trajectory data Using index structures that use MBRs  Past Solutions  A single MBR  Each line segment an MBR  Split trajectories and the resulting sub-trajectories independently by MBRs  Main contributions  An analytical cost model and a dynamic programming solution for splitting a given set of trajectories optimally (in terms of expected I/Os).  Another cost model and algorithm for segments updated incrementally

AAU 5 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 6 Background and Motivation  R-tree Review  Leaf nodes: MBRs of data objects and pointers to the object  Internal nodes: sequence of pairs of an MBR and a pointer to a child node  Why spilt trajectories?  Offer a great potential for improving the performance of sptatio-temporal range queries.  Splitting a trajectory → total volume of MBRs↓ → intersect range queries ↓ → data pages to be retrieved ↓  The actual amount of volume reduction depends on: the number of splits and split points

AAU 7 Background and Motivation  Trajectory splits and query sizes  Not only minimize the volume of trajectory approximations  But also take into account query sizes  Advantages: Include past solutions Offers a potential for tuning the index An average query size is not restricted to static datasets

AAU 8 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 9 Trajectory splitting Methods - Optimal Trajectory Splitting  Objectives  Derive an analytical cost model Estimate the expected number of I/Os Yielded by a given split of a trajectory and a given query size  Introduce an algorithm based on this cost model  A cost model for splitting trajectories  Definition1: a trajectory T= with p i =(x i, y i,t i ) T[u,v]=<p u,…p v ] T=T[1,t] T is split into m segments(1 ≦ m ≦ t-1) : T=(T[I,i 1 ],…,T[i m-1,t]) i 1,…i m-1 :split positions B T =(MBR(T[1,i 1 ]),…, MBR(T[i m-1,t]))  The set of MBR approximations of all possible decompositions of T into m segments Decomp(T,m)={(B 1,…,B m )}, B 1 =MBR(T[1,i 1 ]),…, B m =MBR(T[i m-1.t]}  q intersects B T → q intersects k segments →yield k I/Os →P(q∩B T ;k)  Definition2: E BT (q)= The overall expected number of I/Os.

AAU 10 Trajectory splitting Methods - Optimal Trajectory Splitting  A cost model for splitting trajectories  Lemma 1: P(q∩B i ): the probability that a q intersects the i th segment in B T  S: the area where a query q can fall  Extended MBR Ext q (B i )  Lemma 1:  Minimizing the performance means finding  Definition3:  The minimal number of I/Os over all possible splits:

AAU 11 Trajectory splitting Methods - Optimal Trajectory Splitting  Dynamic Programming Algorithm  Object: Solve Finds the best possible split of T for each value of m  Theorem 1  Proof ↓  Last segment (starting at u) is fixed by assumption, T[1,u] be split into m-1 segments  The sum of volumes of the extended MBRs for the first m-1segments is minimal  The whole sum to be minimal.  Consider all possible values of start positions u in the range 1<u<t for the last segment of T

AAU 12 Trajectory splitting Methods – Heuristic Trajectory Splitting  Objective Find a more efficient and incremental method can produce near optimal results for trajectories are updated continuously large datasets containing long trajectories  A Cost Model for Optimal Segment Size  “Constant-slope trajectories”: segments of equal size  Definition:  Lemma 2:  Definition: C: the number of elementary segments in B i

AAU 13 Trajectory splitting Methods – Heuristic Trajectory Splitting  A Cost Model for Optimal Segment Size  Theorem 2. Given a query q, and increments ( △ x, △ y, △ t), the function g has a global real minimum C opt with respect to c.  Theorem3. C opt is a solution to  Proof: C opt gives smaller f(s) values → ∑f(s i ) is minimal → By Lemma 2,Therorem3  Apply this model to an arbitrary trajectory: compute the average of increments( △ x, △ y, △ t)

AAU 14 Trajectory splitting Methods – Heuristic Trajectory Splitting  Linear Time Trajectory Splitting  Object: Apply the C opt method.  Determine a suitable number of points that should be buffered before applying the split policy.  Iteratively collecting points until becomes true.  Means at least one possible split will result in better I/O expection.  Linear time trajectory splitting Algorithm --- LinearSplit

AAU 15 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 16 Experimental Results  Comparison Between  OptimalSplit: dynamic programming-based algorithm  LinearSplit: linear time algorithm  NoSplit: approximate each trajectory by a single MBR  FullSplit: approximate each line segment of a trajectory individually by an MBR  HKTG: DPSplit, volume oriented split policy  Results  Number of Disk I/Os No matter varying query size and varying database size, the I/O performance of our algorithms is always significantly better than the others,. Except FullSplit in varying database size, but its performance degrades much faster with increasing database size.  Index Building Time 1. Varying query size: Our algorithms exhibit a good balance between trajectory splitting time and insertion time. As query size increases, our index building times decreases. 2. Varying Database Size Our algorithms scale linearly at much slower rate than all other ones. 3 4

AAU 17 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 18 Related Work  Most spatio-temporal index structures proposed are based on R-tree. ---Mokbel.. Saptio-temporal Access Methods  Three main approaches  Time is simply treated as an additional spatial dimension. ---Theodoridis,.. Spatio-Temporal Indexing for Large Multimedia Applications TB-tree: Insertion split strategy is oriented towards trajectory preservation. ---Pfoser,… Novel Approaches to the Indexing of Moving Object Trajectories Leads to Inefficient indices and leading to a high degree of overlap among the MBRS.  Time and space are treated differently within a combined indexing scheme. ---Chakke,.. Indexing Large Trajectory Sets with SETI SEB Tree ---Songs,… An Approach to Index Continuously Moving Objects They are not compatible with our cost models since they don’t use MBRs.  Time is also treated as differently from space and it is to have virtual and incrementally maintained 2-dimensional R-trees for each point in time. ---Nascimento,.. Towards Historical R-tree Suffers from a prohibitively large overhead when indexing very dynamic scenarios, not suite for trajectory data.

AAU 19 Related Work  Recent Work--- improve the first approach  Replace MBRs by different approximation Trim the corners of trajectories’ MBRs to obtain a bounding octagon prism. ---Zhu,…Discovering Similar Multidimensional Trajectories  Splitting trajectories Give a total number of allowed splits for a whole set of trajectories to reduce the amount of approximations’ empty space. ---Hadjieleftheriou,.. Efficient indexing of Spatiotemporal Objects

AAU 20 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 21 Conclusion  Split trajectories and take into account average query sizes  A cost model for predicating the number of data page accesses and a trajectory splitting algorithm  A linear time splitting algorithm  The algorithms scale well respect to database size for both query performance and index building time.  Future Works  Extending the cost model to better understand the effect of directory level page accesses.  Designing optimized split policies for directory pages of spatio-temporal indices.

AAU 22 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 23 Strengthens and Weaknesses  Strengthens  The proof of each lemma and theorem is quite clearly and detailed.  Good related work  Weaknesses  Some definition is not clear. S and Ext q (B i )  Less pseudecode. No pseudecode for dynamic Programming algorithm

AAU 24 Contents Introduction 1 Background and Motivation 2 Trajectory Splitting Methods 3 Experimental Results 4 Related Work 5 Conclusion 6 Strengths and Weaknesses 7 Relate to my Project 8

AAU 25 Relate to my Project  My project  Compare some methods to index the historical trajectories data by Oracle  Find a new method to improve the indexing in some aspects.  Relate…  Give me a method about the realm I’m researching.  Give me an optional orientation about my new method.

AAU Presented by YuQing Zhang

AAU 27 Questions?