On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
1 Computational Complexity Size Matters!. 2 Suppose there are several algorithms which can all be used to perform the same task. We need some way to judge.
Online Algorithm Huaping Wang Apr.21
A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:
James Hays and Alexei A. Efros Carnegie Mellon University CVPR IM2GPS: estimating geographic information from a single image Wen-Tsai Huang.
On-Line Discovery of Hot Motion Paths D. Sacharidis 1, K. Patroumpas 1, M. Terrovitis 1, V. Kantere 1, M. Potamias 2, K. Mouratidis 3, T. Sellis 1 1 National.
AE1APS Algorithmic Problem Solving John Drake
Remote Real-Time Trajectory Simplification Ralph Lange, Tobias Farrell, Frank Dürr, Kurt Rothermel Institute of Parallel and Distributed Systems (IPVS)
1 Chapter 20: Statistical Tests for Ordinal Data.
Continuous Estimation in WLAN Positioning By Tilen Ma Clarence Fung.
1 ECE 776 Project Information-theoretic Approaches for Sensor Selection and Placement in Sensor Networks for Target Localization and Tracking Renita Machado.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Mining Frequent Spatio-temporal Sequential Patterns
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Detection Theory Chapter 12 Model Change Detection Xiang Gao January 18, 2011.
On Map-Matching Vehicle Tracking Data
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Evaluating Search Engine
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
INTEGRALS Areas and Distances INTEGRALS In this section, we will learn that: We get the same special type of limit in trying to find the area under.
Maintenance of Discovered Association Rules S.D.LeeDavid W.Cheung Presentation : Pablo Gazmuri.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Active Learning with Support Vector Machines
Evaluating Hypotheses
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
GPS Trajectories Analysis in MOPSI Project Minjie Chen SIPU group Univ. of Eastern Finland.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Chapter Seven Advanced Shell Programming. 2 Lesson A Developing a Fully Featured Program.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.
Time-focused density-based clustering of trajectories of moving objects Margherita D’Auria Mirco Nanni Dino Pedreschi.
Constructing Optimal Wavelet Synopses Dimitris Sacharidis Timos Sellis
1 Scalable Exploratory Data Mining of Distributed Geoscientific Data Authors : E.C Shek, R.R Muntz, E. Mesrobian and K. Ng by Sona Srinivasan.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
03/05/03© 2003 University of Wisconsin Last Time Tone Reproduction If you don’t use perceptual info, some people call it contrast reduction.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Easiest-to-Reach Neighbor Search Fatimah Aldubaisi.
CS654: Digital Image Analysis
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Sequence Alignment.
Week Aug-24 – Aug-29 Introduction to Spatial Computing CSE 5ISC Some slides adapted from the book Computing with Spatial Trajectories, Yu Zheng and Xiaofang.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Histograms for Selectivity Estimation, Part II Speaker: Ho Wai Shing Global Optimization of Histograms.
VizTree Huyen Dao and Chris Ackermann. Introducing example
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Arrays and Loops. Learning Objectives By the end of this lecture, you should be able to: – Understand what a loop is – Appreciate the need for loops and.
Clustering Data Streams A presentation by George Toderici.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Computer Graphics Lecture 07 Ellipse and Other Curves Taqdees A. Siddiqi
1 Minimum Bayes-risk Methods in Automatic Speech Recognition Vaibhava Geol And William Byrne IBM ; Johns Hopkins University 2003 by CRC Press LLC 2005/4/26.
A paper on Join Synopses for Approximate Query Answering
Polygonal Curve Simplification
Fast Approximate Query Answering over Sensor Data with Deterministic Error Guarantees Chunbin Lin Joint with Etienne Boursier, Jacque Brito, Yannis Katsis,
Spatial Online Sampling and Aggregation
Virtual-Time Round-Robin: An O(1) Proportional Share Scheduler
Time Relaxed Spatiotemporal Trajectory Joins
Communication Driven Remapping of Processing Element (PE) in Fault-tolerant NoC-based MPSoCs Chia-Ling Chen, Yen-Hao Chen and TingTing Hwang Department.
Supporting Online Analytics with User-Defined Estimation and Early Termination in a MapReduce-Like Framework Yi Wang, Linchuan Chen, Gagan Agrawal The.
Presentation transcript:

On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus ADBIS, October

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 2 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression ST Querying Evaluating the Effect of Compression ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 3 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression ST Querying Evaluating the Effect of Compression ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 4 Trajectory is the data obtained from moving point objects and can be seen as a string in the 3D space Trajectory compression is a very promising field since moving objects recording their position in time produce large amounts of frequently redundant data Existing work on trajectory compression is mainly driven by research advances in the fields of line generalization and time series compression. Our interest is in lossy compression techniques which eliminate some repeated or unnecessary information under well-defined error bounds. Problem Statement (1)

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 5 The objectives for trajectory compression are: To obtain a lasting reduction in data size; To obtain a data series that still allows various computations at acceptable (low) complexity; To obtain a data series with known, small margins of error, which are preferably parametrically adjustable. Our goal is to calculate the mean error introduced in query results over compressed trajectory data, which is by no means a trivial task We argue that this mean error can be used for deciding whether the compressed data are suitable for the user needs We restrict our discussion in a special type of spatiotemporal query, the timeslice queries Problem Statement (2)

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 6 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression ST Querying Evaluating the Effect of Compression ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 7 Methods exploiting line simplification algorithms for compressing a trajectory are based on the so called Synchronous Euclidean Distance (SED) SED is the distance between the sampled point P i (x i, y i, t i ) being under examination, and the point of the line (P s, P e ) where the moving object would lie, supposed it was moving on this line, at time instance t i determined by the point under examination Compressing Trajectories: SED P s (x s,y s,t s ) P e (x e,y e,t e ) P i (x i,y i,t i ) SED(P,P)

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 8 The TD-TR algorithm (Meratnia and By, EDBT 2004) is a spatiotemporal extension of the quite famous Top – Down Douglas – Peucker algorithm which was originally used in cartography The algorithm tries (and achieves) to preserve directional trends in the approximated line using a distance threshold The TD-TR algorithm uses SED instead of the perpendicular distance It is a batch algorithm since it requires the full line at its start Compressing Trajectories: TD-TR algorithm

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 9 Opening window (OW) algorithms anchor the start point of a potential segment, and then attempt to approximate the subsequent data series with increasingly longer segments. The algorithm also achieves to preserve directional trends in the approximated line using a distance threshold The OPW-TR algorithm (Meratnia and By, EDBT 2004) also uses SED instead of the perpendicular distance It can be used as an online algorithm Compressing Trajectories: OPW-TR algorithm

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 10 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression ST Querying Evaluating the Effect of Compression ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 11 The only relative work estimates the average value of the Synchronous Euclidean Distance (SED), also termed as Synchronous Error, between an original trajectory and its approximation. There is no obvious way on how to use it in order to determine the error introduced in query results Related work on Error Estimation

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 12 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression in ST Querying Evaluating the Effect of Compression in ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 13 Estimating the Effect of Compression in ST Querying: Preliminaries Our goal is to provide closed-form formulas that estimate the number of false hits introduced in query results over compressed trajectory datasets Among the query types executed against trajectory datasets, we focus on a special type or range query, the so-called timeslice query Two types of errors are introduced in query results when executing a timeslice query over a trajectory dataset false negatives are the trajectories which originally qualified the query but their compressed counterparts were not retrieved false positives are the compressed trajectories retrieved by the query while their original counterparts are not qualifying it

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 14 Estimating the Effect of Compression in ST Querying: Analysis (1) We first calculate AvgP i,P / AvgP i,N, which is the average probability of a single compressed trajectory to be retrieved as false positive / negative, regarding all possible timeslice query windows with sides a b We then sum-up these average probabilities of all dataset trajectories in order to produce the global average probability The error introduced in the position of a trajectory can be calculated as a function of time

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 15 W Estimating the Effect of Compression in ST Querying: Analysis (2) We calculate the average probability of a compressed trajectory T i to be retrieved as false positive / negative regarding a timeslice query window at timestamp t j The quantity of timeslice query windows that may retrieve a compressed trajectory as false positive / negative at timestamp t j can be extracted geometrically We distinguish among 4 cases, regarding the signs of δx and δy values Finally by integrating the area A i,j over all the timestamps inside the unit space we obtain AvgP i,P / AvgP i,N δy i, j >0 δx i, j <0 [0,1] [0,1], t j A i,j W

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 16 Estimating the Effect of Compression in ST Querying: Analysis (3) Summing up the average probabilities of all trajectories and performing the necessary calculations, we obtain: where

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 17 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression in ST Querying Evaluating the Effect of Compression in ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 18 Evaluating the Effect of Compression in ST Querying The evaluation of this formula is a costly operation O(n m); its calculation requires to process the entire original dataset along with its compressed counterpart However, any compression algorithm evaluating SED, need also to calculate δx i,k δy i,k in every timestamp As a consequence, the evaluation of the average error in the query results, can be integrated in the compressions algorithm, introducing only a small overhead on its execution

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 19 Problem Statement Background Compressing Trajectories Related work on Error Estimation Estimating the Effect of Compression in ST Querying Evaluating the Effect of Compression in ST Querying Experimental Results On the performance On the quality Conclusions and Future Work Talk Outline

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 20 Experimental Study: Settings Datasets One real trajectory dataset of a fleet of trucks (273 trajectories, 112K entries) A synthetic dataset of 2000 trajectories generated using network-based data generator and the San Joaquin road network Implementation We implemented the TD-TR algorithm and compressed the real and synthetic datasets varying its threshold Experiments Average overhead introduced in the TD-TR algorithm Average number of false positives and false negatives in randomly distributed timeslice queries

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 21 Experimental Study: On the performance Scaling the value of the TD-TR threshold The algorithms execution time reduces as the value of the TD-TR threshold increases The overhead introduced in the algorithms execution, is typically small (bellow 7%) In absolute times, the overhead introduced never exceeds 0.2 milliseconds per trajectory Trucks dataset Synthetic dataset

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 22 Experimental Study: On the quality (1) Scaling the value of the TD-TR threshold The average number of false hits (negatives and positives) is linear with the value of the TD-TR compression threshold The average error in the estimation for the synthetic dataset is around 6%, varying between 0.2% and 14% In the trucks dataset the average error increases around 10.6%, mainly due to the error introduced in small values of TD-TR threshold Trucks dataset Synthetic dataset

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 23 Experimental Study: On the quality (2) Scaling the query size The average number of false hits (negatives and positives) is sub-linear with the size of the query The average error in the estimation for the synthetic dataset is around 2.9%, varying between 0.2% and 8.7% In the trucks dataset the average error increases around 7.5% Trucks dataset Synthetic dataset

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 24 Summary and Future Work We provided a closed formula of the average number of false negatives and false positives covering the case of uniformly distributed query windows and arbitrarily distributed trajectory data Through an experimental study we demonstrated the efficiency of the proposed model We illustrated the applicability of our model under real-life requirements – it turns out that the estimation of the model parameters introduce only a small overhead in the trajectory compression algorithm We presented the accuracy of our estimations, with an average error being around 6%. Future work: Extension of our model in nearest neighbor and general range queries Applicability of our model in the case of spatiotemporal warehouses

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 25 Acknowledgements Research partially supported by: GEOPKDD (Geographic Privacy-aware Knowledge Discovery and Delivery) project funded by the European Community under FP contract

Frentzos and Theodoridis, ADBIS 2007 On the Effect of Trajectory Compression in Spatiotemporal Querying 26 Thank you! On the Effect of Trajectory Compression in Spatiotemporal Querying