Time Relaxed Spatiotemporal Trajectory Joins

Slides:



Advertisements
Similar presentations
On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus.
Advertisements

Indexing DNA Sequences Using q-Grams
Md. Mahbub Hasan University of California, Riverside.
Spatio-temporal Databases
Longest Common Subsequence
Correlation Search in Graph Databases Yiping Ke James Cheng Wilfred Ng Presented By Phani Yarlagadda.
On-Line Discovery of Flock Patterns in Spatio-Temporal Data * Marcos Vieira Petko Bakalov Vassilis Tsotras UC-Riverside ESRI UC-Riverside
Patch to the Future: Unsupervised Visual Prediction
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Spatio-temporal Databases Time Parameterized Queries.
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Efficient Join Processing over Uncertain Data - By Reynold Cheng, et all. Presented By Lydia & Usha.
Nearest Neighbor Retrieval Using Distance-Based Hashing Michalis Potamias and Panagiotis Papapetrou supervised by Prof George Kollios A method is proposed.
6/29/20151 Efficient Algorithms for Motif Search Sudha Balla Sanguthevar Rajasekaran University of Connecticut.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
Time Series Data Analysis - II
Exact Indexing of Dynamic Time Warping
October 8, 2013Computer Vision Lecture 11: The Hough Transform 1 Fitting Curve Models to Edges Most contours can be well described by combining several.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
MINING RELATED QUERIES FROM SEARCH ENGINE QUERY LOGS Xiaodong Shi and Christopher C. Yang Definitions: Query Record: A query record represents the submission.
Similarity join problem with Pass- Join-K using Hadoop ---BY Yu Haiyang.
Similarity based Retrieval from Sequence Databases using Automata as Queries 作者 : A. Prasad Sistla, Tao Hu, Vikas howdhry 出處 :CIKM 2002 ACM 指導教授 : 郭煌政老師.
A Query Adaptive Data Structure for Efficient Indexing of Time Series Databases Presented by Stavros Papadopoulos.
CEMiner – An Efficient Algorithm for Mining Closed Patterns from Time Interval-based Data Yi-Cheng Chen, Wen-Chih Peng and Suh-Yin Lee ICDM 2011.
Efficient Processing of Top-k Spatial Preference Queries
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Early Profile Pruning on XML-aware Publish- Subscribe Systems Mirella M. Moro, Petko Bakalov, Vassilis J. Tsotras University of California VLDB 2007 Presented.
Exact indexing of Dynamic Time Warping
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
UNIT 5.  The related activities of sorting, searching and merging are central to many computer applications.  Sorting and merging provide us with a.
Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy Nurjahan BegumLiudmila Ulanova Jun Wang 1 Eamonn Keogh University.
15 th ACM GIS: Seattle, WA: Nov 7—9, 2007 TS2-Tree: An Efficient Similarity Based Organization for Trajectory Data Petko Bakalov Eamonn Keogh Vassilis.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Indexing Time Series. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Time Series databases Text databases.
VizTree Huyen Dao and Chris Ackermann. Introducing example
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
On the R ange M aximum-Sum S egment Q uery Problem Kuan-Yu Chen and Kun-Mao Chao Department of Computer Science and Information Engineering, National Taiwan.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
October 3, 2013Computer Vision Lecture 10: Contour Fitting 1 Edge Relaxation Typically, this technique works on crack edges: pixelpixelpixel pixelpixelpixelebg.
Spatial Approximate String Search. Abstract This work deals with the approximate string search in large spatial databases. Specifically, we investigate.
Fast Subsequence Matching in Time-Series Databases.
Spatial Data Management
SIMILARITY SEARCH The Metric Space Approach
Spatio-Temporal Databases
Evaluation of Relational Operations
Web Data Extraction Based on Partial Tree Alignment
Time Series Filtering Time Series
Sameh Shohdy, Yu Su, and Gagan Agrawal
Spatio-temporal Pattern Queries
Fitting Curve Models to Edges
Spatio-temporal Databases
Spatio-Temporal Databases
Finding Fastest Paths on A Road Network with Speed Patterns
Fast Sequence Alignments
Searching Similar Segments over Textual Event Sequences
Searching: linear & binary
Early Profile Pruning on XML-aware Publish-Subscribe Systems
Efficient Record Linkage in Large Data Sets
Connected Word Recognition
Spatio-temporal Databases
CSC 380: Design and Analysis of Algorithms
Continuous Motion Pattern Query
External Sorting Sorting is used in implementing many relational operations Problem: Relations are typically large, do not fit in main memory So cannot.
Efficient Processing of Top-k Spatial Preference Queries
Donghui Zhang, Tian Xia Northeastern University
Efficient Aggregation over Objects with Extent
Presentation transcript:

Time Relaxed Spatiotemporal Trajectory Joins Petko Bakalov, Marios Hadjieleftheriou, Vassilis J. Tsotras University of California, Riverside 4/5/2019

Overview Motivation Problem description Using symbolic representation algorithm Multiple origins sliding window evaluation Symbolic class separation join Conclusion and future work 4/5/2019

Motivation (1) Spatiotemporal data is generated in many novel applications Typical queries involve “range” or “nearest neighbor” Example: X time 4/5/2019

Motivation (2) In the previous paper (“Efficient Trajectory Joins using Symbolic Representations”) we address a novel query, the Trajectory Join: “identify all pairs of similar trajectories between two datasets”. Here we examine another interesting query, the Time Relaxed Spatiotemporal Trajectory Join (TRSTJ) which effectively which finds groups of moving objects that have followed similar patterns in different times. 4/5/2019

Problem description (1) A moving object trajectory is defined as a sequence of location/time-instant pairs. Given: two sets of object trajectories R and S, distance function D(), a threshold ε and time interval δt Time relaxed trajectory join result: set V of pairs <Ri, Sj>, where Ri Є R and Sj Є S, such that there exist time-intervals δti for R and δtj for S such that | δti | = | δtj | = |δt| and D(Rδti, Sδtj) < ε 4/5/2019

Problem description (2) ε |δtR| = |δtS| δtS R δtR Y X 4/5/2019

Proposed join evaluation We can use the trajectory join algorithm described in MDM05. The key ideas of the algorithm are: Use appropriate trajectory approximation to reduce the size of the problem Define a lower bound distance function for this approximation Using this distance function, prune as many trajectory pair similarity evaluations as possible. 4/5/2019

Approximation Used (1) The algorithm takes as input a trajectory of size n and divides it into m equal-sized frames The trajectory value in each frame is substituted with the average of the values and is discretized A trajectory is represented by the “symbolic string”: CCBCDB Time A B C D X 4/5/2019

Approximation Used (2) X D C B A As index structure we keep an array of “symbolic strings” representing the trajectories CCBCDB DAABCC BDDCBB 4/5/2019

Lower bound distance function We proposed the use of the following distance function, defined over symbolic representations of trajectories d() is the distance between two symbols K is the number of symbols representing time interval |δt| 4/5/2019

Time relaxed join algorithm (1) Step 1. Generate all possible substrings from both sets S and R with length k Step 2. Define an order among these substring by using as a score the distance between these strings and some randomly picked string called origin O: Step 3. By using sliding window algorithm find all pairs between the two sets having score less then the threshold. Step 4. Verify the result with the actual tr. data 4/5/2019

Time relaxed join algorithm (2) CCBCDB DAABCC BDDCBB 1.1: CCBC 1.2: CBCD 1.3: BCDB 2.1: DAAB 2.2: AABC 2.3: ABCC 3.1: BDDC 3.2: DDCB 3.3: DCBB Step 1 Step 2 & 3 2.2 2.1 2.3 1.3 3.1 3.3 1.2 1.1 3.2 O 2.2 2.1 2.3 1.3 3.1 3.3 1.2 1.1 3.2 O 2.2 2.1 2.3 1.3 3.1 3.3 1.2 1.1 3.2 O Step 4 Trajectories 2 and 3 satisfy the criteria 4/5/2019

Experimental Results 4/5/2019

Problems There is a huge disproportion between the number of candidate pairs and the number of pairs in the join result caused by the very large number of trajectory segments (subsequences) We propose two heuristics to improve the performance Using Multiple origins Class separation Join 4/5/2019

Multiple Origins we suggest to use multiple origins O1 O2 O3 O4 … On instead of just one. In order for a pair of sequences to be reported as candidate pair by the sliding window algorithm, their difference in distances to the origins should be less than the threshold distance for every origin (drastic reduction on # of candidate pairs) 4/5/2019

Multiple Origins 4/5/2019

Multiple Origins - Experiments 4/5/2019

Class Separation Join Each class is associated with a spatial area. Sequences are assigned to the class in whose area the corresponding trajectory crossed during some (i.e. the first) frame of the sequence. We can decide whether it is possible to have a match between two trajectory sequences by looking at the classes to which they belong. 4/5/2019

Class Separation Join 4/5/2019

Class Separation Join The trajectory classes are scanned in a lexicographic order. For every class we create a sorted list combining the elements of all classes that lie on distance from the current one. Then we process this list with the already discussed algorithm. 4/5/2019

Class Separation Join 4/5/2019

Conclusion The basic symbolic join algorithm is not an efficient solution for this problem. Two simple heuristics can drastically reduce query time for the TRSTJ query. Future extension of these techniques can be the use of different distance metrics. 4/5/2019