Anooshiravan Sharabiani, Houshang Darabi

Slides:



Advertisements
Similar presentations
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Advertisements

Word Spotting DTW.
Yasuhiro Fujiwara (NTT Cyber Space Labs)
Exact algorithms for the minimum latency problem Exact algorithms for the minimum latency problem 吳邦一 黃正男 詹富傑 樹德科大 資工系.
Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.
© 2009 IBM Corporation DUST: A Generalized Notion of Similarity between Uncertain Time Series Smruti R. Sarangi and Karin Murthy IBM Research Labs, Bangalore,
Similarity Search for Adaptive Ellipsoid Queries Using Spatial Transformation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa (Nara.
Fast Algorithm for Nearest Neighbor Search Based on a Lower Bound Tree Yong-Sheng Chen Yi-Ping Hung Chiou-Shann Fuh 8 th International Conference on Computer.
Energy Characterization and Optimization of Embedded Data Mining Algorithms: A Case Study of the DTW-kNN Framework Huazhong University of Science & Technology,
Fast Time Series Classification Using Numerosity Reduction DME Paper Presentation Jonathan Millin & Jonathan Sedar Fri 12 th Feb 2010.
Multimedia DBs.
Branch and Bound Similar to backtracking in generating a search tree and looking for one or more solutions Different in that the “objective” is constrained.
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
Making Time-series Classification More Accurate Using Learned Constraints © Chotirat “Ann” Ratanamahatana Eamonn Keogh 2004 SIAM International Conference.
Probabilistic Similarity Search for Uncertain Time Series Presented by CAO Chen 21 st Feb, 2011.
Sequence Alignment Bioinformatics. Sequence Comparison Problem: Given two sequences S & T, are S and T similar? Need to establish some notion of similarity.
Distance Functions for Sequence Data and Time Series
1. 2 General problem Retrieval of time-series similar to a given pattern.
Dean H. Lorenz, Danny Raz Operations Research Letter, Vol. 28, No
By Makinen, Navarro and Ukkonen. Abstract Let A and B be two run-length encoded strings of encoded lengths m’ and n’, respectively. we will show an O(m’n+n’m)
Computing Sketches of Matrices Efficiently & (Privacy Preserving) Data Mining Petros Drineas Rensselaer Polytechnic Institute (joint.
E.G.M. PetrakisDimensionality Reduction1  Given N vectors in n dims, find the k most important axes to project them  k is user defined (k < n)  Applications:
A Multiresolution Symbolic Representation of Time Series
Dynamic Programming. Pairwise Alignment Needleman - Wunsch Global Alignment Smith - Waterman Local Alignment.
Hybrid Bounding Volumes for Distance Queries Distance Query returns the minimum distance between two geometric models Major application is path planning.
1 Theory I Algorithm Design and Analysis (11 - Edit distance and approximate string matching) Prof. Dr. Th. Ottmann.
Time Series I.
Exact Indexing of Dynamic Time Warping
Wei Cheng 1, Xiaoming Jin 1, and Jian-Tao Sun 2 Intelligent Data Engineering Group, School of Software, Tsinghua University 1 Microsoft Research Asia 2.
FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space
Space-Efficient Sequence Alignment Space-Efficient Sequence Alignment Bioinformatics 202 University of California, San Diego Lecture Notes No. 7 Dr. Pavel.
Edward Kent Jason Atkin Rong Qi 1. Contents Vehicle Routing Problem VRP in Forestry Commissioning Loading Bay Constraints Ant Colony Optimisation Handing.
Qualitative approximation to Dynamic Time Warping similarity between time series data Blaž Strle, Martin Možina, Ivan Bratko Faculty of Computer and Information.
Analysis of Constrained Time-Series Similarity Measures
Brandon Westover, Qiang Zhu, Jesin Zakaria, Eamonn Keogh
Dynamic Time Warping Algorithm for Gene Expression Time Series
Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b
Incorporating Dynamic Time Warping (DTW) in the SeqRec.m File Presented by: Clay McCreary, MSEE.
Mining Time Series.
Semi-Supervised Time Series Classification & DTW-D REPORTED BY WANG YAWEN.
Distributed Spatio-Temporal Similarity Search Demetrios Zeinalipour-Yazti University of Cyprus Song Lin
Identifying Patterns in Time Series Data Daniel Lewis 04/06/06.
Biological Sequence Comparison and Alignment Speaker: Yu-Hsiang Wang Advisor: Prof. Jian-Jung Ding Digital Image and Signal Processing Lab Graduate Institute.
k-Shape: Efficient and Accurate Clustering of Time Series
Exact indexing of Dynamic Time Warping
COMP 5331 Project Roadmap I will give a brief introduction (e.g. notation) on time series. Giving a notion of what we are playing with.
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy Nurjahan BegumLiudmila Ulanova Jun Wang 1 Eamonn Keogh University.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
NSF Career Award IIS University of California Riverside Eamonn Keogh Efficient Discovery of Previously Unknown Patterns and Relationships.
Sequence Alignment.
Xin Tong, Teng-Yok Lee, Han-Wei Shen The Ohio State University
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Xifeng Yan Philip S. Yu Jiawei Han SIGMOD 2005 Substructure Similarity Search in Graph Databases.
Naifan Zhuang, Jun Ye, Kien A. Hua
Keogh, E. , Chakrabarti, K. , Pazzani, M. & Mehrotra, S. (2001)
Singular Value Decomposition and its applications
MATCH A Music Alignment Tool Chest
A Fast Trust Region Newton Method for Logistic Regression
Seung-won Hwang, Kevin Chen-Chuan Chang
Fast Approximate Query Answering over Sensor Data with Deterministic Error Guarantees Chunbin Lin Joint with Etienne Boursier, Jacque Brito, Yannis Katsis,
Query in Streaming Environment
School of Computer Science & Engineering
Discovering Functional Communities in Social Media
Approximate Matching of Run-Length Compressed Strings
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
謝孫源 (Sun-Yuan Hsieh) 成功大學 電機資訊學院 資訊工程系
Time Series Filtering Time Series
Presentation transcript:

Asymptotic Dynamic Time Warping Calculation with Utilizing Values Repetition Anooshiravan Sharabiani, Houshang Darabi Samuel Harford, Elnaz Douzali, Fazle Karim Hereford Johnson, Shun Chen University of Illinois at Chicago

University of Illinois at Chicago The contributions Presenting BDTW, an algorithm that generalizes the utilization of the repetition of any values in time series to calculate exact DTW for any- two-valued time series, and to calculate a close approximation of DTW for more-than-two-valued time series with repetition of any values. Presenting a new DTW approximation method for all type of time series based on combination of Adaptive Piecewise Constant Approximation (APCA) OR S1S2 and BDTW method. We present BDTW Upper Bound as a new upper bound instead of the ED for pruning unhopeful warping alignments of time series with high value repetition rate. We present BDTW Lower Bound as a new lower bound for pruning unhopeful matches in similarity search of time series with high value repetition rate. University of Illinois at Chicago

Example of time series with values repetition University of Illinois at Chicago

Motivation: Warping Distance for Sparse Time Series AWarp algorithm approximates DTW on sparse time series: A. Mueen, N. Chavoshi, N. Abu-El-Rub, H. Hamooni, and A. Minnich, “Awarp: Fast Warping Distance for Sparse Time Series.” in Data Mining (ICDM) IEEE, 2016, pp. 350–359. The comparison of the application of AWarp and BDTW University of Illinois at Chicago

University of Illinois at Chicago Blocked DTW Contribution 1 University of Illinois at Chicago

Time Series Length-encoding TENC ={1(1),3(2),2(1),3(5),1(2),2(4),1(1)} T= {1,0,0,0,1,0,0,0,1,1,0,0,0, 1} TENC ={1(1),0(3), 1(1),0(3),1(2),0(3),1(1)} trr: time series repetition ratio where the original length is l and encoded length is l’   rr: the repetitiveness ratio of a dataset with K time series   rr: the repetitiveness ratio of a dataset with K time series of equal length of n University of Illinois at Chicago

Blocked Dynamic Time Warping Traditional DTW   Encoding D(0, 0) = 0, ∀ ijD(i, 0) = D(0, j) = ∞ X={0,1,1,1,1,0,1,0,0,0,1} XENC={0(1),1(4),0(1),1(1),0(3),1(1)} Y Y={1,0,0,1,0,1,1,1,0} YENC={1(1),0(2),1,0,1(3),0(1)} Blocked DTW YENC X XENC   Top Left Diagonal

Blocked Dynamic Time Warping Traditional DTW   Blocked DTW   Top Left Diagonal University of Illinois at Chicago   1 …

Blocked DTW Algorithm Required: series1 & series2 ←Encoded time series x & y Ensure: Output the distance d between series1 and series2 1: lx ← length(series1), ly ← length(series2) 2: D(1:lx, 1:ly) ← ∞ 3: D1,1 ← (a1-b1)2 4: for i ← 2 to lx 5: Di,1 ← Di-1,j + Ai(ai-b1)2 6: end for 7: for j ← 2 to ly 8: D1,j ← Di,j-1 + Bj(a1-bj)2 9: end for 10: for i ← 2 to lx 11: for j ← 2 to ly 12: top ← Di-1,j + Ai(ai-bj)2 13: diagonal ← Di-1,j-1 + max(Ai, Bj)(ai-bj)2 14: left ← Di,j-1 + Bj(ai-bj)2 15: Di,j ← min(top, diagonal, left) 16: end for 17: end for 18: return Dlx,ly   1 …

BDTW on More-Than-Two-Valued Time Series with Values Repetitions University of Illinois at Chicago

An example of comparison between DTW and BDTW UB alignments The exact DTW distance is 1.858176 The BDTW distance is 1.858309 University of Illinois at Chicago

Blocked DTW-Lower Bound Algorithm Required: series1 & series2 ←Encoded time series x & y Ensure: Output the distance d between series1 and series2 1: lx ← length(series1), ly ← length(series2) 2: D(1:lx, 1:ly) ← ∞ 3: D1,1 ← (a1-b1)2 4: for i ← 2 to lx 5: Di,1 ← Di-1,j + Ai(ai-b1)2 6: end for 7: for j ← 2 to ly 8: D1,j ← Di,j-1 + Bj(a1-bj)2 9: end for 10: for i ← 2 to lx 11: for j ← 2 to ly 12: top ← Di-1,j + Ai(ai-bj)2 13: diagonal ← Di-1,j-1 + max(Ai, Bj)(ai-bj)2 14: left ← Di,j-1 + Bj(ai-bj)2 15: Di,j ← min(top, diagonal, left) 16: end for 17: end for 18: return Dlx,ly   1 …

University of Illinois at Chicago Constrained Blocked DTW University of Illinois at Chicago

University of Illinois at Chicago Constrained Blocked DTW Algorithm   1 … University of Illinois at Chicago

Piecewise Constant Approximation Methods+ Blocked DTW Contribution 2 Piecewise Constant Approximation Methods+ Blocked DTW University of Illinois at Chicago

University of Illinois at Chicago   University of Illinois at Chicago

BDTW Upper Bound (BDTW UB) Contribution 3 BDTW Upper Bound (BDTW UB) A new upper bound which can be used instead of the Euclidean distance (and performs better than it) for pruning unhopeful warping alignments of time series with high repetition rate. University of Illinois at Chicago

University of Illinois at Chicago Upper triangle pruning   UB: ED UB:BDTW UB: Exact DTW University of Illinois at Chicago

BDTW Lower Bound (BDTW LB) Contribution 4 BDTW Lower Bound (BDTW LB) A new lower bound for pruning unhopeful matches in similarity search of time series with high repetition rate. We show that BDTW LB is competitive with famous lower bound techniques in terms of reduction of processing time in similarity search for time series with high level of value repetitiveness. University of Illinois at Chicago

Experiments and results University of Illinois at Chicago

University of Illinois at Chicago Case study 1 Method ED Time DTW Awarp BDTW CDTW CBDTW Accuracy 0.561 0.807 0.806 0.761 0.776 Processing time 51.96 8514.20 3237.53 877.57 936.56 307.50 University of Illinois at Chicago

University of Illinois at Chicago Case study 2 Dataset Repetition Rate Method ED DTW Awarp BDTW CDTW CBDTW HOUSE3,8 Aggergate 49.80% Accuracy 0.87 0.90 0.89 Processing time 5.67 143.85 174.98 57.41 71.93 33.20 HOUSE3,8 Freezer 91.13% 0.64 0.86 0.73 0.77 17.62 1394.83 117.23 28.76 697.42 14.95 University of Illinois at Chicago

BDTW UB as an Approximation Method of DTW University of Illinois at Chicago

University of Illinois at Chicago Using BDTW UB on Top of APCA as an Approximation Method for Any-Valued Time Series University of Illinois at Chicago

BDTW LB: a New Lower Bound to Increase the Efficiency of DTW University of Illinois at Chicago

University of Illinois at Chicago Using BDTW UB as a New Upper Bound to Speed Up All-Pairwise DTW Matrix Calculation University of Illinois at Chicago

CBDTW LB: Constrained Block DTW Computers Repetition Rate Accuracy Processing time (s) CBDTW Speed-up ratio DTW-full CDTW BDTW CBDTW SmallKitchenAppliances 86.4% 64% 61% 73% 715.4 770.7 20.3 16.0 44.7 48.2 1.3 LargeKitchenAppliances 81.6% 79% 71% 80% 72% 718.9 571.4 37.6 22.8 31.5 25.1 1.6 ScreenType 77.5% 40% 43% 39% 647.7 514.8 45.6 27.7 23.4 18.6 70.9% 70% 69% 65% 312.9 297.5 37.1 24.7 12.6 12.0 1.5 Earthquakes 68.1% 74% 124.7 118.5 16.7 11.1 11.2 10.6 RefrigerationDevices 63.9% 46% 51% 758.7 1,016.3 150.2 86.0 8.8 11.8 1.7 ElectricDevices 57.6% 60% 62% 6,172.2 4,568.5 2,111.5 1,087.6 5.7 4.2 1.9 wafer 48.8% 98% 100% 1,120.0 375.4 543.5 278.5 4.0 2.0 FaceFour 47.2% 83% 88% 2.7 3.0 1.4 0.5 5.3 5.9 Two_Patterns 34.4% 724.8 317.4 478.5 195.1 3.7 2.5 uWaveGestureLibrary_Y 31.8% 63% 66% 2,496.3 967.2 1,686.3 740.1 3.4 2.3 uWaveGestureLibraryAll 28.2% 89% 91% 95% 26,655.0 28,464.3 22,290.9 13,844.7 2.1 uWaveGestureLibrary_X 26.2% 2,688.6 1,064.9 1,932.7 850.1 3.2 uWaveGestureLibrary_Z 25.6% 67% 2,335.6 925.1 1,774.9 780.6 1.2 University of Illinois at Chicago

University of Illinois at Chicago Conclusion Blocked Dynamic Time Warping (BDTW) utilizes values repetition in time series to shrink the size of DTW matrix for reducing the processing time. BDTW algorithms obtains an upper bound and a lower bound of DTW. The Combination of BDTW upper bound and APCA provides a close approximation of DTW distance, even for time series without any value repetition. BDTW upper bound and lower bound can be used as pruning technique for time series with high value repetition. Constrained BDTW Algorithm is faster than Constrained DTW with similar or better accuracy for time series with high value repetition University of Illinois at Chicago