Doruk Sart, Abdullah Mueen, Walid Najjar, Eamonn Keogh, Vit Niennatrakul 1.

Slides:



Advertisements
Similar presentations
Indexing Time Series Based on original slides by Prof. Dimitrios Gunopulos and Prof. Christos Faloutsos with some slides from tutorials by Prof. Eamonn.
Advertisements

Choosing Distance Measures for Mining Time Series Data
Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.
Efficient Anomaly Monitoring over Moving Object Trajectory Streams joint work with Lei Chen (HKUST) Ada Wai-Chee Fu (CUHK) Dawei Liu (CUHK) Yingyi Bu (Microsoft)
Instruction Set Extension for Dynamic Time Warping Joseph Tarango, Eammon Keogh, Philip Brisk
Multithreaded FPGA Acceleration of DNA Sequence Mapping Edward Fernandez, Walid Najjar, Stefano Lonardi, Jason Villarreal UC Riverside, Department of Computer.
Using Fingerprints in n-Gram Indices Digital Libraries: Advanced Methods and Technologies, Digital Collections Stefan Selbach
74 th EAGE Conference & Exhibition incorporating SPE EUROPEC 2012 Automated seismic-to-well ties? Roberto H. Herrera and Mirko van der Baan University.
Indexing Time Series Based on Slides by C. Faloutsos (CMU) and D. Gunopulos (UCR)
Please turn off cell phones - thanks! panda.unm.edu/courses/sanfratello/sp09/astro101 Remember - Read syllabus and schedule (contains textbook reading.
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
User Manual of Mining Mouse Vocalizations Prepared by Jesin Zakaria and Eamonn Keogh.
CBF Dataset Two-Pat Dataset Euclidean DTW Increasingly Large Training.
Efficient Query Filtering for Streaming Time Series
Computing motion between images
Distance Functions for Sequence Data and Time Series
Based on Slides by D. Gunopulos (UCR)
Finding Time Series Motifs on Disk-Resident Data
Lecture 3a: Feature detection and matching CS6670: Computer Vision Noah Snavely.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Pattern Matching with Acceleration Data Pramod Vemulapalli.
Exact Indexing of Dynamic Time Warping
Accelerating the Dynamic Time Warping Distance Measure Using Logarithmic Arithmetic Joseph Tarango, Eamonn Keogh, Philip Brisk
GPU Computing with CBI Laboratory. Overview GPU History & Hardware – GPU History – CPU vs. GPU Hardware – Parallelism Design Points GPU Software.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
S DTW: COMPUTING DTW DISTANCES USING LOCALLY RELEVANT CONSTRAINTS BASED ON SALIENT FEATURE ALIGNMENTS K. Selçuk Candan Arizona State University Maria Luisa.
Reporter: Fei-Fei Chen. Wide-baseline matching Object recognition Texture recognition Scene classification Robot wandering Motion tracking.
Fast and Exact Monitoring of Co-evolving Data Streams Yasuko Matsubara, Yasushi Sakurai (Kumamoto University) Naonori Ueda (NTT) Masatoshi Yoshikawa (Kyoto.
Brandon Westover, Qiang Zhu, Jesin Zakaria, Eamonn Keogh
Dynamic Time Warping Algorithm for Gene Expression Time Series
Implementing a Speech Recognition System on a GPU using CUDA
Approximating Hit Rate Curves using Streaming Algorithms Nick Harvey Joint work with Zachary Drudi, Stephen Ingram, Jake Wires, Andy Warfield TexPoint.
K. Selçuk Candan, Maria Luisa Sapino Xiaolan Wang, Rosaria Rossini
Incorporating Dynamic Time Warping (DTW) in the SeqRec.m File Presented by: Clay McCreary, MSEE.
1 CS 260 Winter 2014 Eamonn Keogh’s Presentation of Thanawin Rakthanmanon, Bilson Campana, Abdullah Mueen, Gustavo Batista, Brandon Westover, Qiang Zhu,
Abdullah Mueen Eamonn Keogh University of California, Riverside.
Discovering Deformable Motifs in Time Series Data Jin Chen CSE Fall 1.
Distributed Spatio-Temporal Similarity Search Demetrios Zeinalipour-Yazti University of Cyprus Song Lin
Sequence Comparison Algorithms Ellen Walker Bioinformatics Hiram College.
“Live” Tomographic Reconstructions Alun Ashton Mark Basham.
k-Shape: Efficient and Accurate Clustering of Time Series
Exact indexing of Dynamic Time Warping
Stream Monitoring under the Time Warping Distance Yasushi Sakurai (NTT Cyber Space Labs) Christos Faloutsos (Carnegie Mellon Univ.) Masashi Yamamuro (NTT.
COMP 5331 Project Roadmap I will give a brief introduction (e.g. notation) on time series. Giving a notion of what we are playing with.
Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy Nurjahan BegumLiudmila Ulanova Jun Wang 1 Eamonn Keogh University.
Speed improvements to information retrieval-based dynamic time warping using hierarchical K-MEANS clustering Presenter: Kai-Wun Shih Gautam Mantena 1,2.
Visualization and Exploration of Temporal Trend Relationships in Multivariate Time-Varying Data Teng-Yok Lee & Han-Wei Shen.
CS 521 Data Mining Techniques Instructor: Abdullah Mueen LECTURE 8: TIME SERIES AND GRAPH MINING.
Time Series Sequence Matching Jiaqin Wang CMPS 565.
A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.
Memory Buffering Techniques Greg Stitt ECE Department University of Florida.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Zaiben Chen et al. Presented by Lian Liu. You’re traveling from s to t. Which gas station would you choose?
Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Buffering Techniques Greg Stitt ECE Department University of Florida.
Buffering Techniques Greg Stitt ECE Department University of Florida.
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
NOVA University of Lisbon
Matrix Profile II: Exploiting a Novel Algorithm and GPUs to break the one Hundred Million Barrier for Time Series Motifs and Joins Yan Zhu, Zachary Zimmerman,
핵심어 검출을 위한 단일 끝점 DTW 알고리즘 Yong-Sun Choi and Soo-Young Lee
Supervised Time Series Pattern Discovery through Local Importance
Enumeration of Time Series Motifs of All Lengths
Parallelizing Dynamic Time Warping
Distance Functions for Sequence Data and Time Series
Massively Parallel Astronomic Event Detection using GPU’s
Xuehan Ye, Yongcai Wang, Wei Hu, Lei Song, Zhaoquan Gu, Deying Li
Time Series Filtering Time Series
Presentation transcript:

Doruk Sart, Abdullah Mueen, Walid Najjar, Eamonn Keogh, Vit Niennatrakul 1

Subsequence Search Given a query series : Q : Find the occurrences of Q in a time series S : Distance smaller than a threshold. Dynamic Time Warping 2 Time C Q C Q Warping path w Mapping

Time Warping Subsequence Search Previous Method: SPRING – Reuses computation for subsequence matching 3 S Q Match 1Match 2Match 3Match 4 Yasushi Sakurai, Christos Faloutsos, Masashi Yamamuro: Stream Monitoring under the Time Warping Distance. ICDE 2007: For every new subsequence only one column is added on the right Not Always Possible!!

Normalization IS Necessary Wandering Baseline Problem Z-Normalization: (x-µ)/σ – Shift and Scale Invariance Reuse of computation is no longer possible Query Distance to the query Threshold = Subsequence at time tSubsequence at time t+300 Value Reduced at = 600

Parallelizing DTW search  Slide a window of a fixed size.  Compute the distance between the query and the z-normalized sliding window.  Report those that result less than t.  Possibly try it for other lengths. Assign each distance computation to one GPU core or one systolic array in an FPGA. Q

Speedups 6 Upto 4500x speedup using FPGA. Upto 29x speedup using GPU. Capable of processing a very high speed stream of hundreds of hertz. Capable of processing several low speed streams simultaneously Length of the Query (Q) Time in Seconds Software SSE GPU FPGA

Case Study: Astronomy Rotation Invariant DTW -- O( n 3 ) – Try all possible rotations to find the minimum possible distance. 7 1-NN AccuracyTime FPGATime GPUTime CPU ED80.47%<1.0 seconds 2.5 seconds rED81.25%<1.0 seconds55.3 seconds43.6 minutes DTW86.72%<1.0 seconds43.6 seconds35.4 minutes rDTW91.41%9.54 minutes22.7 hours(42 days) OGLE OGLE DTW distance rDTW distance 0 Important for cyclic time series. e.g. Star Light Curves

Thank You 8