Detecting Time Series Motifs Under

Slides:



Advertisements
Similar presentations
SAX: a Novel Symbolic Representation of Time Series
Advertisements

Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Mining Frequent Spatio-temporal Sequential Patterns
Fast Algorithms For Hierarchical Range Histogram Constructions
Relevance Feedback Retrieval of Time Series Data Eamonn J. Keogh & Michael J. Pazzani Prepared By/ Fahad Al-jutaily Supervisor/ Dr. Mourad Ykhlef IS531.
Avrilia Floratou, Sandeep Tata, and Jignesh M. Patel ICDE 2010 Efficient and Accurate Discovery of Patterns in Sequence Datasets.
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
Mining Time Series.
Locally Constraint Support Vector Clustering
1 Manifold Clustering of Shapes Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Disk Aware Discord Discovery:
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
Drawing Graphs with Nonuniform Nodes Using Potential Fields Jen-Hui Chuang 1, Chun-Cheng Lin 2, Hsu-Chun Yen 2 1 Dept. of Computer and Information Science,
Efficient Query Filtering for Streaming Time Series
Probabilistic Similarity Search for Uncertain Time Series Presented by CAO Chen 21 st Feb, 2011.
Jessica Lin, Eamonn Keogh, Stefano Loardi
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
Based on Slides by D. Gunopulos (UCR)
Finding Time Series Motifs on Disk-Resident Data
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
A Multiresolution Symbolic Representation of Time Series
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen Department of Computer Science Brigham Young University March 31, 2004 Funded by National.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Time Series Motifs Statistical Significance
Exact Indexing of Dynamic Time Warping
Computer Vision and Data Mining Research Projects Longin Jan Latecki Computer and Information Sciences Dept. Temple University
Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.
C OMPUTATIONAL BIOLOGY. O UTLINE Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity of the Algorithms.
Qualitative approximation to Dynamic Time Warping similarity between time series data Blaž Strle, Martin Možina, Ivan Bratko Faculty of Computer and Information.
Analysis of Constrained Time-Series Similarity Measures
Similarity based Retrieval from Sequence Databases using Automata as Queries 作者 : A. Prasad Sistla, Tao Hu, Vikas howdhry 出處 :CIKM 2002 ACM 指導教授 : 郭煌政老師.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
University of Toronto Aug. 11, 2004 Learning the “Epitome” of a Video Sequence Information Processing Workshop 2004 Vincent Cheung Probabilistic and Statistical.
Mining Time Series.
Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.
CEMiner – An Efficient Algorithm for Mining Closed Patterns from Time Interval-based Data Yi-Cheng Chen, Wen-Chih Peng and Suh-Yin Lee ICDM 2011.
A Test Paradigm for Detecting Changes in Transactional Data Streams Willie Ng and Manoranjan Dash DASFAA 2008.
Semi-Supervised Time Series Classification & DTW-D REPORTED BY WANG YAWEN.
Discovering Deformable Motifs in Time Series Data Jin Chen CSE Fall 1.
Texture Detection & Texture related clustering C601 Project Jing Qin Fall 2003.
Identifying Patterns in Time Series Data Daniel Lewis 04/06/06.
ICDE, San Jose, CA, 2002 Discovering Similar Multidimensional Trajectories Michail VlachosGeorge KolliosDimitrios Gunopulos UC RiversideBoston UniversityUC.
Lei Li Computer Science Department Carnegie Mellon University Pre Proposal Time Series Learning completed work 11/27/2015.
Object Recognition Based on Shape Similarity Longin Jan Latecki Computer and Information Sciences Dept. Temple Univ.,
Multi-object Similarity Query Evaluation Michal Batko.
LogTree: A Framework for Generating System Events from Raw Textual Logs Liang Tang and Tao Li School of Computing and Information Sciences Florida International.
Visualization and Exploration of Temporal Trend Relationships in Multivariate Time-Varying Data Teng-Yok Lee & Han-Wei Shen.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Calculating frequency moments of Data Stream
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Jessica K. Ting Michael K. Ng Hongqiang Rong Joshua Z. Huang 國立雲林科技大學.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
VizTree Huyen Dao and Chris Ackermann. Introducing example
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Indexing and Mining Free Trees Yun Chi, Yirong Yang, Richard R. Muntz Department of Computer Science University of California, Los Angeles, CA {
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
Supervised Time Series Pattern Discovery through Local Importance
Visually Mining and Monitoring Massive Time Series
Enumeration of Time Series Motifs of All Lengths
A Time Series Representation Framework Based on Learned Patterns
Time Series Filtering Time Series
Mining Time-Series Databases
Time Series Data and Moving Object Trajectory
Time Relaxed Spatiotemporal Trajectory Joins
Group 9 – Data Mining: Data
Topological Signatures For Fast Mobility Analysis
Presentation transcript:

Detecting Time Series Motifs Under Uniform Scaling D. Yankov, E. Keogh, J. Medina, B. Chiu, V. Zordan Dept. of Computer Science & Eng. University of California Riverside

Outline Problem definition Motivation Formalization and approach Experimental evaluation

Problem definition Given is a long time series or a data set of shorter sequences Goal: Detect similar patterns of various scaling 100 200 300 400 500 600 A C B 100 200 300 400 500 600

Motivation Object recognition with time series representation Animation

Motivation (cont) Time series sampled at different rate Physiological time series of different frequencies

Similarity under uniform scaling Formalization Similarity under uniform scaling Motifs under uniform scaling

Approach Observation: only a limited set of scaling factors need to be checked Algorithm. For every scaling factor do: rescale all query subsequences represent all time series as equal length words over the same alphabet (apply SAX)

Approach (cont) Using PROJECTION (a locality sensitive hashing approach), filter out all non-matching words. Compute the distance between the unfiltered time series pairs.

Experimental evaluation Brain activity time series Valuable in predicting epileptic seizure periods.

Experimental evaluation Effectiveness of the algorithm Efficiency

Experimental evaluation (cont) Projectile shapes Lampasas River Cornertang Castroville Cornertang The algorithm detects a rare cornertang segment – an object that has long intrigued anthropologists. 50 100 150 200 250 300 350

Experimental evaluation (cont) Motion-capture motifs On this sequence the method detects the same blocking movement performed by the actor. The Euclidean distance fails to detect this motif.

Conclusion Uniform scaling motifs appear in diverse areas as – animation, object recognition, medical sequence mining, etc. The presented probabilistic approach for mining such motifs is accurate and extremely effective. The method works in an entirely unsupervised way, requiring only a specified motif length. Possible extensions – multivariate time series, disk resident modifications.

Poster# 28 THANK YOU!