Using Relevance Feedback in Multimedia Databases

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Indexing Time Series Based on original slides by Prof. Dimitrios Gunopulos and Prof. Christos Faloutsos with some slides from tutorials by Prof. Eamonn.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Word Spotting DTW.
Data Mining Classification: Alternative Techniques
Data Mining Classification: Alternative Techniques
Relevance Feedback Retrieval of Time Series Data Eamonn J. Keogh & Michael J. Pazzani Prepared By/ Fahad Al-jutaily Supervisor/ Dr. Mourad Ykhlef IS531.
Face Alignment with Part-Based Modeling
Lazy vs. Eager Learning Lazy vs. eager learning
Patch to the Future: Unsupervised Visual Prediction
Mining Time Series.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Classification Dr Eamonn Keogh Computer Science & Engineering Department University of California - Riverside Riverside,CA Who.
Themis Palpanas1 VLDB - Aug 2004 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use.
ACM Multimedia th Annual Conference, October , 2004
CBF Dataset Two-Pat Dataset Euclidean DTW Increasingly Large Training.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Efficient Query Filtering for Streaming Time Series
Making Time-series Classification More Accurate Using Learned Constraints © Chotirat “Ann” Ratanamahatana Eamonn Keogh 2004 SIAM International Conference.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Based on Slides by D. Gunopulos (UCR)
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
KNN, LVQ, SOM. Instance Based Learning K-Nearest Neighbor Algorithm (LVQ) Learning Vector Quantization (SOM) Self Organizing Maps.
Nearest Neighbor Retrieval Using Distance-Based Hashing Michalis Potamias and Panagiotis Papapetrou supervised by Prof George Kollios A method is proposed.
A Multiresolution Symbolic Representation of Time Series
Time Series I.
Exact Indexing of Dynamic Time Warping
 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
Qualitative approximation to Dynamic Time Warping similarity between time series data Blaž Strle, Martin Možina, Ivan Bratko Faculty of Computer and Information.
Analysis of Constrained Time-Series Similarity Measures
S DTW: COMPUTING DTW DISTANCES USING LOCALLY RELEVANT CONSTRAINTS BASED ON SALIENT FEATURE ALIGNMENTS K. Selçuk Candan Arizona State University Maria Luisa.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
Dynamic Time Warping Algorithm for Gene Expression Time Series
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Finding Better Answers in Video Using Pseudo Relevance Feedback Informedia Project Carnegie Mellon University Carnegie Mellon Question Answering from Errorful.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
K. Selçuk Candan, Maria Luisa Sapino Xiaolan Wang, Rosaria Rossini
Incorporating Dynamic Time Warping (DTW) in the SeqRec.m File Presented by: Clay McCreary, MSEE.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Mining Time Series.
Semi-Supervised Time Series Classification & DTW-D REPORTED BY WANG YAWEN.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
k-Shape: Efficient and Accurate Clustering of Time Series
Semi-Supervised Time Series Classification Li Wei Eamonn Keogh University of California, Riverside {wli,
Query Sensitive Embeddings Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios, Stan Sclaroff.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Exact indexing of Dynamic Time Warping
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Visualization and Exploration of Temporal Trend Relationships in Multivariate Time-Varying Data Teng-Yok Lee & Han-Wei Shen.
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Eick: kNN kNN: A Non-parametric Classification and Prediction Technique Goals of this set of transparencies: 1.Introduce kNN---a popular non-parameric.
Relevant Document Distribution Estimation Method for Resource Selection Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
A Document-Level Sentiment Analysis Approach Using Artificial Neural Network and Sentiment Lexicons Yan Zhu.
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
Accelerometer-Based Character Recognition Pen
Supervised Time Series Pattern Discovery through Local Importance
Information Retrieval
Accelerometer-Based Character Recognition Pen
Time Series Filtering Time Series
Semi-Supervised Time Series Classification
Presentation transcript:

Using Relevance Feedback in Multimedia Databases Chotirat “Ann” Ratanamahatana Eamonn Keogh 7th International Conference on VISual Information Systems at 10th International Conference on Distributed Multimedia Systems September 9, 2004

Roadmap Time series in multimedia databases and their similarity measures Euclidean distance and its limitation Dynamic time warping (DTW) Global constraints and R-K Band Relevance Feedback and Query Refinement Experimental Evaluation Conclusions and future work

What are Time Series A collection of observations made sequentially in time. People measure things… and things…change over time… Their blood pressure George Bush's popularity rating The annual rainfall in San Francisco The value of their Google stock

Time Series in Multimedia Databases Image data may best be thought of as time series…

Image to Time Series

Video to Time Series Hand moving down to grasp gun Steady pointing Hand moving to shoulder level Hand moving down to grasp gun Hand moving above holster Hand at rest

Time Series in Multimedia Databases Video George Washington’s Manuscript

Classification in Time Series Class B Class A Which class does belong to? Pattern Recognition is a type of supervised classification where an input pattern is classified into one of the classes based on its similarity to these predefined classes.

Euclidean Distance Metric Given 2 time series Q = q1, …, qn and C = c1, …, cn their Euclidean distance is defined as Q C

Limitations of Euclidean Metric Very sensitive to some distortion in the data Training data consists of 10 instances from each of the 3 classes Perform a 1-nearest neighbor algorithm, with “leaving-one-out” evaluation, averaged over 100 runs. Euclidean distance Error rate: 29.77% DTW Error rate: 3.33 %

Dynamic Time Warping (DTW) Euclidean Distance One-to-one alignments Time Warping Distance Non-linear alignments are allowed

How Is DTW Calculated? (I) Q C Warping path w

How Is DTW Calculated? (II) Each warping path w can be found using dynamic programming to evaluate the following recurrence: where γ(i, j) is the cumulative distance of the distance d(i, j) and its minimum cumulative distance among the adjacent cells. (i-1, j) (i, j-1) (i, j) (i-1, j-1)

Global Constraints (I) Prevent any unreasonable warping Sakoe-Chiba Band Itakura Parallelogram

Global Constraints (II) A Global Constraint for a sequence of size m is defined by R, where Ri = d 0  d  m, 1  i  m. Ri defines a freedom of warping above and to the right of the diagonal at any given point i in the sequence. Ri Sakoe-Chiba Band Itakura Parallelogram

Ratanamahatana-Keogh Band (R-K Band) Solution: we create an arbitrary shape and size of the band that is appropriate for the data we want to classify.

How Do We Create an R-K Band? First Attempt: We could look at the data and manually create the shape of the bands. (then we need to adjust the width of each band as well until we get a good result) 100 % Accuracy!

Learning an R-K Band Automatically Our heuristic search algorithm automatically learns the bands from the data. (sometimes, we can even get an unintuitive shape that give a good result.) 100 % Accuracy as well!

R-K Band Learning With Heuristic Search

R-K Band Learning in Action!

Classification Examples with R-K Bands Error rate Euclidean 32.13% DTW 10% 4.52% R-K Bands 0.9%

Face Classification

Relevance Feedback A well-known and effective method in improving the query performance, especially in text-mining domains. Refining the query based on user’s reaction Only relatively little research has been done on relevance feedback in images or multimedia data.

Query Refinement Averaging a collection of time series using DTW, according to their weights and warping (DTW) alignments.

Experiment: Datasets Gun Problem Leaf Dataset Handwritten Word Spotting data

Experimental Design Given an initial query, we measure the precision and recall for each round of the relevance feedback retrieval. Show the 10 best matches (k-nearest neighbors). User ranks each result. Accumulatively build the training set. Learn an R-K band according to the current training data. Generate a new query (query refinement), and repeat.

Results: Gun

Results: Leaf

Results: Wordspotting

Conclusions Different shapes and widths of the band contributes to the classification accuracy / precision. We have shown that incorporating R-K Band into relevance feedback can reduce the error rate in classification, and improve the precision at all recall levels in video and image retrieval.

Future Work Investigate other choices that may make envelope learning more accurate. Heuristic functions Search algorithm (refining the search) Is there a way to always guarantee an optimal solution? Examine the best way to deal with multi-variate time series for more complex data. Explore other utilities of R-K Band and relevance feedback, specifically on real-world problems: music, bioinformatics, biomedical data, etc.

Thank You Questions? Contact: ratana@cs.ucr.edu eamonn@cs.ucr.edu Homepage: http://www.cs.ucr.edu/~ratana All datasets are publicly available at: UCR Time Series Data Mining Archive: http://www.cs.ucr.edu/~eamonn/TSDMA