1/16 Dynamic Programming Carmine Casciato MUMT 611 Thursday March 31 st 2005.

Slides:



Advertisements
Similar presentations
Dynamic Time Warping (DTW)
Advertisements

Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
ECE 8443 – Pattern Recognition Objectives: Elements of a Discrete Model Evaluation Decoding Dynamic Programming Resources: D.H.S.: Chapter 3 (Part 3) F.J.:
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
A System for Hybridizing Vocal Performance By Kim Hang Lau.
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
Soundprism An Online System for Score-informed Source Separation of Music Audio Zhiyao Duan and Bryan Pardo EECS Dept., Northwestern Univ. Interactive.
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Learning to Align Polyphonic Music. Slide 1 Learning to Align Polyphonic Music Shai Shalev-Shwartz Hebrew University, Jerusalem Joint work with Yoram.
Distance Functions for Sequence Data and Time Series
Based on Slides by D. Gunopulos (UCR)
Using Relevance Feedback in Multimedia Databases
Dynamic Time Warping Applications and Derivation
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
So far: Historical introduction Mathematical background (e.g., pattern classification, acoustics) Feature extraction for speech recognition (and some neural.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
1 GAs and Feature Weighting Rebecca Fiebrink MUMT March 2005.
1 TEMPLATE MATCHING  The Goal: Given a set of reference patterns known as TEMPLATES, find to which one an unknown pattern matches best. That is, each.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
Dynamic Time Warping Algorithm for Gene Expression Time Series
Hierarchical Dirichlet Process (HDP) A Dirichlet process (DP) is a discrete distribution that is composed of a weighted sum of impulse functions. Weights.
Fundamentals of Music Processing
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Similarity Matrix Processing for Music Structure Analysis Yu Shiu, Hong Jeng C.-C. Jay Kuo ACM Multimedia 2006.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
8.0 Search Algorithms for Speech Recognition References: of Huang, or of Becchetti, or , of Jelinek 4. “ Progress.
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.
ICDE, San Jose, CA, 2002 Discovering Similar Multidimensional Trajectories Michail VlachosGeorge KolliosDimitrios Gunopulos UC RiversideBoston UniversityUC.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Evaluation Decoding Dynamic Programming.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
LECTURE Course: “Design of Systems: Structural Approach” Dept. “Communication Networks &Systems”, Faculty of Radioengineering & Cybernetics Moscow.
Ch 5b: Discriminative Training (temporal model) Ilkka Aho.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
Query by Singing and Humming System
1 Dynamic Time Warping and Minimum Distance Paths for Speech Recognition Isolated word recognition: Task : Want to build an isolated ‘word’ recogniser.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
1 4.7 TIME ALIGNMENT AND NORMALIZATION Linear time normalization:
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
Keogh, E. , Chakrabarti, K. , Pazzani, M. & Mehrotra, S. (2001)
David Sears MUMT November 2009
Machine Learning for the Quantified Self
Rhythmic Transcription of MIDI Signals
Carmine Casciato MUMT 611 Thursday, March 13, 2005
MATCH A Music Alignment Tool Chest
A review of audio fingerprinting (Cano et al. 2005)
4.7 TIME ALIGNMENT AND NORMALIZATION
LECTURE 15: HMMS – EVALUATION AND DECODING
8.0 Search Algorithms for Speech Recognition
Distance Functions for Sequence Data and Time Series
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Digital Systems: Hardware Organization and Design
Distance Functions for Sequence Data and Time Series
School of Computer Science & Engineering
The Functional Space of an Activity Ashok Veeraraghavan , Rama Chellappa, Amit Roy-Chowdhury Avinash Ravichandran.
The connected word recognition problem Problem definition: Given a fluently spoken sequence of words, how can we determine the optimum match in terms.
Authors: Guanghan Ning, Zhi Zhang, Xiaobo Ren, Haohong Wang,
LECTURE 14: HMMS – EVALUATION AND DECODING
4.7 TIME ALIGNMENT AND NORMALIZATION
Dynamic Time Warping and training methods
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Connected Word Recognition
Presentation on Timbre Similarity
The connected word recognition problem Problem definition: Given a fluently spoken sequence of words, how can we determine the optimum match in terms.
MAPO: Mining and Recommending API Usage Patterns
Measuring the Similarity of Rhythmic Patterns
Presentation transcript:

1/16 Dynamic Programming Carmine Casciato MUMT 611 Thursday March 31 st 2005

Overview Problem Space Origins Dynamic Time Warping Overview of Usage in MIR

Problem Space Multi-stage decision processes system S characterized as evolution of vector p N stages * M decisions/stage multi-dimensional maximization problems

Origins Dynamic Programming (DP) “…the optimal decision to be made at any state of the system. ” Bellman (1957) “Dynamic” refers to temporal nature of S Each decision is determined by max/min cost of previous state Allocation problem, x = y + x-y f N (x) = Max/Min [g(y) + h(x-y) + f N-1 (ay + b(x-y)) 0 <= y <= x

Rabiner and Huang 1993 Dynamic Time Warping (DTW) as solution for time-alignment and normalization of two utterances (Dis)similarity measurement of two vectors of short-time spectral features is equal to “best” path through feature grid

DTW Path Constraints endpoint monotonicity local path constraints global path constraints slope weighting locally and globally Dissimilarity metric, constraints, weightings, are all heuristically determined

Paulus and Klapuri 2002 Adopts Rabiner and Huang (1993) DTW to rhythmic similarity Depends on correct segmentation of rhythms from audio signal Finds optimal path between feature vectors of loudness and spectral centroid

Paulus and Klapuri 2002

Usage in MIR Query by humming Heo et al Adams et al Nishimura et. al 2001 Tempo tracking Raphael 2002 Feature selection Chang 1972

References Adams, N., M. Bartsch, J. Shifrin, and G.Wakefield Time series alignment for music information retrieval. In Proceedings of the International Conference on Music Information Retrieval: 303  10 Bellman, R Dynamic Programming. Princeton: Princeton University Press. Chang, C Dynamic programming as applied to feature subset selection in a pattern recognition system. In Proceedings of the ACM annual conference 1: 94  103. Guo, A., and H. Siegelman Time-warped longest common subsequence algorithm for music retrieval. In Proceedings of the International Conference on Music Information Retrieval: 258  61. Heo, S., M. Suzuki, A. Ito, and S. Makino Three dimensional continuous DP algorithm for multiple pitch candidates in music information retrieval system. In Proceedings of the International Conference on Music Information Retrieval. Nishimura, T., H. Hashiguchi, J. Takita, J. Zhang, M. Goto, and R. Oka Music signal spotting retrieval by a humming query using start frame feature dependent continuous dynamic programming. In Proceedings of the International Conference on Music Information Retrieval. Paulus, J., and A. Klapuri Measuring the similarity of rhythmic patterns. In Proceedings of the International Conference on Music Information Retrieval. Raphael, C A hybrid graphical model for rhythmic parsing. Artificial Intelligence 137: 217–38.