1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.

Slides:



Advertisements
Similar presentations
Pattern Finding and Pattern Discovery in Time Series
Advertisements

Lecture 16 Hidden Markov Models. HMM Until now we only considered IID data. Some data are of sequential nature, i.e. have correlations have time. Example:
Angelo Dalli Department of Intelligent Computing Systems
Automatic Speech Recognition II  Hidden Markov Models  Neural Network.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Introduction to Hidden Markov Models
Hidden Markov Models Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 5 October 6, 2004.
2004/11/161 A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition LAWRENCE R. RABINER, FELLOW, IEEE Presented by: Chi-Chun.
Language Model based Information Retrieval: University of Saarland 1 A Hidden Markov Model Information Retrieval System Mahboob Alam Khalid.
Page 1 Hidden Markov Models for Automatic Speech Recognition Dr. Mike Johnson Marquette University, EECE Dept.
Hidden Markov Models Adapted from Dr Catherine Sweeney-Reed’s slides.
Ch-9: Markov Models Prepared by Qaiser Abbas ( )
Profiles for Sequences
Hidden Markov Models Theory By Johan Walters (SR 2003)
Foundations of Statistical NLP Chapter 9. Markov Models 한 기 덕한 기 덕.
Hidden Markov Models in NLP
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
PatReco: Hidden Markov Models Alexandros Potamianos Dept of ECE, Tech. Univ. of Crete Fall
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
2 Personal Introduction previousnexthome end Academic Experience ( ) Bachelor and Master Degree on Electrical Engineering, Zhejiang University,
Jonah Shifrin, Bryan Pardo, Colin Meek, William Birmingham
Hidden Markov Models K 1 … 2. Outline Hidden Markov Models – Formalism The Three Basic Problems of HMMs Solutions Applications of HMMs for Automatic Speech.
1 Hidden Markov Model Instructor : Saeed Shiry  CHAPTER 13 ETHEM ALPAYDIN © The MIT Press, 2004.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
2001/03/29Chin-Kai Wu, CS, NTHU1 Speech and Language Technologies for Audio Indexing and Retrieval JOHN MAKHOUL, FELLOW, IEEE, FRANCIS KUBALA, TIMOTHY.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
1 AUTOMATIC TRANSCRIPTION OF PIANO MUSIC - SARA CORFINI LANGUAGE AND INTELLIGENCE U N I V E R S I T Y O F P I S A DEPARTMENT OF COMPUTER SCIENCE Automatic.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Polyphonic Queries A Review of Recent Research by Cory Mckay.
Online Chinese Character Handwriting Recognition for Linux
A Time Based Approach to Musical Pattern Discovery in Polyphonic Music Tamar Berman Graduate School of Library and Information Science University of Illinois.
Isolated-Word Speech Recognition Using Hidden Markov Models
Hidden Markov Models Applied to Information Extraction Part I: Concept Part I: Concept HMM Tutorial HMM Tutorial Part II: Sample Application Part II: Sample.
Graphical models for part of speech tagging
7-Speech Recognition Speech Recognition Concepts
Fundamentals of Hidden Markov Model Mehmet Yunus Dönmez.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Structure Discovery of Pop Music Using HHMM E6820 Project Jessie Hsu 03/09/05.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Modeling and Generation of Accentual Phrase F 0 Contours Based on Discrete HMMs Synchronized at Mora-Unit Transitions Atsuhiro Sakurai (Texas Instruments.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
1 CONTEXT DEPENDENT CLASSIFICATION  Remember: Bayes rule  Here: The class to which a feature vector belongs depends on:  Its own value  The values.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Performance Comparison of Speaker and Emotion Recognition
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.
Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.
Query by Singing and Humming System
1 Hidden Markov Models Hsin-min Wang References: 1.L. R. Rabiner and B. H. Juang, (1993) Fundamentals of Speech Recognition, Chapter.
Automated Speach Recognotion Automated Speach Recognition By: Amichai Painsky.
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Definition of the Hidden Markov Model A Seminar Speech Recognition presentation A Seminar Speech Recognition presentation October 24 th 2002 Pieter Bas.
Other Models for Time Series. The Hidden Markov Model (HMM)
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
Music Matching Speaker : 黃茂政 指導教授 : 陳嘉琳 博士.
Introduction to Music Information Retrieval (MIR)
An INTRODUCTION TO HIDDEN MARKOV MODEL
Hidden Markov Model LR Rabiner
CONTEXT DEPENDENT CLASSIFICATION
Handwritten Characters Recognition Based on an HMM Model
Music Signal Processing
Presentation transcript:

1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik

2 Contents  Introduction to HMM  Overview of Publications  Conclusion  Introduction to HMM  Overview of Publications  Conclusion

3 Introduction  Definition  A structure that is used to statistically characterize the behavior of sequences of event observations  Extension of a model known as Markov chains  “A double stochastic process with an underlying stochastic process which is not observable, but can only be observed through another set of stochastic process that produces the sequence of observed symbols” (Rabiner and Huang 1986)  Concepts  Any observable sequence can be represented as a succession of states, with each state representing a grouped portion of the observation values and containing its features in a statistical form  Definition  A structure that is used to statistically characterize the behavior of sequences of event observations  Extension of a model known as Markov chains  “A double stochastic process with an underlying stochastic process which is not observable, but can only be observed through another set of stochastic process that produces the sequence of observed symbols” (Rabiner and Huang 1986)  Concepts  Any observable sequence can be represented as a succession of states, with each state representing a grouped portion of the observation values and containing its features in a statistical form

4 Introduction  Concepts (ctd.)  HMM keeps track of  What state will the sequence start in  What state-to-state transitions are likely to take place  What values are likely to occur in each state  Corresponding parameters  Array of initial state probabilities  Matrix of state-to-state transitional probabilities  Matrix of state output probabilities  = ( , A, B)  Concepts (ctd.)  HMM keeps track of  What state will the sequence start in  What state-to-state transitions are likely to take place  What values are likely to occur in each state  Corresponding parameters  Array of initial state probabilities  Matrix of state-to-state transitional probabilities  Matrix of state output probabilities  = ( , A, B)

5 Introduction Markov Model Example. - x — States of the Markov model - a — Transition probabilities - b — Output probabilities - y — Observable outputs

6 Introduction  Three Main HMM Problems  Recognition  given an observation sequence and a Hidden Markov Model, calculate the probability that the model would produce this observation sequence.  Uncovering (Viterbi)  given an observation sequence and a Hidden Markov Model, calculate the optimal sequence of states that would maximize the likelihood of the HMM producing the observation.  Learning / Training  given an observation sequence (or a set of observation sequences and a Hidden Markov Model, adjust the model parameters, so that probability of the model is maximized.  Three Main HMM Problems  Recognition  given an observation sequence and a Hidden Markov Model, calculate the probability that the model would produce this observation sequence.  Uncovering (Viterbi)  given an observation sequence and a Hidden Markov Model, calculate the optimal sequence of states that would maximize the likelihood of the HMM producing the observation.  Learning / Training  given an observation sequence (or a set of observation sequences and a Hidden Markov Model, adjust the model parameters, so that probability of the model is maximized.

7 Introduction  HMM History  Basic concept developed by Markov  Theory for practical implementation summarized by Rabiner and Huang (1986)  Applied in different fields to data stream observation problems  Common in speech recognition  Has become increasingly popular in music information retrieval applications  HMM History  Basic concept developed by Markov  Theory for practical implementation summarized by Rabiner and Huang (1986)  Applied in different fields to data stream observation problems  Common in speech recognition  Has become increasingly popular in music information retrieval applications

8 Overview of Works Automatic Segmentation for Music Classification using Competitive Hidden Markov Models  (2000) Battle, Cano – University Pompeu Fabra  System classifies audio segments (automatic segmentation into abstract acoustic events)  can be applied to classify a database of audio sounds  allows fast indexing and retrieval of audio fragments  similar segment events are given the same label Automatic Segmentation for Music Classification using Competitive Hidden Markov Models  (2000) Battle, Cano – University Pompeu Fabra  System classifies audio segments (automatic segmentation into abstract acoustic events)  can be applied to classify a database of audio sounds  allows fast indexing and retrieval of audio fragments  similar segment events are given the same label

9 Overview of Works  (2000) Battle, Cano (ctd.)  First stage: parametrization, features obtained from audio signals  Mel-cepstrum analysis used to obtain feature vectors  Main classification engine: HMM-based  Traditional HMMs are not suited for blind learning  Competitive HMMs used instead  CoHMMs differ from HMMs only in training stage; recognition is exactly the same  (2000) Battle, Cano (ctd.)  First stage: parametrization, features obtained from audio signals  Mel-cepstrum analysis used to obtain feature vectors  Main classification engine: HMM-based  Traditional HMMs are not suited for blind learning  Competitive HMMs used instead  CoHMMs differ from HMMs only in training stage; recognition is exactly the same

10 Overview of Works Melody Spotting Using Hidden Markov Models  (2001) Durey, Clements–Georgia Institute of Technology  A melody-based database song retrieval system  Uses melody spotting procedure adopted from word spotting techniques in automatic speech recognition  Humming, whistling, keyboard as input  Main goal: develop a practical system for non-symbolic music representation (audio)  Word/melody-spotting: searching for a data segment in a data stream Melody Spotting Using Hidden Markov Models  (2001) Durey, Clements–Georgia Institute of Technology  A melody-based database song retrieval system  Uses melody spotting procedure adopted from word spotting techniques in automatic speech recognition  Humming, whistling, keyboard as input  Main goal: develop a practical system for non-symbolic music representation (audio)  Word/melody-spotting: searching for a data segment in a data stream

11 Overview of Works  (2001) Durey, Clements (ctd.)  Uses monophonic melodies, both audio and MIDI data  Left-to-right, 5-state HMM to represent each available note and a rest  Frequency and time-domain features for feature vectors  Constructs an HMM model from the input query  Runs all of the feature vectors from the songs in the database using Viterbi process  A ranked list of melody occurances in database songs is created  System presented as a proof-of-concept  (2001) Durey, Clements (ctd.)  Uses monophonic melodies, both audio and MIDI data  Left-to-right, 5-state HMM to represent each available note and a rest  Frequency and time-domain features for feature vectors  Constructs an HMM model from the input query  Runs all of the feature vectors from the songs in the database using Viterbi process  A ranked list of melody occurances in database songs is created  System presented as a proof-of-concept

12 Overview of Works Indexing Hidden Markov Models for Music Retrieval  (2002) Jin, Jagadish – University of Michigan  Music retrieval system  Paper describes traditional MIR HMM techniques as effective but not efficient  Efficient mechanism is suggested to index the HMMs  Each state is represented by an interval / inter onset interval ratio  Each transition is transformed into a 4-dimensional box Indexing Hidden Markov Models for Music Retrieval  (2002) Jin, Jagadish – University of Michigan  Music retrieval system  Paper describes traditional MIR HMM techniques as effective but not efficient  Efficient mechanism is suggested to index the HMMs  Each state is represented by an interval / inter onset interval ratio  Each transition is transformed into a 4-dimensional box

13 Overview of Works  (2002) Jin, Jagadish (ctd.)  All boxes are inserted into R-tree, an indexing structure for multidimensional data  HMMs are ranked by the number of boxes  Most likely candidates are selected for evaluation  The evaluation uses the traditional forward algorithm  (2002) Jin, Jagadish (ctd.)  All boxes are inserted into R-tree, an indexing structure for multidimensional data  HMMs are ranked by the number of boxes  Most likely candidates are selected for evaluation  The evaluation uses the traditional forward algorithm

14 Overview of Works Chord Segmentation and Recognition using EM-Trained Hidden Markov Models  (2003) Sheh, Ellis – Columbia University  Uses HMM for chord recognition, EM (Expectation- Maximization) Algorithm to train them  PCP (Pitch Class Profile) vectors used as features to train HMMs  HMM model for each chord type (ex. ‘A Minor’, etc.)  System able to successfully recognize chords in unstructured, polyphonic, multi-timbre audio Chord Segmentation and Recognition using EM-Trained Hidden Markov Models  (2003) Sheh, Ellis – Columbia University  Uses HMM for chord recognition, EM (Expectation- Maximization) Algorithm to train them  PCP (Pitch Class Profile) vectors used as features to train HMMs  HMM model for each chord type (ex. ‘A Minor’, etc.)  System able to successfully recognize chords in unstructured, polyphonic, multi-timbre audio

15 Overview of Works Effectiveness of HMM-Based Retrieval on Large Databases  (2003) Shifrin, Burmingham – University of Michigan  Investigates performance of an HMM-based QBH system on a large musical database  VocalSearch system, part of MusArt project  theme database (roughly songs)  Uses pair as feature vectors Effectiveness of HMM-Based Retrieval on Large Databases  (2003) Shifrin, Burmingham – University of Michigan  Investigates performance of an HMM-based QBH system on a large musical database  VocalSearch system, part of MusArt project  theme database (roughly songs)  Uses pair as feature vectors

16 Overview of Works  (2003) Shifrin, Burmingham (ctd)  Compared perfect and imperfect queries, simulated insertions and deletions  Discovered Trends:  Longer queries have a positive effect on evaluation performance  All experiments show an early saturation point  Performed well with imperfect queries on a large database  (2003) Shifrin, Burmingham (ctd)  Compared perfect and imperfect queries, simulated insertions and deletions  Discovered Trends:  Longer queries have a positive effect on evaluation performance  All experiments show an early saturation point  Performed well with imperfect queries on a large database

17 Overview of Works A HMM-Based Pitch Tracker for Audio Queries  (2003) Orio, Sette – University of Padova  HMM-based approach to transcription of musical queries  HMM used to model features related to singing voice  A sung query is considered as an observation of an unknown process – the melody the user has in mind  Two-level HMM: event-level (using pitches as labels), audio-level (attack-sustain-relst events)  A simple model is presented, low recognition percentages A HMM-Based Pitch Tracker for Audio Queries  (2003) Orio, Sette – University of Padova  HMM-based approach to transcription of musical queries  HMM used to model features related to singing voice  A sung query is considered as an observation of an unknown process – the melody the user has in mind  Two-level HMM: event-level (using pitches as labels), audio-level (attack-sustain-relst events)  A simple model is presented, low recognition percentages

18 Conclusion  HTML Bibliography  Questions  HTML Bibliography  Questions