An Iterative Approach to Discriminative Structure Learning

Slides:



Advertisements
Similar presentations
Pattern Finding and Pattern Discovery in Time Series
Advertisements

Hidden Markov Model in Biological Sequence Analysis – Part 2
Large Vocabulary Unconstrained Handwriting Recognition J Subrahmonia Pen Technologies IBM T J Watson Research Center.
Learning Accurate, Compact, and Interpretable Tree Annotation Recent Advances in Parsing Technology WS 2011/2012 Saarland University in Saarbrücken Miloš.
HMM II: Parameter Estimation. Reminder: Hidden Markov Model Markov Chain transition probabilities: p(S i+1 = t|S i = s) = a st Emission probabilities:
Learning HMM parameters
Hidden Markov Models Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 5 October 6, 2004.
2004/11/161 A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition LAWRENCE R. RABINER, FELLOW, IEEE Presented by: Chi-Chun.
EE 290A: Generalized Principal Component Analysis Lecture 6: Iterative Methods for Mixture-Model Segmentation Sastry & Yang © Spring, 2011EE 290A, University.
… Hidden Markov Models Markov assumption: Transition model:
Hidden Markov Model 11/28/07. Bayes Rule The posterior distribution Select k with the largest posterior distribution. Minimizes the average misclassification.
First introduced in 1977 Lots of mathematical derivation Problem : given a set of data (data is incomplete or having missing values). Goal : assume the.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
Part 4 b Forward-Backward Algorithm & Viterbi Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
Lecture 5: Learning models using EM
Hidden Markov Models Lecture 5, Tuesday April 15, 2003.
Big Ideas in Cmput366. Search Blind Search Iterative deepening Heuristic Search A* Local and Stochastic Search Randomized algorithm Constraint satisfaction.
Hidden Markov Models K 1 … 2. Outline Hidden Markov Models – Formalism The Three Basic Problems of HMMs Solutions Applications of HMMs for Automatic Speech.
Part 4 c Baum-Welch Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
Hidden Markov Models David Meir Blei November 1, 1999.
Hidden Markov Models. Hidden Markov Model In some Markov processes, we may not be able to observe the states directly.
Learning HMM parameters Sushmita Roy BMI/CS 576 Oct 21 st, 2014.
1 Markov Chains. 2 Hidden Markov Models 3 Review Markov Chain can solve the CpG island finding problem Positive model, negative model Length? Solution:
Fundamentals of Hidden Markov Model Mehmet Yunus Dönmez.
Hidden Markov Models Yves Moreau Katholieke Universiteit Leuven.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
S. Salzberg CMSC 828N 1 Three classic HMM problems 2.Decoding: given a model and an output sequence, what is the most likely state sequence through the.
I-SMOOTH FOR IMPROVED MINIMUM CLASSIFICATION ERROR TRAINING Haozheng Li, Cosmin Munteanu Pei-ning Chen Department of Computer Science & Information Engineering.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
Algorithms in Computational Biology11Department of Mathematics & Computer Science Algorithms in Computational Biology Markov Chains and Hidden Markov Model.
ECE 8443 – Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional Likelihood Mutual Information Estimation (CMLE) Maximum MI Estimation.
1 Hidden Markov Model Observation : O1,O2,... States in time : q1, q2,... All states : s1, s2,... Si Sj.
Bayesian Speech Synthesis Framework Integrating Training and Synthesis Processes Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda Nagoya Institute.
1 Hidden Markov Models Hsin-min Wang References: 1.L. R. Rabiner and B. H. Juang, (1993) Fundamentals of Speech Recognition, Chapter.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Statistical Models for Automatic Speech Recognition Lukáš Burget.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
1 Hidden Markov Model Observation : O1,O2,... States in time : q1, q2,... All states : s1, s2,..., sN Si Sj.
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida March 31,
ECE 8443 – Pattern Recognition Objectives: Reestimation Equations Continuous Distributions Gaussian Mixture Models EM Derivation of Reestimation Resources:
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Definition of the Hidden Markov Model A Seminar Speech Recognition presentation A Seminar Speech Recognition presentation October 24 th 2002 Pieter Bas.
Savyasachi Singh Computational NeuroEngineering Lab March 19, 2008.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Learning, Uncertainty, and Information: Learning Parameters
Date: October, Revised by 李致緯
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture
Stochastic Context-Free Grammars for Modeling RNA
Classification of unlabeled data:
Statistical Models for Automatic Speech Recognition
CSC 594 Topics in AI – Natural Language Processing
Hidden Markov Models - Training
Computational NeuroEngineering Lab
Stochastic Context-Free Grammars for Modeling RNA
Bayesian Models in Machine Learning
Learning Markov Networks
Statistical Models for Automatic Speech Recognition
מיומנויות תקשורת בין אישית סוגי שיחות שונים בין שנים, בבית הספר.
Three classic HMM problems
Hidden Markov Model LR Rabiner
Expectation-Maximization Algorithm
An introduction to Graphical Models – Michael Jordan
Ab Initio Profile HMM Generation
Handwritten Characters Recognition Based on an HMM Model
Algorithms of POS Tagging
LECTURE 23: INFORMATION THEORY REVIEW
Introduction to HMM (cont)
Hidden Markov Models By Manish Shrivastava.
Presentation transcript:

An Iterative Approach to Discriminative Structure Learning Peng Xu WS’2001

Discriminative Structure Learning Procedure Baseline HMM training Viterbi alignment of training data Bivariate MI computation Discriminate structure selection Parameter re-estimation of new model WS’2001

Problems With the Procedure Viterbi alignment may change after training the new model MLE for parameter estimation Viterbi approximation restricts the MI computation WS’2001

Discriminative Model Learning Goal: minimize D(P(Q|O)||P’(Q|O)) the divergence between the desired posterior probability distribution and the posterior probability distribution according to the model WS’2001

Geometric Illustration of the Iterative Approach Desired Posterior Distribution Model Family WS’2001

BMM Structures Iteration n Iteration n+1 WS’2001

EM Type Iterative Structure and Parameter Learning E-step: Viterbi alignment of training data, MI computation M-step: discriminative conditional mutual information based BMM edge detection, model parameter learning (MMI) WS’2001

Improving MI Computation Viterbi alignment: Label for each frame is deterministic Posterior probability P(Qt=q|xt) is a  function Soft alignment: Compute P(Qt=q|xt) using forward-backward algorithm Each data frame contributes to all labels WS’2001

Proposal for the Next Year Formal formulation of the iterative model structure and parameter learning Theoretical study of the EM type learning procedure Implementation of the improved MI computation Application to different data sets: Aurora, Audio-visual, etc. WS’2001