Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Chun Kai Chen Author: Aravind.

Slides:

Advertisements

Similar presentations

Introduction to Support Vector Machines (SVM)

Advertisements

Generative Models Thus far we have essentially considered techniques that perform classification indirectly by modeling the training data, optimizing.

ECG Signal processing (2)

Supervised Learning Recap

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Presenter ： Yu Cheng Chen Author: Hichem.

Lecture 17: Supervised Learning Recap Machine Learning April 6, 2010.

Support Vector Machines (SVMs) Chapter 5 (Duda et al.)

The Nature of Statistical Learning Theory by V. Vapnik

What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.

Statistical Learning Theory: Classification Using Support Vector Machines John DiMona Some slides based on Prof Andrew Moore at CMU:

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Extreme Re-balancing for SVMs: a case study Advisor ：

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Text classification based on multi-word with support vector.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Presenter ： Chien-Shing Chen Author: Tie-Yan.

Hybrid Systems for Continuous Speech Recognition Issac Alphonso Institute for Signal and Information Processing Mississippi State.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 On-line Learning of Sequence Data Based on Self-Organizing.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel genetic algorithm for automatic clustering Advisor.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Data mining for credit card fraud: A comparative study.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Adaptive nonlinear manifolds and their applications to pattern.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Local linear correlation analysis with the SOM Advisor :

Intelligent Database Systems Lab Advisor ： Dr. Hsu Graduate ： Chien-Shing Chen Author ： Satoshi Oyama Takashi Kokubo Toru lshida 國立雲林科技大學 National Yunlin.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A data mining approach to the prediction of corporate failure.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.

A Sparse Modeling Approach to Speech Recognition Based on Relevance Vector Machines Jon Hamaker and Joseph Picone Institute for.

Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.

SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Topology Preservation in Self-Organizing Feature Maps: Exact.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Virus Pattern Recognition Using Self-Organization Map.

Intelligent Database Systems Lab Advisor ： Dr. Hsu Graduate ： Chien-Ming Hsiao Author ： Bing Liu Yiyuan Xia Philp S. Yu 國立雲林科技大學 National Yunlin University.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Presenter ： Keng-Wei Chang Author: Yehuda.

Speaker Verification Speaker verification uses voice as a biometric to determine the authenticity of a user. Speaker verification systems consist of two.

Face Recognition by Support Vector Machines 指導教授 : 王啟州教授學生 : 陳桂華 Guodong Guo, Stan Z. Li, and Kapluk Chan School of Electrical and Electronic Engineering.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. An IPC-based vector space model for patent retrieval Presenter: Jun-Yi Wu Authors: Yen-Liang Chen, Yu-Ting.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 GMDH-based feature ranking and selection for improved.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Yu Cheng Chen Author: Manoranjan.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Novel Density-Based Clustering Framework by Using Level.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Utilizing Marginal Net Utility for Recommendation in E-commerce.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Efficient Optimal Linear Boosting of a Pair of Classifiers.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Yu Cheng Chen Author: Chung-hung.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A modified version of the K-means algorithm with a distance.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Presenter ： Yu Cheng Chen Author: YU-SHENG.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Authors :

Intelligent Database Systems Lab Advisor ： Dr.Hsu Graduate ： Keng-Wei Chang Author ： Lian Yan and David J. Miller 國立雲林科技大學 National Yunlin University of.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A text mining approach on automatic generation of web.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Rival-Model Penalized Self-Organizing Map Yiu-ming Cheung.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Information Loss of the Mahalanobis Distance in High Dimensions-

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Multiclass boosting with repartitioning Graduate : Chen,

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Presenter ： Chien Shing Chen Author: Wei-Hao.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.

Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Adaptive FIR Neural Model for Centroid Learning in Self-Organizing.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Direct mining of discriminative patterns for classifying.

Intelligent Database Systems Lab Advisor ： Dr. Hsu Graduate ： Chien-Shing Chen Author ： Jessica K. Ting Michael K. Ng Hongqiang Rong Joshua Z. Huang 國立雲林科技大學.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Yu Cheng Chen Author: Wei Xu,

Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Text Classification Improved through Multigram Models.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Growing Hierarchical Tree SOM: An unsupervised neural.

Intelligent Database Systems Lab Advisor ： Dr. Hsu Graduate ： Yu Cheng Chen Author ： Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. An integrated scheme for feature selection and parameter setting in the support vector machine modeling.

SVMs in a Nutshell.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Prediction model building and feature selection with support.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Chun Kai Chen Author ： Andrew.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A New Cluster Validity Index for Data with Merged Clusters.

Support Vector Machines (SVMs) Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.

LECTURE 16: SUPPORT VECTOR MACHINES

Pattern Recognition CS479/679 Pattern Recognition Dr. George Bebis

LECTURE 17: SUPPORT VECTOR MACHINES

Linear Discrimination

Presentation transcript:

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor ： Dr. Hsu Graduate ： Chun Kai Chen Author: Aravind Ganapathiraju, Jonathan E. Hamaker and Joseph Picone Applications of Support Vector Machines to Speech Recognition IEEE 2004

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Outline Motivation Objective Introduction Speech Recognition Support Vector Machines Experimental Results Conclusions Personal Opinion

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Motivation  There are problems with an ML formulation for applications such as speech recognition. ─ Higher dimensional problem will never achieve perfect classification.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Objective  Apply SVMs to overcome higher dimensional problems and achieve perfect classification.  Application of SVMs to large vocabulary speech recognition  To the development and optimization of an SVM/HMM hybrid system

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction Speech Recognition Speech Recognition Process Hidden Markov Model Application of SVMs to Speech Recognition: Review the SVM approach Discuss applications to speech recognition Present experimental results

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Speech Recognition

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Speech Recognition Process (MFCC)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Hidden Markov Model (1/2)  A HMM is a doubly stochastic process with an underlying stochastic process that is not observable (it is hidden)  It is a state transition process described  For speech modeling applications, the HMM is a generator of vector sequences.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Hidden Markov Model (2/2) Finite-State Machine + Probability Process

Intelligent Database Systems Lab N.Y.U.S.T. I. M. HMMs Problems  Maximizing the likelihood (ML) ─ estimate the parameters that guarantee convergence  Expectation–maximization (EM) ─ estimation with good convergence properties, although it does not guarantee finding the global maximum  Problems with an ML formulation ─ will never achieve perfect classification

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Global maximum problem

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Support Vector Machines

Intelligent Database Systems Lab N.Y.U.S.T. I. M. SVM  Support Vector Classification 的目標是在高維度的特徵空間中找出一個區分平面 (separating hyperplanes ) 。而此區分平面 (separating hyperplanes ) 可以找出最佳的邊界。  ERM and SRM be used to find a good hyperplane ─ ERM: empirical risk minimization Can be used to find a good hyperplane, although this does not guarantee a unique solution ─ SRM: structure risk minimization Can help choose the best hyperplane by ordering the hyperplanes based on the margin  Real-world classification problems  ANNs ─ attempt overcome many of problems ─ Slow convergence during training and a tendency to overfit the data.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. A hyperplane classifier

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Kernels  Allow a dot product to be computed in a higher dimensional space ─ Linear ─ Polynomial ─ Radial basis function (RFB) Slower than polynomial kernels but better performance ─ Sigmoid

Intelligent Database Systems Lab N.Y.U.S.T. I. M. One-against-all method  y i ─ are the class assignments  w ─ represents the weight vector defining the classifier,  b ─ is a bias term  ε i ─ the ’s arethe slack variables.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Applications to speech recognition  Hybrid approaches  SVMs cannot model the temporal structure of speech effectively.  So, we still need use HMM structure to model temporal evolution  Use NN only to estimate posterior probabilities

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Several issues arise  Posterior estimation  Segmental Modeling  N-best List Rescoring

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Posterior estimation  There is significant overlap in the feature space.  SVMs provide a distance or discriminate that can be used to compare classifiers.  Main concerns in using SVMs ─ lack of a clear relationship between distance from the margin ─ the posterior class probability  We used a sigmoid distribution to map the output distances to posteriors

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Sigmoid

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Segmental Modeling (1/2)  At frame-level still not computationally feasible to train on all data available in the large corpora.  In our work, we have chosen to use a segment-based approach to avoid these issues.  Segmental data takes better advantage of the correlation in adjacent frames of speech data.  A related problem is the variable length or duration problem.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Segmental Modeling (2/2)  A simple but effective approach motivated by the three-state HMMs is to assume that the segments are composed of a fixed number of sections.  The first and third sections model the transition into and out of the segment  The second section models the stable portion of the segment

Intelligent Database Systems Lab N.Y.U.S.T. I. M.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. The concept of segmental probability model (SPM)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. N-best List Rescoring  Generate N-best lists using HMM system  Alignment for each hypothesis in the N-best list using the HMM system.  Segment-level feature vectors are generated from these alignments.  The N-best list is reordered based on the likelihood, and the top hypothesis is used to calibrate the performance of the system.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Overview of a hybrid HMM/SVM system

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Experimental Results  The Deterding vowel data ─ Simple but popular static classification task ─ Used to benchmark nonlinear classifiers.  Spoken Letters and Numbers ─ Spoken letters and long distance telephone lines. ─ OGI Alphadigits (AD) ─ Confusable for telephone-quality speech (e.g. “p” vs “b”)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Conclusions  A support vector machine as a classifier in a continuous speech recognition system.  A hybrid SVM/HMM system has been developed.  The results obtained in the experiments clearly indicate the classification power of SVMs and affirm the use of SVMs for acoustic modeling.  Further research into the segmentation issue

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Personal Opinion  I need study more and more… and I wish god can give me more time