Machine Learning and its Application to Speech Impediment Therapy.

Slides:



Advertisements
Similar presentations
This algorithm is used for dimension reduction. Input: a set of vectors {Xn є }, and dimension d,d
Advertisements

ThemeGallery PowerTemplate
Alexander Statnikov1, Douglas Hardin1,2, Constantin Aliferis1,3
Applications of one-class classification
ESPON 2013 DATABASE Malmö Seminar, 2-3 December 2009 Thematic structuring of the ESPON 2013 DB Geoffrey Caruso and Nuno Madeira.
Machine Learning Approaches to the Analysis of Large Corpora : A Survey Xunlei Rose Hu and Eric Atwell University of Leeds.
Introduction to Support Vector Machines (SVM)
Adjusting Active Basis Model by Regularized Logistic Regression
A. Hatzis, P.D. Green, S. Howard (1) Optical Logo-Therapy (OLT) Introduction What is OLT ? OLT is a Computer Based Speech Training system (CBST)
Automatic Virtual Colonoscopy Based Diagnostic Decision Support System for Colorectal Cancer Screening Project IKTA5-058/2002 Infocommunications Technologies.
Vogler and Metaxas University of Toronto Computer Science CSC 2528: Handshapes and Movements: Multiple- channel ASL recognition Christian Vogler and Dimitris.
Making Numbers Two-digit numbers Three-digit numbers Click on the HOME button to return to this page at any time.
What can Actavis contribute for the sustainability in the health care system?
SVM—Support Vector Machines
ISCA Goal "to promote Speech Communication Science and Technology, both in the industry and Academic areas“ Coverage: all.
An Overview of Machine Learning
E-Learning Center Deanship of Academic Development.
Learning similarity measure for natural image retrieval with relevance feedback Reporter: Francis 2005/3/3.
Page 0 of 8 Time Series Classification – phoneme recognition in reconstructed phase space Sanjay Patil Intelligent Electronics Systems Human and Systems.
Professor Eastman EDUC111 A Mimio is a new teaching instrument that is becoming more and more popular in schools today. It is the newer version of.
Speaker Clustering using MDL Principles Kofi Boakye Stat212A Project December 3, 2003.
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
Optimal Adaptation for Statistical Classifiers Xiao Li.
Lecture #1COMP 527 Pattern Recognition1 Pattern Recognition Why? To provide machines with perception & cognition capabilities so that they could interact.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
Learning in Feature Space (Could Simplify the Classification Task)  Learning in a high dimensional space could degrade generalization performance  This.
SOMTIME: AN ARTIFICIAL NEURAL NETWORK FOR TOPOLOGICAL AND TEMPORAL CORRELATION FOR SPATIOTEMPORAL PATTERN LEARNING.
Introduction to Data Mining Engineering Group in ACL.
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
Adaptation Techniques in Automatic Speech Recognition Tor André Myrvoll Telektronikk 99(2), Issue on Spoken Language Technology in Telecommunications,
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
BUSINESS COMMUNICATION AND NEGOTIATION Radu RĂDUCAN.
VBS Documentation and Implementation The full standard initiative is located at Quick description Standard manual.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
COSC 3461: Module 1 S04 Introduction to Interaction & Principles of Design I.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Enhancing Interactive Visual Data Analysis by Statistical Functionality Jürgen Platzer VRVis Research Center Vienna, Austria.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Filter + Support Vector Machine for NIPS 2003 Challenge Jiwen Li University of Zurich Department of Informatics The NIPS 2003 challenge was organized to.
ICASSP Speech Discrimination Based on Multiscale Spectro–Temporal Modulations Nima Mesgarani, Shihab Shamma, University of Maryland Malcolm Slaney.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
A Confidence-Based Approach to Multi-Robot Demonstration Learning Sonia Chernova Manuela Veloso Carnegie Mellon University Computer Science Department.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
S IMULATIONS AND D ATA A NALYSIS Chapter 12. D ATA A NALYSIS What is data analysis? “…the process of transforming data into information…in the process.
Assistive Technology Meeka Bland-Grassaree. Assistive Technology Assistive technology refers to "any item, piece of equipment, or product system, whether.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Speech Lab, ECE, State University of New York at Binghamton  Classification accuracies of neural network (left) and MXL (right) classifiers with various.
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, , 1998.
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
Emergent Literacy Chapter 4 Cohen and Cowan. What is Emergent Literacy?
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
Machine Learning in CSC 196K
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 10: PRINCIPAL COMPONENTS ANALYSIS Objectives:
Basics of Natural Language Processing Introduction to Computational Linguistics.
On the relevance of facial expressions for biometric recognition Marcos Faundez-Zanuy, Joan Fabregas Escola Universitària Politècnica de Mataró (Barcelona.
Cross-Dialectal Data Transferring for Gaussian Mixture Model Training in Arabic Speech Recognition Po-Sen Huang Mark Hasegawa-Johnson University of Illinois.
Madhulika Pannuri Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Correlation Dimension.
CSE 4705 Artificial Intelligence
Classification of models
Landmark-Based Speech Recognition: Spectrogram Reading, Support Vector Machines, Dynamic Bayesian Networks, and Phonology Mark Hasegawa-Johnson
Online Multiscale Dynamic Topic Models
Title: LEARNING FOR THE BLIND
The Effects of Intensive Sand Writing on Phonemic Awareness in Kindergarten By Coby Long.
Blind Signal Separation using Principal Components Analysis
<Project Title>
Presentation transcript:

Machine Learning and its Application to Speech Impediment Therapy

Project Data Title: Machine Learning and its Application to Speech Impediment Therapy Duration: February February 2004 www: Grant No.: IKTA4-055/2001 Keywords: artificial intelligence, machine learning, real-time phoneme recognition, teaching of reading, speech impediment therapy

Project members: Co-ordinator: University of Szeged, Department of Informatics Address: H-6720, Szeged, Árpád tér 2. Project/team leader: András Kocsor Consortium members: University of Szeged, Training Teaching School, Team leader: János Bácsi, Kindergarten, Primary school and Boarding school, (the school for the deaf) Team leader: Jenő Mihalovics,

Summary The project consists of two main parts: a research part and an application part. The research part is devoted to investigating modern machine learning techniques which can form the basis of the development of several info- communication systems. The machine learning algorithms developed have been made available as open-source software. One example of these applications is speech recognition, which is the heart of the “SpeechMaster” software, developed in the second, application part of our project.

Machine Learning Algorithms Basic assumptions: a) objects are characterized by features b) each feature constitutes one dimension in the feature space Questions: - concept making - classification - feature selection - feature space transformation - dimension reduction 99%

Linear feature space transformations: - P rincipal C omponent A nalysis - I ndependent C omponent A nalysis - L inear D iscriminant A nalysis - S pringy D iscriminant A nalysis Machine Learning Algorithms 99%

Machine Learning Algorithms Nonlinear feature space transformations: - K ernel P rincipal C omponent A nalysis - K ernel I ndependent C omponent A nalysis - K ernel L inear D iscriminant A nalysis - K ernel S pringy D iscriminant A nalysis 99%

Machine Learning Algorithms Machine learning algorithms: - A rtificial N eural N ets - Gaussian Mixture Modeling - Support Vector Machines - Projection Pursuit Learner

a) real-time phoneme recognition b) word recognition Speech Recognition 70% Methods: - machine learning - speech corpora

Speech Corpora 200 speakers of different ages 500 speakers (male/female 50-50%) age: 6-7 Speech impediment therapy Teaching of reading 90%

The “SpeechMaster” 60% Phonological Awareness Teaching

The “SpeechMaster” 60% Phoneme-Grapheme association

The “SpeechMaster” 60% Visual feedback for the hearing impaired