24-09-1997 A. Hatzis, P.D. Green, S. Howard (1) Optical Logo-Therapy (OLT) Introduction What is OLT ? OLT is a Computer Based Speech Training system (CBST)

Slides:



Advertisements
Similar presentations
1 Verification by Model Checking. 2 Part 1 : Motivation.
Advertisements

Testing Relational Database
FUNCTION FITTING Student’s name: Ruba Eyal Salman Supervisor:
The 4 T’s of Test Automation:
Speech Case Study Spring 2002 By. Introduction Audience Deaf Education Teachers Goal To present information regarding speech education techniques used.
Decision Support and Artificial Intelligence Jack G. Zheng May 21 st 2008 MIS Chapter 4.
Decision Support and Artificial Intelligence Jack G. Zheng July 11 th 2005 MIS Chapter 4.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
Addition Facts
Database Design: ER Modelling (Continued)
Treatment Principles Contrasted Phonological Disorder Childhood Apraxia of Speech Principles of Motor Learning Copyright © 2011 Caroline Bowen.
Reductions Complexity ©D.Moshkovitz.
1 Ganesh Iyer Perceptual Mapping XMBA Session 3 Summer 2008.
Configuration management
A. Hatzis, P.D. Green, S. Howard (1) Optical Logo-Therapy (OLT) : Visual displays in practical auditory phonetics teaching. Introduction What.
Chapter 18 Methodology – Monitoring and Tuning the Operational System Transparencies © Pearson Education Limited 1995, 2005.
QA practitioners viewpoint
Emergency and Overtime Fan-Out. 2 9/4/2012 About Us In business since 1992 Core strength: Integrating event-driven systems with communications networks.
An Integrated Toolkit Deploying Speech Technology for Computer Based Speech Training with Application to Dysarthric Speakers Athanassios Hatzis, Phil Green,
Mathematics and Special Education Leadership Protocols
Text Categorization.
Analysis of High-Throughput Screening Data C371 Fall 2004.
Slide 1 Shall Lists. Slide 2 Shall List Statement Categories  Functional Requirements  Non-Functional Requirements.
This, that, these, those Number your paper from 1-10.
Addition 1’s to 20.
25 seconds left…...
Phonetics as a scientific study of speech
Week 1.
Specific Language Impairment in the Regular Classroom
Chapter 13 The Data Warehouse
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
A two dimensional kinematic mapping between speech acoustics and vocal tract configurations : WISP A.Hatzis, P.D.Green1 History of Vowel.
Coarticulation Analysis of Dysarthric Speech Xiaochuan Niu, advised by Jan van Santen.
Unsupervised learning. Summary from last week We explained what local minima are, and described ways of escaping them. We investigated how the backpropagation.
Speech Group INRIA Lorraine
Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Neural Net Algorithms for SC Vowel Recognition Presentation for EE645 Neural Networks and Learning Algorithms Spring 2003 Diana Stojanovic.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Speaker Adaptation for Vowel Classification
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
Machine Learning Motivation for machine learning How to set up a problem How to design a learner Introduce one class of learners (ANN) –Perceptrons –Feed-forward.
Optimal Adaptation for Statistical Classifiers Xiao Li.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Training.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Database Construction for Speech to Lip-readable Animation Conversion Gyorgy Takacs, Attila Tihanyi, Tamas Bardi, Gergo Feldhoffer, Balint Srancsik Peter.
New technologies supporting people with severe speech disorders Mark Hawley Barnsley District General Hospital and University of Sheffield.
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
NEURAL NETWORKS FOR DATA MINING
Jacob Zurasky ECE5526 – Spring 2011
Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.
Korean Phoneme Discrimination Ben Lickly Motivation Certain Korean phonemes are very difficult for English speakers to distinguish, such as ㅅ and ㅆ.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Ch 5b: Discriminative Training (temporal model) Ilkka Aho.
ELIS-DSSP Sint-Pietersnieuwstraat 41 B-9000 Gent SPACE Symposium - 05/02/091 Objective intelligibility assessment of pathological speakers Catherine Middag,
Performance Comparison of Speaker and Emotion Recognition
Current Approaches to Management of DAS Michelle D. White.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Recognition of bumblebee species by their buzzing sound
Dean Luo, Wentao Gu, Ruxin Luo and Lixin Wang
Elise A. Piazza, Marius Cătălin Iordan, Casey Lew-Williams 
Speech Case Study Spring 2002
John H.L. Hansen & Taufiq Al Babba Hasan
Attentive Tracking of Sound Sources
Presentation transcript:

A. Hatzis, P.D. Green, S. Howard (1) Optical Logo-Therapy (OLT) Introduction What is OLT ? OLT is a Computer Based Speech Training system (CBST) for the visualisation of articulation using both connectionist and speech technology techniques.

A. Hatzis, P.D. Green, S. Howard (2) Optical Logo-Therapy (OLT) State of the Art in (CBST) Research on (CBST) systems started 15 years ago. The majority of the systems provided feedback on a single acoustic dimension of speech such as, pitch, volume(amplitude,duration,onset), rhythm, intonation, articulation, or on a single acoustic property, such as s-sh frication, vowels articulation, e.t.c. Physiological methods vs Acoustic analysis Electropalatography (EPG)Video Voice Electrolaryngography (ELG)Indiana Speech Training Aid (ISTRA) GlossometryThe Visual Speech Apparatus (VSA) The Dynamic OrometerIBM Speech Viewer HARP Visual Speech Trajectories (Kohonen SOM - Visual Ear - VAHISOM - OLT)

A. Hatzis, P.D. Green, S. Howard (3) Optical Logo-Therapy (OLT) Computer Based Speech Training systems (CBST) Critical aspects The kind of feedback Evaluation of speech production Guidelines for error correction Training curriculum (Phone - Utterance - Continuous speech) Adjustable training targets Specialised to particular speech disorders and clients physiology Motivation. Enjoy the speech drills. OLT design aspects Real time visual animated feedback Qualitative and quantitative results Speaker comparison and trial error correction Simultaneous phone and utterance training Build maps based on best user training performance Specialised phonetic maps Simple games, e.g. a moving object that follows the target trajectories on the map.

A. Hatzis, P.D. Green, S. Howard (4) Optical Logo-Therapy (OLT) Preparation of Speech Training Data Segmentation and labelling –Manual procedures have been used for the current experiments. –Mel Frequency Cepstral Coefficients (MFCC) together with overall energy are taken every 10 msec on an analysis window of 20 msec. –Appropriate phone categories are selected for the special case of the speech disordered person. –Equal number of samples is taken for each one of the phone categories.

A. Hatzis, P.D. Green, S. Howard (5) Optical Logo-Therapy (OLT) Creating an OLT Phonetic Map Three Stages Learning Vector Quantisation (LVQ) –LVQ algorithms [Kohonen et. all] are used to model the subset of phonetic space with a sufficient number of 9D reference vectors. Sammon mapping –A non-linear projection is then applied to reduce the 9D reference vectors to points in a 2D space. Multi Layer Perceptron mapping (MLP) –Finally an MLP neural network is trained using the backpropagation algorithm to learn the nonlinear relationship between 9D space patterns and 2D ones.

A. Hatzis, P.D. Green, S. Howard (6) Optical Logo-Therapy (OLT) OLT - User Interface Three Main Windows OLT - Control panel –Control path display attributes. –Control map projection. –Control speech recording and playback. –Control the creation, selection and loading of maps and utterances. OLT - Samples pool –On line selection of pre-recorded utterances for comparison with user OLT - Main map area (Slide 4) –Recording and playback –Speech evaluation tools

A. Hatzis, P.D. Green, S. Howard (7) Optical Logo-Therapy (OLT) Adults Sibilant Fricatives Experiment Map Construction Training data from 4 and testing data from 2 normal speakers, all male and English adults. 44 utterances of fixed context of the form /ee X u/ where X is /s,sh,z,zh/. Map Comparison A speech impaired subject, also male and English adult was selected for comparison. He articulates all of the target sibilant fricatives laterally, rather than centrally. The picture on the right shows normal (blue dashed line) and abnormal (black solid line) trajectories for the utterance /ee s u/.

A. Hatzis, P.D. Green, S. Howard (8) Optical Logo-Therapy (OLT) MLP-Mapping vs Kohonens-Mapping Normal trajectories of the utterance, /ee s u/ for MLP mapping, pink line with crosses, and Kohonens mapping, blue line with circles. Normal trajectory, blue dashed line, vs abnormal trajectory, pink solid line, for the utterance /ee s u/ with MLP mapping.

A. Hatzis, P.D. Green, S. Howard (9) Optical Logo-Therapy (OLT) Kohonens-Mapping Trajectories Abnormal trajectory of /ee s u/ with normal rate of speech. Normal trajectory of /ee s u/ with normal rate of speech. Abnormal trajectory of /ee s u/ with slow rate of speech.

A. Hatzis, P.D. Green, S. Howard (10) Optical Logo-Therapy (OLT) MLP-Mapping Trajectories Abnormal trajectory of /ee s u/ with normal rate of speech. Normal trajectory of /ee s u/ with normal rate of speech. Abnormal trajectory of /ee s u/ with slow rate of speech.

A. Hatzis, P.D. Green, S. Howard (11) Optical Logo-Therapy (OLT) Time-Frequency Domain Comparison Abnormal /ee s u/ with normal rate of speech. Normal /ee s u/ with normal rate of speech. Abnormal /ee s u/ with slow rate of speech.

A. Hatzis, P.D. Green, S. Howard (12) Optical Logo-Therapy (OLT) OLT Distances Comparison Abnormal /ee s u/ with normal rate of speech. Normal /ee s u/ with normal rate of speech. Abnormal /ee s u/ with slow rate of speech.

A. Hatzis, P.D. Green, S. Howard (13) Optical Logo-Therapy (OLT) Fricative Quality Comparison Abnormal /ee s u/ with normal rate of speech. Normal /ee s u/ with normal rate of speech. Abnormal /ee s u/ with slow rate of speech.

A. Hatzis, P.D. Green, S. Howard (14) Optical Logo-Therapy (OLT) Children Sibilant Fricatives Experiment Speech Data Collection A special program was built to help with the recordings of the children. We recorded in total 18 normal children and each subject repeated a list of words (a sea, a zee, a sheep, a saw, a shore, a zorr, a zoo, a shoe, a suit) 5 times. In addition the subjects recorded isolated sounds. Map Comparison A seven years old girl with misarticulated sibilant fricatives, was compared with one of the normal female speakers. She produced target alveolar and post-alveolar fricatives with lateral rather than central friction.

A. Hatzis, P.D. Green, S. Howard (15) Optical Logo-Therapy (OLT) Normal Child vs Abnormal Child Abnormal speech trajectory of the utterance a zoo Normal speech trajectory of the utterance a zoo Normal (black) vs Abnormal (orange) trajectory of the isolated sound of phoneme /z/

A. Hatzis, P.D. Green, S. Howard (16) Optical Logo-Therapy (OLT) Conclusions We have demonstrated the creation of a real time visual feedback in the form of a trajectory in a 2D phonetic space The speech-impaired subjects show clear abnormalities in speech-impaired vowel-fricative-vowel trajectories which are consistent with their lateralising problem With OLT, the child or adult can be asked not only to produce sounds which are problematic for them, but also sounds which are easy for them. Thus it guarantees them some positive visual feedback, even in the initial stages of therapy. Clients were clearly motivated by OLT to experiment according to the clinicians instructions and were able to relate the results of different articulatory configurations to the visual feedback they received.

A. Hatzis, P.D. Green, S. Howard (17) Optical Logo-Therapy (OLT) Future Plans Implementation of various techniques using Neural Networks for dimensionality reduction and classification for the creation of better phonetic maps. Real time simple animated games based on the phonetic map. Development of a library of pre-prepared maps for common problems in speech therapy and for different subject groups.. Easy building of a map by selecting phone categories from an appropriate phone database. Build personalised phonetic maps based on the speech disorder and potential improvement of the subject.