English Pronunciation Learning System for Japanese Students Based on Diagnosis of Critical Pronunciation Errors Yasushi Tsubota, Tatsuya Kawahara, Masatake.

Slides:



Advertisements
Similar presentations
Tuning Jenny Burr August Discussion Topics What is tuning? What is the process of tuning?
Advertisements

The Proficiency Orientation of Existing Methodologies
Anna-Katharina Elstermann UNESP/ Assis – Rub/ Bochum
EE3P BEng Final Year Project – 1 st meeting SLaTE – Speech and Language Technology in Education Martin Russell
Spoken Vs Written Language. Introduction Languages are first spoken, then written, and then an understanding.
Project MORE Mentoring in Ohio for Reading Excellence Images were found using Google image search Mentor Training.
Learning About Our Mentoring Program (Add your mentoring program’s name)
Academic Communication Lesson 2 Pick up two different handouts per person from the desk at the front of the room: –“Choose a result” homework –“Strategy.
Chapter 3: The Direct Method
Primary Stress and Intelligibility: Research to Motivate the Teaching of Suprasegmentals By Laura D. Hahn Afra MA Carolyn MA Josh MA
Topic: Learning and teaching activities
Agenda What is TOEFL PBT? Sections of the TOEFL PBT Test of Written English (TWE) Listening Comprehension Structure and Written Expression Reading Comprehension.
English Mastery A “How To” guide…
BBI 2420 ORAL INTERACTION SKILLS 1 ST FACE TO FACE SESSION 15 FEBRUARY 2015 SEM 2, 2014/2015.
12.0 Computer-Assisted Language Learning (CALL) References: 1.“An Overview of Spoken Language Technology for Education”, Speech Communications, 51, pp ,
CSD 2230 HUMAN COMMUNICATION DISORDERS
Learning About the M4RA Mentoring Program
Learning About the M4RA Mentoring Program
Language Assessment 4 Listening Comprehension Testing Language Assessment Lecture 4 Listening Comprehension Testing Instructor Tung-hsien He, Ph.D. 何東憲老師.
Word-subword based keyword spotting with implications in OOV detection Jan “Honza” Černocký, Igor Szöke, Mirko Hannemann, Stefan Kombrink Brno University.
Design of a Speech Recognition System to Assist Hearing Impaired Students Richard Kheir 2 and Thomas P. Way Department of Computing Sciences, Villanova.
FROM LANGUAGE NEEDS ANALYSES TO CLASSROOM INSTRUCTION Nana Toradze Language Training School National Defense Academy, Georgia
By Noriko and Luisa.  Language proficiency level: advanced beginners  Previous Computer Knowledge: basic word- processing skills  Class size: 12 students.
Speech Recognition ECE5526 Wilson Burgos. Outline Introduction Objective Existing Solutions Implementation Test and Result Conclusion.
Reading aloud as a literacy learning strategy John Munro
Improves Student Performance through Reading Proficiency My Reading Coach.
1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.
ACADEMIC CONVERSATIONS
A Multimedia English Learning System Using HMMs to Improve Phonemic Awareness for English Learning Yen-Shou Lai, Hung-Hsu Tsai and Pao-Ta Yu Chun-Yu Chen.
LREC 2008, Marrakech, Morocco1 Automatic phone segmentation of expressive speech L. Charonnat, G. Vidal, O. Boëffard IRISA/Cordial, Université de Rennes.
Acoustic Properties of Taiwanese High School Students ’ Stress in English Intonation Advisor: Dr. Raung-Fu Chung Student: Hong-Yao Chen.
ESL Teacher Networking Meeting Session - 2 Raynel Shepard, Ed.D.
DynEd English for Success – For academics New Dynamic English – For everyday life CSC Learning, Chicago 1.
Developing an automated assessment tool for children’s oral reading Leen Cleuren March
4th grade Expository, biography Social Studies- Native Americans
ELIS-DSSP Sint-Pietersnieuwstraat 41 B-9000 Gent Recognition of foreign names spoken by native speakers Frederik Stouten & Jean-Pierre Martens Ghent University.
Learning About the M4RA Mentoring Program
 There must be a coherent set of links between techniques and principles.  The actions are the techniques and the thoughts are the principles.
1 Broadcast News Segmentation using Metadata and Speech-To-Text Information to Improve Speech Recognition Sebastien Coquoz, Swiss Federal Institute of.
How can we teach English? some advice. Grammar - Vocabulary First step: motivation –introduction: pupils guess the topic: rebus, riddle, crossword puzzle.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
The Direct Method 1. Background It became popular since the Grammar Translation Method was not very effective in preparing students to use the target.
Qualitative Reading Inventory
Introduction to Language Teaching The Grammar-Translation Method.
Lesson 4 Grammar - Chapter 13.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Outline  I. Introduction  II. Reading fluency components  III. Experimental study  1) Method and participants  2) Testing materials  IV. Interpretation.
TEACHING GRAMMAR Outline: You have to inductively introduce and illustrate the PRESENT TENSES. Plan the needed activities, the objectives, the level of.
Objectives of session By the end of today’s session you should be able to: Define and explain pragmatics and prosody Draw links between teaching strategies.
T H E D I R E C T M E T H O D DM. Background DM An outcome of a reaction against the Grammar- Translation Method. It was based on the assumption that.
Welcome to College of Foreign Languages, Hainan University.
GGGE6533 LANGUAGE LEARNING STRATEGY INSTRUCTION SUCCESSFUL ENGLISH LANGUAGE LEARNING INVENTORY (SELL-IN) FINDINGS & IMPLICATIONS PREPARED BY: ZULAIKHA.
Teaching Listening Why teach listening?
EXCUSE ME, BUT I’M AFRAID I CAN’T AGREE WITH YOU
Techniques and Principles in Language Teaching
Community language learning
LEARNER MISTAKES Гайнуллин Гусман Салихжанович,
The Audio-Lingual Method
Listening and speaking skills BY Ali, Signe and Emil
Why MPF?.
Paul Deane National Conference on Student Assessment, June 27-29, 2016
THE DIRECT METHOD.
THE AUDIO-LINGUAL METHOD
Academic Communication Lesson 2
National Curriculum Requirements of Language at Key Stage 2 only
Teaching prominence through kazoos
A study guide for students.
The Roles of Teachers and Learners in the Classroom
Chinese.
English Writing I Course Description and Syllabus
Presentation transcript:

English Pronunciation Learning System for Japanese Students Based on Diagnosis of Critical Pronunciation Errors Yasushi Tsubota, Tatsuya Kawahara, Masatake Dantsuji Kyoto University, Japan

HUGO (Pronunciation Learning System) Goal: Pinpointing the pronunciation errors which diminish intelligibility and providing effective feedback for improving a student’s pronunciation Pronunciation practice consists of 2 phases –Dialogue-based skit (for natural conversation) –Practice using individual phrases or words (for correcting specific errors)

Flow of Pronunciation Learning System Speech dialogue ( Role- play ) Pronunciation Error Diagnosis Training on Specific Errors Practice conversation with interesting topics –Original contents developed at Kyoto University –Foster ability to explain Japanese history/culture in English to foreign visitors Speech Recognition Program in background –Error detection optimized for English pronunciation by Japanese students –Error Profile for the student Intelligibility Estimation –Estimated from the error rates for the different type of errors Error Priority –Indicates the student’s performance for a given pronunciation –Expresses how far behind the students is on one pattern compared to students in the same level Training on Specific Errors –Practice of individual pronunciation skills –Error feedback providing both stress and segmental instruction

Introduction to the Beauties of Kyoto

Pronunciation Error Prediction 64 rules for pronunciation errors No equivalent syllable in L1 language –e.g. sea → she No equivalent phoneme in L1 language l vs r, v etc Vowel insertion b-r → b-uh-r “breath” Pronunciation Error Prediction Pronunciation Dictionary Rules for error beh r th l uh s S E Error ↑

2. Sentence Stress Error Detection First Stage ST/NS classification Put it on the desk CVsC CVx VsC CVs CVsC CVs ST NS ST NS ST NS ST NS ST NSST NS Second Stage PS/SS classification NS PS SS PS SS NS PS SS Added syllable By vowel insertion Pause Recognition Result SSNS PSNS H T H M M T Stress HMM Stress HMM Best weight For ST/NS Best weight For PS/SS Two-stage stress error detection

Pronunciation Errors V/B substitution (problem) Final vowel insertion (let) CCV-cluster insertion (active) VCC-cluster insertion (study) H/F substitution (fire) W/Y deletion (would) SH/CH substitution (choose) R/L substitution (road) ER/A substitution (paper) Non-reduction (student) Built from literature in ESL Errors not accurately detected were removed Compute error rates of each subject

Average Error Rates per Intelligibility Level SHERRLVRVBFICCVVCCHF WY

Practice in a university classroom Implementation –JAVA for Windows –HTK Classroom user –48 students –60 min. of pronunciation practice Machine –Windows2000 –Pentium4 1.5G –Memory 512M CALL room at Kyoto University

Introduction to Jidai Festival Introduction to Jidai Festival Introduction to Jidai Festival Introduction to Jidai Festival Grammar, Vocabulary BuildingPronunciation Learning 5/125/195/266/1 English II Syllabus Jidai Festival -Edo period- Jidai Festival -Edo period- Jidai Festival -Edo period- Jidai Festival -Edo period- Grammar, Vocabulary BuildingPronunciation Learning 6/86/156/226/29 Jidai Festival -Edo period- Pronunciation Learning 10/27 Jidai Festival -Edo period- 11/11 Pronunciation Learning 2 nd Semester 16-hours of speech data in total 2 nd session 1 st Semester 1 st session 1 st Semester

Questionnaire Good practice for pronunciation learning This practice is effective because Japanese students are not good at pronunciation. I hope to see further improvement in the performance of this system. I am for this kind of English learning. This practice is good for self-study. Positive comments Negative comments Sometimes the diagnosis results were not understandable. Not enough speech recognition accuracy. Sometimes it seems to the machine improperly recognized my utterance. This practice would be better if there were fewer recognition errors. Satisfied with the concept of the system But, too many errors in speech recognition Score< #Students Evaluation by the class

Examples of recorded speech Yes,that’s right. (noise addition) But, do you know what the festival of ages is like ? (noise addition) Ah, well, the festival of ages is a series of processions. (noise addition) Each representing a different period in Japanese history and its relation to Kyoto. (noise addition) The Edo period which dates from 1603 to 1867, ( Speech Error ) I’d like to stop now under Good Examples Bad Examples

Analysis of logged data Categorize the causes of misrecognition –To measure system performance –If automatically detected, a prompt for re-recording is possible. Analysis of logged data –Listen to the logged speech data –Verify the correctness of speech recognizer’s alignment with spectrogram (Wavesurfer)

Analysis of logged data (1929 utterances) Errors in automatic detection of the end of a recording session[6.0%,116] Addition of noise[13.1%,252] Hesitation[4.2%,81] Speech errors[1.8%,34] Misalignment by the speech recognition system[12.8%,246] Recognition errors[1.5%,29] Instructions on volume settings Provide explanation, prompt for re-recording Make uttereance longer e.g. make into a sentence SolutionCause Improper configuration of recording volume Directed microphone did not work well Unfamiliarity with English sentence Unit of utterance is too short(Phrase)

Analysis of Logged data #UtteranceError Rate (Recording) Error Rate (Recognition) 1 st trial52.1 (Avg.) 1929(Total) 20.4(Avg.) 755(Total) 1.24(Avg.) 46 ( Total ) 2 nd trial 111(Avg.) 3982(Total) 4.9 (Avg.) 176(Total) 0(Avg.) 0(Total)

Conclusions Practical Use of Autonomous English Pronunciation Learning System for Japanese Students –Contents designed to teach students how to explain Japanese tradition and culture –Phoneme, stress error detection, intelligibility estimation –Practical use in an English II class ay Kyoto University Practical use and analysis of logged data –Satisfied with the concept of the system –Analysis of improper operation Errors in automatic detection of the end of a recording session Addition of noise Hesitation Speech errors Misalignment by the speech recognition system Recognition errors