1 Teaching computers to teach people to read and speak updates: (Stanford Open Source Lab ’08) see also:

Slides:



Advertisements
Similar presentations
Autonomous Learning and Face-to-Face Tutoring in Computer Assisted College English in China -- An Explorative Study in Beijing University of Clothing Technology.
Advertisements

1 Using the HTK speech recogniser to analyse prosody in a corpus of German spoken learners English Toshifumi Oba, Eric Atwell University of Leeds, School.
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Maxine Eskenazi Language Technologies Institute Carnegie Mellon University.
Building an ASR using HTK CS4706
Kimberly S. Rodriguez English Language Training Institute (ELTI), University of North Carolina at Charlotte December 2008.
8 Dos and Don’ts for improving your English presentations.
Research & Development ICASSP' Analysis of Model Adaptation on Non-Native Speech for Multiple Accent Speech Recognition D. Jouvet & K. Bartkova France.
EE3P BEng Final Year Project – 1 st meeting SLaTE – Speech and Language Technology in Education Martin Russell
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
Bootstrapping a Language- Independent Synthesizer Craig Olinsky Media Lab Europe / University College Dublin 15 January 2002.
SolidWorks UGM Yakima, WA. What Is DraftSight? Easy to use, professional grade, 2d CAD software Intuitive and Powerful Best of all… It’s FREE !!!
Increasing fluency through video and multimedia Entertaining practice with multiple levels of learning.
Presented by Eroika Jeniffer.  What are we going to learn? - the use of chat in classroom - the most likely application on chat. And many more….. So,
Improving Spoken English NativeAccent™. What is NativeAccent? New internet-delivered technology that assesses a student’s English pronunciation skills.
TEACHING PRONUNCIATION English in the 21 st Century & How to Teach Segmentals.
The most exceptional English training software in the world ELLIS.
{ Digital Storytelling in EFL classrooms By Svetlana Kuznetsova.
 Phonology in language teaching: Phonology in second language teaching/learning presented by: Salmah Alsulami.
1 “ Speech ” EMPOWERED COMPUTING Greenfield Business Centre, 20 th September, 2006.
Speech & Language Modeling Cindy Burklow & Jay Hatcher CS521 – March 30, 2006.
Rosetta Stone Course Online. Type in your account web-address: Enter your Username and Password Select Sign In Click.
® Automatic Scoring of Children's Read-Aloud Text Passages and Word Lists Klaus Zechner, John Sabatini and Lei Chen Educational Testing Service.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
Chapter 7: Generating Funds Part 2 Meg Giddings. 3 Types of Individual Fundraising A) Annual Giving: campaigns run each year soliciting past and new donors.
Christopher Harris Informatics Program The University of Iowa Workshop on Crowdsourcing for Search and Data Mining (CSDM 2011) Hong Kong, Feb. 9, 2011.
How Spread Works. Spread Spread stands for Speech and Phoneme Recognition as Educational Aid for the Deaf and Hearing Impaired Children It is a game used.
Speech Recognition ECE5526 Wilson Burgos. Outline Introduction Objective Existing Solutions Implementation Test and Result Conclusion.
Is phonetic variation represented in memory for pitch accents ? Amelia E. Kimball Jennifer Cole Gary Dell Stefanie Shattuck-Hufnagel ETAP 3 May 28, 2015.
1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
 Ever tried to speak in a foreign language without being understood? Highly personnalized application: mother tongue, age, … Higher interaction thanks.
STARDUST – Speech Training And Recognition for Dysarthric Users of Assistive Technology Mark Hawley et al Barnsley District General Hospital and University.
The Virginia Wiki Consortium and TELL ME MORE® Campus Language Teacher Educators Conference, May 29, 2009.
Imposing native speakers’ prosody on non-native speakers’ utterances: Preliminary studies Kyuchul Yoon Spring 2006 NAELL The Division of English Kyungnam.
Learning Usage of English KWICly with WebLEAP/DSR Takashi Yamanoue Kagoshima University, Japan Toshiro Minami Kyushu Institute of Information Sciences.
English teachers club – Using a coursebook. Talk about your experience What coursebooks are you currently using? How good are they? Who chose the coursebook.
Roman Kálecký UČO: Segmental features  Sounds  Speach Trainer 3D Suprasegmental features and accents  Speak English  SpeakAP  Accentuate!
The HTK Book (for HTK Version 3.2.1) Young et al., 2002.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida March 31,
Men or Man? Teaching Pronunciation Giving Instructions.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
語音訊號處理之初步實驗 NTU Speech Lab 指導教授: 李琳山 助教: 熊信寬
Unit 3 Computer Systems. What is software? unlike hardware it can’t be physically touched it’s the missing link between the computer hardware and the.
Making yourself understood is not all about accent.
Teaching Listening Why teach listening?
G. Anushiya Rachel Project Officer
Teaching pronunciation
Audio to Score Alignment for Educational Software
Investigating Pitch Accent Recognition in Non-native Speech
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture
Dean Luo, Wentao Gu, Ruxin Luo and Lixin Wang
Why Study Spoken Language?
How do ELP teachers manage, apply and moderate the new focus on portfolio evidence as assessment in the ESOL Literacy classes? Debora Potgieter.
INTEGRATED SPEAKING AND WRITING
SEN Speaking Course Professor Jennifer McGhee
Wherever learning flourishes so do people.
Homework questions How does ACTFL define a beginning level learner? (p.30) What are the principles for teaching speaking to beginning learners? (pp.36-40)
Why Study Spoken Language?
Anastassia Loukina, Klaus Zechner, James Bruno, Beata Beigman Klebanov
სკოლა-ლიცეუმი “მწიგნობართუხუცესი” ინგლისური ენის წრე
Command Me Specification
TOEFL.
Phoneme Recognition Using Neural Networks by Albert VanderMeulen
Legal Issues Facing Start-Ups
Education & AI High level discussion points relating ‘techniques / tools’ to possible ‘projects’
Presentation transcript:

1 Teaching computers to teach people to read and speak updates: (Stanford Open Source Lab ’08) see also: (online demo) James Salsman

2 speech recognition for pronunciation evaluation can help most learners acquire language faster typically three to five times more useful per time spent practicing than self study with recordings details: Jack Mostow’s Project LISTEN at CMU commercial example: Rosetta Stone’s English study packs retail for ~$300 up from $30 billions of people want to learn more language

3 Julius open source speech recognition from Cambridge Hidden Markov Model Toolkit free as in speech and beer running on XO C, flat files, a few sh scripts several megabyte memory footprint for triphones expect under 3 MB footprint for diphones (to do!) feasable on low-end cell phone equipment

4 microphone upload Adobe Flash 10 using open Speex vocodec is the best solution for two years now W3C rejected Device Upload as “device dependent” in 1999 Mozilla and Google Chrome have made promises several months ago, but nothing yet

5 phoneme alignment and pronunciation scoring acoustic scores: fit to models from 5000 speakers durations: cadence pitch: important for tonal languages, but not English except for punctuation-like information amplitude: less important for stress and punctuation, very important for weighting parts of speech when converting word to phrase scores can adapt to accent and dialect by comparing phoneme scores to set of exemplar pronunciation to derive word and phrase scores

6 agreement with human pronunciation judges 65-70% is really easy: about 5-10 recorded exemplars of each phrase from diverse speakers speaking with ordinary pronunciation 80% takes 20+ exemplar pronunciations 85%+ is impossible even for humans

7 patent encumbrance “Speech Training Aid” by R. Series et al (1991) at U.K. Defence Research Agency, sold to private QnetiQ, then 20/20 Speech, then Aurix, then NXT plc., maker of high-fidelity stereo equipment doesn’t cover reading tutoring which is in many cases exactly the same task, algorithms, and completely indistinguishable in all other details can be licensed, but it has been very difficult patent holders more interested in suing abundant infringers than licensing

8 crowdsourced accuracy review systems voxforge.org and librivox.org collect exemplars vetting exemplar pronunciations can be done with – volunteers, including learners and anonymous – paid workers, including mostly poor and non- native speakers from e.g. Mechanical Turk or Craigslist Wikimedia Strategic Proposal (accuracy review)

9 Questions and Answers Thank you! these slides: