Research & Development ICASSP'2006 - Analysis of Model Adaptation on Non-Native Speech for Multiple Accent Speech Recognition D. Jouvet & K. Bartkova France.

Slides:



Advertisements
Similar presentations
Talking about origins.
Advertisements

Countries and Nationalities.
WHERE ARE YOU FROM? Where do you come from?
NOVA Comprehensive Perspectives on Child Speech Development and Disorders Chapter 10 Acquiring French Andrea MacLeod 1.
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Frederico Rodrigues and Isabel Trancoso INESC/IST, 2000 Robust Recognition of Digits and Natural Numbers.
History of the English Language
The student will describe the cultural characteristics of Europe.
: Recognition Speech Segmentation Speech activity detection Vowel detection Duration parameters extraction Intonation parameters extraction German Italian.
J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
by the way we 1.Introduction: Cultural Norms and Values - Stereotyping American culture Chinese culture.
Construction of phoneme-to-phoneme converters
Non-native Speech Languages have different pronunciation spaces
Research on teaching and learning pronunciation
GABRIELLA RUIZ LING 620 OHIO UNIVERSITY Cross-language perceptual assimilation of French and German front rounded vowels by novice American listeners and.
English Phonetics arifsuryopriyatmojo.com. Questions to consider? what is a language? how many languages are there? why do people need a language? how.
Current Languages of Europe
The French Language: identity, diversity and changing environments. Lecture 2 Michaelmas term 2012 Personal and group identity: the use of French by immigrants.
1 Survey Data in ECA : Frequency, Coverage, Consistency and Access By Victor Sulla ECS-PE.
Phonology, phonotactics, and suprasegmentals
Why do we study English? Form 9, unit 6.
May 20, 2006SRIV2006, Toulouse, France1 Acoustic Modeling of Accented English Speech for Large-Vocabulary Speech Recognition ATR Spoken Language Communication.
The Sounds of Language. Phonology, Phonetics & Phonemics… Phonology, Phonetics & Phonemics… Producing and writing speech sounds... Producing and writing.
As a conclusion, our system can perform good performance on a read speech corpus, but we will have to develop more accurate tools in order to model the.
(R14) (R14) European Culture: Language(s). Today’s Standard describecultural characteristicsEurope SS6G11 The student will describe the cultural characteristics.
LREC 2008, Marrakech, Morocco1 Automatic phone segmentation of expressive speech L. Charonnat, G. Vidal, O. Boëffard IRISA/Cordial, Université de Rennes.
Countries and nationalities (Europe) Markéta Zakouřilová VY_32_INOVACE_104 ZŠ Jenišovice.
2A. She’s British. Countries & nationalities Point out what countries through their flags & which nationalities?
Improving Speech Modelling Viktoria Maier Supervised by Prof. Hynek Hermansky.
The vowel detection algorithm provides an estimation of the actual number of vowel present in the waveform. It thus provides an estimate of SR(u) : François.
Algoritmi e Programmazione Avanzata
Automatic Speech Recognition: Conditional Random Fields for ASR Jeremy Morris Eric Fosler-Lussier Ray Slyh 9/19/2008.
CMU Robust Vocabulary-Independent Speech Recognition System Hsiao-Wuen Hon and Kai-Fu Lee ICASSP 1991 Presenter: Fang-Hui CHU.
Unit 2: September 8, 2015 TURNER MS: THE BEST MIDDLE SCHOOL IN GA.
4.2.6The effects of an additional eight years of English learning experience * An additional eight years of English learning experience are not effective.
ELIS-DSSP Sint-Pietersnieuwstraat 41 B-9000 Gent Recognition of foreign names spoken by native speakers Frederik Stouten & Jean-Pierre Martens Ghent University.
Political Map of Europe Western European Countries.
Christmas in France By: Lily Cassida. Map of France France’s bordering countries are Switzerland, Italy, Belgium, Spain, and Germany.
English Phonetics 许德华 许德华. Objectives of the Course This course is intended to help the students to improve their English pronunciation, including such.
SURVEY RESULTS. NUMBERS OF QUESTIONNAIRES COMPLETED  ITALY 186  AUSTRIA 132  POLAND 163  ENGLAND 151  SPAIN 139  TURKEY 182  ROMANIA 52.
Tom Ko and Brian Mak The Hong Kong University of Science and Technology.
ACOUSTIC-PHONETIC UNIT SIMILARITIES FOR CONTEXT DEPENDENT ACOUSTIC MODEL PORTABILITY Viet Bac Le*, Laurent Besacier*, Tanja Schultz** * CLIPS-IMAG Laboratory,
Capitalization DAY 1 COMPLETE SLIDES 1-8
Learning about languages. You could Find out how to say ‘hello’ in lots of languages.
COUNTRIES AND NATIONALITIES (COUNTRY – NATIONALITY)
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Immigration. CountryPeak Year Germany1882 Ireland1851 Britain1888 Italy1907 Russia1913 Austria-Hungary1907 Denmark1882 Finland1882 Norway1882 Sweden1882.
Cross-Dialectal Data Transferring for Gaussian Mixture Model Training in Arabic Speech Recognition Po-Sen Huang Mark Hasegawa-Johnson University of Illinois.
Disney World in Orlando Florida. Planning a Visit from the UK to Disney world Florida.
A Study on Speaker Adaptation of Continuous Density HMM Parameters By Chin-Hui Lee, Chih-Heng Lin, and Biing-Hwang Juang Presented by: 陳亮宇 1990 ICASSP/IEEE.
Europe SS6G11a. Explain the diversity of languages as seen in a comparison of German, English, Russian, French, and Italian.
Language Project Human Geography.
History of the English Language
History of the English Language
Communication Across Cultures
Standard and Element SS6G11 The student will describe the cultural characteristics of Europe. a. Explain the diversity of European languages as seen in.
COUNTRIES; NATIONALITIES; LANGUAGE
Countries and nationalities
Unit 1: Vocabulary Section (pp 9-10)
Let‘s travel! Guess the country..
Rohit Kumar *, Amit Kataria, Sanjeev Sofat
COUNTRIES NATIONALITIES LANGUAGES.
Let‘s travel! Guess the country..
Copy all of the information that you see on each slide.
NATIONALITIES LANGUAGES
Countries and nationalities
Groundwater bodies - overview
SS6G11 A Diversity of European Languages
Language Transfer of Audio Word2Vec:
Presentation transcript:

Research & Development ICASSP' Analysis of Model Adaptation on Non-Native Speech for Multiple Accent Speech Recognition D. Jouvet & K. Bartkova France Télécom - R&D

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Overview  Multiple foreign accent speech corpus  Baseline native speech modeling and results  Modeling non-native speech variants Phonological rules Units trained on foreign data Selection of variants  Adaptation on non-native speech On all types of foreign accents Only on subsets of foreign accents  Conclusion

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Multiple Foreign Accent Speech Corpus  83 French words and expressions collected over telephone Cluster Language groupTest set Originating countries (24 in total) French 94 speakers France, Belgium, Switzerland, … EsEnDe Spanish35 speakers Spain English96 speakers USA, UK, Ireland, … German113 speakers Germany, Austria Other Italian56 speakers Italy Portuguese17 speakers Portugal African50 speakers Senegal, Congo, Mali, … Arabic53 speakers Algeria, Tunisia, Marocco Turkish53 speakers Turkey Cambodian48 speakers Cambodia Asian69 speakersChina, Vietnam

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Baseline Modeling and Results Using Native Speech Models  Modeling : MFCC, HMM, Gaussian mixtures, Context-dependent models  Baseline M1.A1: native French acoustic units only (model M1) trained on large French data speech corpus (acoustic parameters A1)  Large dispersion of recognition performances across speaker language groups (error rates: 6% for German speakers … 12% for English & Spanish speakers)

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Modeling Non-Native Speech Variants Variants Derived through Phonological Rules  Vowels apertures  open / close allowed:e ⇨ (e + ɛ )  Possible denasalization of nasal sounds: ɛ ̃ ⇨ ( ɛ ̃ + ɛ N), where N = n, m or ŋ  Difficulty to pronounce front rounded vowel /y/ ( ⇨ /u/) & semi-vowel /Y/ ( ⇨ /w/)  Application of rules  Model M2  Significant improvement for many language groups (not all), but overall better

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Modeling Non-Native Speech Variants Adding Units Trained on Foreign Data Foreign standard units  Standard training e.g. German units trained from German words uttered by German speakers: φ_de_DE  For each French units, corresponding foreign units are added for recognition French units adapted on foreign data  Mapping between French and foreign units for training, for example Paris_uk  p_uk. a_uk. r_uk. i_uk. s_uk  p_fr. a_fr. r_fr. i_fr. s_fr  Hence, here, French units adapted on English speech material: φ_fr_UK e_sp_SP e_fr_FR e_uk_UK e_de_DE e_fr_SP e_fr_FR e_fr_UK e_fr_DE  Model M3  Model M4

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Modeling Non-Native Speech Variants Adding Units Trained on Foreign Data  Adding "standard foreign units" vs "French units adapted on foreign data"  Better results are obtained when adding French units adapted on foreign data Improvement on non-native speech Even for languages that do not correspond to added units

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Modeling Non-Native Speech Variants Adding a Selection of Foreign Adapted Units  Instead of keeping all variants (units) added for each phoneme, only the most frequently ones are kept (model M5) (statistics using force alignments on adaptation set)  Degradation performances (due to added units) on French speakers smaller  Improvement on language groups associated to added units smaller  Better results on other language groups

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Adaptation on Non-native Speech  Adaptation set: about same size as test set Exhibits similar non-native accents (same countries) Generic models M1.A1 & M2.A1 French native units without / with phonological rules Generic model M3.A1 French native units & standard foreign units Generic models M4.A1 & M5.A1 French native units & French units adapted on foreign data Accent adapted models M1.A5 & M2.A5 Accent adapted model M3.A5 Accent adapted models M4.A5 & M5.A5 Non-native speech adaptation corpus French words pronunced by foreign speakers, …

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Adaptation on Non-native Speech Adaptation using all Types of Accents  Behavior of various modeling variants after all accents adaptation is similar to the behavior obtained with generic models

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Adaptation on Non-native Speech Impact of Types of Accents (1)  Experiments using the best model (model M5)  Reference results with generic parameters (model M5.A1)  Adaptation using data from French speakers only ( model M5.A2 ) corresponds task and context adaptation  Adaptation using data from limited set of accents: Spanish, English and German speakers only (model M5.A3)  Adaptation using data from other types of accents: Italian, Portuguese, … and Asian speakers only (model M5.A4)  And results after adaptation using all types of accents (model M5.A5)

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Adaptation on Non-native Speech Impact of Types of Accents (2)  Adaptation on French speakers only ( M5.A2 ) improves on almost all accented data  Best results obtained with adaptation on all types of accents ( M5.A5 )

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Adaptation on Non-native Speech Impact of Types of Accents (3)  After adaptation on only a few types of accents: Es, En, De ( i.e. model M5.A3 ) Large improvement achieved on all accented data including on accents that are not present in adaptation set

Research & Development Multilingual Units for Modeling Pronunciation Variants – ICASSP' Conclusion  Non-native speech recognition takes benefit of variants Application of phonological rules and introduction of units trained on foreign data Selection of variants is beneficial  Adaptation on non-native speech provides important improvement for each type of modeling, and variants are still useful  Adaptation on speech data representing a limited set of foreign accents is also beneficial for other types of accents