Pronouncing Words in TTS Systems

Slides:



Advertisements
Similar presentations
The Five Main Components of Reading Instruction
Advertisements

Virginia Standards of Learning 5.1 a. The student will read, write, & identify the place values of decimals through thousandths.
 10 is a nombre ou numéro! Un chiffre is used to express a digit. ( 1, 2, 3,4,5,6,7,8,9)  dix is un nombre, made from Les Chiffres 1 and 0.  Le numéro.
Onze eleven. dix-huit eighteen dix ten quinze fifteen.
Numbers First, review the numbers going from French to English. Then try going from English to French.
MATH DRILLS. 376 three hundred seventy-six 508 five hundred eight.
Powerpoint 1 Place value Rounding Comparing numbers
“Improving Pronunciation Dictionary Coverage of Names by Modelling Spelling Variation” - Justin Fackrell and Wojciech Skut Presented by Han.
CS 4705 Lecture 4 CS4705 Sound Systems and Text-to- Speech.
1 ENGLISH PHONETICS AND PHONOLOGY Lesson 3A Introduction to Phonetics and Phonology.
A Text-to-Speech Synthesis System
The Cardinal Numbers Study target
Speech & Language Modeling Cindy Burklow & Jay Hatcher CS521 – March 30, 2006.
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
Review for Chapter 1 Place Value. Objective 1.2 Read and write numbers through 999,999 Expanded Form Expanded form is a way to write a number that shows.
1st semester Mexico. 1st semester United States 1st semester Canada.
FlashNormalize: Programming by Examples for Text Normalization International Joint Conference on Artificial Intelligence, Buenos Aires 7/29/2015FlashNormalize1.
Numbers ZERO 0 ONE 1 TWO 2 THREE 3 FOUR 4 FIVE 5.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
UNDERSTANDING NUMERALS TO HUNDRED MILLION. The third period in our number system is MILLIONS ONES __ __ __, THOUSANDS ___ ___ ___, MILLIONS ___ ___ __,,
French Lesson Numbers and Animals What We Will Learn What the numbers 0 through 9 sound like in French. What the numbers 0 through 9 sound like in French.
BEC / MATHS / VIDEO8 Bharat Learn © 2015 PRACTICE 1 NUMBERS INDIAN SYSTEMINTERNATIONAL SYSTEM ,81,64,52078,164,
How Spelling Supports Reading Based on the article “Why Spelling Supports Reading And Why It Is More Regular and Predictable Than You May Think” By Louisa.
Numbers can be written in 2 ways – FIGURES or WORDS Example: or one hundred twenty three thousand seven hundred sixty three.
Being a Reader at St Leonard's
Whole Numbers.
Place Value I ,
Decimals.
Number Sense Click Here to Start! ©
Through the Millions II
Primary Longman Elect 3A Chapter 3 Numbers
Speaker : chia hua Authors : Long Qin, Ming Sun, Alexander Rudnicky
Place Value II By Monica Yuskaitis.
1 - one 2 - two 3 - three 4 - four 5 - five 6 - six 7 - seven
STANDARD 5 TH A SUBJECT -- MATHEMATICS
Numbers Help under 1,000,000.
Numbers >100
Numbers Let's recap !.
Cell phone use is prohibited.
Place Value II.
Place Value Through the Millions © 2007 M. Tallman.
期中考试 Quarter’s Final Oct. 11th – Listening / Reading / Writing (60)
Place Value Basics: Whole Numbers
Place Value ,.
Word Pronunciation Julia Hirschberg CS /18/2018.
English Communication
Numbers and Figures.
one thousand eight hundred twelve
Specifying, Compiling, and Testing Grammars
A0-A1 Unit One Lesson 1C Letters and words.
Place Value.
Counting Chart: Numbers 1 to 100
PLACE VALUE Hundred thousands Ten thousands Thousands Hundreds
Place Value.
Big numbers Play.
twenty-eight hundredths? Who has one hundred five and four tenths?
Big numbers Play.
Rohit Kumar *, Amit Kataria, Sanjeev Sofat
Thirty-six eighty thirty fifteen ten seventeen Forty-seven Forty-one
ENGLISH PHONETICS AND PHONOLOGY Week 2
Word Pronunciation Julia Hirschberg 4/14/2019.
Place Value and Writing Numbers
We will read and write multi-digit numbers.
+/- Numbers Year 6 – Place value, rounding and mental methods
+/- Numbers Year 2 – Addition and subtraction of units within 100
This Photo by Unknown Author is licensed under CC BY-SA
3,050,020 = 3,000, Write the number in words. 6,140,050 = 6,000, ,
Odd and Even Numbers.
How can we speak math? 褚雅儒
Presentation transcript:

Pronouncing Words in TTS Systems Julia Hirschberg CS 4706 9/21/2018

Today Motivation Improve TTS intelligibility and naturalness An application: Language Learning Challenges for automatic word pronunciation Standard methods Pronunciation by rule Pronouncing dictionaries Innovative solutions Pronunciation by language origin Pronunciation by rhyming analogy Expanding the lexicon using Active Learning techniques 9/21/2018

Motivation Intelligibility Naturalness Applications to language learning Unlimited vocabulary Type a word or phrase and hear it spoken in your target language To imitate To learn to recognize 9/21/2018

Converting Text to Phonemes Pronouncing numbers in different contexts Identifying proper names Expanding abbreviations and acronyms correctly 9/21/2018

Numbers Pronouncing numbers in different contexts In 1996 she sold 1995 shares and deposited $42 in her 401(k). The number is 212-555-1210. That cc # is Visa 4444-3607-5959, expiration 2/07. Conventions: Years Money Phone numbers Money amounts 9/21/2018

Cultural Dependence Russia: Article 3 of the rules attached to the Moscow Telephone Network Subscribers Directory, 1916: “Numbers over a hundred are to be pronounced as follows: 1.23—one twenty three, 9.72—nine seventy two, 70.09—seventy zero nine. In numbers over 10,000 every figure of a hundred should be pronounced separately, for example, 1.20.48—one twenty forty eight, 2.08.35—two zero eight thirty five, 3.35.29—three thirty five twenty nine, 4.49.52—four forty nine fifty two, 5.15.86—five fifteen eighty six etc., not one hundred and twenty forty eight, two hundred and eight thirty five etc.” 9/21/2018

In France A French phone number is 10 digits given in series of two: 01-43-48-12-85 "Zéro un, quarante-trois, quarante-huit, douze, quatre-vingt-cinq". Numbers in addresses are always pronounced as a full number: Chambre 823, 240 rie Rivoli Chambre huit-cent-vingt-trois. Deux-cent-quarante, rue de Rivoli 9/21/2018

Pronouncing Words Part-of-speech: use, close, dove, multiply, coax Origin: shoe (ME shoo), phoenix (Gr) mole, attaches, resume Morphological analysis: ferryboat, ferryboats Popemobile Letter-to-sound correspondences: oo, th, qu, e (beet, bet, bite, weigh,…) 9/21/2018

Conventions for numbers and symbols: &c, evalu8, cu, tsp Genre: email, chat, recipe, want ad, software license… 9/21/2018

Word Sense Ambiguity and Pronunciation Homographs bass/bass Nice/nice desert/desert Homograph disambiguation 9/21/2018

Letter-to-Sound Rules E.g. I _{C}e$  /ai/ rise Else I  /ih/ rip Advantages Pronounces anything, seen or not Disadvantages Must be built by hand How to encode all the exceptions: E.g. ripen/risen/riser/river 9/21/2018

Exceptions dictionary Proper names: Nice, Ramirez, Ribeiro, Rise, Infiniti Solutions More complex rules Exceptions dictionary Consulted first But how handle morphological analysis? Rise’s hat 9/21/2018

Dictionary-based Approaches Rely on very large dictionary Disadvantages Hand labor to create entries Redundancy Cat, cats, cat’s, cats’ Out-of-vocabulary items Proper names: covering all U.K. surnames would require >5,000,000 entries New words: fax, email, mudd, … Technical terms: liposuction, anova, bernaise Foreign borrowings: frappe, ciao, louche 9/21/2018

Morphological preprocessing before dictionary look-up Solutions Morphological preprocessing before dictionary look-up Fall back to L2Sound rules if no dictionary ‘hit’ 9/21/2018

More Innovative Approaches Pronouncing OOV words Handling proper names Inferring country of origin: Takashita, Leroy, Kirov, Lima, Infiniti Pronunciation by analogy Analog/dialog Risible/visible Proper names: Alifano/Califano 9/21/2018

Bootstrapping Phonetic Lexicons (Maskey et al ’04) For some languages, online pronouncing lexicons exist – but for others….e.g. Nepali How to minimize effort in creating lexicons? Idea Given a native speaker and a large amount of online text in the language… Native speaker builds small lexicon by hand for seed set of N most common words in text, e.g. is: /izh/ the: /dhax/ 9/21/2018

Derive L2S rules from lexicon automatically, e.g. is  ih{zh} the  {dh}ax … Loop: Choose the next N most common set of words from the text and use the lexicon + L2S rules to predict pronunciations, e.g. telephone -> /telaxfown/ He -> /hax/? Rise -> /rihzhax/? Assign a confidence score to each prediction by comparing each word to all words in lexicon If is -> /ihzh} in lexicon and no other orthographically similar words are pronounced differently, new rule his -> /hihzh/ scores high For low confidence pronunciations, Active Learning step: Inspect and calculate error rate Hand correct errors and add all to lexicon 9/21/2018

Results English: German: Nepali Build a new set of L2S rules from augmented lexicon Iterate from Loop until performance stabilizes Results English: 94% success on test set after 23 iterations, 16K entry lexicon Performance comparable to CMUDict and 1/7 the size German: 90% accuracy after 13 iterations, 28K lexicon Nepali 9/21/2018

94.6% accuracy after 16 iterations, 5K lexicon 9/21/2018

Improving Pronunciation Dictionary Coverage (Fackrell and Skut ’04) Idea: Many proper names have more than one spelling (e.g. More, Moore; Smith Smythe) Find a mapping between OOV (Out of Vocabulary) spellings and alternate spellings – a ‘fuzzy’ match Identify spelling alternations that are ‘pronunciation-neutral’ in an existing lexicon to produce rewrite rules for OOV words 9/21/2018

How do current systems do on pronunciation? Loquendo (temporarily unavailable) CEPSTRAL AT&T Naturally Speaking 9/21/2018

Next Class Accenting and information status 9/21/2018