Making & marking text for synthesis Caroline Henton 10 August 2006.

Slides:



Advertisements
Similar presentations
M. A. K. Halliday Notes on transivity and theme in English (4.2 – 4.5) Part 2.
Advertisements

Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
Prosody Modeling (in Speech) by Julia Hirschberg Presented by Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.
Syllables Most of us have an intuitive feeling about syllables No doubt about the number of syllables in the majority of words. However, there is no agreed.
Syllables and Stress, part II October 22, 2012 Potentialities There are homeworks to hand back! Production Exercise #2 is due at 5 pm today! First off:
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Nigerian English prosody Sociolinguistics: Varieties of English Class 8.
1 Università di Cagliari Corso di Laurea in Economia e Gestione Aziendale Economia e Finanza Economia e Finanza Lingue e Culture per la Mediazione Programma.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Mdt. sprogproduktion English Prosody. Agenda Brief survey of the ground you’ve already covered English prosody Assignment Four –Hand in print copy April.
Connecting Acoustics to Linguistics in Chinese Intonation Greg Kochanski (Oxford Phonetics) Chilin Shih (University of Illinois) Tan Lee (CUHK) with Hongyan.
Introduction to Linguistics 2 The Sound System
Chapter two speech sounds
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
1 Facoltà di Economia Corso di Laurea in Economia e Gestione Aziendale Economia e Finanza Economia e Finanza Economia e Gestione dei Servizi Turistici.
Chapter three Phonology
1 Speech synthesis 2 What is the task? –Generating natural sounding speech on the fly, usually from text What are the main difficulties? –What to say.
Intonation September 18, 2014 The Plan for Today Also: I have posted a couple of readings on TOBI (an intonation transcription system) to the course.
Phonetics and Phonology.
The IPA Chart An Animated and Narrated Glossary of Terms used in Linguistics presents.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
Ch. 4 Phonetics: The Sounds of Language
Phonological Processes
Phonology, phonotactics, and suprasegmentals
ESP COURSE ( English for Specific Purposes) for Class Teachers (3-4, 5-6) Vera Savic, MA Lecturer in English Faculty of Education in Jagodina University.
How IPA is Used in SSML and PLS Paolo Baggia, Loquendo Wed. August 9 th, 2006.
Phonetics and Phonology
An Introduction to Linguistics
Lecture 6 The Intonation Phonology Suprasegmental phonology Intonation
Intonation and Information Discourse and Dialogue CS359 October 16, 2001.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Suprasegmentals Segmental Segmental refers to phonemes and allophones and their attributes refers to phonemes and allophones and their attributes Supra-
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
Intonation January 21, 2014 The Plan for Today There’s a DSP exercise for you to work on! Due next Thursday. Also: I have posted a couple of readings.
Part aspiration (p. 56) aspiration, a period of voicelessness after the stop articulation and before the start of the voicing for the vowel.
K-ToBI Labeling Conventions Sun-Ah Jun, Linguistics, UCLA Version 3.1, November Presented.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
TOBI, continued (continued) February 2, 2010 Languages! Polish2 Tagalog2 Urdu Spanish Afrikaans Korean Gujarati Italian Russian Swedish Also: Perception.
A Fully Annotated Corpus of Russian Speech
TOBI Basics April 13, 2010.
NOVA Comprehensive Perspectives on Child Speech Development and Disorders Appendix 1 Representing Speech Sounds with Written Symbols: Introducing the IPA.
Phonetics, part III: Suprasegmentals October 19, 2012.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
THE SOUND PATTERNS OF LANGUAGE
TEACHING PRONUNCIATION
LI 2013 NATHALIE F. MARTIN P HONETICS AND S POKEN L ANGUAGE.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
TOBI: Bi-Tonal Pitch Accents (the exciting conclusion!) February 4, 2016.
EXPRESS YOURSELF. NEUTRAL ACCENT Neutral accent is a way of speaking a language without regionalism. Accent means variation in pronunciation and it should.
Phonetics, part III: Suprasegmentals October 18, 2010.
Week 3 – Part 2 Phonology The following PowerPoint is to be used as a guideline for the important vocabulary and terminology to know as you do your readings,
TOBI, continued January 29, 2008 The Outlook 1.Return course project reports. 2.New course schedule. 3.Today: Continue the discussion of English Intonation.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
TOBI (the exciting conclusion!) February 1, 2011.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
Definition of syllable One or more letters representing a unit ofletters spoken language consisting of a single uninterrupted sound.language A syllable.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
INFORMATION FOR PARENTS AUTUMN 2014 SPELLING, PUNCTUATION AND GRAMMAR.
Università di Cagliari
SUPRASEGMENTAL PHONEME
Phonetics SPAU 3343 Chap. 10 – Grasping the melody of language
An Animated and Narrated Glossary of Terms used in Linguistics
Intonational and Its Meanings
Intonational and Its Meanings
The American School and ToBI
Pronunciation for Presentations
Information Structure and Prosody
Discourse & Dialogue CMSC October 28, 2004
Facoltà di Economia Economia e Gestione Aziendale Economia e Finanza
Presentation transcript:

Making & marking text for synthesis Caroline Henton 10 August 2006

The IPA has it all The International Phonetic Alphabet (IPA) was created by the International Phonetic Association in the 1880s to transcribe the sounds of all spoken languages. It is based (mostly) on Latin letters and uses a large number of diacritics.International Phonetic Association Full Unicode support

The IPA chart: consonants

The IPA chart: vowels

The IPA chart: diacritics

IPA in Unicode SIL IPA93 Doulos Font

IPA review and beyond Consonants, clicks, vowels, diacritics, suprasegmental marks, tones, syllable and word boundaries, and much more. BUT user-customization is always needed…… Customize pronunciation (with punctuation and phonetic spelling). Customize intonation and affect (using embedded commands.

Computer phonemes ≠ IPA phonemes (US English) Vowels r-coloured Nasals, glides Fricatives Stops vowels syllabic Cs etc. IYARm hp IX ERnfb IHIRNGvt EHORlTHd AEURLX DHk AArsg AHwzDX AOySHKX UH EL EM EN ZHPX AXRX CH TX EYJH QX AYDD OYP- AWSILT- OWK- UW

The IPA Suprasegmentals

The IPA Tones and Word Boundaries

Fine-tuning in Mac OS X Embedded commands –[[emph + | -]] –[[rate wpm]] –[[volm n.n]] –[[emph +; rate 230; volm 0.6]] –Input modes TEXT | PHON | TUNE Further tuning: pbas; pmod; slnc; rset Lexical stress; syllable breaks; normal and destressed words

ToBI: Tones and Break Indices ToBI = framework for developing conventions for transcribing the intonation and prosodic structure of spoken utterances in a language variety. ToBI transcriptions have two important tiers: 1. a tone tier 2. a break-index tier Note: ToBI is not an International Phonetic Alphabet for prosody. Because intonation and prosodic organization differ from language to language, and often from dialect to dialect within a language, there are many different ToBI systems, each one specific to a language variety.

ToBI Phrase Accents & Final Boundary Tones L- H- For non-final intermediate phrases. Intonational phrase boundaries, often at punctuation: L-L% Low final endpoint, like at most periods. L-H% Final rise from a low value, often at a comma. L-% Continuation-like L-H% that’s missing the rise. H-H% Rise from a mid to high value, often used in questions. H-L% High level, often used in lists.

ToBI intonation phrases 4 typical intonation phrases: L-L% The default DECLARATIVE phrase. L-H% The LIST ITEM intonation (nonfinal items only). E.g. "I need food L-H%, shelter L-H%, and comfort L-L%." "You said you would run home this afternoon L-H%, grab your golf clubs L-H%, jump in the car L-H% and race to the club L-L%. H-H% YES-NO QUESTION. Eg, "Are you going L* today H- H%?" "So then are you going L* to the store this afternoon? H-H%?" (where pitch rises right after the L* and stays high til the end). H-L% The PLATEAU. A previous H* or complex accent ‘upsteps’ the final L% to an intermediate level. "I just TOLD you why" L+H* !H-L%

ToBI Pitch Accents Pitch Accents Associated only with accented (prominent) syllables: L+H*Low immediately preceding a steep rise. H* Local maximum or relatively high. L*, L*+H, H+L*, H+!H* Less common pitch accents. < A starred tone to the right of (after) the accent- bearing vowel. > A starred tone to the left of (before) the accent- bearing vowel.

ToBI Transcription example

Issues in Prosodic Transcription 1. Difference between L- and L-%. 2. When is a word accented? 3. Where is the accent's F0 peak when there is a steep segmental perturbation from a preceding voiceless obstruent? 4. Missing L- phrase accents. 5. The difference between H* and L+H*. Look for a linear interpolation to an H* peak from the preceding material. 6. Confusing low pitch due to glottalization with a low tone (L*). 7. Noun compounds without accents in the tail should not get accents in the tail. 8. Difference between L-% and L-H%. 9. Some phrase accents end in H-, not L Is a sentence-initial function word ever accented? 11. Polysyllabic words can contain two accents, e.g. "Tennessee”.

Continuing challenges for TTS Duration rules Homophones, Homographs Diphone glue Constrained function words Noun compounds Optimal weighting Pause insertions Intonation choice improvements: e.g. H* instead of L+H* in verbs following the subject in a sentence Non-Wh-questions Exclamation intonation