Representing Intonational Variation

Slides:



Advertisements
Similar presentations
Teaching Pronunciation
Advertisements

Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
Perceptual Organization in Intonational Phonology: A Test of Parallelism J. Devin McAuley 1 & Laura C. Dilley 2 Department of Psychology Bowling Green.
Prosody Modeling (in Speech) by Julia Hirschberg Presented by Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.
1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.
INTONATION Chapters 15 & 16.
Varied, Vivid Expressive How can you use your voice to engage, express, and create meaning?
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Introduction to Prosody
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
Modelling Polish Intonation for Speech Synthesis Dominika Oliver 23 May 2002.
Connecting Acoustics to Linguistics in Chinese Intonation Greg Kochanski (Oxford Phonetics) Chilin Shih (University of Illinois) Tan Lee (CUHK) with Hongyan.
FLST: Prosodic Models FLST: Prosodic Models for Speech Technology Bernd Möbius
Chapter three Phonology
Intonation September 18, 2014 The Plan for Today Also: I have posted a couple of readings on TOBI (an intonation transcription system) to the course.
STUDY OF ENGLISH STRESS AND INTONATION
Toshiba Update 14/09/2005 Zeynep Inanoglu Machine Intelligence Laboratory CU Engineering Department Supervisor: Prof. Steve Young A Statistical Approach.
Una Y. Chow Stephen J. Winters Alberta Conference on Linguistics November 1, 2014.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Alignment of tonal targets: 30 years on Bob Ladd University of Edinburgh.
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
Segmental encoding of prosodic categories: A perception study through speech synthesis Kyuchul Yoon, Mary Beckman & Chris Brew.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
TOBI Basics April 13, 2010.
INTONATION (Chapter 17).
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Language and Speech, 2000, 43 (2), THE BEHAVIOUR OF H* AND L* UNDER VARIATIONS IN PITCH RANGE IN DUTCH RISING CONTOURS Carlos Gussenhoven and Toni.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
English Intonation (introductory lecture)
11 How we organize the sounds of speech 12 How we use tone of voice 2009 년 1 학기 담당교수 : 홍우평 언어커뮤니케이션의 기 초.
Usage-Based Phonology Anna Nordenskjöld Bergman. Usage-Based Phonology overall approach What is the overall approach taken by this theory? summarize How.
English Pronunciation & Intonation Practice 广东外语外贸大学 王桂珍 编著.
A Text-free Approach to Assessing Nonnative Intonation Joseph Tepperman, Abe Kazemzadeh, and Shrikanth Narayanan Signal Analysis and Interpretation Laboratory,
3.2. Other criteria for materials selection Needs analysis needs analysis by carrying out a careful “needs analysis” we can be ensure that.
Lecture Overview Prosodic features (suprasegmentals)
Teaching pronunciation
Suprasegmental features and Prosody
English Intonation (introductory lecture)
Thought as the basis of speech comprehension
4AOD Malinnikova Ekaterina
Phonetics SPAU 3343 Chap. 10 – Grasping the melody of language
What are suprasegmentals?
Tone in Sherpa (Sino-Tibetan) Joyce McDonough1, Rebecca Baier2 and
Functions of intonation 1
Kuiper and Allan Chapter 6.2
‘The most natural way to communicate is simply to speak
Studying Intonation Julia Hirschberg CS /21/2018.
Meanings of Intonational Contours
Representing Intonational Variation
Studying Intonation Julia Hirschberg CS /21/2018.
Intonational and Its Meanings
Intonational and Its Meanings
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
The American School and ToBI
Kuiper and Allan Chapter 6.2
Meanings of Intonational Contours
Representing Intonational Variation
Recognizing Structure: Sentence, Speaker, andTopic Segmentation
Discourse Structure in Generation
Comparative Studies Avesani et al 1995; Hirschberg&Avesani 1997
ENGLISH PHONETICS AND PHONOLOGY Week 2
Thought as the basis of speech comprehension
Presentation transcript:

Representing Intonational Variation Julia Hirschberg CS 4706 11/24/2018

Today How can we represent meaningful speech variation s.t. we can communicate this to others? Expanded vs. compressed pitch range? Louder vs. softer speech? Faster vs. slower speech? Differences in intonational prominence? Differences in intonational phrasing? Differences in pitch contours? 11/24/2018

Schemes for Representing Intonational Variation An early proposal: Joshua Steele Language Learning Approaches / IS it INteresting / / d’you feel ANGry? / / WHAT’S the PROBlem? / (McCarthy, 1991:106) How can we capture all and only the meaningful intonational variation for a given language unambiguously? 11/24/2018

Intonation Models No commonly agreed upon model for one language, let alone all Researchers work in different traditions and focus on different aspects of intonation Different models may arise from different types of data Auditory Acoustic Perceptual … 11/24/2018

Intonation Models Auditory: ESL-orientated; empirical data scarce; even trained listeners do not always agree on what they hear Acoustic: Distinction between linguistically relevant and irrelevant details in acoustic signal Perceptual approach Experimental data, often w/ manipulated f0 Hard to design experiments with naïve listeners which give adequate control over parameters used in making decisions 11/24/2018

Intonation models Basic division into linear and superpositional models Linear models: intonation involves a succession of individual choices from an intonation lexicon Superpositional models: the intonation of an utterance involves a combination of local and utterance-sized components Speakers may combine aspects of linear and superpositional models in the production of intonation 11/24/2018

Intonation Models Linear or Tone sequence models British school (Kingdon ’58, O’Connor & Arnold ’73, Cruttenden ’97): based on auditory analysis American School (Pierrehumbert ’80, ToBI): mainly acoustic analysis Dutch school (‘t Hart, Collier and Cohen 1990): perceptual data Superpositional models (Fujisaki 1983, Möbius et al. 1993): acoustic/physiological 11/24/2018

Superpositional models Pitch pattern of intonation modeled with two components: phrase component and accent component. Phrase has basic shape, and pitch movements for individual accents are superimposed over basic shape: plus = Apples, oranges and tomatoes 11/24/2018

Good for modeling declination Declination: downtrend in f0 over the course of an utterance Best seen as statistical abstraction: if one takes f0 measurements from enough utterances, over time, a downtrend in f0 will emerge Lily and Rosa thought this was divine. Prince William was gorgeous and he was looking for a bride. They dreamed of wedding bells. 11/24/2018

Superpositional models Advantages Good at modeling declination in intonation languages Successful in speech synthesis for languages like Japanese (little variation in accent type, e.g.) Capture prosodic structure in languages which have both tone and intonation (e.g. Mandarin) Disadvantages All contours must be modeled with an accent and a phrase component Many SAE contours cannot be captured easily 11/24/2018

Intonation contours cannot be modeled as sequences of prosodic events No account of different accent types, or variations in phrase endings No notation system which allows users to share observations from large speech corpora or to compare contours A method primarily for synthesis, analysis of speech production 11/24/2018

Tone sequence models General assumption: intonation is generated from sequences of (possibly) categorically different and phonologically distinctive accents Two types of models within the group of tone sequence models: Type 1: Intonation made up of sequences of pitch movements Type 2: Intonation made up of sequences of pitch levels or targets 11/24/2018

Two types of tone-sequence model Type 1: based on pitch movements t a r g e Type 2: based on pitch levels H The British School The Dutch School t a r g e L The American School 11/24/2018

Tone Sequence Models Overall shape of intonation phrase is not component of models Model is a succession of independent accent and boundary tone choices from an intonation lexicon Do not model phrase-level phenomena (e.g. declination, pitch range, nuclear accent) 11/24/2018

The British School Tone sequence model and pitch movement analysis (e.g. falling vs. rising intonation) Auditory model: teaching English as a second language O’Connor and Arnold 1972: Earliest textbook for English instruction that tells user which contour appropriate in which context No empirical evidence British school analyses applied to English, German, Dutch, French, … 11/24/2018

Concepts in the British School Basic unit of intonational description: intonation phrase (tone unit) Delimited by pauses, phrase-final lengthening, pitch movement Syllables within a tone unit can be stressed or accented telephone Accented syllables are stressed and pitch prominent 11/24/2018

Accent Stressed syllable has full vowel and is perceived as involving a rhythmic beat Pitch prominence syllable produced with moving pitch or syllable part of a pitch jump from a preceding syllable or onto a following syllable or syllable at a point in the utterance where the direction of pitch movement changes (e.g. from rising to falling) 11/24/2018

Pitch Prominence Syllable produced with moving pitch Syllable part of a pitch jump from a preceding syllable or onto a following syllable Syllable at a point in utterance where direction of pitch movement changes i g the r l g i r l in the gar the den n e d g a r h e i n t i r l 11/24/2018 g the

An example and I think it’s HOrriblerrible ...a POINT where you have to CLEAN it There’s a point where you have to clean it and I think it’s horrible... 11/24/2018

Intonation Phrase Structure Intonational phrases have an internal structure Structure determined by location of accents in an IP Each accent defines the beginning of a prosodic constituent 11/24/2018

Intonation phrase structure Two types of accent unit in the British School: Prenuclear accent units; also called the Head Nuclear accent units; also called the Nucleus The nuclear accent unit is the last accent unit in the IP The head comprises all prenuclear accent units 11/24/2018

Intonation phrase structure Prenuclear accent unit Nuclear accent unit Prehead ‘Head’ ‘Nucleus’ Stressed syllable But JOHN’s never BEEN to Jamaica 11/24/2018

Six nuclear choices in English J a m i c falling i c rising J a m a c rising-falling i J m falling-rising J a m i c Rising-falling-rising a c i J m level J a m i c 11/24/2018

Strengths and Weaknesses How are accents, prominence defined? How are they related to segments? Too many options…. Are prenuclear accents qualitatively different from nuclear accents? What is the evidence? Does each pitch accent begin a new ‘prosodic unit’ in the phrase? What is the evidence? 11/24/2018

Next Class The American School and Laboratory Phonology ToBI Read the ToBI conventions Listen to the ToBI training data or cardinal examples Bring your laptop and headphones to class 11/24/2018