Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.

Slides:



Advertisements
Similar presentations
Prosody Modeling (in Speech) by Julia Hirschberg Presented by Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.
Advertisements

Frequency, Pitch, Tone and Length October 15, 2012 Thanks to Chilin Shih for making some of these lecture materials available.
1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.
Suprasegmentals The term suprasegmental refers to those properties of an utterance which aren't properties of any single segment. The following are usually.
Syllables Most of us have an intuitive feeling about syllables No doubt about the number of syllables in the majority of words. However, there is no agreed.
Syllables and Stress, part II October 22, 2012 Potentialities There are homeworks to hand back! Production Exercise #2 is due at 5 pm today! First off:
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Introduction to Prosody
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Tone, Accent and Stress February 14, 2014 Practicalities Production Exercise #2 is due at 5 pm today! For Monday after the break: Yoruba tone transcription.
Making & marking text for synthesis Caroline Henton 10 August 2006.
1 3.4 Intonation Intonation involves “the occurrence of recurring pitch patterns, each of which is used with a set of relatively consistent meanings, either.
Pitch Tracking + Prosody January 20, 2009 The Plan for Today One announcement: On Thursday, we’ll meet in the Tri-Faculty Computer Lab (SS 018) Section.
1) Introduction to research topic. Thesis: There is a difference between the prosodic properties of: a) contrastive Focus and b) presentational Focus.
Introduction to Intonation Jennifer J. Venditti Cognitive Science March 2001.
J-ToBi Jennifer J. Venditti Presentation by James Rishe.
Context in Multilingual Tone and Pitch Accent Recognition Gina-Anne Levow University of Chicago September 7, 2005.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.
Syllables and Stress October 21, 2009 Syllables “defined” “Syllables are necessary units in the organization and production of utterances.” (Ladefoged,
Chapter three Phonology
Syllables and Stress October 25, 2010 Practicalities Some homeworks to return… Review session on Wednesday. Mid-term on Friday. Note: transcriptions.
Intonation September 18, 2014 The Plan for Today Also: I have posted a couple of readings on TOBI (an intonation transcription system) to the course.
STUDY OF ENGLISH STRESS AND INTONATION
Perceived prominence and nuclear accent shape Rachael-Anne Knight LAGB 5 th September 2003.
Intonation and Information Discourse and Dialogue CS359 October 16, 2001.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
English Linguistics: An Introduction
Intonation January 21, 2014 The Plan for Today There’s a DSP exercise for you to work on! Due next Thursday. Also: I have posted a couple of readings.
Acoustic Properties of Taiwanese High School Students ’ Stress in English Intonation Advisor: Dr. Raung-Fu Chung Student: Hong-Yao Chen.
VOT + Suprasegmentals April 8, 2010 Announcements Next Tuesday--Silke and Jon will be presenting. Any order preferences? I may have a few things to say.
Syllables and Stress October 19, 2012 Practicalities Mid-sagittal diagrams to turn in! Plus: homeworks to hand back. Production Exercise #2 is still.
K-ToBI Labeling Conventions Sun-Ah Jun, Linguistics, UCLA Version 3.1, November Presented.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
Pitch Tracking + Prosody January 17, 2012 The Plan for Today One announcement: On Thursday, we’ll meet in the Craigie Hall D 428 We’ll be working on.
Frequency, Pitch, Tone and Length October 16, 2013 Thanks to Chilin Shih for making some of these lecture materials available.
Syllables and Stress October 25, 2010 Practicalities Some homeworks to return… Review session on Wednesday. Mid-term on Friday. Note: transcriptions.
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
TOBI, continued (continued) February 2, 2010 Languages! Polish2 Tagalog2 Urdu Spanish Afrikaans Korean Gujarati Italian Russian Swedish Also: Perception.
TOBI Basics April 13, 2010.
INTONATION (Chapter 17).
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Phonetics, part III: Suprasegmentals October 19, 2012.
Syllables and Stress October 21, 2015.
Lecture 7 Intonation 2 Lec. Maha Alwasidi.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
TOBI: Bi-Tonal Pitch Accents (the exciting conclusion!) February 4, 2016.
Phonetics, part III: Suprasegmentals October 18, 2010.
INTONATION Islam M. Abu Khater.
TOBI, continued January 29, 2008 The Outlook 1.Return course project reports. 2.New course schedule. 3.Today: Continue the discussion of English Intonation.
TOBI (the exciting conclusion!) February 1, 2011.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
11 How we organize the sounds of speech 12 How we use tone of voice 2009 년 1 학기 담당교수 : 홍우평 언어커뮤니케이션의 기 초.
INTONATION And IT’S FUNCTIONS
Lecture Overview Prosodic features (suprasegmentals)
Suprasegmental features and Prosody
(2) Suprasegmentals The features such as pitch, stress, and length, which are used simultaneously with units larger than segments, are called “suprasegmentals.”
Phonetics SPAU 3343 Chap. 10 – Grasping the melody of language
Tone in Sherpa (Sino-Tibetan) Joyce McDonough1, Rebecca Baier2 and
Kuiper and Allan Chapter 6.2
Studying Intonation Julia Hirschberg CS /21/2018.
Meanings of Intonational Contours
Intonational and Its Meanings
Intonational and Its Meanings
The American School and ToBI
Kuiper and Allan Chapter 6.2
Meanings of Intonational Contours
Jennifer J. Venditti Presentation by James Rishe
Presentation transcript:

Pitch Tracking + Prosody January 19, 2012

Homework! For Tuesday: introductory course project report Background information on your consultant and the language they speak. For Thursday: Digital Signal Processing exercises!

A Typology F0 is generally used in three different ways in language: 1. Tone languages (Chinese, Navajo, Igbo) Lexically determined tone on every syllable “Syllable-based” tone languages 2. Accentual languages (Japanese, Swedish) The location of an accent in a particular word is lexically marked. “Word-based” tone languages 3. Stress languages (English, Russian) It’s complicated.

Mandarin Tone ma1: mother ma2: hemp ma3: horse ma4: to scold Mandarin (Chinese) is a classic example of a tone language.

How to Transcribe Tone Tones are defined by the pattern they make through a speaker’s frequency range. The frequency range is usually assumed to encompass five levels (1-5). (although this can vary, depending on the language) Highest F0 Lowest F0

In Mandarin, tones span a frequency range of 1-5 Each tone is denoted by its (numerical) path through the frequency range Each syllable can also be labeled with a tone number (e.g., ma 1, ma 2, ma 3, ma 4 ) Tone

How to Transcribe Tone Tone is relative i.e., not absolute Each speaker has a unique frequency range. For example: Highest F0 Lowest F0 FemaleMale 100 Hz 200 Hz350 Hz 150 Hz

General Relativity In ordinary conversation, for European languages (Fant, 1956) : Men have an average F0 of 120 Hz A range of Hz Women have an average F0 of 220 Hz A range of Hz Children have an average F0 of 330 Hz In a normal utterance, the F0 range is usually one octave. i.e., highest F0 = 2 * lowest F0

Relativity, in Reality The same tones may be denoted by completely different frequencies, depending on the speaker.  Tone is an abstract linguistic unit. female speaker male speaker ma, tone 1 (55)

Accent Languages In accent languages, there is only one pitch accent associated with each word. The pitch accent is realized on only one syllable in the word. The other syllables in the word can have no accent. Accent is lexically determined, so there can be minimal pairs. Japanese is a pitch accent language… for some, but not all, words for some, but not all, dialects

Japanese Japanese words have one High accent it attaches to one “mora” in the word A mora = a vowel, or a consonant following a vowel, within a syllable. For example: [ni] ‘two’has one mora. [san] ‘three’ has two morae. The first mora, if not accented, has a Low F0. Morae following the accent have Low F0. It’s actually slightly more complicated than this; for more info, see:

Japanese Examples asa‘morning’H-L asa‘hemp’L-H

“chopsticks”H-L-L “bridge”L-H-L “edge”L-H-H

Stress Languages Stress is a suprasegmental property that applies to whole syllables. It is defined by more than just differences in F0. Stressed syllables are higher in pitch (usually) Stressed syllables are longer (usually) Stressed syllables are louder (usually) Stressed syllables reflect more phonetic effort. More aspiration, less coarticulation in stressed syllables. Vowels often reduce to schwa in unstressed syllables. The combination of these factors give stressed syllables more prominence than unstressed syllables.

Stress: Pitch (N) (V) Complicating factor: pitch tends to drift downwards at the end of utterances

Intonation Languages superimpose pitch contours on top of word- based stress or tone distinctions. This is called intonation. It turns out that English: has word-based stress and phrase-based pitch accents (intonation) The pitch accents are pragmatically specified, rather than lexically specified. = they change according to discourse context.

English Intonation We’ll analyze English intonation with a framework called TOBI Tones and Break Indices Note: intonational patterns vary across dialects The patterns and examples presented today might not match up with your own intonational system Also: this framework has only been applied to a few (primarily western) languages Check out the following: Course in Phonetics, pp Mary Beckman’s notes

Levels of Prominence In English, pitch accents align with stressed syllables. Example: “exploitation” vowelX X X X full vowelX X X stressX X pitch accent X Normally, the accent falls on the last stressed syllable.

Pitch Accent Types In English, pitch accents can be either high or low H* or L* Examples:High (H*)Low (L*) Yes.Yes? H* L* Magnification.Magnification? As with tones in tone languages, “high” and “low” pitch accents are defined relative to a speaker’s pitch range. My pitch range: H* = 155 HzL* = 100 Hz Mary Beckman: H* = 260 HzL* = 130 Hz

Whole Utterances The same pitch pattern can apply to an entire sentence: H* H*:Manny came with Anna. L* L*:Manny came with Anna? H* H*:Marianna made the marmalade. L* L*:Marianna made the marmalade?

Information Note that there’s a tendency to accent new information in the discourse. 4 different patterns for 4 different contexts: H* H*:Manny came with Anna. H* H*:Manny came with Anna. L* L*:Manny came with Anna? L* L*:Manny came with Anna?

Pitch Tracking H* is usually associated with a peak in F0; L* is usually associated with a valley (trough) in F0 Pitch tracking can help with the identification of pitch peaks and valleys. Note: it’s easier to analyze utterances with lots of sonorants. Check out both productions of “Manny came with Anna” in Praat. Note that there is more to the intonation contour than just pitch peaks and valleys The H* is followed by a falling pitch pattern The L* is followed by a rising pitch pattern

Tone Types There are two types of tones at play: 1.Pitch Accents associated with a stressed syllable may be either High (H) or Low (L) marked with a * 2.Boundary Tones appear at the end of a phrase not associated with a particular syllable may be either High (H) or Low (L) marked with a %

Tone Transcription L* H%

Phrases Intonation organizes utterances into phrases “chunks” Boundary tones mark the end of intonational phrases Intonational phrases are the largest phrases In the transcription of intonation, phrase boundaries are marked with Break Indices Hence, TOBI: Tones and Break Indices Break Indices are denoted by numbers 1 = break between words 4 = break between intonational phrases

Break Index Transcription Tones:L* H% Breaks:

Question Formation Note that not all questions end in L* H%. What’s the intonational difference between these two? Did you see Bob? L*H% Where did you go? H* L% The upsloping intonation only applies to yes/no questions. Also note: “Uptalk” = application of L* H% pattern to declarative sentences.

0 Level Boundaries 0 level boundaries are marked wherever there is clear coarticulation across a word boundary Also for flaps across word boundaries, as in “got it”

More Tones Note that there can be more than one pitch accent within an intonational phrase. Examples: Anna gave Manny a mango. L* H* L% Anna gave Manny a mango. H* H* H* L% The last accent in a phrase is somehow more prominent than the others. This accent is called the nuclear accent.

Downstepping Successive H* accents tend to drift downward in F0 within an intonational phrase. = downdrift, or downstepping This provides further evidence for phrasal organization. Downstepping essentially reduces the pitch range. Downstepped H* accents are denoted with a !H* Anna gave Manny a mango. H* !H* !H* L% There’s a lovely, yellowish, old one. H* !H* !H* L%

Downstepping Pitch Track H* !H* !H* L% =271 Hz=238 Hz =200 Hz

Intermediate Phrases A downstepping pattern can be reset by the presence of an intermediate phrase boundary. Example: It’s lovely, and yellowish, and it’s an old one. H* !H* L- H*L-L% Intermediate phrase boundaries are marked with a break index of 3. At the end of each intermediate phrase is an phrase accent Either Low (L-) or High (H-)

Intermediate Phrase Transcription H* !H* L- H* L-L%

One Phrase vs. Two Phrases No intermediate phrase boundary: “I” means insert. H* H*L-L% An intermediate phrase boundary, with a L- phrase accent: “I” means insert. H* L- H*L-L% 3 1 4

One Phrase vs. Two Phrases No intermediate phrase boundary: Marianna made the marmalade. L* L* H-H% An intermediate phrase boundary, with a H- phrase accent: Marianna made the marmalade. L* H- L* H-H%

Filling the Gap Another feature of phrase accents is that they fill in the gap between the nuclear accent and the boundary of the intermediate phrase. L* + H L- H%

Combinations Different combinations of phrase accents and boundary tones have different connotations. 1.L-L%Declarative sentences 2.H-H%Yes/No questions (usually) 3.L-H%Continuations 4.H-L%A “plateau” pattern Upstep: boundary tones after H- are higher than normal.

Upstepping H-H% H-L% “My name is Marianna.”

A Chunking Review utterance intonational phrase(intonational phrase)... intermediate phrase(intermediate phrase)... (pitch accent)nuclear accent (stressed syllable)stressed syllable

Break Indices 4 marks boundaries between intonational phrases associated with a boundary tone (H% or L%) sense of complete disjuncture 3 marks boundaries between intermediate phrases associated with a phrase accent (H- or L-) lesser sense of disjuncture 1 marks boundaries between words 0 marks non-boundaries between words (2 marks uncertainties or apparent mismatches) rarely used

Bitonal Pitch Accents In addition to H* and L*, there are two bitonal pitch accents L + H* L* + H The starred element denotes the tone which is associated with the stressed syllable L + H* = high peak on stressed syllable, preceded by a sharp rise in pitch L* + H = low pitch target on stressed syllable, followed by a sharp rise in pitch

H* vs. L + H* Marianna won it. H* L + H*

L* vs. L* + H Only a millionaire. Marianna made the marmalade. H* L* + HL-H% L* H-H%

L + H* vs. L* + H There’s a lovely one in Bloomingdale’s. L* + H L + H*

More Downstepping Bitonal pitch accents can also undergo downstepping. L + H* L + !H* L + !H*L-L%

Pitch-Accents Round-up There are four pitch accents: H* L* L + H* L* + H They attach to stressed syllables The final pitch accent in an intonational phrase is the nuclear accent. Generally perceived as more prominent.

Practice Time! Marianna made the marmalade.

Practice Time That’s a cat. (H* vs. L*) Noodle Eileen? Stalin. Five versions of Amelia.