Sounds that “move” Diphthongs, glides and liquids.

Slides:



Advertisements
Similar presentations
Tom Lentz (slides Ivana Brasileiro)
Advertisements

Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
Rhotic Vowels 5. The Special Case of Vocalic /r/ This is the vowel in words like “bird,” “learn,” “nerd,” “sir” Symbol: /Ô/ (schwar) or /ÎÕ/ MacKay prefers.
Acoustic and Physiological Phonetics
CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.
Speech Perception Dynamics of Speech
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2008.
“Connecting the dots” How do articulatory processes “map” onto acoustic processes?
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.
Acoustic Characteristics of Vowels
Phonetics.
Hello, Everyone! Review questions  Give examples to show the following features that make human language different from animal communication system:
Nasal Stops.
Speech Science XII Speech Perception (acoustic cues) Version
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
Digital Systems: Hardware Organization and Design
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Speech sounds Articulation.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Static Vowels ► Diphthongs ► Semivowels ► Stops Extending SF Theory to Accommodate Articulatory Movement.
Spectrogram & its reading
Chapter 6 Features PHONOLOGY (Lane 335).
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Nasal Stops. Nasals Distinct vocal tract configuration Pharyngeal cavity Oral cavity (closed) Nasal cavity (open)
Consonants and vowel January Review where we’ve been We’ve listened to the sounds of “our” English, and assigned a set of symbols to them. We.
Vowel Transcription.
Conclusions  Constriction Type does influence AV speech perception when it is visibly distinct Constriction is more effective than Articulator in this.
Speech Sounds of American English and Some Iranian Languages
The sounds of language Phonetics Chapter 4.
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010.
Phonetics HSSP Week 5.
Laterals + Nasals December 2, 2009 Administrative Stuff Friday: some notes on audition (hearing) Also: in-class USRIs Production Exercise #4 is due on.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Perception of Speech and the Consequences of Hearing Loss.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Phonetics: Dimensions of Articulation October 13, 2010.
Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.
Transitions + Perception March 27, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their projects.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
IIT Bombay 14 th National Conference on Communications, 1-3 Feb. 2008, IIT Bombay, Mumbai, India 1/27 Intro.Intro.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Speech Perception.
Stop + Approximant Acoustics
Phonetics: consonants
Liquid + Nasal Acoustics
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
Rhotic Vowels 5. The Special Case of Vocalic R This is the vowel in words like “bird,” “learn,” “nerd,” “sir” Symbol: [ ɚ ] (schwar) or [ ɝ ] MacKay.
Stop/Plosives.
PHONETICS AND PHONOLOGY
Static Vowels ► Diphthongs ► Semivowels ► Stops
Structure of Spoken Language
Speech Perception.
Speech Perception (acoustic cues)
Motor theory.
/r/ Place: palatal Articulatory phonetics Acoustics
Presentation transcript:

Sounds that “move” Diphthongs, glides and liquids

The important role of movement Articulatory movement = spectral change Articulatory movement = spectral change Spectral change occurs as speakers transition within and between sound sequences Spectral change occurs as speakers transition within and between sound sequences Spectral change plays a significant role in Spectral change plays a significant role in Perception of certain speech sounds Perception of certain speech sounds Overall speech intelligibility Overall speech intelligibility

Diphthongs Slow gliding (~ 350 msec) between two vowel qualities Slow gliding (~ 350 msec) between two vowel qualitiesComponents Onglide- starting point of articulation Onglide- starting point of articulation Offglide- end point of articulation Offglide- end point of articulation Articulatory Transition = formant transition Articulatory Transition = formant transition Diphthongization: articulatory movement within the vowel Diphthongization: articulatory movement within the vowel Varies by geographic region Varies by geographic region

American English Diphthongs /  / - “bye” /  / - “bye” /  / - “bough” /  / - “bough” /  / - “boy” /  / - “boy” /  / - “bay” /  / - “bay” /  / - “bow” /  / - “bow”

/  /

/  /

What cues listeners? (Nábĕlek et al., 1993) Question: What spectral temporal variables contribute to distinction of /  / vs. /  /? What spectral temporal variables contribute to distinction of /  / vs. /  /? Does the quality of listening environment or hearing ability of the listener influence those factors? Does the quality of listening environment or hearing ability of the listener influence those factors? Control variables Rate (Hz/msec) and Duration (msec) of formant transition Rate (Hz/msec) and Duration (msec) of formant transitionResults /  / is perceived for /  / is perceived for ↓ transition rates of ↑ duration ↓ transition rates of ↑ duration ↑ transition rate of ↓ duration ↑ transition rate of ↓ duration

What cues listeners? (Nábĕlek et al., 1993) Results continued, Ambient noise had a larger effect on Ambient noise had a larger effect on ↑ transition rate, ↓ duration condition ↑ transition rate, ↓ duration condition Hearing Impairment differentially affected Hearing Impairment differentially affected ↑ transition rate, ↓ duration condition ↑ transition rate, ↓ duration conditionConclusion Fast, short transitions and slow long transitions can both serve to cue listeners to diphthongs Fast, short transitions and slow long transitions can both serve to cue listeners to diphthongs Slow, long transitions are more resistant to Slow, long transitions are more resistant to Poor listening environments Poor listening environments HI listeners HI listeners

From Tasko & Greilick (2010) diphthong duration vs. speech clarity rating

Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels Greater than vowels P oral slightly greater than P atmos P oral slightly greater than P atmos Less than fricatives Less than fricatives P oral for glides/liquids < P oral for fricatives P oral for glides/liquids < P oral for fricatives Constriction lasts ~ 100 msec Constriction lasts ~ 100 msec Constriction results in a loss in energy Constriction results in a loss in energy weaker formants weaker formants Transition rate faster than the diphthongs faster than the diphthongs slower than the stops slower than the stops lasts ~ msec lasts ~ msec Associated with 1. high degree of vocal tract constriction 2. articulatory transition

/w/ Place: labial Place: labial Acoustics Acoustics /u/-like formant frequencies /u/-like formant frequencies Constriction  formant values Constriction  formant values F1 ~ 330 Hz F1 ~ 330 Hz F2 ~ 730 Hz F2 ~ 730 Hz weak F3 weak F3 (~ 2300 Hz) V w V F1 F2 F Freq (Hz)

/j/ Place: palatal Place: palatal Acoustics Acoustics /i/-like formant frequencies /i/-like formant frequencies F1 ~ 300 Hz F1 ~ 300 Hz F2 ~ 2200 Hz F2 ~ 2200 Hz F3 ~ 3000 Hz F3 ~ 3000 Hz VjV F1 F2 F Freq (Hz)

/j/ VjV

Liquids (/l/, /r/) lateral /l/ lateral /l/ Rhotic /r/ Rhotic /r/ Pickett (1999) considers these consonants glides as well Pickett (1999) considers these consonants glides as well

/r/ Place: palatal Place: palatal Articulatory phonetics Articulatory phonetics Variable tongue positions Variable tongue positions “bunched” “bunched” “retroflexed” “retroflexed” Allophonic Variations Some suggest “dark” (CV) –very low F3 “dark” (CV) –very low F3 “light” (VC) –F3 not as low “light” (VC) –F3 not as low Acoustics Acoustics Hallmark of /r/ is a low F3 F1 ~ 350 Hz F2 ~ 1050 Hz F3 ~ 1550 Hz F3 ~ 1550 Hz Vowels have F3 above 2200 Hz colored Vowels around /r/ are colored or F3 values lower than usual

/r/ VrV F1 F2 F Freq (Hz)

Role of F3 transition in /w/ vs. /r/ perception

/r/ “coloring” of vowels /  / /  /

Articulatory Variability and /r/

Point parameterized representation Bunched

Retroflexed

Between-speaker variation “row” JW39 tp004 “row” JW45 tp004 Very common

Within-speaker variation: different context “row” JW37 tp009 “dorm” JW37 tp099 Common

Within-speaker variation: same context “right” JW37 tp009 “right” JW37 tp099 Not common, but possible!

N=53 normal speakers N=53 normal speakers Not just two different configurations, but a whole family of possible configuration Not just two different configurations, but a whole family of possible configuration From Westbury et al. (1998)

How can these vastly different tongue configurations lead to similar acoustic/perceptual consequences?

Summary There is a wide distribution of articulatory configurations for /r/ There is a wide distribution of articulatory configurations for /r/ Different articulatory configurations of /r/ are indistinguishable acoustically and perceptually Different articulatory configurations of /r/ are indistinguishable acoustically and perceptually Different tongue configurations can produce equivalent area functions Different tongue configurations can produce equivalent area functions Some parts of the area function are more critical than others for determining key acoustic/perceptual effects Some parts of the area function are more critical than others for determining key acoustic/perceptual effects

Clinical Digression Clinically, /r/ is a difficult sound for children to learn. Clinically, /r/ is a difficult sound for children to learn. Is there anything from our discussion that might suggest why this might be the case? Is there anything from our discussion that might suggest why this might be the case?

/l/ Place: alveolar Place: alveolar Articulatory phonetics: Articulatory phonetics: tongue tip contacts alveolar ridge, splitting the vocal tract tongue tip contacts alveolar ridge, splitting the vocal tract Introduces antiformants Introduces antiformants Acoustics Acoustics F1 ~ 360 Hz F2 ~ 1300 Hz F3 ~ 2700 Hz F2 is variable and affected by vowel environment Transition often looks more abrupt than other sounds discussed Allophonic variations Light /l/: CV environment Dark /l/: VC environment

/l/ VlV F1 F2 F Freq (Hz)

/l/ VlV