1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.

Slides:



Advertisements
Similar presentations
Tom Lentz (slides Ivana Brasileiro)
Advertisements

CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
Sounds that “move” Diphthongs, glides and liquids.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Speech Perception Dynamics of Speech
Articulation and the Vocal Tract Chapter 3 Perry C. Hanavan, Au.D. Audiologist.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2008.
CS 551/651: Structure of Spoken Language Lecture 12: Tests of Human Speech Perception John-Paul Hosom Fall 2008.
Acoustic Characteristics of Vowels
The sound patterns of language
Nasal Stops.
Fry: acoustic cues for consonants o. Fry p 135 Remind yourself of the fortis-lenis distinction in the slide for week 6 – as far as English is concerned,
Recap: Vowels & Consonants V – central “sound” of the syllable C – outer “shell” of the syllable (C) V (C) (C)(C)(C)V(C)(C)(C)
Speech Science XII Speech Perception (acoustic cues) Version
Digital Systems: Hardware Organization and Design
Clinical Phonetics.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Spectrogram & its reading
Recognition of Voice Onset Time for Use in Detecting Pronunciation Variation ● Project Description ● What is Voice Onset Time (VOT)? – Physical Realization.
The following spectrogram is: A. burst B. spectrogram C. speech D. formant.
Recap: Vowels & Consonants V – central “sound” of the syllable C – outer “shell” of the syllable (C) V (C) (C)(C)(C)V(C)(C)(C)
Investigating physical properties of speech sounds
Nasal Stops. Nasals Distinct vocal tract configuration Pharyngeal cavity Oral cavity (closed) Nasal cavity (open)
Stop Place Contrasts before Liquids Edward Flemming MIT.
Fricatives + VOT April 6, 2010 For Starters… A note on perceptual verbiage. Also note: I gave you the wrong CP data!
An important point… When discussing source-filter theory, the sound source was the glottal spectrum When discussing stops (and fricatives and affricates),
Fricatives + Voice Onset Time March 31, 2014 In the Year 2000 Today: we’ll wrap up fricatives… and then move on to stops. This Friday, there will be.
Speech Sounds of American English and Some Iranian Languages
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010.
English Pronunciation Practice A Practical Course for Students of English By Wang Guizhen Faculty of English Language & Culture Guangdong University of.
Structure of Spoken Language
Laterals + Nasals December 2, 2009 Administrative Stuff Friday: some notes on audition (hearing) Also: in-class USRIs Production Exercise #4 is due on.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
1 4. Consonants  Consonants are produced ‘ by a closure in the vocal tract, or by a narrowing which is so marked that air cannot escape without producing.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Structure of Spoken Language
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Voice Quality + Stop Acoustics
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
1 CS 551/651: Structure of Spoken Language Lecture 6: Phonological Processes John-Paul Hosom Fall 2008.
Glides, Place and Perception March 18, 2010 News The hard drive on the computer has been fixed! A couple of new readings have been posted to the course.
Transitions + Perception March 27, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their projects.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
Sonorant Acoustics + Place Transitions
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Nasals John-Paul Hosom Fall 2010.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
Phonetics: consonants
Transitions + Perception March 25, 2010 Tidbits Mystery spectrogram #3 is now up and ready for review! Final project ideas.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
Stop/Plosives.
Stop Acoustics + Glides December 2, 2015 Down The Stretch They Come Today: Stop and Glide Acoustics Friday: Sonorant Acoustics + USRI evaluations We’ll.
Fricatives + Voice Onset Time November 25, 2015 In the Year 2000 Today: we’ll wrap up fricatives… and then move on to stops. This Friday, there will.
Articulation and the Vocal Tract Chapter 3
Structure of Spoken Language
Structure of Spoken Language
Structure of Spoken Language
Manner of Articulation
Structure of Spoken Language
Structure of Spoken Language
Speech Perception (acoustic cues)
Presentation transcript:

1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010

2 Stops/Plosives There are six plosives (oral stops) in American English:.bilabialalveolarvelar unvoiced | /p/ /t/ /k/ voiced | /b/ /d/ /g/ plus the flap /dx/ which is a very short /t/ or /d/ Plosives can be difficult to identify and discriminate; contextual cues can be varied Cue (1) is the formant transitions of neighboring vowels: for bilabials, F2 drops at CV boundary for alveolars, F2 goes toward 1800 Hz at CV boundary for velars, F2 may meet F3 (velar pinch) or be fairly flat Cue (2) is that voiced plosives may have pre-voicing; more likely when plosive is between two vowels

3 Stops/Plosives Cue (3) is that voiced plosives usually have VOT of < 30 msec, but unvoiced plosives usually have VOT of > 50 msec Cue (4) is that the VOT is shortest for bilabials, longer for alveolars, and longest for velars. (VOT /p/ < /t/ < /k/ and /b/ < /d/ < /g/) Cue (5) is that aspirated (unvoiced) plosives show evidence of F2 and F3 during aspiration; voiced plosives usually don’t Cue (6) is the spectral shape; in theory, the shape of the spectrum at burst release can be used to distinguish plosives: /p/ and /b/ have energy low in frequency or weakly spread throughout spectrum, /t/ and /d/ have more energy above 4KHz (related to alveolar fricatives /s/ and /z/), /k/ and /g/ tend to have more well-defined peaks in the spectrum (near formant locations).

4 Stops/Plosives Other cues related to spectral shape: Cue (7a): In the context of front vowels, /k/ and /g/ have spectral peak just above F2 of adjacent vowel, making them confusable with /t/ and /d/; but front vowels show more “velar pinch” Cue (7b): In the context of back vowels, /k/ and /g/ have one spectral peak between 1000 and 1500 Hz, a second peak between 3000 and 4500 Hz. Cue (8): Velar bursts also sometimes display “double burst”, or a second burst during the frication Cue (9): Post-vocalic consonants are often unreleased; they can be identified by (a) glottalization, (b) sudden drop in vowel energy, or (c) formant movement at end of vowel

5 Stops/Plosives Cue (10): When the plosive is unreleased, the voicing distinction is based more on length of preceding vowel; voiced plosives are associated with longer vowels, unvoiced plosives with shorter vowels Cue (11): In V 1 C 1 C 2 V 2 patterns, where both C are plosives, the existence of two plosives is in the different formant transitions in V 1 and V 2, the longer duration of closure, and sometimes in a brief “click” in spectrum indicating a change in place of articulation Cue (12): Plosives have different characteristics in stressed vs. unstressed environments. VOT for unvoiced plosives before unstressed vowels is shorter than VOT for unvoiced plosives before stressed vowels; plosives in an unstressed-vowel environment are less spectrally clear; in unstressed syllables, /t/ and /d/ may be realized as a flap /dx/.

6 Stops/Plosives Cue (13): Flaps have short duration (< 30 msec), dip in energy levels between two vowels, weak F2 and F3, and F2 tends toward 1800 Hz Cue (14): Consonant clusters can provide restrictions; for 3-consonant clusters (beginning with /s/-plosive), the only valid combinations are: /s p l/, /s p r/, /s p y/ /s t r/, /s t y/ /s k l/, /s k r/, /s k y/, /s k w/ Cue (15): In /s/−plosive−vowel combinations, VOT tends to be shorter and duration of /s/ shorter than normal

7 Plosives: Unvoiced Initial in Front-Vowel Context /p iy t iy k iy/

8 Plosives: Voiced Initial in Front-Vowel Context /b iy d iy g iy/

9 Plosives: Unvoiced Initial in Mid-Vowel Context /p ah t ah k ah/

10 Plosives: Voiced Initial in Mid-Vowel Context /b ah d ah g ah/

11 Plosives: Unvoiced Initial in Back-Vowel Context /p aa t aa k aa/

12 Plosives: Voiced Initial in Back-Vowel Context /b aa d aa g aa/

13 Plosives: Unvoiced Final in Front-Vowel Context /iy p iy t iy k/

14 Plosives: Voiced Final in Front-Vowel Context /iy b iy d iy g/

15 Plosives: Unvoiced Final in Mid-Vowel Context /ah p ah t ah k/

16 Plosives: Voiced Final in Mid-Vowel Context /ah b ah d ah g/

17 Plosives: Unvoiced Final in Back-Vowel Context /aa p aa t aa k/

18 Plosives: Voiced Final in Back-Vowel Context /aa b aa d aa g/