Laryngeal correlates of the English tense/lax vowel contrast

Slides:



Advertisements
Similar presentations
Non-normative preaspiration of voiceless fricatives in Scottish English a comparison with Swedish preaspiration Olga Gordeeva and James M.Scobbie, Queen.
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Sounds that “move” Diphthongs, glides and liquids.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Linguistic Voice Quality Patricia Keating University of California, Los Angeles Christina Esposito Macalester College, St. Paul.
Acoustic Characteristics of Vowels
Spectral Analysis Feburary 24, 2009 Sorting Things Out 1.TOBI transcription homework rehash. And some structural reminders. 2.On Thursday: back in the.
Phonetic variability of the Greek rhotic sound Mary Baltazani University of Ioannina, Greece  Rhotics exhibit considerable phonetic variety cross-linguistically.
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
Speech Science XII Speech Perception (acoustic cues) Version
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 1 At the Segment-Prosody.
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
PHONETICS AND PHONOLOGY
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Anatomy of the vocal mechanism
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
Complete Discrete Time Model Complete model covers periodic, noise and impulsive inputs. For periodic input 1) R(z): Radiation impedance. It has been shown.
Emotions and Voice Quality: Experiments with Sinusoidal Modeling Authors: Carlo Drioli, Graziano Tisato, Piero Cosi, Fabio Tesser Institute of Cognitive.
Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
Learning Objectives Describe how speakers control frequency and amplitude of vocal fold vibration Describe psychophysical attributes of pitch, loudness.
Voice Transformations Challenges: Signal processing techniques have advanced faster than our understanding of the physics Examples: – Rate of articulation.
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
Source/Filter Theory and Vowels February 4, 2010.
Hoarse meeting in Liverpool April 22, 2005 Subglottal pressure and NAQ variation in Classically Trained Baritone Singers Eva Björkner*†, Johan Sundberg†,
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Perceived prominence and nuclear accent shape Rachael-Anne Knight LAGB 5 th September 2003.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
VOT trumps other measures in predicting Korean children’s early mastery of tense stops Eun Jong Kong Mary E. Beckman Jan Edwards LSA2010 January 7 th.
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
Voice Quality + Spectral Analysis Feburary 15, 2011.
Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
Voice Onset Time + Voice Quality
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
Voice Quality + Korean Stops October 16, 2014 Don’t Forget! The mid-term is on Tuesday! So I have a review sheet for you. For the mid-term, we will just.
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Phonation + Voice Quality Feburary 11, 2014 Weekday Update Course project report #2 is due right now! I have guidelines for course project report #3,
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
SPPA 6010 Advanced Speech Science
Witsuwit’en final glottalization and voice quality Sharon Hargus University of Washington SSILA, Oakland CA, January 8, 2005.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
Voice Quality January 19, 2010 Vocal Tract Anatomy Our vocal tracts are shaped in a way that makes it easier to speak… But more dangerous to eat!
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
Behrman Chapter 5, 6 Place less emphasis on… Minor anatomical landmarks and features Extrinsic muscles of the larynx Blood supply to the larynx Central.
1 Introduction1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work2 Spectral subtraction 3 QBNE4 Results5 Conclusion, & future.
PHONETICS AND PHONOLOGY
The Human Voice. 1. The vocal organs
Structure of Spoken Language
Olga Gordeeva James Scobbie
The Human Voice. 1. The vocal organs
Voice source characterisation
Graduate School of Language Technology
†Department of Speech Music Hearing, KTH, Stockholm, Sweden
Norm-Based Coding of Voice Identity in Human Auditory Cortex
Speech Perception (acoustic cues)
Presentation transcript:

Laryngeal correlates of the English tense/lax vowel contrast Olga Gordeeva 13 December, 2005 CNTS University of Antwerp

Question and Background Systemic differences involving tense/lax vowel contrast are part of my PhD: on the acquisition of vowel quality, duration, vocal effort by Russian/SSE bilingual pre-schoolers. One of the conclusions: tense/lax vowel contrast is more difficult to acquire than the realisational differences such as between /¬ u/ (Panasyuk et al., 1995; Markus & Bond, 1999; Escudero, 2000; Guion, 2003; Piske et al., 2002) Does the English tense/lax vowel contrast involve only differences vowel quality and length, or do they vary along other phonetic dimensions? This presentation: based on the results from my English-speaking adult controls, a partly on my postdoctoral work involving EGG.

Introduction (2) [i] [I] The CHEAP/CHIP vowel contrast is often described in terms more extreme values for the tense vowels due to higher muscular tension and more extreme movements of the articulators, longer duration (discussion in Trask, 1996). Stevens (1998, p.297) hypothesised laryngeal involvement on top of segmental differences: a more breathy phonation for tense vowels, and a less breathy one for the lax vowels resulting from the greater vocal tract constriction for the high vowels. Acoustic consequence of this laryngeal adjustment  lax vowels have enhanced intensity in midfrequencies compared to the tense ones Jessen (2002) provided supportive evidence for Steven’s statement for German tense/lax contrast on both acoustic and laryngeal EGG level.

Adopted from Gauffin & Sundberg (1989) study Introduction (3) Glottal pulse Radiated Spectrum Transglottal airflow (arbitrary units) Time (arbitrary units) Sound Pressure Level Frequency Adopted from Gauffin & Sundberg (1989) study

This presentation Review the acoustic evidence of spectral tilt differences between SSE/SBE tense/lax close(-mid) vowels /i I/ and /u U/ Try to link the acoustic results for SBE to the more direct EGG analysis of the glottal pulse Introduce the EGG method used Present preliminary EGG results based on the MOCHA corpus

Acoustic evidence Gordeeva (2005) (1) 5 SSE adults and 4 SBE adults (all female) Materials 12 CVC words with the /i I/ and /u U/ Embedded in utterances in four prosodic positions E.g. “It’s a sheep” “A sheep is a sheep, and nothing but a sheep.” Spectral tilt was measured as an RMS-power (dB) difference of F1 and F2 in fixed bands (Sluijter, 1995; Sluijter et al. 1996) Data was normalised for differences in formant structure following Hanson (1997) and Jessen (2002) Statistical analysis: mixed design ANOVA with fixed factors “Language” (SSE or SSBE), “Vowel” (tense or lax), and normalised SPL level in F2 as a dependent variable

Acoustic evidence Gordeeva (2005) (2) If /i/ has a more spectral tilt than /I/ in both SSE and SBE

Gordeeva (2005) SBE /u U/ and SSE short and long /¬/ (3)

Summary acoustic results Our acoustic data for SSE/SBE supports Jessen’s (2002) acoustic results for German, and Steven’s (1998) suggestion. In SSE/SBE, more spectral tilt is associated with the tense high vowels than with the lax ones. There is more acoustic energy loss in the vocal tract for the tense vowels. Gradual difference? speakers spend more laryngeal effort to produce the lax vowels (adopting a less breathy phonation for the lax vowels) Categorical voice quality difference? The steepness of the spectral tilt may also serve as an indicator of the presence of creaky voice (Laver, 1980; Klatt & Klatt, 1990)

Current EGG study Confirm the acoustic results for SSE / SBE based on more direct EGG evidence SBE pilot based on the MOCHA corpus 2 SBE speakers: 1 female and 1 male EGG data, phonetic labelling, and acoustic waveform Tokens of /i/ versus /I/ and /u/ versus /U/ with vowel duration > 60 ms All cycles within the vowel boundaries (except for the first and the last one)

EGG signal model Marasek (1997)

EGG parametrisation Derived using Marasek’s (1997) model of EGG signal implemented as a Praat script the EGG signal and phonetic segmentation of duration as input Outputs relevant cycle-by-cycle parameters: Closing quotient: duration of the closing phase relative to T Speed Quotient: skewness of the glottal pulse (duration of the closing phase relative to the opening phase) Closing Slope: skewness / steepness of the closing phase Pitch: indication of vowel-intrinsic pitch and subglottal pressure Peak-to-peak amplitude: indication of subglottal pressure Plus long-term parameters Jitter (F0 fluctutation) and Shimmer (amplitude fluctuation) Skewness parameters

EGG signal model Marasek (1997)

Results: EGG Cslope

Shimmer (amplitude fluctuation)

Jitter (F0 fluctuation)

Analysis and Results Multivariate ANOVA with tense/lax vowel and speaker as fixed factors And OQ, CSlope, CQ, SQ, Jitter, Shimmer and OI as dependent variables  Effects & Interactions OQ Cslope CQ SQ Jitter Shimmer OI uU   ** speaker uU*speaker iI * iI*speaker highly significant Significant

Summary EGG results There were no systematic differences in between the tense/lax vowels in OQ, CQ, SQ and intensity As suggested by Marashek (1997), the Cslope parameter might be a better predictor of the skewness in the EGG domain The lax vowels are produced with a less breathy laryngeal configuration than the tense ones Significant shimmer/jitter differences indicate that there might be a categorical voice quality difference involved (e.g. it is compatible with a more creaky phonation type in the lax vowels)

Summary [i] [I]

Implications Whether this difference in spectral tilt is a consequence of a gradual or categorical voice quality adjustment: the tense/lax vowel distinction in English seems to involve a triple phonetic distinction: segmental, durational and laryngeal. In L2 acquisition context: the lax vowel is difficult to acquire. Studies involving perceptual cues to the contrast may need to look at the laryngeal level as well as segmental.

Further Plans Record more controlled EGG materials for SSE Control utterance position Consider prominence issues (stressed unaccented versus stressed accented) Control for segmental context (minimal pairs) Consider vowel height Correlate spectral tilt and EGG skewness measures