Vowels and speech production: gender differences Presentation from Lina Hecker Speaker Characteristics Venice International University Prof. Dr. Jonathan.

Slides:



Advertisements
Similar presentations
Motion in One Dimension
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Acoustic and Physiological Phonetics
Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Acoustic Characteristics of Consonants
Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et.
From Resonance to Vowels March 8, 2013 Friday Frivolity Some project reports to hand back… Mystery spectrogram reading exercise: solved! We need to plan.
Basic Spectrogram & Clinical Application Lab 9. Spectrographic Features of Vowels n 1st formant carries much information about manner of articulation.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
Acoustic Characteristics of Vowels
Coarticulation Analysis of Dysarthric Speech Xiaochuan Niu, advised by Jan van Santen.
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
DOES THE LINEAR SYNERGY HYPOTHESIS GENERALIZE BEYOUND THE SHOULDER AND ELBOW IN MULTI-JOINT REACHING MOVEMENTS? James S. Thomas*, Daniel M Corcos†,, and.
Phonetic variability of the Greek rhotic sound Mary Baltazani University of Ioannina, Greece  Rhotics exhibit considerable phonetic variety cross-linguistically.
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Interlanguage Production of English Stop Consonants: A VOT Analysis Author: Liao Shu-jong Presenter: Shu-ling Hung (Sherry) Advisor: Raung-fu Chung Date:
Speech Science XII Speech Perception (acoustic cues) Version
Vowel Acoustics, part 2 March 12, 2014 The Master Plan Today: How resonance relates to vowels (= formants) On Friday: In-class transcription exercise.
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
Speech and speaker normalization (in vowel normalization)
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Speech Group INRIA Lorraine
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
Experiment with the Slinky
Development of coarticulatory patterns in spontaneous speech Melinda Fricke Keith Johnson University of California, Berkeley.
Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge
1 Sounds: the building blocks of language CA461 Speech Processing 1 Lecture 2.
Motion in One Dimension
Chapter 2 Motion in One Dimension. Quantities in Motion Any motion involves three concepts Displacement Velocity Acceleration These concepts can be used.
Chapter 2 Motion in One Dimension. Dynamics The branch of physics involving the motion of an object and the relationship between that motion and other.
Motion in One Dimension. The branch of physics involving the motion of an object and the relationship between that motion and other physics concepts Kinematics.
Interarticulator programming in VCV sequences: Effects of closure duration on lip and tongue coordination Anders Löfqvist Haskins Laboratories New Haven,
Present Experiment Introduction Coarticulatory Timing and Lexical Effects on Vowel Nasalization in English: an Aerodynamic Study Jason Bishop University.
Phonetics and Phonology
Abstract Research Questions The present study compared articulatory patterns in production of dental stop [t] with conventional dentures to productions.
Phonological Constraints on the Acquisition of Mid Vowels in English for Students in Taiwan author: 黃俐雯 presented by Lisa Liu 報告人: 劉莉莎.
Diphthongs Five most frequent diphthongs in Māori are /ai ae au ou ao/. Mergers between /ai~ae/ and /au~ou/ [3] (Figure 2). Only one of these mergers is.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Whither Linguistic Interpretation of Acoustic Pronunciation Variation Annika Hämäläinen, Yan Han, Lou Boves & Louis ten Bosch.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
COORDINATION AND TIMING OF SPINE AND HIP MOTION DURING FULL BODY REACHING TASKS Gary E. Gibson, and James S. Thomas Ph.D, P.T. School of Physical Therapy,
Björkner, Eva Researcher, Doctoral Student Address Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing P.O. Box 3000.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Introduction to Vibrations and Waves
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Chapter 11 Motion.
The Effect of Initial Posture on The Performance of Multi-Joint Reaching Tasks: A Comparison of Joint Excursions Between Individuals With and Without Chronic.
Chapter 3 Acceleration Lecture 1
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
Tongue movement kinematics in speech: Task specific control of movement speed Anders Löfqvist Haskins Laboratories New Haven, CT.
Brandon Magliocco & Dr. David Schaffer  Economics  Univ. of Wisconsin-Eau Claire Changing Wage Rates Among Men and Women in the U.S. by Age Cohort and.
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
Acoustic Phonetics 3/14/00.
The effect of speech timing on velopharyngeal function
English Phonetics and Phonology
The Acoustics and Perception of American English Vowels
Speech Perception (acoustic cues)
/r/ Place: palatal Articulatory phonetics Acoustics
A Japanese trilogy: Segment duration, articulatory kinematics, and interarticulator programming Anders Löfqvist Haskins Laboratories New Haven, CT.
Analyzing F0 and vowel formants of Persian based on long-term features
Presentation transcript:

Vowels and speech production: gender differences Presentation from Lina Hecker Speaker Characteristics Venice International University Prof. Dr. Jonathan Harrington 17. October 2007

Introduction: there have been some analyses of female speech in the past → focal point has been the male voice female voice has a higher frequency range men are more studied and they are regarded as the standard to which everything else is compared in this lecture you can hear some differences in the speech of females and males based on adults focus on dynamic articulatory and acoustic consequences of differences in male and female vocal tract dimensions and the relationship between formant change and tongue movement

1. What are the dynamic articulatory and acoustic consequences of differences in male and female vocal tract dimensions? (Simpson 2001) Simply illustrated in Goldstein (1980) using the mid-sagittal vocal tract dimensions models vocal tract growth from infant to adult and its acoustic products Goldstein draws together available anatomical dimension data from a number of qualitative and quantitative different sources

Conclusion of Figure 1: In the figure female stricture sizes are calculated as 80% of the male values. It shows superimposed tongue positions for female (solid) and male (dashed) [i] and [a]. The distance from male [a] to [i] is ~11% greater than the analogous female distance. If you assume the same nominal articulatory speed and neglect inertia and acceleration, then the male V–V movement will also take 11% longer.

2. The relationship between the size of oral structures and its implications for articulatory displacement and articulatory velocity.(Kuehn & Moll 1976) They showed that the subjects with larger oral structures, had larger articulatory displacement and employed greater articulatory velocity to traverse larger articulatory spaces. → focused on the general consequences of differences in oral structure size →did not discuss the more wide-ranging implications of their findings for gender specific consequences in articulatory behavior and its acoustic products

Explanation of Figure 2: In the next figure you can see a hypothetical male and female F1 paths for open–close vowel movement, assuming the same nominal tongue body movement speed of 200mm/s. →the male acoustic trajectory lasts longer than the female one →the linear acoustic rate of change of F1 for females is ~35% greater than the male value. ==>female tongue covers a shorter distance to achieve analogous targets, and corresponds to a greater acoustic distance.

Conclusion of Figure 2: males and females aim for analogous phonetic vowel targets in CVC sequences if they move their articulators at the same speed, and if they are operating within the same durational framework → females reach their target earlier female degree of openness is greater than the male one females exhibit less undershoot than males → undershoot increases from close to open vowel categories ==>despite dimensional differences, targets are attained at approximately the same time with a difference in articulatory speed

3. The Relationship between formant change and tongue movement main articulatory-acoustic patterns found in diphthongs (Simpson 2001) average male and female pellet and formant tracks are similar in form female speakers cover a greater acoustic space both in linear (Hz) and nonlinear (Bark) terms The articulatory distance covered by the two posterior lingual pellets during the vocalic stretch is greater for male speakers the dorso-tectal stricture size defined by the two posterior lingual pellets is smaller for female speakers throughout the vocalic stretch mean pellet speeds are greater for male than female speakers

3.1. Data: UW-XRMBDB (Westbury, 1994) data set for examining gender-specific differences in the relationship between articulation and its acoustic products contains acoustic and articulatory records from 26 female and 22 male speakers (age 18-37), speaking Upper Midwest dialect of Am. English linguistic (e.g. reading text) and non-linguistic (e.g. swallowing) tasks articulatory data consists of 8 gold pellets 4 lingual pellets are placed along the midline of the tongue

3.2. Method use stretches of utterance to investigate the dynamic relationship between acoustic and articulatory activity which fulfill 3 criteria 1.large amounts of articulatory and acoustic movement; 2.continuous voicing throughout the stretch to facilitate reliable automatic formant tracking; 3.repetition by the same speaker of the same expression containing a suitable stretch. “The coat has a blend of both light and dark fibers.” ‘‘They all know what I said’’

A: Formant analysis analysis of the vocalic stretch of “they all” made with the ESPS program formant nominal default value of F1 was increased by 10% to 550Hz for female speakers analysis times were extended by 25ms beyond the segment start and end times formant tracks of the 239 tokens were visually checked for tracking errors each set of formant tracks was resampled to provide 11 temporally equidistant formant records 11 points provide a good definition of formant movement throughout the vocalic stretch

B: Pellet position pellet position of the UW-XRMBDB are stated in a coordinate system The normalization method redefines the position of the pellets on the tongue surface, with respect to their distance from the tip of the upper incisors. normalization allows to compare values from speakers with different palate outline lengths raw pellet positions were averaged separately for males and females male and female average palate outlines were created using individual palate outlines, resampled at 0,5mm intervals

3.3. Results Duration: a one-tailed t-test for the V-V stretches shows that the mean female duration is greater than the male one → no significant difference was found between the male and female durations of the utterances in other studies there were also found longer female durations for diphthongs (Simpson 2001) and monophthongs (Hillenbrand, Getty, Clark & Wheeler 1995)

Formant tracks at the 11 equidistant measurement points means and standard deviations of F1-F3 were calculated for males (right) and females (left) tokens. (next fig) formant values for the V-V stretch for “they all” can only cautiously compared with the results found in the literature 1.speakers in the UW-XRMBDB speak an Upper Midwest American English 2.vowels are from the initial part of the utterance, particularly “they” being utterance-initial, unstressed and preceding a stressed back open vowel → expect a more centralized vowel than you would find in isolation or utterance finally

In the next figure you can see a graphical comparison of the mean male and female formant tracks, converted to the Bark scale. In linear (Hz) terms, female acoustic excursion within the vocalic stretch is greater for both F1 and F2 In non-linear (Bark) terms, situation is different. The mean tracks for F2 and F3 run parallel with little change and a distance between them throughout the vocalic stretch. difference in mean F1 is 0,74 Bark at the beginning and is 1,58 Bark (more than twice) by the end of the stretch → suggesting a closer male vowel or a more open female quality → more open the vowel quality, the larger the difference becomes between female and male F1

Explanation of figure 5: during vocalic stretch tongue body makes a small upward moving before moving backwards and downwards F1 is determined by the apico-dental stricture of “they” over the initial part of the stretch (t1-t4) at the final part of the stretch (t5-t11) the tongue body is lowered, resulting in an increase in the size of the dorso-palatal stricture defined by T2–T4 F2 rises (t2-t4) to reach a plateau at (t3-t4) for the closing phase of the diphthong F2 falls continuously as dorso-palatal stricture size increases and the tongue moves back rise in F3 can be related to the lowering and backing of the tongue body causing pharyngeal narrowing

Pellet position and speeds Fig. 6 shows the pellet position of the 4 lingual pellets T1-T4 at each of the 11 measurement points for female and male speakers transformed and normalized values are shown in (a) in (b) raw values are plotted together with average palate outlines and pharynx line segments can be seen arrows indicate the direction of movement over time (b) shows the mean size, shape and location of the male and female pellet trajectories in the transformed data (a), the palate has been ‘flattened’ →must be interpreted more carefully

Explanation of Figure 6: both transformed and raw data bring out the larger male dorso-palatal strictures defined by T3 and T4 laminal and apical strictures are not different for males and females transformed data encode the distance between the palate and the pellets higher location of the female trajectories shows the different stricture size (T3-T4) T-test proves that for females the palate-pellet distance for T3-T4 is smaller average lengths of the pellet trajectories during the vocalic stretch are shorter for females

posterior male lingual pellets T3-T4 travel a greater distance than the female pellets and they stay in contrast to the smaller acoustic space traversed by the male speakers these gender differences stand in contrast to findings in (Hashi et al. 1998) where no gender influence on isolated vowel tokens was found male dorsum travels a greater distance in a shorter time period (see 1.Duration) than the female one because the mean speed of the male posterior pellets (T3-T4) is higher

Explanation of Figure 7: the next figure summarizes the average pellet speeds at each of the 11 measurement points for the anterior pellets T1-T2 the male and female speed is not significantly different over the whole vocalic stretch for T3-T4 the initial and final portions are similar as well whereas the mean speeds of T3-T4 are at the highest point you can see significantly higher male speeds → compensation by both males and females is necessary to achieve the same targets, despite differences in articulatory space

Conclusion of Figure 7: gender-specific stricture differences are restricted to posterior region of the oral cavity → degree of male palatal doming is higher and creates a greater articulatory space to cross there are nonuniform differences in the relation of oral to pharyngeal cavity length and nonuniform differences in palate shape → this has nonuniform dynamic consequences for tongue movement

4. Discussion for the same V-V sequences male and female tongue movements and their acoustic and perceptual products are similar in shape and structure difference between male and female F1 increased acoustically with the degree of vowel openness male speakers had a shorter stretch duration → the speed of tongue dorsum displacement was higher size of male and female articulatory spaces is different and stands in an inverse relationship to the size of their acoustic products for the V-V sequences male and female pellet tracks have a similar form and differ only in size and position

male and female speakers a operating with similar speeds of tongue movements assume that the slower (female) articulatory movements require more time and faster (male) ones less larger vowel space for women → women speak more clearly and articulate more because it is the prestige form for female women produce longer vowels than men possibly speakers adopt different articulatory strategies to arrive at tokens of the same phonological categories → many of the hypothetical consequences are speculation

→ no proof whether 2 speakers aim for similar targets when they produce tokens of the same phonological categories in a language → no classification that tokens of the same phonological categories are equivalent in articulatory, acoustic and perceptual terms several experiments draw conclusions based on a few informants → tendencies might be individual rather than gender based many reasons for difference between male and female speech → women tend to have a greater variation in their speech → female speech has been seen more difficult to analyse

5. References Simpson, A. P. (2002). Gender-specific articulatory- acoustic relations in vowel sequences. Journal of Phonetics, 30(3): Simpson, A. P. (2001). Dynamic consequences of differences in male and female vocal tract dimensions. Journal of the Acoustical Society of America, 109(5): Samuelsson, Y. (2006) Gender effects on phonetic variation and speaking styles: A literature study. GSLT Speech Technology Term Paper, autumn 2006.

Goldstein, U. (1980) An articulatory model for the vocal tracts of growing children. Ph. D. Thesis, MA: M.I.T. Hashi, M., Westbury, J. R. & Honda, K. (1998) Vowel posture normalization, Journal of the Acoustical Society of America, 104, 2426–2437. Johnson, K., Ladefoged, P. & Lindau, M. (1993) Individual differences in vowel production, Journal of the Acoustical Society of America, 94, 701–714. Kuehn, D. P. & Moll, K. L. (1976) A cineradiographic study of VC and CV articulatory velocities, Journal of Phonetics, 4, 303–320.

Thank you for listening!