The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm.

Slides:



Advertisements
Similar presentations
A. Hatzis, P.D. Green, S. Howard (1) Optical Logo-Therapy (OLT) : Visual displays in practical auditory phonetics teaching. Introduction What.
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Rhotic Vowels 5. The Special Case of Vocalic /r/ This is the vowel in words like “bird,” “learn,” “nerd,” “sir” Symbol: /Ô/ (schwar) or /ÎÕ/ MacKay prefers.
Reading and the phonetic module Carol A. Fowler Haskins Laboratories University of Connecticut Yale University.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
Basic Spectrogram & Clinical Application Lab 9. Spectrographic Features of Vowels n 1st formant carries much information about manner of articulation.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
JPN494: Japanese Language and Linguistics JPN543: Advanced Japanese Language and Linguistics Phonology & Phonetics (2)
A two dimensional kinematic mapping between speech acoustics and vocal tract configurations : WISP A.Hatzis, P.D.Green1 History of Vowel.
1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.
Vowel Acoustics, part 2 March 12, 2014 The Master Plan Today: How resonance relates to vowels (= formants) On Friday: In-class transcription exercise.
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Vowels Vowels: Articulatory Description (Ferrand, 2001) Tongue Position.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Learning Styles and Comprehension Brought to you by: Jennifer, Annika, and Katharine.
English Phonetics and Phonology Lesson 4A
The auditory and the visual percept evoked by the same audiovisual stimuli Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm.
SPEECH ARTICULATION: Vowels David Brett David Brett.
Revision: What are pure vowel sounds?
Conclusions  Constriction Type does influence AV speech perception when it is visibly distinct Constriction is more effective than Articulator in this.
Phonetics HSSP Week 5.
PHONETICS & PHONOLOGY COURSE WINTER TERM 2014/2015.
Preschool-Age Sound- Shape Correspondences to the Bouba-Kiki Effect Karlee Jones, B.S. Ed. & Matthew Carter, Ph.D. Valdosta State University.
Phonetics and Phonology
Phonological Constraints on the Acquisition of Mid Vowels in English for Students in Taiwan author: 黃俐雯 presented by Lisa Liu 報告人: 劉莉莎.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
Quantifying Vowel Space Using Recordings of the IPA Vowels Bob Shackleton Congressional Budget Office Quantitative Linguistics and Dialectology University.
Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,
Nasal endings of Taiwan Mandarin: Production, perception, and linguistic change Student : Shu-Ping Huang ID No. : NA3C0004 Professor : Dr. Chung Chienjer.
Björkner, Eva Researcher, Doctoral Student Address Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing P.O. Box 3000.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Dan Wright Developing Algorithms for Computational Comparative Diachronic Historical Linguistics.
Vowels LIN Vowels vs. Consonants Vowels Pulmonic Egressive Airstream Usually voiced, but can be voiceless Maintainable articulations More open than.
Speech Perception 4/4/00.
Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
Sensation & Perception
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Neurophysiologic correlates of cross-language phonetic perception LING 7912 Professor Nina Kazanina.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
DIPHTHONGS Also called gliding vowels A significant glide from one articulatory position to another They have two target configurations represented by.
Phonetics The object of study of phonetics are called phones. Phones are actual speech sounds as uttered by human beings. Phonetics has three main branches:
2.3 Markedness Differential Hypothesis (MDH)
Bosch & Sebastián-Gallés Simultaneous Bilingualism and the Perception of a Language-Specific Vowel Contrast in the First Year of Life.
Perception of Danger Signals: The Role of Control Jochen Brandtstadter, Andreas Voss, and Klaus Rothermund.
Speech Perception.
Language Perception.
Speech Science II Capturing and representing speech.
Acoustic Phonetics 3/14/00.
Rhotic Vowels 5. The Special Case of Vocalic R This is the vowel in words like “bird,” “learn,” “nerd,” “sir” Symbol: [ ɚ ] (schwar) or [ ɝ ] MacKay.
Does the brain compute confidence estimates about decisions?
Speechreading Based on Tye-Murray (1998) pp
Auditory Perception 1 Streaming 400 vs. 504 Hz 400 vs. 566 Hz 400 vs. 635 Hz 400 vs. 713 Hz A 400-Hz tone (tone A) is alternated with a tone of a higher.
Rhotic Vowels.
S. Kramer1, K. Tucker1, A.L. Moro1, E. Service1, J.F. Connolly1
English Phonetics and Phonology
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Speech Perception.
Speech Perception (acoustic cues)
An Introduction to Speechreading
Motor theory.
Topic: Language perception
What is linguistics? Linguistics is the scientific study of language, in other words, it is the discipline that studies the nature and use of language.
Presentation transcript:

The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm

Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. They were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. They were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. The vowels were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. The vowels were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

Background Typical results: Visual roundedness combined with auditory openness. AVPercept ɡyɡɡeɡɡeɡ → ɡiɡ ɡeɡɡyɡ → ɡøɡɡøɡ ɡiɡɡyɡ → ɡyɡɡyɡ ɡeɡɡeɡɡiɡ → ɡeɡ

Background Explanation: Acoustic cues to openness (F 1 etc.) are prominent and reliable. Acoustic cues to roundedness (higher formants) are less reliable. Optic cues to roundedness are prominent and reliable; rounded lips are easy to distinguish from unrounded. Optic cues to openness are less reliable because of variation due to individual habits, attitude and emotion.

Background Explanation: Acoustic cues to openness (F 1 etc.) are prominent and reliable. Acoustic cues to roundedness (higher formants) are less reliable. Optic cues to roundedness are prominent and reliable; rounded lips are easy to distinguish from unrounded. Optic cues to openness are less reliable because of variation due to individual habits, attitude and emotion.

Background The mentioned experiment was designed with the objective of investigating categorical phonemic perception. However, subjects informally reported having heard vowels whose quality differed from that of ordinary Swedish vowels. Auditory rounding together with visual unrounding appeared to affect the heard backness quality of the vowel.

Background The mentioned experiment was designed with the objective of investigating categorical phonemic perception. However, subjects informally reported having heard vowels whose quality differed from that of ordinary Swedish vowels. Auditory rounding together with visual unrounding appeared to affect the heard backness quality of the vowel.

The present study The present experiment aims at exploring the effect of the optic signal on the finer phonetic, sub- categorical auditory perception of vowels.

The present study We reused a subset of the stimuli from the previous experiment. AV ɡyɡɡiɡ ɡyɡɡeɡ ɡyɡ-- ɡyɡ AV ɡeɡɡiɡ ɡeɡɡyɡ ɡeɡ-- ɡeɡ AV ɡiɡɡyɡ ɡiɡɡeɡ ɡiɡ-- ɡiɡ

The present study There were 4 speakers: 2 male, 2 female.

The present study There were 8 perceivers: They were selected from a previous experiment where they had shown sensitivity to the optic signal in incongruent audiovisual stimuli. The 8 subjects were all phonetically skilled and familiar with the IPA-chart for vowels.

The present study The subjects perceived the stimuli by way of headphones and a computer screen. The stimuli were presented in quasi-random order. Responses were given on electronic response sheets.

The present study The subjects were instructed to rate the dimensions of the heard vowel (or of those seen in purely optical stimuli). Lip rounding (6 degrees), 1st: unrounded; 5th: rounded Lip spreading (3 degrees) Openness (18 degrees), 2nd: close vowels, 6th: close-mid vowels Backness (11 degrees auditorily; 7 degrees visually), 2nd: front vowels, 6th (auditorily): central vowels

Results Openness opn vs. roundedness rnd; acoustic stimuli (listening only):

Results Openness opn vs. roundedness rnd; optic stimuli (lipreading only):

Results Openness opn of incongruent AV-stimuli vs. opn of A-stimuli: opn = opn A (r 2 = 0.97)

Results Roundedness rnd of incongruent AV-stimuli vs. rnd of A-stimuli: (no significant correlation)

Results Backness bac of incongruent AV-stimuli vs. rnd of A-stimuli: bac = rnd A (r 2 = 0.66) bac = rnd A – 0.20 rnd AV (r 2 = 0.74)

Results Openness opn of incongruent AV-stimuli vs. opn of V-stimuli: (no significant correlation)

Results Roundedness rnd of incongruent AV-stimuli vs. rnd of V-stimuli: rnd = rnd V (r 2 = 0.92) rnd = rnd V bac V (r 2 = 0.95)

Results Backness bac of incongruent AV-stimuli vs. rnd of V-stimuli: (significant negative correlation)

Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: 1.The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. 2.F 2 ’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: 1.The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. 2.F 2 ’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: 1.The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. 2.F 2 ’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: 1.The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. 2.F 2 ’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

Discussion Analysis of perceived backness StimulusPredictionObservation A (acoustic) V (optic) Expl. 1 (gestural) Expl. 2 (auditory) roundedunroundedfrontedretracted unroundedroundedretractedfronted

Discussion Analysis of perceived backness StimulusPredictionObservation A (acoustic) V (optic) Expl. 1 (gestural) Expl. 2 (auditory) roundedunroundedfrontedretracted unroundedroundedretractedfronted

Discussion Analysis of perceived backness StimulusPredictionObservation A (acoustic) V (optic) Expl. 1 (gestural) Expl. 2 (auditory) roundedunroundedfrontedretracted unroundedroundedretractedfronted Conclusion: The effect is due to auditory (F 2 ’) rather than articulatory (gestural) associations.

Discussion We have seen that the perceived retractedness of A rounded but V unrounded vowels can be understood as due to a continuous auditory variable (F 2 ’). The variation in perceived retractedness cannot be explained on the basis of a late-integration hypothesis, since Swedish lacks non-front unrounded vowel phonemes, whose existence would be required in order to apply such a hypothesis. This is clear and direct evidence for early, sub- categorical integration.

x Thank you for your attention!