Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech.

Slides:



Advertisements
Similar presentations
Identification of Stress Placement in Speakers with and without Dysarthria Pamela Campellone Thomas DiCicco Rupal Patel.
Advertisements

The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Sounds that “move” Diphthongs, glides and liquids.
“Connecting the dots” How do articulatory processes “map” onto acoustic processes?
Function words are often reduced or even deleted in casual conversation (Fig. 1). Pairs may neutralize: he’s/he was, we’re/we were What sources of information.
Acoustic Characteristics of Vowels
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
Chapter 8 Auditory Training Perry C. Hanavan, Au.D.
The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/
Hillenbrand: Vowels1 The Acoustics and Perception of American English Vowels.
SPEECH PERCEPTION 2 DAY 17 – OCT 4, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Speech perception 2 Perceptual organization of speech.
Jessica E. Huber Ph.D. in Speech Science from University at Buffalo MA in Speech-Language Pathology, Certified Speech- Language Pathologist Assistant Professor,
The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/
Speech and speaker normalization (in vowel normalization)
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
Sentence Durations and Accentedness Judgments ABSTRACT Talkers in a second language can frequently be identified as speaking with a foreign accent. It.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Effectiveness of spatial cues, prosody, and talker characteristics in selective attention C.J. Darwin & R.W. Hukin.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
Profile of Phoneme Auditory Perception Ability in Children with Hearing Impairment and Phonological Disorders By Manal Mohamed El-Banna (MD) Unit of Phoniatrics,
Interrupted speech perception Su-Hyun Jin, Ph.D. University of Texas & Peggy B. Nelson, Ph.D. University of Minnesota.
GABRIELLA RUIZ LING 620 OHIO UNIVERSITY Cross-language perceptual assimilation of French and German front rounded vowels by novice American listeners and.
The Perception of Speech
Vowel formant discrimination in high- fidelity speech by hearing-impaired listeners. Diane Kewley-Port, Chang Liu (also University at Buffalo,) T. Zachary.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Perception of Speech and the Consequences of Hearing Loss.
The Motor Theory of Speech Perception April 1, 2013.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
Speech Perception 4/4/00.
1. Background Evidence of phonetic perception during the first year of life: from language-universal listeners to native listeners: Consonants and vowels:
Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.
Intelligibility of voiced and voiceless consonants produced by Lebanese Arabic speakers with respect to vowel length Romy Ghanem.
Assessment of Phonology
Calibration of Consonant Perception in Room Reverberation K. Ueno (Institute of Industrial Science, Univ. of Tokyo) N. Kopčo and B. G. Shinn-Cunningham.
Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
How Does auditory perception organization works ? by Elvira Perez and Georg Meyer Dept. Psychology, Liverpool University, UK Hoarse Meeting, Chrysler Ulm,
Phonetic Context Effects Major Theories of Speech Perception Motor Theory: Specialized module (later version) represents speech sounds in terms of intended.
The long-term retention of fine- grained phonetic details: evidence from a second language voice identification training task Steve Winters CAA Presentation.
The New Normal: Goodness Judgments of Non-Invariant Speech Julia Drouin, Speech, Language and Hearing Sciences & Psychology, Dr.
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
Katherine Morrow, Sarah Williams, and Chang Liu Department of Communication Sciences and Disorders The University of Texas at Austin, Austin, TX
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Speech Perception.
Language Perception.
Motor Theory + Signal Detection Theory
Evaluation of a Binaural FMV Beamforming Algorithm in Noise Jeffery B. Larsen, Charissa R. Lansing, Robert C. Bilger, Bruce Wheeler, Sandeep Phatak, Nandini.
Detection of Vowel Onset Point in Speech S.R. Mahadeva Prasanna & Jinu Mariam Zachariah Department of Computer Science & Engineering Indian Institute.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Motor Theory of Perception March 29, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their.
A STUDY ON PERCEPTUAL COMPENSATION FOR / /- FRONTING IN A MERICAN E NGLISH Reiko Kataoka February 14, 2009 BLS 35.
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
What can we expect of cochlear implants for listening to speech in noisy environments? Andrew Faulkner: UCL Speech Hearing and Phonetic Sciences.
Introduction Method Experiment 2 In spoken word recognition, phonological and indexical properties (i.e., characteristics of the speaker’s voice) of a.
2014 Development of a Text-to-Speech Synthesis System for Yorùbá Language Olúòkun Adédayọ̀ Tolulope Department of Computer Science.
Speechreading Based on Tye-Murray (1998) pp
Sentence Durations and Accentedness Judgments
The effect of speech timing on velopharyngeal function
Bi-dialectalism: the investigation of the cognitive advantage and non-native dialect perception in noise Brittany Moore, Jackie Rayyan, & Lynn Gilbertson,
The Acoustics and Perception of American English Vowels
Speech Perception.
Vincent Porretta & Benjamin V. Tucker University of Alberta
The Acoustics and Perception of American English Vowels
Speech Perception (acoustic cues)
Presentation transcript:

Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech

Sinewave Speech 3 sinusoids of the same center freq.’s of the lowest 3 formants in a speech sample. Not harmonically related – aperiodic Narrow formant peaks Weird.

Remez, Rubin, Pisoni, and Carrell (1981) CONDITION A Presented 18 listeners with the SW sentence, “Where were you a year ago?” 2 transcribed the utterance, about half chose speech-like answers

Remez et al. (1981) cont’d CONDITION B Presented the same sentence to 18 new listeners with the instruction that they were about to hear speech 9 listeners transcribed the sentence completely Several others transcribed part or most of the sentence

Remez et al. (1981) Authors concluded that even in the absence of traditional speech cues, listeners could phonetically perceive SWS. The instruction that they were about to hear speech allowed listeners to direct their attention to the phonetic properties of the sinusoidal signals.

Problem? Remez et al. (1981) did not measure SWS perception at the phonetic level. With complete sentences listeners may utilize their existing higher level knowledge of language to reconstruct unfamiliar utterances.

Hillenbrand, Clark, and Baer (2011) Comprehensive study of SW vowels. Listeners must rely exclusively on perceptual pattern- matching mechanisms operating at the phonetic level. Focused on 3 training procedures

Hillenbrand et al. (2011) cont’d 71 phonetically trained listeners Initial vowel intelligibility test 300 sinewave /hVd/ syllables Database from Hillenbrand, Getty, Clark, and Wheeler (1995) Spoken by 48 men, 45 women, and yr. old children Same geographical region as listeners

Hillenbrand et al. (2011) cont’d Feedback 180 sinewave /hVd/ syllables from Hillenbrand et al. (1995) Sentence Transcription 50 sinewave sentences from HINT Triad SW, NS, SW Control: Male/Female Identification Task

Hillenbrand et al. (2011) cont’d Results of vowel intelligibility post- test * Transcription accuracy for sentences was 89.6%

Hillenbrand et al. (2011) cont’d Second Experiment 12 phonetically trained listeners Sinewave /hVd/ with feedback If correct, listener moved on to the next trial If incorrect: Sinewave /hVd/ Natural /hVd/ Sinewave /hVd/

Hillenbrand et al. (2011) cont’d Intelligibility rates in experiment 2 increased 19.8 percentage points. While phonetic information may be conveyed through sinewave speech, results are still low compared to vowel intelligibility rates – 95.4% (Hillenbrand and Nearey, 1999)

Hillenbrand et al. (2011) cont’d Results of vowel intelligibility post- test * Transcription accuracy for sentences was 89.6%

Hillenbrand, Clark, Houde, K. Hillenbrand, and M. Hillenbrand (2012) Sentence v. syllable intelligibility discrepancy Higher linguistic knowledge Length allows for accommodation to weird acoustic characteristics

Hillenbrand et al. (2012) cont’d 103 phonetically trained listeners Test signals recorded by 10 men and 10 women of same geographic region “The next word on the list is /hVd/” 16 vowels

Hillenbrand et al. (2012) cont’d ConditionSignals Presented SWISSinewave /hVd/ syllables in isolation. ISNSNaturally spoken /hVd/ syllables in isolation. SWCP-SWIS-WT Sinewave /hVd/ syllables preceded by a sinewave carrier phrase replicated from the same speaker. NSCP – WT Naturally spoken /hVd/ syllables preceded by a naturally spoken carrier phrase from the same speaker. SWCP-SWIS-XT Sinewave /hVd/ syllables preceded by a sinewave carrier phrase of a randomly paired speaker. NSCP- XT Naturally spoken /hVd/ syllables preceded by a naturally spoken carrier phrase of a randomly paired speaker. NSCP-SWIS-WTSinewave /hVd/ syllables preceded by a naturally spoken carrier phrase of the same speaker.

Hillenbrand et al. (2012) cont’d RESULTS ISNS – 95.1% NSCP – WT – 96.5% NSCP – XT – 95.2%

Hillenbrand et al. (2012) cont’d

Experiment 2 New set of 23 phonetically trained listeners 10 blocks of 32 trials Presented with alternating utterances from SWIS and SWCP

Hillenbrand et al. (2012) cont’d

Carrier Phrase Effect Does the carrier phrase need to be intelligible in order to increase sinewave vowel intelligibility? OR Does the listener need to accommodate to the unusual acoustical characteristics of the sinewave signal?

The Study 43 phonetically trained participants Pre-trial – naturally spoken /hVd/ vowels ‘heed’, ‘hawd’, ‘hayed’, etc. 16 vowel choices Six conditions

Conditions ISNS (control) – Natural speech recordings of /hVd/ vowels ISSS – Sinewave synthesized /hVd/ vowels presented in isolation

Conditions, cont’d CP-E - Sinewave synthesized /hVd/ vowels preceded by a consistent sinewave synthesized carrier phrase “the next word on the list is…” CP-J- Sinewave synthesized (English) vowels preceded by the same consistent carrier phrase, except in Japanese.

Conditions, cont’d HINT-E - vowels preceded by one of 240 sinewave synthesized sentences in the Hearing in Noise Test database HINT-J - vowels preceded by one of 240 sinewave synthesized sentences in the Hearing in Noise Test database, except in Japanese

99% 77% 61% 52% 50%48% 99% 52% 77% 50% 48.6% 61%

Japanese v. English Simply having time to accommodate to a weird sound with unique acoustical characteristics was not enough to increase intelligibility of the following vowels

HINT v. Fixed CP HINT-E = more intelligible than Japanese Fixed CP = most intelligible 1 CP v. 240 CP More intelligible = Greater /hVd/ scores

Why is Intelligibility Important? Results show – an intelligible CP enhances vowel intelligibility scores aides in unconscious signaling to speech processors 99%-77% - What else is helping? Not acoustical accommodation

References Assmann, P., Nearey, T., and Hogan, J. (1982). Vowel identification: Orthographic, perceptual, and acoustic aspects. Journal of the Acoustical Society of America, 71, Hillenbrand, J., Getty, L. A., Clark, M. J., & Wheeler, K. (1995). Acoustic characteristics of American English vowels. Journal of the Acoustical Society of America, 97(5 I), Hillenbrand, J. M., Clark, M. J., & Baer, C. A. (2011). Perception of sinewave vowels. Journal of the Acoustical Society of America, 129(6), Hillenbrand, J. M., Clark, M. J., & Houde, R. A. (2000). Some effects of duration on vowel recognition. Journal of the Acoustical Society of America, 108(6), Hillenbrand, J. M., Clark, M. J., Hillenbrand, M. W., Hillenbrand, K. S., & Hourde, R. A. (2012) Perceptual accommodation to sinewave speech. In preparation. Hillenbrand, J. M., Houde, R. A., & Gayvert, R. T. (2006). Speech perception based on spectral peaks versus spectral shape. Journal of the Acoustical Society of America, 119(6), Hillenbrand, J. M., & Nearey, T. M. (1999). Identification of resynthesized /hVd/ utterances: Effects of formant contour. Journal of the Acoustical Society of America, 105(6), Macleod, A., & Summerfield, Q. (1987). Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology, 21(2), Nilsson, M., Soli, S. & Sullivan, J. A. (1994). Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and noise. Journal of the Acoustical Society of America, 95(2),

References cont’d Remez, R. E., Rubin, P. E., Nygaard, L. C., & Howell, W. A. (1987). Perceptual normalization of vowels produced by sinusoidal voices. Journal of Experimental Psychology: Human Perception and Performance, 13(1), Remez, R. E., Rubin, P. E., Pisoni, D. B., & Carrell, T. D. (1981). Speech perception without traditional speech cues. Science, 212(4497),