Calibration of Consonant Perception in Room Reverberation K. Ueno (Institute of Industrial Science, Univ. of Tokyo) N. Kopčo and B. G. Shinn-Cunningham.

Slides:



Advertisements
Similar presentations
Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
Advertisements

Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Auditorium Acoustics 1. Sound propagation (Free field)
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
1 Auditory Sensitivity, Masking and Binaural Hearing.
Improvement of Audibility for Multi Speakers with the Head Related Transfer Function Takanori Nishino †, Kazuhiro Uchida, Naoya Inoue, Kazuya Takeda and.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
1 Real Time Walkthrough Auralization - the first year from static to dynamic auralization properties and limitations model and receiver grid examples current.
Source Localization in Complex Listening Situations: Selection of Binaural Cues Based on Interaural Coherence Christof Faller Mobile Terminals Division,
Watkins, Raimond & Makin (2011) J Acoust Soc Am –2788 temporal envelopes in auditory filters: [s] vs [st] distinction is most apparent; - at higher.
Masker-First Advantage in Cued Informational Masking Studies Virginia M. Richards a, Rong Huang a, and Gerald Kidd Jr b. (a) Department of Psychology,
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
ICA Madrid 9/7/ Simulating distance cues in virtual reverberant environments Norbert Kopčo 1, Scott Santarelli, Virginia Best, and Barbara Shinn-Cunningham.
High Frequency Ultrasonic Characterization of Carrot Tissue Christopher Vick Advisor: Dr. Navalgund Rao Center for Imaging Science Rochester Institute.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg, Julia Hirschberg Columbia University Interspeech /14/06.
Effect of roving on spatial release from masking for amplitude-modulated noise stimuli Norbert Kopčo *, Jaclyn J. Jacobson, and Barbara Shinn-Cunningham.
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
PH 105 Dr. Cecilia Vogel Lecture 4. OUTLINE  Room Acoustics  direct and early sound  precedence effect  echoes and anechoic chamber  reverberation.
Acoustical Society of America, Chicago 7 June 2001 Effect of Reverberation on Spatial Unmasking for Nearby Speech Sources Barbara Shinn-Cunningham, Lisa.
Physics 1251 The Science and Technology of Musical Sound Unit 2 Session 18 MWF Room Acoustics Unit 2 Session 18 MWF Room Acoustics.
Acoustics Reverberation.
Interarticulator programming in VCV sequences: Effects of closure duration on lip and tongue coordination Anders Löfqvist Haskins Laboratories New Haven,
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
3-D Sound and Spatial Audio MUS_TECH 348. Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back.
Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.
Sounds in a reverberant room can interfere with the direct sound source. The normal hearing (NH) auditory system has a mechanism by which the echoes, or.
Developing a model to explain and stimulate the perception of sounds in three dimensions David Kraljevich and Chris Dove.
Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.
Need for cortical evoked potentials Assessment and determination of amplification benefit in actual hearing aid users is an issue that continues to be.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
‘Missing Data’ speech recognition in reverberant conditions using binaural interaction Sue Harding, Jon Barker and Guy J. Brown Speech and Hearing Research.
Hearing Research Center
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Neurophysiologic correlates of cross-language phonetic perception LING 7912 Professor Nina Kazanina.
Scaling Studies of Perceived Source Width Juha Merimaa Institut für Kommunikationsakustik Ruhr-Universität Bochum.
Staffan Hygge Noise, memory and learning (Buller, minne och inlärning) Staffan Hygge Environmental Psychology Department of Building, Energy and Environmental.
Katherine Morrow, Sarah Williams, and Chang Liu Department of Communication Sciences and Disorders The University of Texas at Austin, Austin, TX
Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Progress To Date, Monday 21 st January 2008.
Room Acoustics DHC 161 March 2, Early sound in a room.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Phonetic features in ASR Kurzvortrag Institut für Kommunikationsforschung und Phonetik Bonn 17. Juni 1999 Jacques Koreman Institute of Phonetics University.
On the improvement of virtual localization in vertical directions using HRTF synthesis and additional filtering Wersényi György SZÉCHENYI ISTVÁN UNIVERSITY,
Frequency-response-based Wavelet Decomposition for Extracting Children’s Mismatch Negativity Elicited by Uninterrupted Sound Department of Mathematical.
Janine Wotton, Kristin Welsh, Crystal Smith, Rachel Elvebak, Samantha Haseltine (Gustavus Adolphus College) and Barbara Shinn-Cunningham (Boston University).
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
The role of reverberation in release from masking due to spatial separation of sources for speech identification Gerald Kidd, Jr. et al. Acta Acustica.
Introduction Method Experiment 2 In spoken word recognition, phonological and indexical properties (i.e., characteristics of the speaker’s voice) of a.
Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech.
Speech Audiometry Lecture 8.
Auditory Localization in Rooms: Acoustic Analysis and Behavior
Auditorium Acoustics 1. Sound propagation (Free field)
Volume 31, Issue 3, Pages 392.e1-392.e12 (May 2017)
4aPPa32. How Susceptibility To Noise Varies Across Speech Frequencies
Relationship between Pitch and Rhythm Perception with Tonal Sequences
Precedence-based speech segregation in a virtual auditory environment
University of Silesia Acoustic cues for studying dental fricatives in foreign-language speech Arkadiusz Rojczyk Institute of English, University of Silesia.
Ana Alves-Pinto, Joseph Sollini, Toby Wells, and Christian J. Sumner
Mid-Term Review John W. Worley AudioGroup, WCL
Pitch Perception Accuracy of Band, Orchestra, and Choir Students
Speech Perception CS4706.
Loudness asymmetry in real-room reverberation: cross-band effects
Volume 62, Issue 1, Pages (April 2009)
RESULTS: Individual Data
Volume 62, Issue 1, Pages (April 2009)
Speech Perception (acoustic cues)
A maximum likelihood estimation and training on the fly approach
MECH 373 Instrumentation and Measurement
Presentation transcript:

Calibration of Consonant Perception in Room Reverberation K. Ueno (Institute of Industrial Science, Univ. of Tokyo) N. Kopčo and B. G. Shinn-Cunningham (Hearing Research Center, Boston Univ.)

Introduction  Our auditory process is usually assumed to be static and fixed, dependent only on the input signals rather than on the state of the listener.  We naturally and fluidly compensate for many interfering effects in everyday environments.  How do listeners calibrate auditory perception to acoustic interference?

Outline of the Study  PURPOSE: To explore how listeners calibrate auditory perception to room reverberation.  STRATEGY: Measure the effect of sudden changes of reverberation on speech perception. Carrier phrase (Rev C ) Target (Rev T ) Carrier phrase (Rev C ) --- Lower performance Un-matching reverberation --- Higher performance Matching reverberation  HYPOTHESIS: Consonants identification performance should be better when listeners have consistent room experience just prior to a test sound.

Stimuli VC1 VC VC * Rev-C *Rev-T Carrier phrase Target  Speech source: VC (Vowel-Consonant) syllables with 16 consonants preceded by ‘o’ (/a/) ok, ot, op, of, od, og, ob, ov, oth(v), om, on, ong, oz, oth(uv), os, osh Two male and One female Recordings from corpus and a past study  Binaural room IR (BRIR): R1, R2, Anechoic  Test sound: VC*BRIR

Binaural room impulse responses R1: at relatively closer point (12m) to the sound source in very reverberant church. … reverberant R2: at second balcony in a large concert hall (33m) … reverberant Pseudo-anechoic BRIR are processed from R1 BRIR by a 5-ms time window. … dry (clear) R1,Lch R2,Lch

R1 R2

Binaural room impulse responses R1: at relatively closer point (12m) to the sound source in very reverberant church. … reverberant R2: at second balcony in a large concert hall (33m) … reverberant Pseudo-anechoic BRIR are processed from R1 BRIR by a 5-ms time window. … dry (clear) R1,Lch R2,Lch Processed for Pseudo-Anechoic HRTF

Experimental Design and Procedure  Test signals were presented with insert headphones.  Subject’s responses for the final VCs were obtained by GUI using 16 graphical buttons labeled with the VCs.  Number of VCs (2 or 4) in the carrier was fixed throughout blocks of trials.  Stimuli set (10 VCs x 3 talkers x 3 conditions = 90 trials in total) were randomly presented in each block, repeated twice for each subject.  Subjects: 14 Native English speakers  Percent-correct target identification scores were calculated for each condition and subject. t t =0.8 s 2 VCs carrier VCs carrier ---- VC1 VC2 VC VC1 VC2 VC3 VC4 VC Rev-C Rev-T Carrier phrase Target tttt Rev-T R1R2AE Rev-C 2VCs or 4VCs AE R AE - R1 RmRm R nm - R2 R nm RmRm -

Experimental Results Carrier Reverberation 2VCs % Correct target identification Rev-T R1R2AE Rev-C 2VCs or 4VCs AE R AE - R1 RmRm R nm - A2 R nm RmRm - 4VCs RmRm RmRm R nm R AE ** * ○ : Rev-T = R1 , ● : Rev-T=R2  The effect of Rev-C is significant only with Rev-T=R2 (p<.0001): performance with matching reverberation is significantly higher than unmatching rev. with Rev-T=R2.  The effect of the carrier length is not significant.  Condition means of the PC: across 14 subjects and two repetitions  Error bars: showing 95 % confidence intervals for mean within subject (14 data)

Analysis of BNIR - reverberation Frequency [Hz] Reverberation Energy (Rev(50ms-)/Dir(0-50ms)) Frequency [Hz] Reverberation Time (T60) R1R1 R2R2 Frequency [Hz] FFT of early 100ms Relative level [dB] SNR and STI R1 R2

Summary  Calibration to room reverberation improved consonant perception in one (but not in the other) room explored in this study.  The two rooms differ in several acoustic characteristics, which might be the cause of this effect.  The calibration occurs quickly, after just a few words.

Thank you for your attention!