Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Larry Feth Ashok Krishnamurthy Ohio State University.

Slides:



Advertisements
Similar presentations
Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
Advertisements

SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
Sound source segregation Development of the ability to separate concurrent sounds into auditory objects.
Periodicity and Pitch Importance of fine structure representation in hearing.
Purpose The aim of this project was to investigate receptive fields on a neural network to compare a computational model to the actual cortical-level auditory.
Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.
CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.
Speech Science XII Speech Perception (acoustic cues) Version
Pitch Perception.
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
Temporal processing Representation of the temporal characteristics of sound.
Rob van der Willigen Auditory Perception.
Effect of roving on spatial release from masking for amplitude-modulated noise stimuli Norbert Kopčo *, Jaclyn J. Jacobson, and Barbara Shinn-Cunningham.
Two- tone unmasking and suppression in a forward-masking situation Robert V. Shannon 1976 Spring 2009 HST.723 Theme 1: Psychophysics.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
4aPP17. Effect of signal frequency uncertainty for random multi-burst maskers Rong Huang and Virginia M. Richards Department of Psychology, University.
Sound source segregation (determination)
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
CAPD: ”Behavioral assessment”
Methods Neural network Neural networks mimic biological processing by joining layers of artificial neurons in a meaningful way. The neural network employed.
SIGNAL DETECTION IN FIXED PATTERN CHROMATIC NOISE 1 A. J. Ahumada, Jr., 2 W. K. Krebs 1 NASA Ames Research Center; 2 Naval Postgraduate School, Monterey,
Lecturer: Rob van der Willigen
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Applied Psychoacoustics Lecture 3: Masking Jonas Braasch.
Hearing Research Center
Temporal masking of spectrally reduced speech: psychoacoustical experiments and links with ASR Frédéric Berthommier and Angélique Grosgeorges ICP 46 av.
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Psychophysics and Psychoacoustics
Additivity of auditory masking using Gaussian-shaped tones a Laback, B., a Balazs, P., a Toupin, G., b Necciari, T., b Savel, S., b Meunier, S., b Ystad,
When the Brain is attending a cocktail party When the Brain is attending a cocktail party Rossitza Draganova.
Applied Psychoacoustics Lecture 3: Masking
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Speech Perception.
Acoustic Phonetics 3/14/00.
Motor Theory of Perception March 29, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their.
SOUND PRESSURE, POWER AND LOUDNESS
Auditory Perception 1 Streaming 400 vs. 504 Hz 400 vs. 566 Hz 400 vs. 635 Hz 400 vs. 713 Hz A 400-Hz tone (tone A) is alternated with a tone of a higher.
PSYCHOACOUSTICS A branch of psychophysics
Thomas Andrillon, Sid Kouider, Trevor Agus, Daniel Pressnitzer 
Ana Alves-Pinto, Joseph Sollini, Toby Wells, and Christian J. Sumner
The “Flash-Lag” Effect Occurs in Audition and Cross-Modally
Temporal Processing and Adaptation in the Songbird Auditory Forebrain
Volume 82, Issue 1, Pages (April 2014)
Mark Sayles, Ian M. Winter  Neuron 
CHAPTER 10 Auditory Sensitivity.
Volume 77, Issue 5, Pages (March 2013)
Contrast Gain Control in Auditory Cortex
Results from offline processing
Perceptual Echoes at 10 Hz in the Human Brain
Josh H. McDermott, Eero P. Simoncelli  Neuron 
Thomas Andrillon, Sid Kouider, Trevor Agus, Daniel Pressnitzer 
Volume 61, Issue 2, Pages (January 2009)
The Generation of Direction Selectivity in the Auditory System
Speech Perception (acoustic cues)
Hearing Illusory Sounds in Noise: The Timing of Sensory-Perceptual Transformations in Auditory Cortex  Lars Riecke, Fabrizio Esposito, Milene Bonte, Elia.
Jason Samaha, Bradley R. Postle  Current Biology 
Responses of Collicular Fixation Neurons to Gaze Shift Perturbations in Head- Unrestrained Monkey Reveal Gaze Feedback Control  Woo Young Choi, Daniel.
Volume 49, Issue 3, Pages (February 2006)
Multiple Timescales of Memory in Lateral Habenula and Dopamine Neurons
Liu D. Liu, Christopher C. Pack  Neuron 
Consequences of the Oculomotor Cycle for the Dynamics of Perception
Attentive Tracking of Sound Sources
Temporal Processing and Adaptation in the Songbird Auditory Forebrain
Consequences of the Oculomotor Cycle for the Dynamics of Perception
Attention Reorients Periodically
Auditory Demonstrations
Tuning to Natural Stimulus Dynamics in Primary Auditory Cortex
Volume 58, Issue 1, Pages (April 2008)
Presentation transcript:

Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Larry Feth Ashok Krishnamurthy Ohio State University

Spectral Center-of-Gravity Chistovitch and Lublinskaja (1976,1979) Perceptual Formant at ‘Center-of-Gravity’ Two-formant synthetic vowel Matched by adjustable single-formant signal Center frequency of match depends on relative amplitudes of the two formants

Experimental Paradigm

Chistovitch and Lublinskaja Results

Voelcker Two-tone Signals

Initially, led to the EWAIF model Envelope-Weighted Average of Instantaneous Frequency (time domain) Point by point multiply E x F values Sum over N periods Divide by sum of weights Indicates pitch change in periodic signals Helmholtz (1954, 2 nd English edition) Jeffress (1964)

EWAIF Model

IWAIF Model Predictions

Two-tone resolution task Feth and O’Malley (1977) Two-tone resolution  I = 1 dB;  f independent variable ‘Voelcker-tone pair’ pitch discrimination inverted “u-shaped” psychometric functions Components resolved beyond –75% point ~3.5 Bark separation = jnnd

Voelcker Signal: Discrimination Task

Discrimination Results Jnnd – ‘Just not noticeable difference’ Filled circles Breakpoint estimates Open circles CR – critical ratio CBW CB – ‘empirical’ CBW Solid line TW envelope

IWAIF Model Intensity Weighted Average of Instantaneous Frequency = Centroid of signal’s positive power spectrum (Anantharaman, et al., 1993)

Dynamic Center-of-Gravity Effect Lublinskaja (1996) Three-formant synthetic Russian vowels Listeners identified vowels with : ‘conventional’ formant transitions co-modulated formant pairs that exhibit the same dynamic spectral center-of-gravity ID functions were very similar with formant pairs separated by 4.3 Bark or less

Psychophysics Anantharaman (1998) Two-tone signals with dynamic c-o-g effect We called them ‘Virtual Frequency’ Glides Listeners matched transition rates in VF glides to those in FM glides IWAIF model predicts results for transitions from 2 to ~5 ERB

Dynamic Center-of-Gravity Signals Waveform Long-term Spectrum Spectrogram

Rate-matching results

Model Results

Short-term running IWAIF Model

IWAIF Model Results

Application of ST-IWAIF Model

More Psychophysics Research Question(s) What is being ‘integrated’ in spectral integration? OR Where in the auditory system is the processing located?

Psychophysics Iyer, et al., (2001) Temporal acuity for FM and VF glides Step vs. linear ramp discrimination Similar  T values may mean common process Masking patterns for FM and VF glides Peripheral process i.e., ‘Energy Masking’ Different results – VF not peripheral process

Temporal Acuity Paradigm Step (red) versus Glide (blue) transitions for FM tone (left panel) and Virtual Frequency (right panel)

Temporal Acuity Results Just discriminable step duration for FM (solid lines; filled symbols) and VF (dashed lines; unfilled symbols) signals. Frequency separations are 2, 5 and 8 ERBu. The results for 1000 Hz are represented by circles and those for 4000 Hz by triangles. Average for 4 listeners.

Dynamic Center-of-Gravity Maskers Masking of brief probe by FM glide (left panel) and by VF glide (right panel). Probe is in the spectro-temporal center of each masker. Five auditory filter bands are illustrated. Time Fl Fc Fh Time Fl Fc Fh

Masking Results Masking of a 20 ms probe by FM (light blue) and VF (darker blue) maskers. The probe is placed at the beginning, middle, and end of the masker. Significant differences are seen at 5 and 8 ERB for the middle position and the initial position at 8 ERB. Average for 4 listeners.

Glide Direction Asymmetry Gordon and Poeppel 3 Frequency ranges: (for F 1,F 2 & F 3 ) ~ 30 unpracticed listeners 20 trials / signal One interval Direction Identification: Up vs. Dn Best results at high frequency (F 3 ) range 10- through 160 ms ‘Up’ is easier to ID than ‘Dn’ Less clear-cut results at low or mid-freq. ranges

Glide Direction Asymmetry Gordon and Poeppel – ARLO (2002) Identification of FM Sweep direction is easier for rising than for falling tones.

Glide Direction Asymmetry Dawson, (2002) Tested only high frequency range (F 3 ) Practiced listeners; ~ 100% all conditions! Modified procedure Rove each frequency sweep over 1 octave Practice to ~ asymptote

Glide ID Results Average for 4 listeners One-interval ID task 250 trials / datum point Well-practiced Subj’s Starting frequency roved over 1-octave range Summary FM ‘easier’ than VF Up ‘easier’ than Down

CV Identification Experiment [da] – [ga] continuum: varying F 3 transition Duration: 50 ms transition into 200 ms base F 3 onset: 2018 to 2658 Hz in 80 Hz steps F 3 base: 2527 Hz (constant) Formant transition ‘type’: Klatt synthesizer Frequency Modulated tone glide Virtual Frequency glide

CV Identification: Stimuli Spectrogram 1. Step 1 of Klatt Monaural Continuum—/ga/ endpoint

CV Identification: Stimuli Spectrogram 2. Step 1 of FM Monaural Continuum—/ga/ endpoint

CV Identification: Stimuli Spectrogram 3. Step 1 of VF Monaural Continuum—/ga/ endpoint

CV Identification: Stimuli Spectrogram 4. Step 1 of Dichotic FM Continuum—/ga/ endpoint

CV Identification: Stimuli Spectrogram 5. Step 1 of Dichotic VF Continuum—/ga/ endpoint

CV Identification Experiment Listeners: 8 adults with normal hearing Procedure: One interval, 2-AFC 3 transition types: Klatt, FM or VF 6 of 8 tokens tested 20 repetitions / token Results are averaged for the 8 listeners

CV Identification: Results

Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Conclusions ‘Excitation’ is integrated not signal energy The processing is central not peripheral Masking Patterns are very different Temporal Acuity results are similar for FM & VF glides Direction ID Asymmetry is similar for FM & VF glides

Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Conclusions CV identification functions are similar for: Klatt synthesized sounds FM formant sounds VF formant sounds Thus, it doesn’t matter how ‘excitation’ is moved from A to B, the brain will interpret it as the same sound. The effect is evident under dichotic listening; further support for central processing.

Collaborators Rob Fox Nandini Iyer Jayanth Anantharaman Ewa Jacewicz Robin Dawson

Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Thank You Questions?

Up vs. Down FM Glide

Up vs. Down VF Glide

Effect of Masker Direction Masking produced by VF (above) and FM (below) maskers with  F = 5 ERB. Purple bars are “up” glides; yellow bars are “down” glides. Centered probe.

Effect of Masker Position Masking produced by VF (above) and FM (below) maskers with  F = 5 ERB. Purple bars are “up” glides; yellow bars are “down” glides.

Klatt & FM Parameters

Virtual Frequency Parameters