Visual calibration, uncertainty, and attention in spatial auditory processing Norbert Kopčo Dept. of Cybernetics and AI, Technical University of Košice,

Slides:



Advertisements
Similar presentations
Attention and neglect.
Advertisements

Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
Visual Attention Attention is the ability to select objects of interest from the surrounding environment A reliable measure of attention is eye movement.
Chapter 6: Visual Attention. Overview of Questions Why do we pay attention to some parts of a scene but not to others? Do we have to pay attention to.
3-D Sound and Spatial Audio MUS_TECH 348. Wightman & Kistler (1989) Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical.
Source Localization in Complex Listening Situations: Selection of Binaural Cues Based on Interaural Coherence Christof Faller Mobile Terminals Division,
Chapter 6: Visual Attention. Scanning a Scene Visual scanning – looking from place to place –Fixation –Saccadic eye movement Overt attention involves.
Chapter 6: Visual Attention. Scanning a Scene Visual scanning – looking from place to place –Fixation –Saccadic eye movement Overt attention involves.
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo Center for Cognitive Neuroscience, Duke University Hearing Research Center,
Post-test review session Tuesday Nov in TH241.
Masker-First Advantage in Cued Informational Masking Studies Virginia M. Richards a, Rong Huang a, and Gerald Kidd Jr b. (a) Department of Psychology,
Organizational Notes no study guide no review session not sufficient to just read book and glance at lecture material midterm/final is considered hard.
Development of sound localization
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
ICA Madrid 9/7/ Simulating distance cues in virtual reverberant environments Norbert Kopčo 1, Scott Santarelli, Virginia Best, and Barbara Shinn-Cunningham.
December 1, 2009Introduction to Cognitive Science Lecture 22: Neural Models of Mental Processes 1 Some YouTube movies: The Neocognitron Part I:
Theoretical Models of Attention. Broadbent (1958) conceptualized attention as information processing Used a cuing paradigm to show that attentional selection.
Effect of roving on spatial release from masking for amplitude-modulated noise stimuli Norbert Kopčo *, Jaclyn J. Jacobson, and Barbara Shinn-Cunningham.
Vienna 4/28/ Click vs. click-click vs. blink-click: Factors influencing human sound localization in the horizontal plane Norbert Kopčo TU Košice.
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
4aPP17. Effect of signal frequency uncertainty for random multi-burst maskers Rong Huang and Virginia M. Richards Department of Psychology, University.
Sound source segregation (determination)
Acoustical Society of America, Chicago 7 June 2001 Effect of Reverberation on Spatial Unmasking for Nearby Speech Sources Barbara Shinn-Cunningham, Lisa.
Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston
Visual Adaptation And Spatial Auditory Processing Peter Lokša, Norbert Kopčo Collaborators: Jenni Groh, Barb Shinn-Cunningham, I-Fan Lin.
Attention Modulates Responses in the Human Lateral Geniculate Nucleus Nature Neuroscience, 2002, 5(11): Presented by Juan Mo.
Hearing in Time Slides used for talk to accompany Roger go to yellow three … Sarah Hawkins* and Antje Heinrich** *Centre for Music and Science, University.
Applied Psychoacoustics Lecture: Binaural Hearing Jonas Braasch Jens Blauert.
Chapter 12: Auditory Localization and Organization
Sounds in a reverberant room can interfere with the direct sound source. The normal hearing (NH) auditory system has a mechanism by which the echoes, or.
Calibration of Consonant Perception in Room Reverberation K. Ueno (Institute of Industrial Science, Univ. of Tokyo) N. Kopčo and B. G. Shinn-Cunningham.
Localization of Auditory Stimulus in the Presence of an Auditory Cue By Albert Ler.
Sound Localization Strategies in Simple and Complex Environments Norbert Kopčo Department of Cybernetics and AI, TU Košice, Slovakia Hearing Research Center.
Chapter 12: Sound Localization and the Auditory Scene.
Week 6: (March 15, 2011) Auditory Maps and orienting: need for Coordinate Transformations.
Dynamic Aspects of the Cocktail Party Listening Problem Douglas S. Brungart Air Force Research Laboratory.
‘Missing Data’ speech recognition in reverberant conditions using binaural interaction Sue Harding, Jon Barker and Guy J. Brown Speech and Hearing Research.
Hearing Research Center
Role of Working Memory in Visual Selective Attention de Fockert, Rees, Frith, Lavie (2001)
Auditory Neuroscience 1 Spatial Hearing Systems Biology Doctoral Training Program Physiology course Prof. Jan Schnupp HowYourBrainWorks.net.
Binding problems and feature integration theory. Feature detectors Neurons that fire to specific features of a stimulus Pathway away from retina shows.
When the Brain is attending a cocktail party When the Brain is attending a cocktail party Rossitza Draganova.
Fundamentals of Sensation and Perception THE AUDITORY BRAIN AND PERCEIVING AUDITORY SCENE ERIK CHEVRIER OCTOBER 13 TH, 2015.
The Relation Between Speech Intelligibility and The Complex Modulation Spectrum Steven Greenberg International Computer Science Institute 1947 Center Street,
Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
RIGHT PARIETAL CORTEX PLAYS A CRITICAL ROLE IN CHANGE BLINDNESS by Naser Aljundi.
Orienting Attention to Semantic Categories T Cristescu, JT Devlin, AC Nobre Dept. Experimental Psychology and FMRIB Centre, University of Oxford, Oxford,
Cortical Representations of Attention and Salience James A. Mazer Department of Neurobiology Yale School of Medicine Theoretical and Experimental Approaches.
$ studying barn owls in the laboratory $ sound intensity cues $ sound timing cues $ neural pathways for sound location $ auditory space $ interaural time.
ANT Z=52 R ACUE - PASSIVE VCUE - PASSIVE 1300 msVoltageCSD.31uV.03uV/cm 2 AIM We investigate the mechanisms of this hypothesized switch-ERP.
Fletcher’s band-widening experiment (1940)
The role of reverberation in release from masking due to spatial separation of sources for speech identification Gerald Kidd, Jr. et al. Acta Acustica.
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
Fundamentals of Sensation and Perception
Sound Localization and Binaural Hearing
Oregon Health & Science University
Auditory Localization in Rooms: Acoustic Analysis and Behavior
Precedence-based speech segregation in a virtual auditory environment
The barn owl (Tyto alba)
Localizing a speech target in a multitalker mixture
Cognitive Brain Dynamics Lab
Volume 62, Issue 1, Pages (April 2009)
Visually-induced auditory spatial adaptation in monkeys and humans
Volume 62, Issue 1, Pages (April 2009)
Alteration of Visual Perception prior to Microsaccades
Attentional Modulations Related to Spatial Gating but Not to Allocation of Limited Resources in Primate V1  Yuzhi Chen, Eyal Seidemann  Neuron  Volume.
Jennifer K. Bizley, Ross K. Maddox, Adrian K.C. Lee 
Presentation transcript:

Visual calibration, uncertainty, and attention in spatial auditory processing Norbert Kopčo Dept. of Cybernetics and AI, Technical University of Košice, Košice, Slovakia Dept. of Cognitive and Neural Systems, Boston University

Auditory Processing in Complex Scenes Sound source determination, segregation, grouping (Yost, 1992) Auditory scene analysis (Bregman, 1990) Factors (Yost, 1994): - Spectral separation - Spectral profile - Harmonicity - Spatial separation - Temporal separation - Temporal onsets and offsets - Temporal modulations - Visual information

Auditory Pathway and Spatial Hearing Kendal, Schwartz & Jessell (2000) Cochlea – peripheral filtering and neural coding Olivary complex – extraction of binaural spatial information Thalamus (Inferior Colliculus) – integration, modulation detection Auditory Cortex – auditory object formation, figure/ground separation, ASA, attentional factors

OUTLINE Three behavioral experiments (and computational considerations): 1. Spatial specificity and coordinate frame of visual recalibration (ventriloquism aftereffect) in humans and monkeys 2. How a priori information / uncertainty about masker locations influences localizability of target talker in a multi- talker environment 3. „What and where“ model of cortical processing of auditory distance information

Visually-induced auditory spatial adaptation in monkeys and humans Norbert Kopčo, I-Fan Lin, Barbara Shinn-Cunningham, Jennifer Groh Center for Cognitive Neuroscience, Duke University Hearing Research Center, Boston University Technical University, Košice, Slovakia

6 Introduction Visual stimuli can affect the perception of sound location e.g. the Ventriloquism Effect Way to go Red Sox! Way to go Red Sox! But does effect persist?

7 Introduction Visual stimuli can affect the perception of sound location e.g. the Ventriloquism Effect Way to go Red Sox! But does effect persist? - barn owls: prism adaptation (Knudsen et al.) - monkeys: “ventriloquism aftereffect” (Woods and Recanzone, Curr. Biol. 2004)

8 GOALS 1. Ventriloquism “aftereffect” in saccade task, in monkeys and humans? 2. Can “spatially specific” aftereffect be induced? - Zwiers et al. (2003): prism adaptation in humans generalizes 3. Reference frame of plasticity? - Visual, auditory, or oculomotor reference frame?

9 Methods Basic idea: 1. Pre-adaptation baseline: Measure auditory saccade accuracy 2. Adaptation phase: Present combined visual-auditory stimuli, with visual location shifted 3. Compare auditory saccade accuracy pre- and post-adaptation

10 Q1: Does it work? Initial experiment Design: Monkey Pre-adaptation baseline – ~100 Auditory-only trials Adaptation phase – 80% V-A stimuli, visual stimulus shifted 6 deg. Left or Right 20% Auditory-only Compare Auditory-only trials from adaptation phase to pre- adaptation phase Sounds: Loudspeakers Visual stimuli: LEDs

11 RESULTS

12 RESULTS

13 RESULTS

14 RESULTS

15 Q2: specificity

16 Q3: reference frame Eye-centered? Head (ear) -centered? Oculomotor? ? ? 3a. What is the reference frame? 3b. Is the reference frame the same for humans and monkeys?

17 Method Audiovisual display Expected behavior Stimulus Location (°) Magnitude (°) Fix head to face 0° Induce shift: - in only one region of space - from a single fixation point Test to see if shift generalizes to the same sub-region in: - head-centered space - eye-centered space FP LEDs Speakers

18 Predictions for reference frame

19 Results: Humans Audiovisual display Expected Responses FP LEDs Speakers Head-centered or Eye-centered Trained FP A-only responses: - Shift induced in trained sub-region - Some generalization to untrained regions Spatial specificity Shifted FP A-only responses: - Shift reduced and moved to left Mixture of Head-centered and eye-centered representation

20 Results: Monkeys Audiovisual display Expected Responses FP LEDs Speakers Head-centered or Eye-centered Trained FP A-only responses: - Shift induced in trained sub-region Spatial specificity - Generalization to untrained regions (asymmetrical) Shifted FP A-only responses: - Shift reduced and moved to left Representation more mixed than in humans

21 Summary The main results are consistent across species: Locally induced ventriloquist effect results in short-term adaptation, causing 30-to-50% shifts in responses to A-only stimuli from trained sub-region. The induced shift generalizes outside the trained sub-region, with gradually decreasing strength -> spatially specific The pattern of induced shift changes as the eyes move. Representation appears to be mixed, in a frame that is not purely head-centered or eye-centered.

22 Discussion Posterior Parietal Cortex Neural adaptation could have been induced at several stages along the pathway. Future work Examine temporal and spatial factors influencing the eye- centered modulation. Look at other trained sub-regions. Midbrain Pons Cerebrum Thalamus Midbrain Pons Thalamus (Kandel, Schwartz, Jessel) and (Purves)

Localizing a speech target in a multitalker mixture Norbert Kopčo 1, Virginia Best 2, and Simon Carlile 2 1 Technical University of Košice, Košice, Slovakia Hearing Research Center, Boston University 2 School of Medical Sciences, University of Sydney, Sydney, Australia

24 Introduction Spatial separation of sources enhances speech perception In complex environments (e.g., with multiple talkers), spatial perception also important for “sorting” acoustic scene into objects and focusing attention on sources of interest (Brungart et al 2001; Freyman et al 1999; Kidd et al 2005; Best et al 2007; Shinn-Cunningham 2008) Relatively few studies actually measured localization of speech in a multitalker environment (Yost et al., 1996; Hawley et al.1999; Drullman and Bronkhorst 2000; Brungart et al. 2006)

25 Experiment and Goals Study horizontal localization of speech in a multitalker environment Question 1: How does presence of maskers influence localization performance? Evaluate the effect of maskers on biases/variability in responses. Question 2: Is performance affected by a priori knowledge / uncertainty about distribution of masker locations? Compare performance when masker distribution fixed vs. varied from trial to trial. Hypothesis: masker location uncertainty will hurt performance.

26 Setup and masker patterns

27 Methods Stimuli: Target: word “two” spoken by a female talker Maskers: 4 different monosyllabic words, spoken by 4 male talkers (all longer than target) Target-to-Masker energy ratios: 0 dB or -5 dB Task:Subjects pointed head to perceived target location Subjects asked to indicate location only if target heard (5 catch trials with no target per block to monitor obedience) Conditions (separate blocks): - Control: No masker - Fixed: Masker pattern fixed across block of trials - Mixed: Masker pattern randomly chosen for each trial

28 Detection Detection worse at lower TMR, similar in both uncertainty conditions

29 Effect of uncertainty Averaged across patterns and target locations, a priori knowledge does not help Masker Uncertainty Helps   Hurts

30 Effect of uncertainty When looking only at off-masker locations, a priori knowledge does help Masker Uncertainty Helps   Hurts

31 Effect of uncertainty When looking only at on-masker locations, a priori knowledge hurts performance Masker Uncertainty Helps   Hurts

32 Interim summary A priori knowledge of masker locations influences target talker localizability: - Improving performance at locations from which (the subject knows) no masker can come - Degrading performance at locations from which (the subject knows) maskers will come Why more information causes worse performance? Possible mechanism: - Redistribution of processing resources - “incorrect” strategy: focusing only on off-masker locations Analyze means and st.devs. in responses to gain more insight into behavior

33 Mean Responses: Pattern 1 Compression of responses to peripheral targets Bias Leftward   Rightward

34 Bias due to Maskers Compression strongest for targets near peripheral maskers Leftward   Rightward

35 Masker Uncertainty and Bias When masker pattern fixed throughout a block, responses biased away from maskers Bias due to Masker Uncertainty Left   Right

36 Response Variability Complex effect of target location, masking pattern, uncertainty and TMR

37 Uncertainty and Response Variability Patts 1-3 (grouped maskers): Averaged x-location, no Mixed-Fixed diff. If not averaged, fixing pattern - helps for off-masker targets - hurts for on-masker targets Masker Uncertainty Helps   Hurts

38 Uncertainty and Response Variability Patts 1-3 (grouped maskers): Averaged x-location, no Mixed-Fixed diff. If not averaged, fixing pattern - helps for off-masker targets - hurts for on-masker targets Masker Uncertainty Helps   Hurts Patts 4-5 (distributed maskers): Effect of uncertainty independent of loc. or TMR - patt 4: uncertainty hurts performance - patt 5: uncertainty helps performance

39 Summary 1. Mixture has complex effects on localization bias and variability - depending on masker pattern, location of target re. maskers, and TMR - compression of mean localization responses near peripheral maskers - increases in standard deviations, in particular when maskers distributed 2. Trial-to-trial randomization in the distribution of speech maskers (i.e., masker location uncertainty) modulates the effect of masking: - sometimes exaggerating it (as expected) - but sometimes reducing it (unexpected) These modulatory effects are likely to be due to - change in strategy / assignment of resources: focusing on off-masker locations in fixed condition - alternative: adaptation

40 Computational approach - Modeling Faller and Merimaa (2004) Source localization in complex listening situations: Selectionof binaural cues based on interaural coherence. Journal of the Acoustical Society of America. - computational “effective signal processing” model - computes cross-correlation between the ear inputs - only considers moments when cross-correlation peaks = when only one source is present and remaining sources are silent - creates probability distributions of ITD/ILD at these “clean look” moments - “decision maker” selects which source to process

41 Computational approach - Modeling Faller and Merimaa (2004) Source localization in complex listening situations: Selectionof binaural cues based on interaural coherence. Journal of the Acoustical Society of America. Example of distribution of representations with and without the criterion.

42 Computational approach - Modeling Faller and Merimaa (2004) Source localization in complex listening situations: Selectionof binaural cues based on interaural coherence. Journal of the Acoustical Society of America. Implications for our data: - This model can be a good starting point, but - Model would fail because no “good looks” available (concurrent mono-syllabic words) - No mechanism in the model to describe how listeners use attention in this task: - how attention is directed towards target - if/how “a priori” knowledge of masker locations is used - how uncertainty about masker locations influences behavior

Cortical representation of auditory distance Norbert Kopčo 1, Jyrki Ahveninen 2 1 TU Košice, Slovakia 2 Martinos Center for Biomedical Imaging Harvard Medical School

Hypotéza o sluch spracovaní v kôre Existujú dve paralelné dráhy, ktoré spracúvajú informáciu o obsahu (what) a priestorovej polohe (where) zvukov. Preto, organizácia sluchovej informácie môže byť podobná ako vo vizuálnom kortexe, kde je what/where organizácia v celku akceptovaná Problémy: - tieto závery sú hlavne na základe štúdií na zvieratách - u človeka, niektoré foneticky citlivé oblasti na zlej strane A1 Rauschecker (2005)

fMRI aktivácie kôrových oblastí Aktivácia sluchovej kôry pri prezentovaní stimulov s meniacou sa vzdialenosťou Aktvovaná primárna sluchová kôra Aktivovaná „parietálna“ vetva – „What & Where“ model sedí Aktivovaná aj vizuálna oblasť MT, zodpovedná za vizuálnu reprezentáciu pohybu Ďalšie kroky: Nástroj pre testovanie modelov spracovania vzdialenostnej informácie Kopčo et al. (ARO, 2011)

Overall Summary Three behavioral experiments examined sound localization in the horizontal (left-right) dimension: Exp. 1. In both humans and monkeys: - it is possible to induce a spatially specific ventriloquism aftereffect - visually guided recalibration occurs in a mixed coordinate frame Exp. 2. A priori information affects talker localization in a multi-talker mixture - Listeners redistribute resources from masked to non-masked locations Exp. 3. Auditory distance representation consistent with „what and where“ model. Exp. 1 & 2 published in J Neurosci and J Acoust Soc Am, Exp. 3 in preparation

Current Projects Continuing in the projects described above Spatial auditory attention and speech perception Contextual Plasticity in Sound Localization (US NIH) Perceptual and cross-modal learning in auditory distance perception (Marie Curie Project, 7FP EU)

Collaborators and Support Beata Tomoriová, Rudolf Andoga, Luboš Hládek – TU Košice Virginia Best, Barbara Shinn-Cunningham – Boston University Jennifer Groh – Duke University Simon Carlile – University of Sydney Aaron Seitz – University of California, Riverside Jyrki Ahveninen – Harvard Medical School Support by: US National Institutes of Health Human Frontiers Science Program Slovak Science Grant Agency US National Academies of Science Twinning Program

49 Introduction Two sounds can perceptually interact even when they do not overlap in time or frequency. In a previous study of auditory “cuing” (Kopco et al., 2001, 2003), we found that a preceding sound had a strong effect on the perceived location of a target. The current study was designed to examine these interactions in detail, and to determine if these interactions were due to - acoustic effects of the room - low-level neural processing effects (adaptation) - higher level processing (perceptual organization, strategy)

50 Methods We measured azimuthal localization performance for a click target stimulus when preceded by another identical click (coming from a known location): - presented with a short onset asynchrony - from a different azimuthal location - in either an anechoic chamber or an ordinary classroom Performance was compared to that in a control in which there was no preceding distractor click.

Stimuli and setup

Contextual task-dependent plasticity Responses biases away from distractor even on trials on which the target is not preceded by a distractor

Click vs Click-Click:Effect of the Preceding Distractor Perceived target location of frontal sources is “attracted” by lateral distractors Interactions for SOAs up to 200 ms Possible mechanisms: - Precedence effect - Short-term adaptation - Grouping of identical target and distractor

Click vs Click-Click vs. Click-Click-Click-Click-Click: Effect of Streaming on Perceived Target Azimuth Replacing the single-click distractor by a 5-Hz click-train such that the target does not fall into the temporal pattern of the distractor eliminates the “attractive” bias.