Efficient Coding of Natural Sounds Grace Wang HST 722 Topic Proposal.

Slides:



Advertisements
Similar presentations
Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
Advertisements

Acoustic/Prosodic Features
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
V1 Physiology. Questions Hierarchies of RFs and visual areas Is prediction equal to understanding? Is predicting the mean responses enough? General versus.
Hearing over time Using the neural representation of the time waveform of sound.
Fast Readout of Object Identity from Macaque Inferior Tempora Cortex Chou P. Hung, Gabriel Kreiman, Tomaso Poggio, James J.DiCarlo McGovern Institute for.
Edward L. Bartlett Ph.D., University of Wisconsin –Auditory thalamus in vitro synaptic physiology and anatomy Postdoctoral Research at Johns Hopkins University.
Purpose The aim of this project was to investigate receptive fields on a neural network to compare a computational model to the actual cortical-level auditory.
Whale Songs. Humpback whale song, wavelet graph These images are visual representations of songs sung by whales and dolphins.
Cortical Encoding of Natural Auditory Scenes Brigid Thurgood.
1 Testing the Efficiency of Sensory Coding with Optimal Stimulus Ensembles C. K. Machens, T. Gollisch, O. Kolesnikova, and A.V.M. Herz Presented by Tomoki.
Nonstationary Signal Processing Hilbert-Huang Transform Joseph DePasquale 22 Mar 07.
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
Language. Using Language What is language for? Using Language What is language for? – Rapid, efficient communication To accomplish this goal, what needs.
Read this article for Friday [1]Chelazzi L, Miller EK, Duncan J, Desimone R. A neural basis for visual search in inferior temporal cortex. Nature 1993;
Application of Statistical Techniques to Neural Data Analysis Aniket Kaloti 03/07/2006.
Intro to Fourier Analysis Definition Analysis of periodic waves Analysis of aperiodic waves Digitization Time-frequency uncertainty.
Spike-triggering stimulus features stimulus X(t) multidimensional decision function spike output Y(t) x1x1 x2x2 x3x3 f1f1 f2f2 f3f3 Functional models of.
Basic Concepts and Definitions Vector and Function Space. A finite or an infinite dimensional linear vector/function space described with set of non-unique.
Spike Train decoding Summary Decoding of stimulus from response –Two choice case Discrimination ROC curves –Population decoding MAP and ML estimators.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
HELSINKI UNIVERSITY OF TECHNOLOGY LABORATORY OF COMPUTER AND INFORMATION SCIENCE NEURAL NETWORKS RESEACH CENTRE Variability of Independent Components.
-Gaurav Mishra -Pulkit Agrawal. How do neurons work Stimuli  Neurons respond (Excite/Inhibit) ‘Electrical Signals’ called Spikes Spikes encode information.
The biological basis of bird song production. Bird song facts: 1.There are about 4000 species of song birds each of which usually produce 1 to many bird.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
Michael P. Kilgard Sensory Experience and Cortical Plasticity University of Texas at Dallas.
The search for organizing principles of brain function Needed at multiple levels: synapse => cell => brain area (cortical maps) => hierarchy of areas.
Michael P. Kilgard Sensory Experience and Cortical Plasticity University of Texas at Dallas.
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
Low Level Visual Processing. Information Maximization in the Retina Hypothesis: ganglion cells try to transmit as much information as possible about the.
From Auditory Masking to Supervised Separation: A Tale of Improving Intelligibility of Noisy Speech for Hearing- impaired Listeners DeLiang Wang Perception.
2 2  Background  Vision in Human Brain  Efficient Coding Theory  Motivation  Natural Pictures  Methodology  Statistical Characteristics  Models.
Neural coding (1) LECTURE 8. I.Introduction − Topographic Maps in Cortex − Synesthesia − Firing rates and tuning curves.
Methods Neural network Neural networks mimic biological processing by joining layers of artificial neurons in a meaningful way. The neural network employed.
Projects: 1.Predictive coding in balanced spiking networks (Erwan Ledoux). 2.Using Canonical Correlation Analysis (CCA) to analyse neural data (David Schulz).
Studies of Information Coding in the Auditory Nerve Laurel H. Carney Syracuse University Institute for Sensory Research Departments of Biomedical & Chemical.
Visual Object Undrestanding. Verdical Perception Perception that is consistent with actual state of afairs in the environment.
Gammachirp Auditory Filter
On Natural Scenes Analysis, Sparsity and Coding Efficiency Redwood Center for Theoretical Neuroscience University of California, Berkeley Mind, Brain.
CCN COMPLEX COMPUTING NETWORKS1 This research has been supported in part by European Commission FP6 IYTE-Wireless Project (Contract No: )
Communication and Cortex The computational neuroethology of mouse vocalizations Robert Liu Sloan-Swartz Center for Theoretical Neurobiology University.
BCS547 Neural Decoding. Population Code Tuning CurvesPattern of activity (r) Direction (deg) Activity
BCS547 Neural Decoding.
Are worms more complex than humans? Rodrigo Quian Quiroga Sloan-Swartz Center for Theoretical Neurobiology. Caltech.
S.Klimenko, August 2003, Hannover LIGO-G Z How optimal are wavelet TF methods? S.Klimenko l Introduction l Time-Frequency analysis l Comparison.
The Fractional Fourier Transform and Its Applications Presenter: Pao-Yen Lin Research Advisor: Jian-Jiun Ding, Ph. D. Assistant professor Digital Image.
Computational Neuroscience Lecture 7 Conor Houghton.
LIGO-G Z The Q Pipeline search for gravitational-wave bursts with LIGO Shourov K. Chatterji for the LIGO Scientific Collaboration APS Meeting.
SPHSC 462 HEARING DEVELOPMENT Overview Review of Hearing Science Introduction.
Efficient Encoding of Vocalizations in the Auditory Midbrain Lars Holmstrom Systems Science PhD Program Portland State University.
Biological Modeling of Neural Networks: Week 15 – Fast Transients and Rate models Wulfram Gerstner EPFL, Lausanne, Switzerland 15.1 Review Populations.
Non-Linear Transformations Michael J. Watts
Object and face recognition
Confirmatory analysis for multiple spike trains Kenneth D. Harris 29/7/15.
>>ITD.m running… IC 800Hz 40 sp/sec 34 O azim Neuron April 16, 2009 Bo Zhu HST.723 Spring 2009 Theme 3 Paper Presentation April 1, 2009.
LIGO-G Z The Q Pipeline search for gravitational-wave bursts with LIGO Shourov K. Chatterji for the LIGO Scientific Collaboration APS Meeting.
1 Nonlinear models for Natural Image Statistics Urs Köster & Aapo Hyvärinen University of Helsinki.
Sound localization in the owl: representation and control
Information-Theoretic Listening
Kocaeli University Introduction to Engineering Applications
Nonlinear processing in LGN neurons
Continuous Time Representations of Song in Zebra Finches
Sound shadow effect Depends on the size of the obstructing object and the wavelength of the sound. If comparable: Then sound shadow occurs. I:\users\mnshriv\3032.
Reduction of Information Redundancy in the Ascending Auditory Pathway
Origin and Function of Tuning Diversity in Macaque Visual Cortex
Corticothalamic feedback connections
Feature-based attention
Tuning to Natural Stimulus Dynamics in Primary Auditory Cortex
Neurons spike at a consistent phase of the oscillations in the LFP
Decomposing the motor system
Presentation transcript:

Efficient Coding of Natural Sounds Grace Wang HST 722 Topic Proposal

Background Our natural environment consists of multiple sound sources and background noises with complex harmonic and temporal structures  Stationary harmonic structure (vocalizations)  Nonstationary structure (crunchy) Have our brains developed to be optimized for processing natural sounds efficiently?

Efficient coding hypothesis Shannon’s model for transferring data in communication systems (info theory) Barlow applied info theory to model neural behavior  Proposed spikes of neural populations was optimized to efficiently represent naturally occurring images and sounds  Efficiency = reduce redundancy to maximize independence  Largely consistent with early stages of visual processing  Is the same true for the auditory system?

Information in adjacent filters is highly redundant Nearly identical statistics across filters and across sound types Suggested natural sounds may be associated with bandwidth and translation invariance from Attias and Schreiner (1997), “Temporal low-order statistics of natural sounds”

How do we remove redundancy to achieve statistically independent neural responses? from Schwartz and Simoncelli (2000), “Natural sound statistics and divisive normalization in the auditory system”

Sound pressure waveform decomposition Differs from spectrograms by retaining the phase information from Smith and Lewicki (2005), “Efficient coding of time-relative structure using spikes”

Time-frequency distribution of optimal filter shapes Fourier transform may be sufficient for animal vocalizations Need wavelet analysis for environmental sounds and speech from Lewicki (2002), “Efficient coding of natural sounds”

Decompose into modulation spectra from Singh and Theunissen (2003), “Modulation spectra of natural sounds and ethological theories of auditory processing”

Discarding a lot of energy in white noise

from Singh and Theunissen (2003), “Modulation spectra of natural sounds and ethological theories of auditory processing”

Neural responses Increasing selectivity along ascending pathway for songs in zebra finches Optimal stimulus set in grasshoppers does not coincide completely with their natural sounds  Suggested neurons are optimized for behaviorally relevant sounds (Machens et al 2005)

Papers Background  Smith and Lewicki 2005: Efficient coding Discussion  Attias and Schreiner 1997: Demonstrate redundant representation in peripheral auditory system  Lewicki 2002: Fourier vs wavelet representation  Singh and Theunissen 2003: Modulation spectra Further reading  Machens et al 2005: grasshoppers, behaviorally relevant sounds  Hsu et al 2004: zebra finches, increasing selectivity