A Model of Binaural Processing Based on Tree-Structure Filter-Bank

Slides:



Advertisements
Similar presentations
Audio Workgroup Neuro-inspired Speech Recognition.
Advertisements

Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
An AER Analog Silicon Cochlea Model using Pseudo Floating Gate Transconductors Master Thesis in Electronics and Computer Science, Microelectronics Programme,
Sound Localization Superior Olivary Complex. Localization: Limits of Performance Absolute localization: localization of sound without a reference. Humans:
The case of the missing pitch templates: How harmonic templates emerge in the early auditory system Shihab Shamma and David Klein, 2000.
Advanced Speech Enhancement in Noisy Environments
Purpose The aim of this project was to investigate receptive fields on a neural network to compare a computational model to the actual cortical-level auditory.
HEARING Sound How the Ears Work How the Cochlea Works Auditory Pathway
Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.
Hearing and Deafness Outer, middle and inner ear.
MIMICKING THE HUMAN EAR Philipos Loizou (author) Oliver Johnson (me)
The peripheral auditory system David Meredith Aalborg University.
USING COMPUTATIONAL MODELS OF BINAURAL HEARING TO IMPROVE AUTOMATIC SPEECH RECOGNITION: Promise, Progress, and Problems Richard Stern Department of Electrical.
Alaphangmagassag (virtual pitch) Terhardt ( ): megkulombozendo “virtual pitch” es “spectral pitch” dimenziok Virtual pitch: valoszinuleg (=biztos)
1 Auditory Sensitivity, Masking and Binaural Hearing.
The Auditory Nervous System Classical Ascending Pathway.
Chapter 6: Masking. Masking Masking: a process in which the threshold of one sound (signal) is raised by the presentation of another sound (masker). Masking.
Source Localization in Complex Listening Situations: Selection of Binaural Cues Based on Interaural Coherence Christof Faller Mobile Terminals Division,
cells in cochlear nucleus
Neural mechanisms of sound localization How the brain calculates interaural time and intensity differences.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
The Auditory System. Audition (Hearing)  Transduction of physical sound waves into brain activity via the ear. Sound is perceptual and subjective. 
Structure and function
Frequency representation Part 2 Development of mechanisms involved in frequency representation.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Plasticity in sensory systems Jan Schnupp on the monocycle.
Welcome To The Odditory System! Harry I. Haircell: Official Cochlea Mascot K+K+ AIR FLUID amplification.
The Auditory System Sound is created by pressure waves in air; these waves are induced by vibrating membranes such as vocal cords. Because the membranes.
Hearing Part 2. Tuning Curve Sensitivity of a single sensory neuron to a particular frequency of sound Two mechanisms for fine tuning of sensory neurons,
Speech Segregation Based on Sound Localization DeLiang Wang & Nicoleta Roman The Ohio State University, U.S.A. Guy J. Brown University of Sheffield, U.K.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Audio and Music Representations (Part 2) 1.
Hearing.
Abstract We report comparisons between a model incorporating a bank of dual-resonance nonlinear (DRNL) filters and one incorporating a bank of linear gammatone.
EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Cell Types and Physiology in the CANS. Major Components of the Central Auditory Nervous System (CANS) VIIIth cranial nerve Cochlear Nucleus Superior Olivary.
The auditory system Romain Brette Romain Brette Ecole Normale Supérieure.
Minimum Mean Squared Error Time Series Classification Using an Echo State Network Prediction Model Mark Skowronski and John Harris Computational Neuro-Engineering.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
Senior Design Fall 06 and Spring 07 Speech Strategy for the Cochlear Implant.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 1) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Monaural Speech Segregation: Representation, Pitch, and Amplitude Modulation DeLiang Wang The Ohio State University.
Applied Psychoacoustics Lecture: Binaural Hearing Jonas Braasch Jens Blauert.
Filtering. What Is Filtering? n Filtering is spectral shaping. n A filter changes the spectrum of a signal by emphasizing or de-emphasizing certain frequency.
Methods Neural network Neural networks mimic biological processing by joining layers of artificial neurons in a meaningful way. The neural network employed.
ICASSP Speech Discrimination Based on Multiscale Spectro–Temporal Modulations Nima Mesgarani, Shihab Shamma, University of Maryland Malcolm Slaney.
Hearing Physiology.
Gammachirp Auditory Filter
Hearing Research Center
GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO: , M.E(MBCBS),COM SYSTEM,VI.
IIT Bombay {pcpandey,   Intro. Proc. Schemes Evaluation Results Conclusion Intro. Proc. Schemes Evaluation Results Conclusion.
Hearing Sound and the limits to hearing Structure of the ear: Outer, middle, inner Outer ear and middle ear functions Inner ear: the cochlea - Frequency.
Humans can hear sounds at frequencies from about 20Hz to 20,000Hz.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 3) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Lecture 8 Neuromorphic Hearing 1. Outline The ear and the cochlea Silicon Cochlea 2.
Speech Segregation Based on Oscillatory Correlation DeLiang Wang The Ohio State University.
Development of Sound Localization 2 How do the neural mechanisms subserving sound localization develop?
January 2001RESPITE workshop - Martigny Multiband With Contaminated Training Data Results on AURORA 2 TCTS Faculté Polytechnique de Mons Belgium.
대학원 생체신호처리 - 4 이상민.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Petr Marsalek Prague, CZ, Charles University,
Perceptual Constancies
Fundamentals of Sensation and Perception
Central auditory processing
The Human Ear.
Tuning in the basilar membrane
Bell Work According to the Gestalt principle of proximity,
Coincidence Detection in the Auditory System
Presentation transcript:

A Model of Binaural Processing Based on Tree-Structure Filter-Bank 길이만, 김영익, 김화길, 구임회 한국과학기술원 응용수학전공

Motivation Design of auditory preprocessors motivated from the characteristics of biological auditory systems. - robustness to noise - capturing the minute differences between signals (2 Hz difference) - wide dynamic range (140 dB) - selective attention - source localization using two ears

Design of Basilar Membrane (BM) Types of BM Models Lyon and Mead - R. F. Lyon and C. Mead, An Analog Electronic Cochlea, IEEE Transactions on Acoustics, Speech and Signal Processing, 37(7), 1988. Liu - W. Liu, A. G. Andreou, and Jr. M. H. Goldstein, Voiced-Speech Representation by an Analog Silicon Model of the Auditory Periphery, IEEE Transactions on Neural Network, 3(3), 1992. Kates - J. M. Kates, A Time-Domain Digital Cochlear Model, IEEE Transaction on Signal Processing, 39(12), 1991. Hamming BPF - O. Ghitza, Robustness against Noise: the Role of Timing-Synchrony Measurement. IEEE International Conference on Acoustics, Speech and Audio Processing, 6.8, 1987.

Design of Filter Bank (2) Fully Cascaded BPF (3) TSFB (1) Lyon & Mead H L H L (1) Lyon & Mead L Cascaded LPFs Number of Filters: Cascaded LPFs & HPFs Higher bandpass capability Equal delay time Number of Filters: Tree sructure Cascaded LPFs & HPF Higher bandpass capability Equal delay time Versatile Q control Number of Filters:

Binaural Processing Models EE (Excitation-Excitation) cells in medial superior olive (MSO) - interaural cross-correlation models EI (Excitation-Inhibition) cells in lateral superior olive (LSO) - equalization-cancellation (EC) theory

Interaural Cross-correlation Model (EE-type cells) Running interaural cross-correlation (Jeffress, 1948) Delay weighting (Colburn, 1977) Frequency weighting (Stern and Shear, 1996)

Lindemann’s Model (EI-type cells) Contralateral inhibition mechanism Stationary-inhibition component Dynamic-inhibition component

Breebaart Model (EI-type cells) Combined EI-type cell Temporal windowing Nonlinear saturation

Shamma’s Model The Stereausis Network

Stereausis Processor

Network output for time shifted 600Hz tone a) zero shift b) shift c) shift d) shift

Binaural Processing with TSFB

Simulation for Binaural Processing - Signal : TI46 (‘zero’ ~ ‘nine’) male speech samples - Noise : Noisex samples

Simulation for Binaural Processing Feature ZCPA 45 90 White Gaussian Noise 10 94.3 95.0 95.3 94.1 94.5 5 85.9 93.0 94.2 92.2 92.5 49.7 63.7 74.3 63.9 68.2 75.9 Op Room 94.8 94.9 95.6 93.3 93.7 94.4 74.2 87.5 89.6 88.7 91.3 88.8 F16 94.7 95.2 90.7 93.5 93.6 92.1 51.0 67.9 71.2 66.4 64.7

Simulation for Monaural Processing without HRTF Feature ZCPA White Gaussian Noise 10 97.4 95.8 5 96.8 95.1 82.4 86.2 Op Room 95.7 96.0 94.9 76.8 91.2 F16 97.3 96.9 83.5 87.5

Conclusion A model of binaural processing with TSFB has been suggested. Simulation results showed that the binaural processing could be advantageous in noisy environment. The HRTF could degrade the performance of speech recognition. A new feature combining binaural data will be investigated in the sense of noise robustness.