Meena Ramani 04/12/06 EEL 6586 Automatic Speech Processing.

Slides:



Advertisements
Similar presentations
Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab:
Advertisements

Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
Perception Chapter 11: Hearing and Listening
Mr. McCormick A.P. Psychology
Periodicity and Pitch Importance of fine structure representation in hearing.
HEARING Sound How the Ears Work How the Cochlea Works Auditory Pathway
Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.
CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.
Hearing and Deafness Outer, middle and inner ear.
Auditory Neuroscience - Lecture 7 Hearing Aids and Cochlear Implants aauditoryneuroscience.com/lectures.
MIMICKING THE HUMAN EAR Philipos Loizou (author) Oliver Johnson (me)
More From Music music through a cochlear implant Dr Rachel van Besouw Hearing & Balance Centre, ISVR.
Pitch organisation in Western tonal music. Pitch in two dimensions Pitch perception in music is often thought of in two dimensions, pitch height and pitch.
INTRODUCTION TO HEARING. WHAT IS SOUND? amplitude Intensity measured in decibels.
Sensation and Perception - audition.ppt © 2001 Laura Snodgrass, Ph.D.1 Audition Anatomy –outer ear –middle ear –inner ear Ascending auditory pathway –tonotopic.
Meena Ramani 04/10/06 EEL 6586 Automatic Speech Processing.
M.Sc. in Medical Engineering
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
PH 105 Dr. Cecilia Vogel Lecture 10. OUTLINE  Subjective loudness  Masking  Pitch  logarithmic  critical bands  Timbre  waveforms.
Interrupted speech perception Su-Hyun Jin, Ph.D. University of Texas & Peggy B. Nelson, Ph.D. University of Minnesota.
Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab:
Fitting Formulas Estimate amplification requirements of individual patients Maximize intelligibility of speech Provide good overall sound quality Keep.
Structure and function
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
TOPIC 4 BEHAVIORAL ASSESSMENT MEASURES. The Audiometer Types Clinical Screening.
Chapter 6: The Human Ear and Voice
Hearing Aids and Hearing Impairments
EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Psychoacoustics: Sound Perception Physics of Music, Spring 2014.
Hearing Impairment Hair cells are responsible for translating mechanical information into neural information. Thus, with damaged hair cells, the auditory.
Humans can hear sounds at frequencies from about 20Hz to 20,000Hz.
KTH ROYAL INSTITUTE OF TECHNOLOGY Sound, the Auditory System, and Pitch Perception Roberto Bresin DT2350 Human Perception for Information Technology Copyright.
Audiograms How to read them and what they are.. Terms to know: Audiogram An audiogram is a means of recording the results of a hearing test. It will include.
Cochlear Implants American Sign Language Children & Cochlear Implants Psychological Evaluation of Implant Candidates James H. Johnson, Ph.D., ABPP Department.
Test of Hearing And Pure tone Audiometry
Hearing Chapter 5. Range of Hearing Sound intensity (pressure) range runs from watts to 50 watts. Frequency range is 20 Hz to 20,000 Hz, or a ratio.
Hearing Subtitle. The Physics of Sound  Frequency: The number of cycles a sound wave completes in a given period of time  Amplitude: the Strength of.
Department of Electrical & Computer Engineering Auditory Perception Meena Ramani 04/09/2004.
BY: SAM SPINK COCHLEAR IMPLANT. WHAT IS IT? System of external and internal devices that aim to restore hearing to sensorineural hearing loss patients.
Chapter 5: Normal Hearing. Objectives (1) Define threshold and minimum auditory sensitivity The normal hearing range for humans Define minimum audible.
Md.Kausher ahmed Electrical department. Biomedical engineering Code:6875.
IB Assessment Statements Option I-1, The Ear and Hearing: I.1.1.Describe the basic structure of the human ear. I.1.2.State and explain how sound pressure.
COCHLEAR IMPLANTS Brittany M. Alphonse Biomedical Engineering BME 181.
Chapter 11: Hearing.
Chapter 4 Sensation What Do Sensory Illusions Demonstrate? Streams of information coming from different senses can interact. Experience can change the.
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Otoacoustic Emissions
Fundamentals of Sensation and Perception
Humans can hear sounds at frequencies from about 20Hz to 20,000Hz.
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Pitch What is pitch? Pitch (as well as loudness) is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether.
SOUND PRESSURE, POWER AND LOUDNESS
HEARING- 3. LEARNING OBJECTIVES LEARNING OBJECTIVES Discuss the principles used in performing tests of hearing Discuss the principles used in performing.
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
Design a New Speech Encoder for Cochlear Implants Using DRNL model. Prepared By: Koka Gopi 3 rd sem, M.tech Communication Systems.
Hearing or audition.
Sound, the Auditory System, and Pitch Perception
Oregon Health & Science University
Hearing tests.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
PSYCHOACOUSTICS A branch of psychophysics
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Chapter 6 (D): Hearing.
Your Ear…. Your Ear…..
Hearing, not trying out for a play
Psychoacoustics: Sound Perception
Sound in the Ear Presentation
Ms. Saint-Paul A.P. Psychology
Hearing Loss, Hearing Aids and Cochlear Implants
Presentation transcript:

Meena Ramani 04/12/06 EEL 6586 Automatic Speech Processing

Topics to be covered The incredible sense of hearing 1 Lecture 1: The incredible sense of hearing 1 Anatomy Perception of Sound The incredible sense of hearing 2 Lecture 2: The incredible sense of hearing 2 Psychoacoustics Hearing aids and cochlear implants

The incredible sense of hearing-2 “ Behind these unprepossessing flaps lie structures of such delicacy that they shame the most skillful craftsman" -Stevens, S.S. [Professor of Psychophysics, Harvard University]

How do we hear?

Threshold of Hearing

Equal loudness curves

The Bass Loss Problem Rock music Too low  no bass Too high  too much bass

Threshold variation with age

The Audiogram

The Audiogram (contd.) Pure tone audiogram [ K 2K 4K 6k] Hz <20 dB HL is Normal Hearing

Loudness Growth Curve

Otoacoustic emissions The ear produces some sounds! –OHC-outer hair cell Used to test hearing for infants & check if patient is feigning a loss

Monoaural beats If two tones are presented monaurally with a small frequency difference, a beating pattern can be heard 500 & 502 Hz500 & 520 Hz Interaction of the two tones in the same auditory filter Waveform: 150 Hz Hz

Beating can also be heard when the tones are presented to different ears! Beating arises from neural interaction Only perceived if the tones are sufficiently close in frequency 500 Hz - left520 Hz - rightbinaural Binaural beats

The case of the missing fundamental Telephone BW: Hz How do we know the pitch? Primary Auditory cortex Pitch sensitive neurons [Bendor and Wang, Nature 2005] Neuron responds to fundamental and harmonics What are the I/Ps to these neuron? How do spikes represent periodic, temporal and spectral information?

Matlab code available Feed it a wav file Spits out PSTH Auditory-periphery model (Zhang et al. ~2001)

Critical bands Equally loud, close in frequency Same IHCs Slightly louder Equally loud, separated in freq. Different IHCs Twice as loud Psychoacoustic experiments

Critical Band (cont.) Proposed by Fletcher How to measure? –S/N ratio vs noise BW CB ~= 1.5mm spacing on BM 24 such band pass filters BW of the filters increases with f c Logarithmic relationship –Weber’s law example Bark scale Center FreqCritical BW

Critical bands for HI

“You know I can't hear you when the water is running!” MASKINGMASKING

Frequency Masking Masking occurs because two frequencies lie within a critical band and the higher amplitude one masks the lower amplitude signal Masking can be because of broad band, narrowband noise, pure and complex tones Low frequency broad band sounds mask the most –Eg. Truck on road, water flowing Masking threshold –Amount of dB for test tone to be just audible in presence of noise

Temporal Aspects of Masking Simultaneous Masking Pre-Stimulus/Backward/Premasking –1 st test tone 2 nd Masker Poststimulus/Forward/Postmasking –1 st Masker 2 nd test tone

Simultaneous masking –Duration >200ms constant test tone threshold –Assume hearing system integrates over a period of 200ms Postmasking –Decay in effect of masker for 100ms –More dominant Premasking –Takes place 20ms before masker is on!! –Each sensation is not instantaneous, requires build-up time Quick build up for loud maskers Slower build up for softer maskers –Less dominant effect Temporal Aspects of Masking (contd.)

Temporal masking for HI

Meena Ramani 04/14/06 EEL 6586 Automatic Speech Processing

Normal Hearing Sensorineural Hearing Loss Mild to Severe Loss [ ] dB HL What do the hearing impaired hear?

Facts on Hearing Loss in Adults One in every ten (28 million) Americans has hearing loss. The vast majority of Americans (95% or 26 million) with hearing loss can have their hearing loss treated with hearing aids. Only 6 million use HAs Millions of Americans with hearing loss could benefit from hearing aids but avoid them because of the stigma.

Types of Hearing aids Behind The ear In the Ear In the Canal Completely in the canal

Anatomy of a Hearing Aid Microphone Tone hook Volume control On/off switch Battery compartment

Ear Mold Measurements Hearing Aid Fitting

Acclimatization effect Auditory cortex  brain plasticity Time for the HI to reuse the HF information: Acclimatization effect How does this affect HA fitting? –Multiple fitting sessions –Initial fitting should be optimum one

So doc, what is the fitting methodology employed by the hearing aid company to compensate for my hearing loss? Not-so-average Joe (PhD EE/Speech person) CONFIDENTIAL ?

So, do you want your HA to: 1)Always be comfortably loud 2)Equalize loudness across frequencies 3)Normalize loudness …? ? best Which fitting methodology is the best ?

Existing HL compensation algorithms Rationale  Adhoc: Half Gain, POGO  Make speech comfortable: NAL-R  Loudness normalization: IHAFF, Fig 6  Loudness equalization: DSL

Sensorineural hearing loss [ ] dB HL Speech level= 65 dBA Spectrograms and sound files Normal hearing Hearing impairedHI with Linear gain HI with DSL gain HI with RBC gain Section Two

Speech Intelligibility Objective Measures AI, STI Speech Quality Objective Measures PESQ Subjective Measures MOS Speech Intelligibility (SI): The degree to which speech can be understood Performance metrics Subjective Measures HINT Speech Quality: “Does the speech match your expectations?”

Performance metrics (contd.) Objective speech quality measure –Perceptual Evaluation of Subjective Quality (PESQ) Subjective speech quality measure –Mean Opinion Score (MOS) Subjective speech intelligibility measure –Hearing In Noise Test (HINT) Reference signal Comparison signal Score

Hearing In Noise Test (HINT)

Subjective listening experiments Audiograms of the HI patients Location: Shands speech & hearing clinic (sound proof booth) Subjects: 15 HI people –PTA: dB HL 15 normal hearing people Tools used: Matlab HINT and MOS GUIs

Subjective HINT and MOS scores for RBC: hearing impaired, cell phone speech RBC has a 7 dB improvement in SI when compared to DSL MOS scores reveal that RBC has a quality rating of ‘Good’

Subjective HINT and MOS scores for RBC: normal hearing, cell phone speech RBC has a 12 dB improvement in SI when compared to DSL MOS scores reveal that RBC has a quality rating of ‘Good’

Cochlear Implants The first fully functional Brain Machine Interface (BMI) Definition: A device that electrically stimulates the auditory nerve of patients with severe-to- profound hearing loss to provide them with sound and speech information

Who is a candidate? Severe-to profound sensorineural hearing loss Hearing loss did not reach severe-to- profound level until after acquiring oral speech and language skills Limited benefit from hearing aids

Worldwide: –Over 100,000 multi-channel implants At Univ of Florida: –Implanted first patient in 1985 –Currently follow over 400 cochlear patients CI statistics

Technical and Safety Issues Magnetic Resonance Imaging Surgical issues

How does the Cochlea encode frequencies?

Example: New Freedom

CI characteristics 1.Electrode design –Number of electrodes, electrode configuration 2. Type of stimulation –Analog or pulsatile 3. Transmission link –Transcutaneous or percutaneous 4. Signal processing –Waveform representation or feature extraction

Signal processing Compressed Analog (CA) Continuous Interleaved Sampling (CIS) Multiple Peak (MPEAK ) Spectral Maxima Sound Processor (SMSP) Spectral Peak (SPEAK)

Compressed Analog (CA) approach

CA activation signals

Continuous Interleaved Sampling (CIS)

CIS activation signals

Multiple Peak (MPEAK)

MPEAK activated electrodes

Spectral Maxima Sound Processor (SMSP)

SMSP activated electrodes

Spectral Peak (SPEAK)

SPEAK activated electrodes

Outcomes for Post-lingual Adults Wide range of success Most score % on AV sentence materials Majority score > 80% on high context Performance more varied on single word tests

Auditory Brainstem Implant Approved October 20, 2000 Uses the Nucleus 24 system processors Plate array with 21 electrodes

Review-1 Pinna: ITDs,IIDs: Horizontal localization Reflections: Vertical localization Ear canal: ¼ wave resonance 1-3 kHz Middle ear: Amplification by lever action and by area Stapedius reflex Cochlea: IHCs/OHCs: convert mechanical to electrical Place theory: frequency analysis Missing fundamental

Review-2 Adaptation: AN firing sensitive to changes Otoacoustic emissions: Produced by movement of OHCs Beats: Monaural & binaural Measurement of hearing: Audiogram: threshold of hearing Threshold variation with age Equal loudness curves Bass loss problem: discrimination against LFs

Review-3 Critical bands: used for efficient encoding Bark scale Masking: Frequency: LFs mask more Temporal: simultaneous, pre and post Hearing impairment: Hearing aids: external to cochlea Cochlear implants: inside cochlea