Human Auditory Sensation

Slides:



Advertisements
Similar presentations
ANATOMY AND PHYSIOLOGY OF THE EAR
Advertisements

Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab:
Perception Chapter 11: Hearing and Listening
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Psychoacoustics Riana Walsh Relevant texts Acoustics and Psychoacoustics, D. M. Howard and J. Angus, 2 nd edition, Focal Press 2001.
Periodicity and Pitch Importance of fine structure representation in hearing.
HEARING Sound How the Ears Work How the Cochlea Works Auditory Pathway
Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.
CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.
Pitch organisation in Western tonal music. Pitch in two dimensions Pitch perception in music is often thought of in two dimensions, pitch height and pitch.
HEARING.
The peripheral auditory system David Meredith Aalborg University.
Pitch Perception.
INTRODUCTION TO HEARING. WHAT IS SOUND? amplitude Intensity measured in decibels.
Sensation and Perception - audition.ppt © 2001 Laura Snodgrass, Ph.D.1 Audition Anatomy –outer ear –middle ear –inner ear Ascending auditory pathway –tonotopic.
Structure of human ear Understanding the processes of human auditory system are key for posting requirements for architectural acoustics. This gives us.
Meena Ramani 04/10/06 EEL 6586 Automatic Speech Processing.
Audition. Sound Any vibrating material which can be heard.
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
Sensory Systems: Auditory. What do we hear? Sound is a compression wave: When speaker is stationary, the air is uniformly dense Speaker Air Molecules.
Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab:
The Auditory System. Audition (Hearing)  Transduction of physical sound waves into brain activity via the ear. Sound is perceptual and subjective. 
Chapter 6: The Human Ear and Voice
Unit 4: Sensation & Perception
S 319 < Auditory system >
Hearing.
From Vibration to Sound
EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Beats and Tuning Pitch recognition Physics of Music PHY103.
The Auditory System Dr. Kline FSU. What is the physical stimulus for audition? Sound- vibrations of the molecules in a medium like air. The hearing spectrum.
KTH ROYAL INSTITUTE OF TECHNOLOGY Sound, the Auditory System, and Pitch Perception Roberto Bresin DT2350 Human Perception for Information Technology Copyright.
Hearing: How do we hear?. Hearing: The Nature of Sound Module 9: Sensation.
1 Hearing or Audition Module 14. Hearing Our auditory sense.
The ear and perception of sound (Psychoacoustics) General Physics Version Updated 2014July07 1.
Hearing: How do we hear?. Hearing: The Nature of Sound Module 9: Sensation.
Hearing Chapter 5. Range of Hearing Sound intensity (pressure) range runs from watts to 50 watts. Frequency range is 20 Hz to 20,000 Hz, or a ratio.
The Ear.
IB Assessment Statements Option I-1, The Ear and Hearing: I.1.1.Describe the basic structure of the human ear. I.1.2.State and explain how sound pressure.
Sound and the human ear. Sound Sound radiates from the point source in all directions Sound intensity is power / Area Spherical area is 4πr 2 so sound.
SOUND & THE EAR. Anthony J Greene2 Sound and the Ear 1.Sound Waves A.Frequency: Pitch, Pure Tone. B.Intensity C.Complex Waves and Harmonic Frequencies.
Chapter 11: Hearing.
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Pitch Perception Or, what happens to the sound from the air outside your head to your brain….
HEARING. The Nature of Sound Sound, like light, comes in waves Sound is vibration Features of sound include: –Pitch / Hertz – Loudness / Decibels.
52 The Sense of Hearing Dr. A.R. Jamshidi Fard 2011.
Fundamentals of Sensation and Perception
Hearing: How do we hear?. Hearing: The Nature of Sound Module 9: Sensation.
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Applied Psychoacoustics Lecture 4: Pitch & Timbre Perception Jonas Braasch.
SOUND PRESSURE, POWER AND LOUDNESS
The Nature of Sound Sound, like light, comes in waves Sound is vibration Features of sound include: –Pitch –Hertz –Decibels.
Physics Acoustics for Musicians selected slides, March 5, 2002 Loudness at different frequencies Critical band Masking The ear Neurological response.
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
4. The Ear and the Perception of Sound (Psychoacoustics) Updated May 13,
PSY2301: Biological Foundations of Behavior The Auditory System Chapter 10.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
PSYCHOACOUSTICS A branch of psychophysics
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Ear and the hearing process.
Hearing. (Perception of Sound)
Hearing. (Perception of Sound)
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Hearing. (Perception of Sound)
Section 14.3 Hearing and Equilibrium
Psychoacoustics: Sound Perception
How We Hear.
Hearing: How do we hear?.
Fundamentals of Sensation and Perception
Fundamentals of Sensation and Perception
Presentation transcript:

Human Auditory Sensation Topic 3 Human Auditory Sensation (Some slides are adapted from Bryan Pardo’s course slides on Machine Perception of Music)

The Ear Outer Ear Middle Ear Inner Ear Ossicles (Middle ear Bones) Pinna Cochlea (inner ear) Ossicles (Middle ear Bones) Outer Ear Middle Ear Inner Ear Concha Eustachian tube ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Function of the Ear Outer ear: shape the sound spectrum Torso, head, pinna: head-related transfer function (HRTF). Interaural difference. Concha, canal: increase sound level of about 10-15dB between 1.5k-7kHz, due to resonances Middle ear: effective and efficient transfer Eardrum: effective area about 55 mm^2 (where the oval window is about 3 mm^2 size. Three ossicles: a lever system The last ossicle is called stapes, the smallest bone in the human body ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Function of the Ear Inner ear: Vestibular system: sense of balance Eustachian tube: provides ventilation to middle ear Cochlea: the primary auditory organ of the inner ear ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

The Chochlea Apex (wide and flexible) Base (narrow and stiff) Round window Each point on the Basilar membrane resonates to a particular frequency At the resonance point, the membrane moves ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Gammatone Filterbank Impulse response (n=4) 𝑔 𝑡 =𝑎 𝑡 𝑛−1 𝑒 −2𝜋𝑏 cos 2𝜋𝑓𝑡+𝜑 𝑢(𝑡) gamma tone ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Cross Section of Cochlea (from Sacheli et al. 2013) Basilar membrane Outer Hair Cell (OHC) Inner Hair Cell (IHC) When the membrane moves, it moves hairs. When hairs move, they fire nerve impulses. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Hair Cells Inner hair cell Outer hair cell The actual transducer Outer hair cell Makes small but very fast movements (maybe ~100kHz) Feedforward amplifiers, nonlinear They are damaged by age and hard to regrow Let’s look at a dance by an outer hair cell! http://www.youtube.com/watch?v=Xo9bwQuYrRo ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Auditory Nerve Fibers Each IHC -> 10-20 type I fibers ~10 OHCs -> ~6 type II fibers This is the evidence that IHCs are the actual transducers. Fibers are arranged to maintain the tonotopy of basilar membrane. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Frequency Sensitivity single nerve measurements (roughly) symmetric in log of frequency Basilar membrane ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Encoding Loudness More spikes/second = louder More fibers firing = louder ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Phase Locking Half-wave rectification For high frequency tones, the fibers phase lock to low frequency modulations. (from Chris Darwin) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Poststimulus Time Histogram (PSTH) Single fiber firing pattern to a click sound A half-wave rectified gammatone function! time 1/central frequency ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

PSTH to a Tone Burst Tone duration (from Gelfand 1998) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Coding Frequency Information Frequencies under 5 kHz Individual harmonics are resolved by the cochlea Coded by place (which nerve bundles along the cochlea are firing) Coded by time (nerves fire in synchrony to harmonics) Frequencies over 5 kHz Individual harmonics can’t be resolved by the inner ear and the frequency is revealed by temporal modulations of the waveform amplitude (resulting in synched neuron activity) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Loudness Loudness is a subjective measure of sound pressure (or intensity). How does the intensity of a sound relate to its perceived loudness? Does frequency matter? Is broadband noise different from narrow band? How can we find out? ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Measuring Frequency’s Effect Pick a reference frequency (like 1000Hz) Play a sine wave of a defined intensity at that frequency (say, 30 dB SPL) Pick another frequency (any one) Play a sine wave at the new frequency, 𝑓 Adjust the intensity of the sine at 𝑓 until its loudness equals the reference ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Equal Loudness Contours ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Phons The phon is a unit of perceived loudness for pure tones. The purpose of the phon scale is to compensate for the effect of frequency on the perceived loudness of tones. By definition, 1 phon is equal to 1 db SPL at a frequency of 1000 Hz. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Sensitivity to Loudness ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

JND The just noticeable difference (JND) is the smallest difference in sensory input that is detectable by a human being. It is also known as the difference limen (DL) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Difference Limens ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Weber’s Law Weber’s Law (named after Ernst Heinrich Weber, 1795-1878) attempts to describe the relationship between the physical magnitudes of stimuli and the perceived intensity of the stimuli. DL in intensity is proportional to the intensity itself, i.e. ∆𝐼 𝐼 is constant. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

The Sone The sone is a unit of perceived loudness, proposed by S. Stevens in 1936. At 1kHz, 1 sone = 40 phons = 40 dB-SPL A stimulus that is n sones loud is judged to be n times as loud as 1 sone. Twice loudness 10 phones, 10 dB ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Masking A loud tone masks perception of tones at nearby frequencies 1000 Hz 1000_975_20dB 1000_975_6dB 1000_475_20dB ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Critical Band Critical band – the frequency range over which a pure tone interferes with perception of other pure tones. Think about the masked threshold as a bandpass filter. How to measure the bandwidth? ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Equivalent Rectangular Bandwidth (ERB) ERB Hz =24.7 4.37 f kHz +1 The bandwidth increases with frequency. ERB frequency ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Frequency Difference Limen The smallest difference between the frequencies of two sine tones that can be discriminated correctly 75% of the time. 1000Hz 1001Hz 1002Hz 1005Hz ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Pitch (ANSI 1994 Definition) That attribute of auditory sensation in terms of which sounds may be ordered on a scale extending from low to high. Pitch depends mainly on the frequency content of the sound stimulus, but also depends on the sound pressure and waveform of the stimulus. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Pitch (Operational) A sound has a certain pitch if it can be reliably matched to a sine tone of a given frequency at 40 db SPL ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Pitch and Intensity Stevens Rule The pitch of low frequency (below 1000Hz) sine tones decreases with increasing intensity (low loud sounds go flat). The pitch of high frequency tones (over 3000 Hz) increases with intensity (high loud sounds go sharp) 220Hz 7040Hz ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Mel Scale A perceptual scale of pitches judged by listeners to be equal in distance from one another. The reference point between this scale and normal frequency measurement is defined by equating a 1000 Hz tone, 40 dB SPL, with a pitch of 1000 mels. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Mel Scale Mel=2595 log 10 (1+ 𝑓 700 ) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Mel Scale Above about 500 Hz, larger and larger intervals are judged by listeners to produce equal pitch increments. The name mel comes from the word melody to indicate that the scale is based on pitch comparisons. Proposed by Stevens, Volkman and Newman (Journal of the Acoustic Society of America 8(3), pp 185-190, 1937) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Ear Craziness Binaural Diplacusis Otoacoustic Emissions Left ear hears a different pitch from the right. Can be up to 4% difference in perceived pitch Otoacoustic Emissions Ears sometimes make noise. Thought to be a by-product of the sound amplification system in the inner ear. Caused by activity of the outer hair cells in the cochlea. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Harmonic Sound A complex sound with strong sinusoid components at integer multiples of a fundamental frequency. These components are called harmonics or overtones. Sine waves and harmonic sounds are the sounds that may give a perception of “pitch” ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Classify Sounds by Harmonicity Sine wave Strongly harmonic Oboe Clarinet ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Classify Sounds by Harmonicity Somewhat harmonic (quasi-harmonic) Marimba Human voice ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Classify Sounds by Harmonicity Inharmonic Gong ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Frequency (often) equals pitch Complex tones Strongest frequency? Lowest frequency? Something else? Let’s listen and explore… ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Hypothesis Pitch is determined by the lowest strong frequency component in a complex tone. Oboe ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

The Missing Fundamental Frequency (linear) Time ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Hypothesis Pitch is determined by the lowest strong frequency component in a complex tone. The case of the missing fundamental proves that ain’t always so. ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Hypothesis – “It’s complicated” We hear which frequency components are loudest We hear if frequencies are integer multiples of the same common frequency We hear if frequencies are regularly spaced 300:200:2100 Hz 200Hz 300Hz Amp 300 500 700 900 1100 1300 1500 1700 1900 2100 Freq (Hz) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Pitch and Music How do we tune pitch in music? How do we represent pitch in music? How do we represent the relation of pitches in music? ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

𝑓(𝑛)= 2 𝑛 12 𝑓 ref Equal Temperament Octave is a relationship by power of 2. There are 12 half-steps in an octave 𝑓(𝑛)= 2 𝑛 12 𝑓 ref number of half-steps from the reference pitch frequency of desired pitch frequency of the reference pitch ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Measurement 𝑐=1200 log 2 𝑓 𝑓 ref 100 Cents in a half step 2 half steps in a whole step 12 half steps in an octave Number of cents 𝑐=1200 log 2 𝑓 𝑓 ref Virtual keyboard ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

A=440 Equal tempered tuning ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Musical Intervals (from C) ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Interval Names ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Some Magic C  C: 12 half-steps, 2 12 12 = 2 1 0 1 2 3 4 5 6 7 8 9 10 11 12 Half-steps: C  C: 12 half-steps, 2 12 12 = 2 1 C -> G: 7 half-steps, 2 7 12 =1.4983≈ 3 2 C  F: 5 half-steps, 2 5 12 =1.3348≈ 4 3 C  E: 4 half-steps, 2 4 12 =1.2599≈ 5 4 Are these just coincidence? ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Related to Standing Waves How about defining pitches this way, so that they sound more harmonic? ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Pythagorean Tuning Frequency ratios of all intervals are based on the ratio 3:2, i.e. perfect fifth (P5), which is 7 half-steps. F C G D A E B ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Circle of Fifths ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Problem with Pythagorean Tuning One octave = 2f A perfect 5th = (3/2)f What happens if you go around the circle of 5ths to get back to your original pitch class? (3/2)12 = 129.75 Nearest octave is 27 = 128 128 != 129.75 Not convenient for key changes ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013

Overtone Series Approximate notated pitch for the harmonics (overtones) of a frequency f 2f 3f 4f 5f 6f 7f 8f 9f 10f 11f 12f C C G C E G Bb C D E F# G ECE 492 - Computer Audition and Its Applications in Music, Zhiyao Duan 2013