Applied Psychoacoustics Lecture 4: Pitch & Timbre Perception Jonas Braasch.

Slides:



Advertisements
Similar presentations
Week 1: *Fundamentals of Sound *Sound Levels and the Decibel
Advertisements

© 2010 Pearson Education, Inc. Conceptual Physics 11 th Edition Chapter 21: MUSICAL SOUNDS.
Scaling. Scaling seeks to discover how varying the physical parameters of the stimulus affects the psychological parameters. In general, scaling is concerned.
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
The Physics of Sound Sound begins with a vibration of an object Vibrating object transfers energy to air medium All complex vibration patterns seen as.
Timbre perception. Objective Timbre perception and the physical properties of the sound on which it depends Formal definition: ‘that attribute of auditory.
Periodicity and Pitch Importance of fine structure representation in hearing.
Physics 1251 The Science and Technology of Musical Sound Unit 1 Session 8 Harmonic Series Unit 1 Session 8 Harmonic Series.
Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.
CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.
Lecture 8  Perceived pitch of a pure tone  Absolute pitch  Midterm review Instructor: David Kirkby
Pitch Perception.
PITCH AND TIMBRE MUSICAL ACOUSTICS Science of Sound Chapter 7.
EE Audio Signals and Systems Psychoacoustics (Pitch) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Sensation and Perception - audition.ppt © 2001 Laura Snodgrass, Ph.D.1 Audition Anatomy –outer ear –middle ear –inner ear Ascending auditory pathway –tonotopic.
Sound Chapter 13.
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
A.Diederich– International University Bremen – USC – MMM – Spring 5 1 The Perception of Frequency cont'd.
On Timbre Phy103 Physics of Music. Four complex tones in which all partials have been removed by filtering (Butler Example 2.5) One is a French horn,
The Science of Sound Chapter 8
Hearing & Deafness (4) Pitch Perception 1. Pitch of pure tones 2. Pitch of complex tones.
PH 105 Dr. Cecilia Vogel Lecture 13. OUTLINE  Timbre and graphs:  Time graph  Spectrum graph  Spectrogram  Envelope  scales  units  interval factors.
Rob van der Willigen designed by Stephanie Thái.
Chapter 6: The Human Ear and Voice
© 2010 Pearson Education, Inc. Conceptual Physics 11 th Edition Chapter 21: MUSICAL SOUNDS Noise and Music Musical Sounds Pitch Sound Intensity and Loudness.
Basic Concepts: Physics 1/25/00. Sound Sound= physical energy transmitted through the air Acoustics: Study of the physics of sound Psychoacoustics: Psychological.
Harmonics, Timbre & The Frequency Domain
The ear and perception of sound (Psychoacoustics) Updated 2013Aug05 1.
Chapter 21 Musical Sounds Noise Versus Music Pitch Pitch Loudness Loudness Quality Quality.
Chapter 12 Preview Objectives The Production of Sound Waves
Beats and Tuning Pitch recognition Physics of Music PHY103.
COMBINATION TONES The Science of Sound Chapter 8 MUSICAL ACOUSTICS.
On Timbre Phy103 Physics of Music
Applied Psychoacoustics Lecture 4: Loudness Jonas Braasch.
Chapter 13 - Sound 13.1 Sound Waves.
The ear and perception of sound (Psychoacoustics) General Physics Version Updated 2014July07 1.
David Meredith Aalborg University
Chapter 7: Loudness and Pitch. Loudness (1) Auditory Sensitivity: Minimum audible pressure (MAP) and Minimum audible field (MAF) Equal loudness contours.
Applied Psychoacoustics Lecture 3: Masking Jonas Braasch.
© Houghton Mifflin Harcourt Publishing Company Preview Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect Chapter 12.
Fundamentals of Sensation and Perception EXPLORING PERCEPTION BY STUDYING BEHAVIOUR ERIK CHEVRIER SEPTEMBER 16 TH, 2015.
Applied Psychoacoustics Lecture 3: Masking Jonas Braasch.
MUSICAL ACOUSTICS PITCH AND TIMBRE Science of Sound Chapter 7.
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Hearing: Physiology and Psychoacoustics 9. The Function of Hearing The basics Nature of sound Anatomy and physiology of the auditory system How we perceive.
Applied Psychoacoustics Lecture 2: Basic Measurement Methods, Signal Detection Theory Jonas Braasch.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Chapter 21 Musical Sounds.
Pitch Perception Or, what happens to the sound from the air outside your head to your brain….
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Closed Pipe Pipe closed at ONE end: closed end pressure antinode air press. L = /4 L.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Pitch What is pitch? Pitch (as well as loudness) is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether.
Response Processes Psych DeShon. Response Elicitation Completion Completion Requires production Requires production Allows for creative responses.
Applied Psychoacoustics Lecture 4: Pitch & Timbre Perception Jonas Braasch.
Chapter 12 Preview Objectives The Production of Sound Waves
SOUND PRESSURE, POWER AND LOUDNESS
Physics Mrs. Dimler SOUND.  Every sound wave begins with a vibrating object, such as the vibrating prong of a tuning fork. Tuning fork and air molecules.
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
4. The Ear and the Perception of Sound (Psychoacoustics) Updated May 13,
Sound, Waves, uh yea.. Sound! Come  Pitch, loudness, and timbre are all perceived attributes of sound.  Pitch is the perceived frequency.
COMBINATION TONES The Science of Sound Chapter 8 MUSICAL ACOUSTICS.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
PSYCHOACOUSTICS A branch of psychophysics
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Pitch What is pitch? Pitch (as well as loudness) is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether.
Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.
Pitch What is pitch? Pitch (as well as loudness) is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether.
Classical psychophysics
Principles of perception
Presentation transcript:

Applied Psychoacoustics Lecture 4: Pitch & Timbre Perception Jonas Braasch

Homework 2: Raw Data

Homework 2: Mean

Homework 2: Mean+STD

Mean and Standard Deviation mean Standard deviation

Homework 2: Mean vs. Median Mean, blue Median, red

Homework 2: Median

Statistics for non-Gaussian distributions Median: is a number that separates the higher half of a sample, a population, or a probability distribution from the lower half.probability distribution Quartiles: –first quartile (designated Q1) = lower quartile = cuts off lowest 25% of data = 25th percentilepercentile –second quartile (designated Q2) = median = cuts data set in half = 50th percentilemedianpercentile –third quartile (designated Q3) = upper quartile = cuts off highest 25% of data, or lowest 75% = 75th percentile percentile

Homework 2: Median

Homework 2: Median+Quartiles

Homework 2: Mean vs. Median Mean, blue Median, red

Homework 2: Median+Quartiles

HW 2: Median+Quartiles+Range

HW2: left ear vs. right ear left ear, blue right ear, red

Euclid ( BC) Some sounds are higher pitched, being composed of more frequent and more numerous motions

Contents Pitch perception –Pure Tones –Place and Rate Theory –Complex Tones Timbre

IntervalSemitonesFreq. Ratio Prime01:1 Minor second116:15 Major second29:8 Minor third36:5 Major third45:4 Perfect Fourth54:3 Augmented fourth Diminished fifth 645:32 64:45 Perfect Fifth73:2 Minor sixth88:5 Major sixth95:3 Minor seventh1016:9 Major seventh1115:8 Perfect Octave122:1

IntervalEqual temp.Just intonation Prime00 Minor second Major second Minor third Major third Perfect Fourth Augmented fourth Diminished fifth Perfect Fifth Minor sixth Major sixth Minor seventh Major seventh Perfect Octave1200 cents

Equal Temperament One semitone equals: 12 √2=1.0595=5.9463% One cent: 1200 √2=1.0006=0.059% Perfect fifth: 1.5=700 cent=50% Perfect Octave: 2=1200 cent=100%

Psychometric Function Describes the relationship between a physical parameter and its psychological correlate Example: Phon-Sone conversion

Weber-Fechner Law The earliest scientific approach to measuring a psychometric function Ernst H. Weber ( ) investigated just noticeable differences (JNDs) for lifting weights with the hand. The subjects were blindfolded and the weight was gradually increased until they were able to detect a difference. He noticed that the JNDs were proportional to the overall weight. (e.g., if the JND for a 100 g weigth was 10 g, the JND for a 1000 g weight was 100 g). If the mass is doubled, the threshold is also doubled.

Gustav T. Fechner ( ) later developed the Weber-Fechner Law from Weber’s findings: S=klog(I/I 0 ) With I the physical parameter (Intensity), S its psychophysical correlate, and k a constant, and I 0 the detection threshold of I. The JND is then: dS=klog(dI/I) Weber-Fechner Law

Fechner’s indirect scales 0 sensation units (0 JND of sensation) stimulus intensity at absolute detection threshold 1 sensation unit (1 JND of sensation) stimulus intensity that is 1 difference threshold above absolute threshold 2 sensation units (2 JND of sensation) stimulus intensity that is 1 difference threshold above the 1-unit stimulus

Fechner’s Law

Pitch Pitch is often thought to be perceived logarithmically: Frequencyoctave 440 Hz1 st 880 Hz2 nd 1760 Hz3rd But for other psychophysical correlates, this logarithmic relationship does not hold true…

Stevens’ Power Law Stevens’ was able to provide a general formula to relate sensation magnitudes to stimulus intensity: S = aI m Here, the exponent m denotes to what extent the sensation is an expansive or compressive function of stimulus intensity. The purpose of the coefficient a is to adjust for the size of the unit of measurement. log S = m log(I-I0) + log a

Examples for Steven’s Power Law

Examples for Steven’s Power Law Exponents

… and now in the log-log space

Definition of Pitch Pitch is that attribute of auditory sensation in terms of which sounds may be ordered on a scale extending from low to high. Pitch depends mainly on the frequency content of the sound stimulus, but it also depends on the sound pressure and the waveform of the stimulus. ANSI standard 1994

The mel scale Stevens, Volkmann & Newmann, 1937 Five listeners were asked to judge a the frequency of a second sinusoidal tone generator to be perceived half the Magnitude of the first oscillator with constant frequency (method of adjustment) Sound was switched between both oscillators (2-s interval) 60 dB SPL Stevens, Volkmann & Newmann, 1937

Mel Scale - Raw Data Stevens, Volkmann & Newmann, 1937 Geometric means for five observers, and average error for 2 listeners

Def.: 1000 mels= 1000 Hz at 40 dB Stevens, Volkmann & Newmann, 1937

Solid line: mel scale /2.83; Black squares: integrated difference limens; open circles: relative location of the resonant positions on the basilar membrane

Size of Musical interval in terms of Mels Stevens, Volkmann & Newmann, 1937

Hz/mel conversion To convert f hertz into m mel use: m = log e (1 + f / 700). And the inverse: f = 700(e m / − 1).

Frequency JND’s Different symbols show different studies (Fig.:Terhardt 1998)

Frequency Difference Limens Wier et al., 1977

Frequency Difference Limens At low sound pressure levels (<10 dB SPL), the JND for pitch Increases. The hump at 800 Hz was not confirmed in follow-up studies At one 1-kHz the difference limens is about 3 cents (0.2%) At high and low frequencies, we are less sensitive to pitch (e.g., 0.5% at 200 Hz and 1% at 8 kHz. Melody recognition disappears for frequencies above 4-5 kHzMelody recognition disappears for frequencies above 4-5 kHz

DL compared to a semitone

Pure Tone Frequency Discrimination from Cheveigne, 2004

Effect of signal duration Duration (ms) d’ relative to 20 ms Large improvement in F0 discrimination with duration for unresolved harmonics (White and Plack, 1998):

SPINC vs. Bark (Fig.:Terhardt 1998) SPINC=Spectral Pitch Increment based on JNDs

Theories on Pitch Perception Place Theory –Pitch is determined by the location of the firing inner hair cell population on the basilar membrane Rate Theory –Pitch is determined by the rate code of the inner hair cells (phase locking)

Place Theory Excitation on Basilar membrane from: Hartmann, 1996

Place Theory Excitation on Basilar membrane Excitation on Basilar membrane for two sinusoids of same frequency f but 30 dB level difference from: Hartmann, 1996

Pitch shift for level variation according to Terhardt 1982 Pitch variation of a sinusoid as function of SPL

Autocorrelation Cross-Correlation Models  Y (  )= 1/(t 1 -t 0 ) Y(t)Y(t+  )  t=t 0 t1t1 Licklider (1951)

Rate model (Sinusoid analysis) (from: de Cheveigne, 2004)

Rate Pitch Model f=1/t log(f)=log(1/t)

1-kHz sine tone FFT

1-kHz harmonic complex with equally strong harmonics

250-Hz harmonic complex with equally strong harmonics

250-Hz harmonic complex with missing fundamental

250-Hz harmonic complex with decreasing harmonic strength

Cochlear Implants Research on Cochlear Implant users suggest that our auditory system makes use of both the rate and place when determining the pitch. The analysis of the rate code is not possible for high frequencies

Cochlear Implants Illustration from "Functional Replacement of the Ear," by Gerald E. Leob, 1985

Cochlear Implants

Excitation (dB) ResolvedUnresolved Centre Frequency (Hz) Excitation pattern: Auditory filterbank: Frequency (Hz) Input spectrum: Level (dB) from Plack, Oxenham, 2002

Which harmonic determines pitch? from: Chris Darwin Time (t) Period = 1/200 s = 5ms Frequency (Hz) Amplitude Harmonic spacing = 200 Hz Fundamental = 200 Hz Pressure

Pitch remains the same without fundamental (Licklider, 1956): from: Chris Darwin Time (t) Period = 1/200 s = 5ms Frequency (Hz) Harmonic spacing = 200 Hz Pressure Amplitude

Pitch perception of complex sounds (from: de Cheveigne, 2004)

Figure Explanation All spectra (A-E) produce the same magnitude of pitch. Solution: For each harmonic produce subharmonics (F/n) and plot these into frequency histogram (bottom figure). The perceived pitch typically corresponds to the highest value.

Pitch perception of complex sounds (from: de Cheveigne, 2004)

Explanation of Figure (from: de Cheveigne, 2004) “Landmarks do not work well in the time domain” The pitch does not match the period of the envelope if the ratio between carrier frequency and modulation frequency is < 10.

Pitch perception of formant-like sounds (from: de Cheveigne, 2004) Can evoke two pitches Diagonal line sinusoids

Another Pitch Definition “…that attribute of auditory sensation in terms of which sounds may be ordered on a scale extending from low to high” (ANSI, 1994). The perceptual correlate of the repetition rate of a sound. instead of

F0 discrimination for unresolved harmonics Lowest Harmonic Number F0DL (%) (F0 = 200 Hz) Houtsma and Smurzynski, 1990 Low-numbered harmonics (<10) dominate pitch perception

Absolute Pitch "Passive" absolute pitch –Persons who are able to identify individual notes which they hear, –They can typically identify the key of a composition "Active" absolute pitch –Persons with active absolute pitch will be able to sing any given note when asked. –Usually, people with active absolute pitch will not only be able to identify a note, but recognize when that note is slightly sharp or flat. –1 in every 10,000 people in the US posses active absolute pitch possessors (1/20 in some other locations).

Motoric Absolute Pitch Persons who can reproduce an absolute reference tone to determine the pitch of other tones (e.g. professional singer knowing their range, persons who speak a tone language).

Timbre When we hear notes of the same force and same pitch sounded successively on a piano- forte, a violin, clarinet, oboe, or trumpet, or by the human voice, the character of the musical tone of each of these instruments, notwithstanding the identify of force and pitch, is so different that by means of it we recognise with the greatest of ease which of these instruments was used." (p. 19) Helmholtz, 1877

Timbre Textbooks customarily believe that loudness, pitch and timbre correlate directly with sound intensity, fundamental frequency and overtone structure..."but these experiments show that a simple one-to-one relationship does not exist." (p. 59) Fletcher, 1934

Timbre...harmonics manifest themselves in the specific quality or timbre of the complex tone.... Timbre is multidimensional....we do not have a unidimensional scale for comparing the timbres of various sounds. Plomp, 1976

Timbre Timbre is that attribute of auditory sensation in terms of which a listener can judge that two sounds similarly presented and having the same loudness and pitch are dissimilar. ANSI, 1960

Comment Bregman (1990) On the ASA definition: "This is, of course, no definition at all. For example, it implies that there are some sounds for which we cannot decide whether they possess the quality of timbre or not. In order for the definition to apply, two sounds need to be able to be presented at the same pitch, but there are some sounds, such as the scarping of a shovel in a pile of gravel, that have no pitch at all. We obviously have a problem: Either we must assert that only sounds with pitch can have timbre, meaning that we cannot discuss the timbre of a tambourine or of the musical sounds of many African cultures, or there is something terribly wrong with the definition." (p. 92)

Elusive attributes of timbre The range between tonal and noiselike character. The spectral envelope. The time envelope in terms of rise, duration, and decay. The changes both of spectral envelope (formant-glide) and fundamental frequency (micro-intonation). The prefix, an onset of a sound quite dissimilar to the ensuing lasting vibration. Schouten, 1968

Grey (1977) Multidimensional scaling of orchestral instruments

Grey (1977) Stimuli: 16 different orchestral instruments Listeners had to judge similarity Multidimensional scale analysis Three scale axes: –Spectral energy distribution –Synchronicity of attack transients for different harmonics –Presence of low-amplitude, high frequency energy in initial attack segment.

Harmonic Spectra Flue pipe (open) Flue pipe (closed) Reed pipe with long cyl. res. Reed pipe with short cyl. res.

Harmonic Spectra Relationship of low harmonics –Spectral centroid –Dominance of fundamental sound –Existence of even partial tones

Comparison Flue pipe Stops Reed Pipe Stops HM HM: Harmonic centroid

Comparison Flue pipe Reed Pipe Important: synchronicity of attack transients among partial tones, Initial frequency changes

Literature William M. Hartmann (1996): Pitch, periodicity, and auditory organization, J. Acoust. Soc. Am. 100, Alain de Cheveigne (2004) Pitch Perception models, in: Pitch (Plack, Oxenham, eds.), Springer, New York.