How speech sounds are physically represented Chapters 12 and 13

Slides:



Advertisements
Similar presentations
Vowel production Introduction to sound waves
Advertisements

Acoustic/Prosodic Features
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Acoustic Characteristics of Consonants
Vowel Formants in a Spectogram Nural Akbayir, Kim Brodziak, Sabuha Erdogan.
THE LINGUISTIC STUDY OF VOWELS: What is Sound? Sound is a disturbance of air molecules. A vibrating tuning fork is a good example. When it moves in one.
Physics 1251 The Science and Technology of Musical Sound Unit 1 Session 8 Harmonic Series Unit 1 Session 8 Harmonic Series.
Harmonic Series and Spectrograms 220 Hz (A3) Why do they sound different? Instrument 1 Instrument 2Sine Wave.
The frequency spectrum
Speech Science XII Speech Perception (acoustic cues) Version
Physics of Sounds Overview Properties of vibrating systems Free and forced vibrations Resonance and frequency response Sound waves in air Frequency, wavelength,
Chapter 6 (Sections ) Sound. The speed of sound in a substance depends on: the mass of its constituent atoms, and the strength of the forces between.
Sound waves and Perception of sound Lecture 8 Pre-reading : §16.3.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
PHYSICAL PROPERTIES OF SPEECH SOUNDS
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
Spectrogram & its reading
Acoustics of Instruments Music Theory Class Gettysburg College.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Pitch changes result from changing the length and tension of the vocal folds The pitch you produce is based on the number of cycles per second Hertz (Hz)
Harmonics, Timbre & The Frequency Domain
CS 551/651: Structure of Spoken Language Lecture 1: Visualization of the Speech Signal, Introductory Phonetics John-Paul Hosom Fall 2010.
Source/Filter Theory and Vowels February 4, 2010.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Resonance, Revisited March 4, 2013 Leading Off… Project report #3 is due! Course Project #4 guidelines to hand out. Today: Resonance Before we get into.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
Standing waves on a string (review) n=1,2,3... Different boundary conditions: Both ends fixed (see above) Both ends free (similar to both ends fixed )
Vowel Acoustics November 2, 2012 Some Announcements Mid-terms will be back on Monday… Today: more resonance + the acoustics of vowels Also on Monday:
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Calculate the speed of 25 cm ripples passing through water at 120 waves/s.
Speech Science Oct 7, 2009.
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
Chapter 14 Waves and Sound
Formants, Resonance, and Deriving Schwa March 10, 2009.
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Resonance October 23, 2014 Leading Off… Don’t forget: Korean stops homework is due on Tuesday! Also new: mystery spectrograms! Today: Resonance Before.
Vowel Acoustics March 10, 2014 Some Announcements Today and Wednesday: more resonance + the acoustics of vowels On Friday: identifying vowels from spectrograms.
Statistical NLP Spring 2011
12/2/2015 Fourier Series - Supplemental Notes A Fourier series is a sum of sine and cosine harmonic functions that approximates a repetitive (periodic)
Resonance January 28, 2010 Last Time We discussed the difference between sine waves and complex waves. Complex waves can always be understood as combinations.
AUDITORY TRANSDUCTION SEPT 4, 2015 – DAY 6 Brain & Language LING NSCI Fall 2015.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
P105 Lecture #27 visuals 20 March 2013.
Acoustic Phonetics 3/14/00.
Basic Acoustics. Sound – your ears’ response to vibrations in the air. Sound waves are three dimensional traveling in all directions. Think of dropping.
Phonetics: A lecture Raung-fu Chung Southern Taiwan University
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
PHONETICS AND PHONOLOGY
B. Harpsichord Strings are plucked
The Human Voice. 1. The vocal organs
Structure of Spoken Language
The Human Voice. 1. The vocal organs
Determine the l, f, & T of the 49th overtone of a 4
Pg. 298 – Damping Underdamping Overdamping
Sources of Sound To produce sound, we have to make air molecules move back and forth; we can do this either with acoustic vibrations in an enclosed space.
The Vocal Pedagogy Workshop Session III – Articulation
Intro to Fourier Series
C-15 Sound Physics 1.
Sound & Sound Waves.
Analyzing the Speech Signal
Signals Prof. Choong Seon HONG.
Elements of Acoustic Phonetics
The Production of Speech
Evolution of human vocal production
Speech Perception (acoustic cues)
Presentation transcript:

How speech sounds are physically represented Chapters 12 and 13 Acoustic Phonetics How speech sounds are physically represented Chapters 12 and 13

Sound Energy Travels through a medium to reach the ear Compression waves Information from Phonetics for Dummies. William F. Katz. “Making Waves: An Overview of Sound.” 2013.

Periodic waves Simple (sine; sinusoid) Complex (actually a composite of many overlapping simple waves)

Sinusoid waves Simple periodic motion from perfectly oscillating bodies Found in in nature (e.g., swinging pendulum, sidewinder snake trail, airflow when you whistle) Sinusoids sound ‘cold’ (e.g. flute)

Let’s crank one out! Pg. 175

Frequency - Tones

Simple waves - key properties Frequency = cycles per sec (cps) = Hz Amplitude – measured in decibels (dB), 1/10 of a Bell (Note: dB is on a log scale, increases by powers of 10)

Phase A measure of the position along the sinusoidal vibration These two waveforms are slightly out of phase (approx. 900 difference) Used in sound localization

Damping Loss of vibration due to friction

Quickie Quiz! Q: What is the frequency of this wave ? HINT: It repeats twice in 10 msec

Answer: 200 Hz! (2 cycles in .01 sec = 200 cps)

Physical vs. perceptual Fundamental frequency (F0)  Amplitude/ Intensity  Duration  PERCEPTUAL “Pitch” “Loudness” “Length”

Image from Fetal Hydrocephalus. The Amazing Owen Image from Fetal Hydrocephalus. The Amazing Owen. “Great News from the Audiologist.” March 23, 2009. Accessed June 13, 2016. http://fetalhydrocephalus.com/hydro/SIblog/default.aspx?id=35&t=Great-news-from-the-Audiologist

Complex periodic waves Results from imperfectly oscillating bodies Demonstrate simple harmonic motion Examples - a vibrating string, the vocal folds

Frequency – Tones/ Adding

Another example….. "http://www.askamathematician.com/wpcontent/uploads/2012/09/IndykKatabiPriceHassanieh.jpg">

Waveforms - Male Vowels

Waveforms - Female Vowels

Complex periodic waves – cont’d Consists of a fundamental (F0) and harmonics Harmonics (“overtones”) consist of energy at integer multiples of the fundamental (x2, x3, x4 etc…)

Harmonic series Imagine you pluck a guitar string and could look at it with a really precise strobe light Here is what its vibration will look like

From complex wave to its components… and the frequency spectrum Also known as a “line spectrum” Here, complex wave at the bottom… ..is broken into its component sin waves shown at the top (complex wave)

Complex wave  component sinusoids Fourier analysis 1768-1830 Complex wave  component sinusoids Sound Light

Review of source characteristics Simple waves are a good way to learn about basic properties of frequency, amplitude, and phase. Examples include whistling; not really found much in speech Complex waves are found in nature for oscillating bodies that show simple harmonic motion (e.g., the vocal folds) Information from Phonetics for Dummies. William F. Katz. “Making Waves: An Overview of Sound.” 2013.

Now let’s look at the filter In speech, the filter is the supralaryngeal vocal tract (SLVT) The shape of the oral/pharyngeal cavity determines vowel quality SLVT shape is chiefly determined by tongue movement, but lips, velum and (indirectly) jaw also play a role

Resonance Reinforcement or shaping of frequencies as a function of the boundary conditions through which sound is passed FUN: Try producing a vowel with a paper towel roll placed over your mouth! The ‘extra tube’ changes the resonance properties

Resonance / Formants The SLVT can be modeled as a kind of bottle with different shapes… as sound passes through this chamber it achieves different sound qualities The resonant peaks of speech that relate to vowel quality are called formants. Thus, R1 = F1 (“first formant). R2 = F2, etc. F1 and F2 are critical determinants of vowel quality

Input  SLVT  final output

Vocal tract shape  formant frequencies

Resonance – FOUR basic rules F1 rule – inversely related to jaw height. As the jaw goes down, F1 goes up, etc. F2 rule – directly related to tongue fronting. As the tongue moves forward, F2 increases. F3 rule – F3 drops with r-coloring Lip rounding rule – All formants are lowered by liprounding (because lip protrusion lengthens the vocal tract ‘tube’)

Examples of resonance for /i/, /ɑ/, /u/ /i/ is made with the tongue high (thus, low F1) and fronted (high F2) /ɑ/ is made with the tongue low (high F1) and back (low F2)

American English Vowels (Assmann & Katz, 2000) Tables from Phonetics for Dummies. William F. Katz. “Making Waves: An Overview of Sound.” 2013.

F2 x F1 plot American English Vowels Peterson & Barney, 1952 Figure from Phonetics for Dummies. William F. Katz. “Making Waves: An Overview of Sound.” 2013.

Chap 13 Reading a sound spectrogram

The sound spectrograph Invented in the 1940s First called ‘visible speech’ Originally thought to produce a “speech fingerprint” (?) We now know speech perception is far more complicated and ambiguous..

Basics of spectrogram operation Original systems used bandpass filters Accumulated energy was represented by a dark image burned onto specially-treated paper Nowadays, performed digitally using variety of algorithms (e.g., DFT, LPC)

Relating line spectrum to spectrogram F3 F1 1

Sample of word “spectrogram” Pg. 192 Figure from Phonetics for Dummies. William F. Katz. “Reading a Sound Spectrogram.” 2013.

Vowel basics Here is /i ɑ i ɑ / produced with level pitch Wideband spectrogram (left); narrow band (right) Spectrogram from Ladefoged and Johnson, A course in phonetics

Let’s find some vowels! Figure from Phonetics for Dummies. William F. Katz. “Reading a Sound Spectrogram.” 2013.

Here they are: Figure from Phonetics for Dummies. William F. Katz. “Reading a Sound Spectrogram.” 2013.

Consonants – formant transitions An example of an F1 transition for the syllable /da/ Figure from Phonetics for Dummies. William F. Katz. “Reading a Sound Spectrogram.” 2013.

American English vowels in /b_d/ context TOP ROW (front vowels): “bead bid bade bed bad” BOTTOM ROW (back vowels) “bod bawd bode buhd booed” Spectrograms from Ladefoged and Johnson, A course in phonetics

Stops/ formant transitions Spectrograms of “bab” “dad” and “gag” Labials - point down, alveolars point to ~1700-1800 Hz, velars “pinch” F2 and F3 together Note: bottom-most fuzzy is the voice bar! Spectrogram from Ladefoged and Johnson, A course in phonetics

Voicing (voice of WK)

Fricatives Top row: /f/, theta, s, esh, Bottom row: /v/, ethe, z, long z Distribution of the spectral noise is the key here! Spectrogram from Ladefoged and Johnson, A course in phonetics

The fricative /h/ Commonly excites all the formant cavities May look slightly different in varying vowel contexts Spectrogram from Ladefoged and Johnson, A course in phonetics

Nasal stops Spectrograms of “dinner dimmer dinger” Marked by “ zeroes” or formant regions with little energy Can also result in broadening of formant bandwidths (fuzzying the edges) Spectrogram from Ladefoged and Johnson, A course in phonetics

Approximants /ɹ/ - very low third formant, just above F2 /l/ - formants in the neighborhood of 250, 1200, and 2400 Hz; less apparent in final position. Higher formants considerable reduced in intensity Spectrogram from Ladefoged and Johnson, A course in phonetics

Common allophonic variations “a toe” “a doe” “otto” For full stops, there is about 100 ms of silence For tap, about 10-30 ms Spectrogram from Ladefoged and Johnson, A course in phonetics

Pseudo-colored example Here is an American English /æ/ (male) Analyzed in Wavesurfer “Hot” areas (in green/yellow/red) have more energy

Some “tough cases”…. ALS Healthy (notice loss of formant frequency quality)

Women and children (High F0 can cause problems estimating formants)