Voice Quality + Spectral Analysis Feburary 15, 2011.

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

Harmonics October 29, 2012 Where Were We? Were halfway through grading the mid-terms. For the next two weeks: more acoustics Its going to get worse before.
Acoustic Characteristics of Consonants
Linguistic Voice Quality Patricia Keating University of California, Los Angeles Christina Esposito Macalester College, St. Paul.
Spectral Analysis Feburary 24, 2009 Sorting Things Out 1.TOBI transcription homework rehash. And some structural reminders. 2.On Thursday: back in the.
Phonology, part 5: Features and Phonotactics
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
The Source January 26, 2010 Voice Quality Review ATLTMCFlow Modalmoderatevariesmoderatemed. Creakyhighlowhighlow Breathylowvarieslowhigh.
Frequency, Pitch, Tone and Length October 15, 2012 Thanks to Chilin Shih for making some of these lecture materials available.
Anatomy of the vocal mechanism
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Sonorant Acoustics March 23, 2010 Announcements and Such Collect course reports Give back extra credits Hand out new course project guidelines Also:
Topic 3b: Phonation.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
Laryngeal Physiology.
Fricatives + Voice Onset Time March 31, 2014 In the Year 2000 Today: we’ll wrap up fricatives… and then move on to stops. This Friday, there will be.
Source/Filter Theory and Vowels February 4, 2010.
Laterals + Nasals November 24, 2008.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.
Resonance, Revisited March 4, 2013 Leading Off… Project report #3 is due! Course Project #4 guidelines to hand out. Today: Resonance Before we get into.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.
Phonology, part 4: Distinctive Features
Vowel Acoustics November 2, 2012 Some Announcements Mid-terms will be back on Monday… Today: more resonance + the acoustics of vowels Also on Monday:
Harmonics November 1, 2010 What’s next? We’re halfway through grading the mid-terms. For the next two weeks: more acoustics It’s going to get worse before.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Laryngeal Structure & Function; Vocal Fold Vibration
Speech Science V Akustische Grundlagen WS 2007/8.
Voice Quality + Stop Acoustics
The end of vowels + The beginning of fricatives November 19, 2012.
Pitch Tracking + Prosody January 17, 2012 The Plan for Today One announcement: On Thursday, we’ll meet in the Craigie Hall D 428 We’ll be working on.
Frequency, Pitch, Tone and Length October 16, 2013 Thanks to Chilin Shih for making some of these lecture materials available.
Voice Onset Time + Voice Quality
Respiration + Vocal Fold Physiology
Sonorant Acoustics March 24, 2009 Announcements and Such Collect course reports Give back homeworks Hand out new course project guidelines.
Vocal Fold Physiology + Voice Quality October 9, 2014.
Resonance October 23, 2014 Leading Off… Don’t forget: Korean stops homework is due on Tuesday! Also new: mystery spectrograms! Today: Resonance Before.
Voice Quality + Korean Stops October 16, 2014 Don’t Forget! The mid-term is on Tuesday! So I have a review sheet for you. For the mid-term, we will just.
Vowel Acoustics March 10, 2014 Some Announcements Today and Wednesday: more resonance + the acoustics of vowels On Friday: identifying vowels from spectrograms.
Spectral Analysis Feburary 23, 2010 Sorting Things Out 1.On Thursday: back in the computer lab. Craigie Hall D 428 Analysis of Korean stops. 2.Remember:
Frequency, Pitch, Tone and Length February 12, 2014 Thanks to Chilin Shih for making some of these lecture materials available.
Resonance January 28, 2010 Last Time We discussed the difference between sine waves and complex waves. Complex waves can always be understood as combinations.
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Fricatives November 20, 2015 The Road Ahead Formant plotting + vowel production exercises are due at 5 pm today! Monday and Wednesday of next week: fricatives,
Sonorant Acoustics + Place Transitions
Phonation + Voice Quality Feburary 11, 2014 Weekday Update Course project report #2 is due right now! I have guidelines for course project report #3,
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
Stop + Approximant Acoustics
Airstream Mechanisms + Trills October 7, 2013 Announcements and Such 1.Next transcription homework is due on Wednesday. 2.I’m in the midst of grading.
SPPA 6010 Advanced Speech Science
Phonation.
CONSONANT 1 Pertemuan 3 Matakuliah: G0332/English Phonology Tahun: 2007.
Voice Quality January 19, 2010 Vocal Tract Anatomy Our vocal tracts are shaped in a way that makes it easier to speak… But more dangerous to eat!
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
Fricatives + Voice Onset Time November 25, 2015 In the Year 2000 Today: we’ll wrap up fricatives… and then move on to stops. This Friday, there will.
Phonation Physiology Phonation = series of openings and closings of the vocal folds Two phases 1.Prephonation phase: period during which VFs move from.
Basic Acoustics + Digital Signal Processing January 11, 2013.
Spectral Analysis March 3, 2016 Mini-Rant I have succeeded in grading your course project reports. Things to keep in mind: A table of stop phonemes is.
Resonance October 29, 2015 Looking Ahead I’m still behind on grading the mid-term and Production Exercise #1… They should be back to you by Monday. Today:
Harmonics October 28, Where Were We? Mid-terms: our goal is to get them back to you by Friday. Production Exercise #2 results should be sent to.
Fundamental Frequency Change
Laryngeal correlates of the English tense/lax vowel contrast
Structure of Spoken Language
Voiced sounds Which sounds are fully voiced? baby dog today egg
Review of Catford.
Voice source characterisation
Presentation transcript:

Voice Quality + Spectral Analysis Feburary 15, 2011

Today Today: Wrap up voice quality discussion Begin examination of spectral analysis 1.On the Tuesday after the break: back in the computer lab (SS 020). Analysis of Korean stops. 2.Remember: mid-term on Thursday Review sheet to be passed out today, once we wrap up voice quality… 3.Also note: the last TOBI homework

1. Modal Voice Settings At the low end of a speaker’s F0 range: 1. Adductive tension force is moderate 2. Medial compression force is moderate 3. Vocal folds are short and thick. = longitudinal tension is low 4.Moderate airflow F0 is increased by: 1.Increasing the longitudinal tension  activity of the cricothyroid muscle 2.Increasing airflow

For the Record Contraction of the cricothyroid muscle pulls down the thyroid cartilage. Interestingly: researchers often study the activity of this muscle using EMG.

A Little More Hardcore Increasing Medial Compression of the vocal folds can create tense voice. Remember the Mpi contrasts: Also check out the Steve Sklar video Increasing Medial Compression even further can induce ventricular voice …in which the ventricular folds vibrate along with the (true) vocal folds. (go back to the video + endoscopy evidence) Finally, amping up the intensity of all the laryngeal forces results in harsh voice. Compare with: “death metal voice”

2. Creaky Voice A voice quality that is somewhat similar to ventricular voice is creaky voice. Also known as “glottal fry” Laryngeal settings for creaky voice: 1.Ventricular folds often compressed down on true vocal folds. 2.High medial compression 3.Very little longitudinal tension 4.Low airflow  Air bubbles up sporadically through the folds, near the thyroid arch.

Creaky EGG Note: vocal folds are very short during creaky voicing. Look at the creaky video.

Creaky Quirks Note: creaky voice often emerges at the low end of a speaker’s range. In a language like English, at the ends of utterances In a tone language, for very low tones. Note: creaky voice also often has a “double pulse” effect.

Modal to Creaky [][]

Jitter Creaky voice often exhibits a lot of jitter and shimmer. Jitter = Variation in timing of glottal pulses Defined as a percentage: period deviation/period duration.

Shimmer Shimmer = Variation in amplitude of glottal pulses Note: synthetic speech has to include jitter and shimmer …otherwise the voice won’t sound natural. Check out the “voice report” measures out in Praat.

Harsh Voice A “raucous voice quality” (Holmes, 1932) Acoustically: fundamental frequency is aperiodic = lots of jitter (variability in time) Articulatorily: harsh voice does not add anything new to the voice quality parameters; it just increases the intensity of those already in operation. Harsh voice  “excessive approximation of the vocal folds” = high medial compression and high adductive tension

Harsh, continued “Harshness results from overtensions in the throat and neck; it is often if not usually accompanied by hypertensions of the whole body.” (Gray and Wise, 1959) Harsh F0 is usually > 100 Hz Creaky F0 is usually < 100 Hz

3. Breathy Voice In breathy voice, the vocal folds remain open… and “wave” in the airflow coming up from the lungs. Laryngeal settings for breathy voice: 1.Low medial compression 2.Minimal adductive tension 3.Variable longitudinal tension (for F0 control) 4.Higher airflow Check out the breathy video.

Breathy Voice EGG Also note: closure phases in breathy voice are more symmetrical than in modal voice.

Some Real-Life Examples breathy modal

Contrasts Gujarati contrasts breathy voiced vowels with modal voiced vowels: Hausa contrasts modal [j] with creaky/tense [j]: Hausa is spoken in West Africa (primarily in Nigeria) Creaky consonants are also said to be laryngealized.

All Three Jalapa Mazatec has a three-way contrast between modal, breathy and creaky voiced vowels: Jalapa Mazatec is spoken in southern Mexico, around Oaxaca and Veracruz.

Voiced Aspirated Some languages distinguish between (breathy) voiced aspirated and voiceless aspirated stops and affricates. Check out Hindi:

One Random Thing Breathy voiced segments can “depress” the tone on a following segment. Examples from Tsonga: Tsonga is spoken in South Africa and Mozambique. Voiced stops also “depress” tones more than voiceless stops. depressor consonants Nobody really knows why.

Open Quotient From EGG measures, we can calculate the “open quotient” for any particular voicing cycle = time glottis is open period of voicing cycle EGG measures show that there are reliable differences in open quotient values between the three primary voicing types. Breathy voicing has a high open quotient Creaky voicing has a low open quotient Modal voicing is in between

Open Quotient Traces one period open phase The open quotient in modal voicing is generally around 0.5

Tense Voice Tense voice (from throat singing demo) has a lower open quotient. Result of medial compression. Actual value: about 0.3 one period open phase

OQ Traces, continued OQ for creaky voice is also supposed to be low… but it’s actually quite sporadic. Breathy voice OQ is quite high (0.65 or greater)

4. Whispery Voice When we whisper: The cartilaginous glottis remains open, but the ligamental glottis is closed. Air flow through opening with a “hiss” The laryngeal settings: 1.Little or no adductive tension 2.Moderate to high medial compression 3.Moderate airflow 4.Longitudinal tension is irrelevant…

Nodules One of the more common voice disorders is the development of nodules on either or both of the vocal folds. nodule = callous-like bump What effect might this have on voice quality?

Last but not least What’s going on here? At some point, my voice changes from modal to falsetto.

5. Falsetto The laryngeal specifications for falsetto: 1.High longitudinal tension 2.High adductive tension 3.High medial compression Contraction of thyroarytenoids 4.Lower airflow than in modal voicing The results: Very high F0. Very thin area of contact between vocal folds. Air often escapes through the vocal folds.

Falsetto EGG The falsetto voice waveform is considerably more sinusoidal than modal voice.

Some Real EGGs Modal voice (F0 = 140 Hz) Falsetto voice (F0 = 372 Hz)

Voice Quality Summary ATLTMCFlow Modalmoderatevariesmoderatemed. Tensehighvarieshighhigh Creakyhighlowhighlow WhisperlowN/Ahighmed. Breathylowvarieslowhigh Falsettohighhighhighlow

Last but not least, Korean makes an interesting distinction between “emphatic” (or fortis) obstruents and unaspirated and aspirated (lenis) obstruents.

What’s going on here? A variety of things occur during the articulation of fortis consonants in Korean. 1.Glottis is not open as wide (during closure) as in lenis stops.  Voicing begins more quickly after stop release 2.Increased airflow in fortis stops.  Higher F0 after stop release. 3.Vocal folds are “more tense” than in lenis stops. = greater medial compression = “squarer” glottal waveform

Back to the Source… Modal voicing (by me): Note: completely closed and completely open phases are both actually quite short. Also: closure slope is greater than opening slope. Q: Why might there be differences in slope?

A Different Kind of Voicing The basic voice quality in khoomei is called xorekteer. Notice any differences in the EGG waveforms? This voice quality requires greater medial compression of the vocal folds....and also greater airflow

Why Should You Care? Remember that the most basic kind of sound wave is a sinewave. time pressure Sinewaves can be defined by three basic properties: Frequency, (peak) amplitude, phase

Complex Waves It is possible to combine more than one sinewave together into a complex wave. At any given time, each wave will have some amplitude value. A 1 (t 1 ) := Amplitude value of sinewave 1 at time 1 A 2 (t 1 ) := Amplitude value of sinewave 2 at time 1 The amplitude value of the complex wave is the sum of these values. A c (t 1 ) = A 1 (t 1 ) + A 2 (t 1 ) Note: a harmonic is simply a component sinewave of a complex wave.

Complex Wave Example Take waveform 1: high amplitude low frequency Add waveform 2: low amplitude high frequency The sum is this complex waveform: + =

Another Perspective Sinewaves can also be represented by their power spectra. Frequency on the x-axis Intensity on the y-axis (related to peak amplitude) WaveformPower Spectrum

Putting the two together WaveformPower Spectrum + + = = harmonics

More Combinations What happens if we keep adding more and more high frequency components to the sum? += +=

A Spectral Comparison WaveformPower Spectrum

What’s the Point? Remember our EGG waveforms for the different kinds of voice qualities: The glottal waveform for tense voice resembles a square wave.  lots of high frequency components (harmonics)

What’s the point, part 2 A modal voicing EGG looks like: It is less square and therefore has less high frequency components. Although it is far from sinusoidal...

What’s the point, part 3 Breathy and falsetto voice are more sinusoidal... And therefore the high frequency harmonics have less power, compared to the fundamental frequency.

Let’s Check ‘em out Head over to Praat and check out the power spectra of: a sinewave a square wave a sawtooth wave tense voice modal voice creaky voice breathy voice

Spectral Tilt Spectral tilt = drop-off in intensity of higher harmonics, compared to the intensity of the fundamental.