Behrman Chapter 5, 6 Place less emphasis on… Minor anatomical landmarks and features Extrinsic muscles of the larynx Blood supply to the larynx Central.

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

Acoustic Characteristics of Vowels
Spectral Analysis Feburary 24, 2009 Sorting Things Out 1.TOBI transcription homework rehash. And some structural reminders. 2.On Thursday: back in the.
SPPA 2000 Voice Lecture Stephen Tasko The Voice & Voice Disorders SPPA 2000 Stephen Tasko.
CSD 2230 HUMAN COMMUNICATION DISORDERS Topic 7 Speech Disorders Voice Disorders.
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
Voice and Voice Disorders
Hillenbrand: Phonation1 Phonation Note: Audio demos made with fsyn: original pitch, monotone, and inverted pitch. FDR demo original pitch and monotone.
8 VOCE VISTA, ELECTROGLOTTOGRAMS, CLOSED QUOTIENTS
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Instrumentation: Vocal Fold Vibration 2/10/00. Glottogram Analyzes the vibratory pattern of the vocal folds Graph of the laryngeal source waveform Graph.
Anatomy of the vocal mechanism
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
By Dr. Supreet Singh Nayyar, AFMC For more presentations, visit 7/15/ Physiology Of Phonation.
Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University.
SPPA 4030 Speech Science1 Phonation SPPA 4030 Speech Science2 Topic Sequence Anatomy review Achieving phonation Capturing glottal and vocal fold behavior.
Topic 3b: Phonation.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
Learning Objectives Describe how speakers control frequency and amplitude of vocal fold vibration Describe psychophysical attributes of pitch, loudness.
Laryngeal Physiology.
Laryngeal Physiology.
Voice Assessment: Instrumental
Phonatory System Lecture 8
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
Chapter 6: The Human Ear and Voice
Airflows for Speech and Voice
Laryngeal Function and Speech Production
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Speech Acoustics1 Clinical Application of Frequency and Intensity Variables Frequency Variables Amplitude and Intensity Variables Voice Disorders Neurological.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Laryngeal Structure & Function; Vocal Fold Vibration
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
Voice Quality + Spectral Analysis Feburary 15, 2011.
David Meredith Aalborg University
Structure of Spoken Language
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Voice Quality + Korean Stops October 16, 2014 Don’t Forget! The mid-term is on Tuesday! So I have a review sheet for you. For the mid-term, we will just.
Phonation + Voice Quality Feburary 11, 2014 Weekday Update Course project report #2 is due right now! I have guidelines for course project report #3,
Syllables and Stress October 21, 2015.
1. SPEECH PRODUCTION MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
SPPA 6010 Advanced Speech Science
Phonation.
The Speech Chain (Denes & Pinson, 1993)
Speech Generation and Perception
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
Trills and Voicing October 13, 2010 Back to Aerodynamics Aerodynamic method #1: Stops A.start air flow Boyle’s Law and all that. B.stop air flow Just.
Phonation Physiology Phonation = series of openings and closings of the vocal folds Two phases 1.Prephonation phase: period during which VFs move from.
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
Hillenbrand: Phonation
P105 Lecture #26 visuals 18 March 2013.
VF Vibration 1) VFs abduct via PCA muscles during inhalation 2) VFs adduct via LCA, IA, TA muscles 3) Subglottal air pressure (Ps) builds beneath VFs 4)
Instrumental Assessment
Fundamental Frequency Change
Laryngeal correlates of the English tense/lax vowel contrast
Breathy Voice Note that you can hear both a buzzy (periodic) component and a hissy (aperiodic) component.
SPPA 6010 Advanced Speech Science
Chapter 5 Vocal Mechanism
Speech Generation and Perception
Voice source characterisation
1. SPEECH PRODUCTION MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Speech Generation and Perception
Presentation transcript:

Behrman Chapter 5, 6 Place less emphasis on… Minor anatomical landmarks and features Extrinsic muscles of the larynx Blood supply to the larynx Central motor control of larynx Peripheral Sensory control of larynx Stress-Strain Properties of Vocal Folds

Laryngeal Activity in Speech/Song Sound source to excite the vocal tract –Voice –Whisper Prosody –Fundamental frequency (F0) variation –Amplitude variation Realization of phonetic goals –Voicing –Devoicing –Glottal frication (/  /, /  /) –Glottal stop (/  /) –Aspiration Para-linguistic and extra-linguistic roles –Transmit affect –Speaker identity

The vocal fold through life… Newborns –No layered structure of LP –LP loose and pliable Children –Vocal ligament appears 1-4 yrs –3-layered LP is not clear until 15 yrs Old age –Superficial layer becomes edematous & thicker –Thinning of intermediate layer and thickening of deep layer –Changes in LP more pronounced in men –Muscle atrophy

The Glottal Cycle

Complexity of vocal fold vibration Vertical phase difference Longitudinal phase difference 8&sa=N&tab=wv#

Myoelastic Aerodynamic Theory of Phonation Necessary and Sufficient Conditions Vocal Folds are adducted (Adduction) Vocal Folds are tensed (Longitudinal Tension) Presence of Aerodynamic pressures

2-mass model Lower part of vocal fold Upper part of vocal fold Mechanical coupling stiffness TA muscle Coupling between mucosa & muscle

VF adducted & tensed → myoelastic pressure (P me ) Glottis is closed subglottal air pressure (P sg ) ↑ P sg ~ 8-10 cm H 2 0, P sg > P me L and R M1 separate Transglottal airflow (U tg ) = 0 As M1 separates, M2 follows due to mechanical coupling stiffness P sg > P me glottis begins to open P sg > P atm therefore U tg > 0

U tg ↑ ↑ since glottal aperature << tracheal circumference U tg ↑ P tg ↓ due to Bernoulli effect Pressure drop across the glottis Bernoulli’s Law P + ½  U 2 = K where P = air pressure  = air density U = air velocity

U tg ↑ P tg ↓ due to Bernoulli effect Plus “other” aerodynamic effects P tg < P me M1 returns to midline M2 follows M1 due to mechanical coupling stiffness U tg = 0 Pattern repeats times a second

Limitations of this simple model

The Glottal Cycle

Sound pressure wave Time Instantaneous sound pressure

Phonation is actually quasi-periodic Complex Periodic –vocal fold oscillation Aperiodic –Broad frequency noise embedded in signal –Non-periodic vocal fold oscillation –Asymmetry of vocal fold oscillation –Air turbulence Voicing vs. whispering

Glottal Aerodynamics Volume Velocity Driving Pressure Phonation Threshold Pressure –Initiate phonation –Sustain phonation Laryngeal Airway Resistance

Measuring Glottal Behavior Videolaryngoscopy –Stroboscopy –High speed video

Photoglottography (PGG) Time illumination

Electroglottography (EGG) Human tissue =  conductor Air:  conductor Electrodes placed on each side of thyroid lamina high frequency, low current signal is passed between them VF contact  =  impedance VF contact  =  impedance

Electroglottogram

Glottal Airflow (volume velocity) Instantaneous airflow is measured as it leaves the mouth Looks similar to a pressure waveform Can be inverse filtered to remove effects of vocal tract Resultant is an estimate of the airflow at the glottis

Flow Glottogram

Synchronous plots Sound pressure waveform (at mouth) Flow glottogram (inverse filtered mask signal) Photoglottogram Electroglottogram

F 0 Control Anatomical factors Males ↑ VF mass and length = ↓ F o Females ↓ VF mass and length = ↑ F o Subglottal pressure adjustment – show example ↑ P sg = ↑ F o Laryngeal and vocal fold adjustments ↑ CT activity = ↑ F o TA activity = ↑ F o or ↓ F o Extralaryngeal adjustments ↑ height of larynx = ↑ F o

Fundamental Frequency (F 0 ) Average F 0 speaking fundamental frequency (SFF) Correlate of pitch Infants –~ Hz Boys & girls (3-10) –~ Hz Young adult females –~ 220 Hz Young adult males –~ 120 Hz Older females: F0 ↓ Older males: F0 ↑ F 0 variability F 0 varies due to –Syllabic & emphatic stress –Syntactic and semantic factors –Phonetics factors (in some languages) Provides a melody (prosody) Measures –F 0 Standard deviation ~2-4 semitones for normal speakers –F 0 Range

Maximum Phonational Frequency Range highest possible F 0 - lowest possible F 0 Not a speech measure measured in Hz, semitones or octaves Males~ Hz 1 Females~ Hz 1 3 octaves often considered normal 1 Baken (1987)

Fundamental Frequency (F 0 ) Control Ways to measure F 0 –Time domain vs. frequency domain –Manual vs. automated measurement –Specific Approaches Peak picking Zero crossing Autocorrelation The cepstrum & cepstral analysis

Autocorrelation Data Correlation

Cepstrum

Amplitude Control Subglottal pressure adjustment ↑ P sg = ↑ sound pressure Laryngeal and vocal fold adjustments ↑ medial compression = ↑ sound pressure Supralaryngeal adjustments

Measuring Amplitude Pressure Intensity Decibel Scale

Sound Pressure Level (SPL) Average SPL Correlate of loudness conversation: ~ dB SPL SPL Variability  SPL to mark stress Contributes to prosody Measure –Standard deviation for neutral reading material: ~ 10 dB SPL

Dynamic Range Amplitude analogue to maximum phonational frequency range ~50 – 115 dB SPL

Vocal Quality no clear acoustic correlates like pitch and loudness However, terms have invaded our vocabulary that suggest distinct categories of voice quality Common Terms Breathy Tense/strained Rough Hoarse

Are there features in the acoustic signal that correlate with these quality descriptors?

Breathiness Perceptual Description Audible air escape in the voice Physiologic Factors Diminished or absent closed phase Increased airflow Potential Acoustic Consequences Change in harmonic (periodic) energy –Sharper harmonic roll off Change in aperiodic energy –Increased level of aperiodic energy (i.e. noise), particularly in the high frequencies

harmonics (signal)-to-noise-ratio (SNR/HNR) harmonic/noise amplitude  HNR –Relatively more signal –Indicative of a normality  HNR –Relatively more noise –Indicative of disorder Normative values depend on method of calculation “normal” HNR ~ 15

Harmonic peak Noise ‘floor’ Frequency Amplitude Harmonic peak

From Hillenbrand et al. (1996) First harmonic amplitude

Prominent Cepstral Peak

Spectral Tilt: Voice Source

Spectral Tilt: Radiated Sound

Peak/average amplitude ratio

From Hillenbrand et al. (1996)

WMU Graduate Students

Tense/Pressed/Effortful/Strained Voice Perceptual Description Sense of effort in production Physiologic Factors Longer closed phase Reduced airflow Potential Acoustic consequences Change in harmonic (periodic) energy –Flatter harmonic roll off

Pressed Breathy Spectral Tilt

Acoustic Basis of Vocal Effort F0 + RMS + Open Quotient Perception of Effort Tasko, Parker & Hillenbrand (2008)

Roughness Perceptual Description –Perceived cycle-to-cycle variability in voice Physiologic Factors –Vocal folds vibrate, but in an irregular way Potential Acoustic Consequences –Cycle-to-cycle variations F0 and amplitude –Elevated jitter –Elevated shimmer

Period/frequency & amplitude variability Jitter: variability in the period of each successive cycle of vibration Shimmer: variability in the amplitude of each successive cycle of vibration …

Jitter and Shimmer Sources of jitter and shimmer Small structural asymmetries of vocal folds “material” on the vocal folds (e.g. mucus) Biomechanical events, such as raising/lowering the larynx in the neck Small variations in tracheal pressures “Bodily” events – system noise Measuring jitter and shimmer Variability in measurement approaches Variability in how measures are reported Jitter –Typically reported as % or msec –Normal ~ % Shimmer –Can be % or dB –Norms not well established

Vocal Register What is a vocal register?

Vocal Registers Pulse (Glottal fry) –30-80 Hz, mean ~ 60 Hz –Closed phase very long (90 % cycle) –May see biphasic pattern of vibration (open, close a bit, open and close completely) –Low subglottal pressure (2 cm water) –Energy dies out over the course of a cycle so parts of the cycle has very little energy –Hear each individual cycle

Vocal Registers Modal –VF are relatively short and thick –Reduced VF stiffness –Large amplitude of vibration –Possesses a clear closed phase –The result is a voice that is relatively loud and low in pitch –Average values cited refer to modal register

Vocal Registers Falsetto – Hz ( Hz males) –VF are relatively long and thin –Increased VF stiffness –Small amplitude of vibration –Vibration less complex –Incomplete closure (no closed phase) –The result is a voice that is high in pitch