SPPA 6010 Advanced Speech Science

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

Digital Signal Processing
Physical modeling of speech XV Pacific Voice Conference PVSF-PIXAR Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona.
Acoustic Characteristics of Vowels
SPPA 2000 Voice Lecture Stephen Tasko The Voice & Voice Disorders SPPA 2000 Stephen Tasko.
CSD 2230 HUMAN COMMUNICATION DISORDERS Topic 7 Speech Disorders Voice Disorders.
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
Voice and Voice Disorders
Hillenbrand: Phonation1 Phonation Note: Audio demos made with fsyn: original pitch, monotone, and inverted pitch. FDR demo original pitch and monotone.
Chapter 2 Resonance Perry C. Hanavan, Au.D.. Question What is meant by phonation? A.Whispered speech sound B.Voiced speech sound C.Produce a nasal sound.
ACOUSTICS OF SPEECH AND SINGING MUSICAL ACOUSTICS Science of Sound, Chapters 15, 17 P. Denes & E. Pinson, The Speech Chain (1963, 1993) J. Sundberg, The.
8 VOCE VISTA, ELECTROGLOTTOGRAMS, CLOSED QUOTIENTS
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Phonation Physiology Chapter 5 Perry C. Hanavan, AuD.
Instrumentation: Vocal Fold Vibration 2/10/00. Glottogram Analyzes the vibratory pattern of the vocal folds Graph of the laryngeal source waveform Graph.
Anatomy of the vocal mechanism
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University.
SPPA 4030 Speech Science1 Phonation SPPA 4030 Speech Science2 Topic Sequence Anatomy review Achieving phonation Capturing glottal and vocal fold behavior.
SPPA 6010 Advanced Speech Science 1 The Source-Filter Theory: The Sound Source.
Topic 3b: Phonation.
ACOUSTICS OF SINGING MUSICAL ACOUSTICS Science of Sound, Chapters 15, 17.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
Learning Objectives Describe how speakers control frequency and amplitude of vocal fold vibration Describe psychophysical attributes of pitch, loudness.
Laryngeal Physiology.
Physiology of Phonation
Voice Assessment: Instrumental
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
Chapter 6: The Human Ear and Voice
Airflows for Speech and Voice
Laryngeal Function and Speech Production
Voice evaluation.
Hoarse meeting in Liverpool April 22, 2005 Subglottal pressure and NAQ variation in Classically Trained Baritone Singers Eva Björkner*†, Johan Sundberg†,
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Speech Acoustics1 Clinical Application of Frequency and Intensity Variables Frequency Variables Amplitude and Intensity Variables Voice Disorders Neurological.
Laryngeal Structure & Function; Vocal Fold Vibration
Peter R. LaPine, Ph.D. Department of Audiology and Speech Sciences Michigan State University.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
Voice Quality + Spectral Analysis Feburary 15, 2011.
Structure of Spoken Language
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Voice Quality + Korean Stops October 16, 2014 Don’t Forget! The mid-term is on Tuesday! So I have a review sheet for you. For the mid-term, we will just.
Phonation + Voice Quality Feburary 11, 2014 Weekday Update Course project report #2 is due right now! I have guidelines for course project report #3,
1. SPEECH PRODUCTION MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
Phonation.
The Speech Chain (Denes & Pinson, 1993)
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
Behrman Chapter 5, 6 Place less emphasis on… Minor anatomical landmarks and features Extrinsic muscles of the larynx Blood supply to the larynx Central.
Phonation Physiology Phonation = series of openings and closings of the vocal folds Two phases 1.Prephonation phase: period during which VFs move from.
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
P105 Lecture #26 visuals 18 March 2013.
Instrumental Assessment
Fundamental Frequency Change
Laryngeal correlates of the English tense/lax vowel contrast
Breathy Voice Note that you can hear both a buzzy (periodic) component and a hissy (aperiodic) component.
SPPA 6010 Advanced Speech Science
Chapter 5 Vocal Mechanism
Voice source characterisation
1. SPEECH PRODUCTION MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
†Department of Speech Music Hearing, KTH, Stockholm, Sweden
Presentation transcript:

SPPA 6010 Advanced Speech Science Topic Sequence Basic Laryngeal Anatomy Myoelastic-aerodynamic theory of phonation Basic Features of Phonation Frequency and Intensity Control Other Phonatory Parameters Instrumentation SPPA 6010 Advanced Speech Science

Fundamental Frequency (F0) SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science F0 Control Anatomical factors Males ↑ VF mass and length = ↓ Fo Females ↓ VF mass and length = ↑ Fo Subglottal pressure adjustment – show example ↑ Psg = ↑ Fo Laryngeal and vocal fold adjustments ↑ CT activity = ↑ Fo TA activity = ↑ Fo or ↓ Fo Extralaryngeal adjustments ↑ height of larynx = ↑ Fo SPPA 6010 Advanced Speech Science

Fundamental Frequency (F0) Average F0 speaking fundamental frequency (SFF) Correlate of pitch Infants ~350-500 Hz Boys & girls (3-10) ~ 270-300 Hz Young adult females ~ 220 Hz Young adult males ~ 120 Hz Older females: F0 ↓ Older males: F0 ↑ F0 variability F0 varies due to Syllabic & emphatic stress Syntactic and semantic factors Phonetics factors (in some languages) Provides a melody (prosody) Measures F0 Standard deviation ~2-4 semitones for normal speakers F0 Range SPPA 6010 Advanced Speech Science

Maximum Phonational Frequency Range highest possible F0 - lowest possible F0 Not a speech measure measured in Hz, semitones or octaves Males ~ 80-700 Hz Females ~135-1000 Hz 3 octaves often considered normal SPPA 6010 Advanced Speech Science

Fundamental Frequency (F0) Control Ways to measure F0 Time domain vs. frequency domain Manual vs. automated measurement Specific Approaches Peak picking Zero crossing Autocorrelation The cepstrum & cepstral analysis SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Autocorrelation Data Correlation + 1.0 + 0.1 - 0.82 + 0.92 SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Cepstrum SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Amplitude SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Amplitude Control Measured as Pressure or Intensity Subglottal pressure adjustment ↑ Psg = ↑ sound pressure Laryngeal and vocal fold adjustments ↑ medial compression = ↑ sound pressure Supralaryngeal adjustments SPPA 6010 Advanced Speech Science

Sound Pressure Level (SPL) Average SPL Correlate of loudness conversation: ~ 65-80 dBSPL SPL Variability  SPL to mark stress Contributes to prosody Measure Standard deviation for neutral reading material: ~ 10 dBSPL SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Dynamic Range Amplitude analogue to maximum phonational frequency range ~50 – 115 dB SPL SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Topic Sequence Basic Laryngeal Anatomy Myoelastic-aerodynamic theory of phonation Basic Features of Phonation Frequency and Intensity Control Other Phonatory Parameters Instrumentation SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Vocal Quality SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Vocal Quality no clear acoustic correlates like pitch and loudness However, terms have invaded our vocabulary that suggest distinct categories of voice quality Common Terms Breathiness Tense Roughness Strain Hoarseness SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Are there features in the acoustic signal that correlate with these quality descriptors? SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Breathiness Perceptual Description Audible air escape in the voice Physiologic Factors Diminished or absent closed phase Increased airflow Potential Acoustic Consequences Change in harmonic (periodic) energy Sharper harmonic roll off Change in aperiodic energy Increased level of aperiodic energy (i.e. noise), particularly in the high frequencies SPPA 6010 Advanced Speech Science

harmonics (signal)-to-noise-ratio (SNR/HNR) harmonic/noise amplitude  HNR Relatively more signal Indicative of a normality  HNR Relatively more noise Indicative of disorder Normative values depend on method of calculation “normal” HNR ~ 15 SPPA 6010 Advanced Speech Science

Harmonic peak Noise ‘floor’ Harmonic peak Noise ‘floor’ Amplitude Frequency

SPPA 6010 Advanced Speech Science

First harmonic amplitude From Hillenbrand et al. (1996)

How to measure periodicity? Cepstrum revisited SPPA 6010 Advanced Speech Science

Prominent Cepstral Peak SPPA 6010 Advanced Speech Science

Spectral Tilt: Voice Source SPPA 6010 Advanced Speech Science

Spectral Tilt: Radiated Sound SPPA 6010 Advanced Speech Science

Peak/average amplitude ratio SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science From Hillenbrand et al. (1996)

SPPA 6010 Advanced Speech Science WMU Graduate Students SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Tense/Pressed Voice Perceptual Description Sense of effort in production Physiologic Factors Longer closed phase Reduced airflow Potential Acoustic consequences Change in harmonic (periodic) energy Flatter harmonic roll off SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Spectral Tilt Pressed Breathy SPPA 6010 Advanced Speech Science

Tense vs. Pressed vs. Effortful vs. Strain SPPA 6010 Advanced Speech Science

Acoustic Basis of Vocal Effort Perception of Effort F0 + RMS + Open Quotient Tasko, Parker & Hillenbrand (2008) SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Roughness Perceptual Description Perceived cycle-to-cycle variability in voice Physiologic Factors Vocal folds vibrate, but in an irregular way Potential Acoustic Consequences Cycle-to-cycle variations F0 and amplitude Elevated jitter Elevated shimmer SPPA 6010 Advanced Speech Science

Cycle to cycle variability in vibration Vocal fold vibration not strictly periodic ↓ frequency and amplitude fluctuations are normal When variability is excessive, it sounds abnormal SPPA 6010 Advanced Speech Science

Frequency variability Variability in the period of each successive cycle of vibration Termed frequency perturbation or jitter … SPPA 6010 Advanced Speech Science

Amplitude variability Variability in the amplitude of each successive cycle of vibration Termed amplitude perturbation or shimmer SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Jitter and Shimmer Sources of jitter and shimmer Small structural asymmetries of vocal folds “material” on the vocal folds (e.g. mucus) Biomechanical events, such as raising/lowering the larynx in the neck Small variations in tracheal pressures “Bodily” events – system noise Measuring jitter and shimmer Variability in measurement approaches Variability in how measures are reported Jitter Typically reported as % or msec Normal ~ 0.2 - 1% Shimmer Can be % or dB Norms not well established SPPA 6010 Advanced Speech Science

What is a vocal register? SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Vocal Registers Pulse (Glottal fry) 30-80 Hz, mean ~ 60 Hz Closed phase very long (90 % cycle) May see biphasic pattern of vibration (open, close a bit, open and close completely) Low subglottal pressure (2 cm water) Energy dies out over the course of a cycle so parts of the cycle has very little energy Hear each individual cycle SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Vocal Registers Modal VF are relatively short and thick Reduced VF stiffness Large amplitude of vibration Possesses a clear closed phase The result is a voice that is relatively loud and low in pitch Average values cited refer to modal register SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Vocal Registers Falsetto 500-1100 Hz (275-600 Hz males) VF are relatively long and thin Increased VF stiffness Small amplitude of vibration Vibration less complex Incomplete closure (no closed phase) The result is a voice that is high in pitch SPPA 6010 Advanced Speech Science

The larynx as an articulator SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Aspiration of plosives Laryngeal devoicing gesture Rapid abduction of vocal folds by what muscle? SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Phonatory onset Timing of respiratory and phonatory activities Simultaneous vocal attack Hard glottal attack Breathy attack SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Topic Sequence Basic Laryngeal Anatomy Myoelastic-aerodynamic theory of phonation Basic Features of Phonation Frequency and Intensity Control Other Phonatory Parameters Instrumentation SPPA 6010 Advanced Speech Science

Estimating glottal area Videolaryngoscopy Stroboscopy High speed video SPPA 6010 Advanced Speech Science

Photoglottography (PGG) SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Photoglottogram illumination Time SPPA 6010 Advanced Speech Science

Electroglottography (EGG) Human tissue =  conductor Air:  conductor Electrodes placed on each side of thyroid lamina high frequency, low current signal is passed between them VF contact  =  impedance VF contact  =  impedance SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Electroglottogram SPPA 6010 Advanced Speech Science

Glottal Airflow (volume velocity) Instantaneous airflow is measured as it leaves the mouth Looks similar to a pressure waveform Can be inverse filtered to remove effects of vocal tract Resultant is an estimate of the airflow at the glottis SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Flow Glottogram SPPA 6010 Advanced Speech Science

SPPA 6010 Advanced Speech Science Synchronous plots Sound pressure waveform (at mouth) Flow glottogram (inverse filtered mask signal) Photoglottogram Electroglottogram SPPA 6010 Advanced Speech Science