Download presentation
Presentation is loading. Please wait.
Published byLindsay Booker Modified over 9 years ago
1
Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech F. Bettens *, F. Grenez *, J. Schoentgen *,** * Université Libre de Bruxelles ** National Fund for Scientific Research Belgium
2
CauseVocal Dysperiodicities Vocal Fold Dynamics Diplophonia, Bi-Phonation, Random Vibrations Perturbations Vocal Jitter & Shimmer, Frequency & Amplitude Tremor (Audible) Additive Noise Owing to Turbulence Breathiness, Breathy Voice, Whispery Voice, … “Parasitic” Vibrations Vibrations Ventricular Folds or Ary-Epiglottic Ligaments, … Transients Pitch Breaks, Phonation Breaks, Timbre Breaks, …
3
Existing Cues of Vocal Noise Detection of individual vocal cycles (or harmonics) Steady vowel fragments (Pseudo)-Periodicity Period Perturbation Quotient Amplitude Perturbation Quotient Harmonics-to-Noise Ratio
4
Objectives : Analyses of Dysperiodicities Give up request that speech fragments are : (Pseudo)-Periodic Steady Any Speech Fragment : Modal Voices & (Very) Hoarse Voices Sustained Vowels & Running Speech
5
Motivation : Analysis of Running Speech Voicing in running speech Variable acoustic impedance Voicing onsets & offsets Variable pressure drops Variable laryngeal positions Voice Loading
6
Double Linear Predictive Analysis Conventional short-term linear prediction: Long-term linear prediction: remove existing correlations unpredictable noise component (Qi, 1999) forward short-term prediction error forward double prediction error
7
Double Linear Predictive Analysis Drawbacks: –e S [n] is an artificial signal –the dysperiodicities in weighted sum x [n] are omitted –e L [n] is inflated to the right of unvoiced/voiced boundaries Solutions: remove short-term linear predictive analysis stage proceed to bi-directional analysis
8
Forward long-term linear prediction: Backward long-term linear prediction: Bi-directional long-term linear prediction: keep the “best” (frame by frame) Bi-directional Long-term Prediction forward long-term prediction error backward long-term prediction error bi-directional long- term prediction error
9
Long-term Prediction Distance : P Maximum of the auto-correlation function example: steady vowel [a] (dysphonic speaker) P = 184 (2 cycles)
10
Vocal Noise Cue Signal-to-Dysperiodicity Ratio: SDR = 31,2 dB speech signal dysphonic speaker bi-directional long-term prediction error SDR = 10,1 dB healthy speaker x[n]x[n] eL[n]eL[n] example: steady vowel [a]
11
Results 1 : Sentence (1 female speaker; modal phonation type) ( http://www.limsi.fr/VOQUAL/ : “Il est sorti avant le jour”) http://www.limsi.fr/VOQUAL/ speech signal forward long-term prediction error bi-directional long-term prediction error segments [il]
12
Results 2 : Sentence (1 female speaker; 5 phonation types) ( http://www.limsi.fr/VOQUAL/ : “Il est sorti avant le jour”) http://www.limsi.fr/VOQUAL/
13
Conclusion The forward & backward long-term prediction of speech enables the analysis of any speech signal with a view to the assessment of the vocal noise (i.e. vocal dysperiodicities) The analysis is not based on any assumptions regarding the periodicity or stationarity of the speech signals
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.