Presentation is loading. Please wait.

Presentation is loading. Please wait.

Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University.

Similar presentations


Presentation on theme: "Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University."— Presentation transcript:

1

2 Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University

3 Emasters School Leuven 2002Voice Source Characterization2 Voice research To describe and model the properties of the vocal sound source from view points of: –Physiology –Acoustics –Perception

4 Emasters School Leuven 2002Voice Source Characterization3 Importance of the voice Speech synthesis –Towards natural sounding synthesis Speech recognition –Using source properties in recognition Speaker recognition/identification –Voice source characteristics are essential Diagnosis –Pathologies, voice classifications

5 Emasters School Leuven 2002Voice Source Characterization4 Voice possibilities Limited use of voice in speech Range of the fundamental frequency Vocal intensity range Spectral variation

6 Emasters School Leuven 2002Voice Source Characterization5 Focus in this presentation How do acoustic voice source characteristics vary as a function of F 0 and vocal intensity

7 Emasters School Leuven 2002Voice Source Characterization6 Voice profile measurement Thirties: Intensity range as function of various pitches –manual measurement Eighties: Automatic computation of F 0 and Intensity – computer measurement – visual feedback – additional parameters

8 Emasters School Leuven 2002Voice Source Characterization7 Measurement unit One decibel One semi-tone

9 Emasters School Leuven 2002Voice Source Characterization8 Measurement procedure Subject in front of computer screen Microphone on head set (30 cm) Just phonate, sing, and see the result immediately Best results with recording protocol Feed back stimulates extreme phonations

10 Emasters School Leuven 2002Voice Source Characterization9 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Sample density Voice profile / density

11 Emasters School Leuven 2002Voice Source Characterization10 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Sample density Voice profile / speech area

12 Emasters School Leuven 2002Voice Source Characterization11 Acoustic voice quality parameters Jitter –Stability of periodicity –Asymmetry in vocal folds Crest factor –Max amplitude divided by average energy –Relates to spectral slope Many more …

13 Emasters School Leuven 2002Voice Source Characterization12 Crest factor Vocal Intensity (dB SPL) Fundamental frequency (Hz) Crest factor

14 Emasters School Leuven 2002Voice Source Characterization13

15 Emasters School Leuven 2002Voice Source Characterization14 Real time presentation Screen presentation One data point per F 0 -I cell Advanced data storage [new] Full audio signal Full distribution of data per F 0 -I cell Data for screen presentation

16 Emasters School Leuven 2002Voice Source Characterization15 Advantages Reusability of recordings Statistical analysis per F 0 -I cell Study of time-varying behavior

17 Emasters School Leuven 2002Voice Source Characterization16 Crest factor Vocal Intensity (dB SPL) Fundamental frequency (Hz) Crest factor

18 Emasters School Leuven 2002Voice Source Characterization17 Median smoothing of crest factor Vocal Intensity (dB SPL) Fundamental frequency (Hz) Crest factor Crest factor median smoothed

19 Emasters School Leuven 2002Voice Source Characterization18 Vocal Registers Different movement patterns of the vocal folds Pulse register (creaky voice) Modal register Falsetto register

20 Emasters School Leuven 2002Voice Source Characterization19 Pulse register Less than 50 Hz Irregular Long closed period

21 Emasters School Leuven 2002Voice Source Characterization20 Fundamental Frequency (Hz) Vocal Intensity (dB SPL) Pulse register

22 Emasters School Leuven 2002Voice Source Characterization21 Modal register “Normal” use of voice Active role of M. Vocalis Vocal folds thick and completely vibrating Wide range in F 0 and intensity Flat spectrum

23 Emasters School Leuven 2002Voice Source Characterization22 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Modal register

24 Emasters School Leuven 2002Voice Source Characterization23 Falsetto register Higher pitches M. Vocalis passive, tense vocal ligaments through M.Cricothyroidus Edge vibration of vocal volds Sound poor in higher harmonics (in untrained subjects)

25 Emasters School Leuven 2002Voice Source Characterization24 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Falsetto register

26 Emasters School Leuven 2002Voice Source Characterization25 Fundamental frequency (Hz) Vocal Inensity (dB SPL) Register overlap

27 Emasters School Leuven 2002Voice Source Characterization26 Chest- en head voice Refer to secundary vibratory sensations in the body Chest voice: loud modal register Head voice: –males: higher, softer modal register in overlap area with falsetto register –women: falsetto register

28 Emasters School Leuven 2002Voice Source Characterization27 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Chest voice and Head voice chest head

29 Emasters School Leuven 2002Voice Source Characterization28 Registers and voice profiles With a description using Iso-crest factor lines Iso-jitter lines

30 Emasters School Leuven 2002Voice Source Characterization29 Iso-crest factor lines 4 dB 6 dB Vocal Intensity (dB SPL) Crest factor Fundamental frequency (Hz)

31 Emasters School Leuven 2002Voice Source Characterization30 Vocal Intensity (dB SPL) Fundamental frequency (Hz) 3 % Jitter (%) Iso-jitter lines

32 Emasters School Leuven 2002Voice Source Characterization31 New representation Areas defined by iso-parameter lines –crest factor < 4 dB –crest factor > 4 dB, < 6 dB –crest factor > 6 dB –jitter < 3 % –[relative rise time < 6 %]

33 Emasters School Leuven 2002Voice Source Characterization32 Areas in the phonetogram Vocal Intensity (dB SPL) Fundamental frequency (Hz) Jitter > 3%, unstable RRT < 6 % pressed-like Crest factor < 4 dB sine-like

34 Emasters School Leuven 2002Voice Source Characterization33 Fundamental frequency (Hz) Vocal registers in the phonetogram Falsetto upper boundary Modal lower boundary Chest voice boundary Vocal Intensity (dB SPL)

35 Emasters School Leuven 2002Voice Source Characterization34 Comparison of voice profiles Characterisation of Voice pathologies Voice classifications Reuse stored voice profiles of subjects with known voice history

36 Emasters School Leuven 2002Voice Source Characterization35 Important features Contour has limited value –but most research goes into that direction (norm profiles) Distribution of acoustical parameters across the voice profile tells much more

37 Emasters School Leuven 2002Voice Source Characterization36 Unit for comparison Voice profile unit defined by small range of F 0 and Vocal Intensity Distributions of acoustic voice parameters per unit Probability density function per parameter Model Hidden Markov Model We need

38 Emasters School Leuven 2002Voice Source Characterization37 two unconnected states per phonetogram unit vocal registers start and end of phonetion Unit model

39 Emasters School Leuven 2002Voice Source Characterization38 SpeechVoice Profile phoneme modelF 0 /I unit model not labeledlabeled by F 0 and I spectral envelopeacoustic voice parameters language modelunrestricted transitions “forced alignment recognition” Correspondences

40 Emasters School Leuven 2002Voice Source Characterization39 Crest factor distributions

41 Emasters School Leuven 2002Voice Source Characterization40 Fundamental frequency (Hz) Vocal Intensity (dB SPL) Distinctiveness Most distinctive states

42 Emasters School Leuven 2002Voice Source Characterization41 Conclusions Voice profiles can enhance our understanding of vocal behaviour in a visually attractive way Current data storage opens a series of important research topics Market opportunities for “light” versions


Download ppt "Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University."

Similar presentations


Ads by Google