Aging Female Voices: an Acoustic and Perceptive Analysis Technical University of Berlin, Germany Institute for Language and Communication Markus Brückl,

Slides:



Advertisements
Similar presentations
An Approach in Reproducing the Auto-Tune Effect Mentees: Dong-San Choi & Tejas Rawal Mentor: David Jun.
Advertisements

Tom Lentz (slides Ivana Brasileiro)
PF-STAR: emotional speech synthesis Istituto di Scienze e Tecnologie della Cognizione, Sezione di Padova – “Fonetica e Dialettologia”, CNR.
Auditory scene analysis 2
Glottal Source Parameterization: A Comparative Study Authors: Ixone Arroabarren, Alfonso Carlosena UNIVERSIDAD PÚBLICA DE NAVARRA Dpt. Electrical Engineering.
1 A Spectral-Temporal Method for Pitch Tracking Stephen A. Zahorian*, Princy Dikshit, Hongbing Hu* Department of Electrical and Computer Engineering Old.
Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Associations of behavioral parameters of speech emotional prosody perception with EI measures in adult listeners Elena Dmitrieva Kira Zaitseva, Alexandr.
Defining and Measuring Voice Quality Jody Kreiman*, Diana Vanlancker- Sidtis**, and Bruce R. Gerratt* *UCLA and **New York University.
Jessica E. Huber Ph.D. in Speech Science from University at Buffalo MA in Speech-Language Pathology, Certified Speech- Language Pathologist Assistant Professor,
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Speaker Identification Using Pitch Engineering Expo Banquet /08/09.
Vineel Pratap Girish Govind Abhilash Veeragouni. Human listeners are capable of extracting information from the acoustic signal beyond just the linguistic.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
PHONETICS AND PHONOLOGY
Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech F. Bettens *, F. Grenez *, J. Schoentgen *,** * Université Libre.
Speaking Style Conversion Dr. Elizabeth Godoy Speech Processing Guest Lecture December 11, 2012.
Vocal microtremor in normophonic and mildly dysphonic speakers Jean Schoentgen Université Libre Bruxelles Brussels - Belgium.
Anatomy of the vocal mechanism
Analysis and Synthesis of Shouted Speech Tuomo Raitio Jouni Pohjalainen Manu Airaksinen Paavo Alku Antti Suni Martti Vainio.
Emotions and Voice Quality: Experiments with Sinusoidal Modeling Authors: Carlo Drioli, Graziano Tisato, Piero Cosi, Fabio Tesser Institute of Cognitive.
Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Learning Objectives Describe how speakers control frequency and amplitude of vocal fold vibration Describe psychophysical attributes of pitch, loudness.
Laryngeal Physiology.
Maria Cristina Matteucci, Dina Guglielmi
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
Instrumental Assessment SPPA 6400 Voice Disorders: Tasko.
IIT Bombay ICA 2004, Kyoto, Japan, April 4 - 9, 2004   Introdn HNM Methodology Results Conclusions IntrodnHNM MethodologyResults.
CSD 5100 Introduction to Research Methods in CSD The Introduction Section.
Speech Acoustics1 Clinical Application of Frequency and Intensity Variables Frequency Variables Amplitude and Intensity Variables Voice Disorders Neurological.
Male Cheerleaders and their Voices. Background Information: What Vocal Folds Look Like.
The vowel detection algorithm provides an estimation of the actual number of vowel present in the waveform. It thus provides an estimate of SR(u) : François.
Subtractive Sound Synthesis. Subtractive Synthesis Involves subtracting frequency components from a complex tone to produce a desired sound Why is it.
SH 565- Instrumentation in Communicative Disorders Spring ‘02.
UNDERSTANDING EFFECTIVE COMMUNICATION TECHNIQUES.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Arlindo Veiga 1,2 Sara Cadeias 1 Carla Lopes 1,2 Fernando Perdigão 1,2 1 Instituto.
Predicting Voice Elicited Emotions
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
SPPA 6010 Advanced Speech Science
Detection of Vowel Onset Point in Speech S.R. Mahadeva Prasanna & Jinu Mariam Zachariah Department of Computer Science & Engineering Indian Institute.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Topic: Pitch Extraction
ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone Institute for Signal and Information Processing, Temple University.
               n n n d d d z z z       Elettoglotographic study from Brazilian Portuguese fricative voiced sounds Dra. Luciana de.
Effective Communication Techniques. Interest Approach Give each student a copy of a relevant news article. Explain the importance of skimming and scanning.
Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)
Spectral subtraction algorithm and optimize Wanfeng Zou 7/3/2014.
Pitch and Amplitude Perturbation (Jitter and Shimmer) Basic idea: Phonated speech is called quasiperiodic, with quasi being Latin for “sort of” or “more-or-less.
Voice Activity Detection Based on Sequential Gaussian Mixture Model Zhan Shen, Jianguo Wei, Wenhuan Lu, Jianwu Dang Tianjin Key Laboratory of Cognitive.
Voice and Speech therapy using VOCALAB From research to practice
Pitch and Amplitude Perturbation (Jitter and Shimmer)
Instrumental Assessment
Automatic Speech Recognition
Laryngeal correlates of the English tense/lax vowel contrast
Chapter 5 Vocal Mechanism
Speech Perception CS4706.
Voice source characterisation
Norm-Based Coding of Voice Identity in Human Auditory Cortex
Speech Perception (acoustic cues)
How (Not) to Add Laughter to Synthetic Speech
Eugeniusz Cyran KUL, Lublin
COPYRIGHT © All rights reserved by Sound acoustics Germany
Analyzing F0 and vowel formants of Persian based on long-term features
Presentation transcript:

Aging Female Voices: an Acoustic and Perceptive Analysis Technical University of Berlin, Germany Institute for Language and Communication Markus Brückl, Walter Sendlmeier

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier2 Introduction Sue Ellen Linville: -“Firm conclusions as to the effect of aging on jitter and shimmer levels are not now possible.” -“Amplitude SD in female speakers with aging has yet to be investigated.” -“Research is necessary to examine spectral noise as a correlate of perceived age estimates from women’s voices.” -“Studies have not been conducted correlating age estimates to speech rate in female speakers.” Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier3 Aims of this Study Investigate Amp SD (and other perturbation measures) Articulation rate Spectral noise, as a function of chronological age and perceived age -Further acoustic parameters: tremor measures F0 -Relevance of vowel onset for age perception and age measurement Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier4 Data -56 speakers, aged from 20 to 87 (AM=49.77, SD=16.01) -8 types of voice samples, assumed to differ in amount and type of age-related information: Spontaneous speech (s-sp) Read speech (r-sp) Sustained vowels /a/, /i/ and /u/ -Onset sample (e.g. /a/-o)/a/-o -Quasi-stationary sample (e.g. /a/-s)/a/-s Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier5 Methods -15 young adult listeners rated perceived age of each sample -estimations are significantly concordant  listeners’ age perceptions are averaged  perceived age for each voice sample -22 Acoustic parameters are extracted separately for each voice sample -Correlation of: acoustic parameters and chronological age acoustic parameters and perceived age perceived age and chronological age Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier6 Accuracy of Perception Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary rp

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier7 Accuracy of Perception Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary rp

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier8 Accuracy of Perception Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary rp

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier9 Accuracy of Perception: Summary: -the most accurate age estimations result from spontaneous speech and read speech -accuracy of age estimations on vowels differs according to vowel type onset criterion: vowels containing onset are rated more accurately Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier10 Acoustic Parameters Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier11 Acoustic Parameters Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary F0

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier12 Acoustic Parameters Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary Jita STD Jitt RAP PPQ sPPQ vF0 F0

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier13 Acoustic Parameters Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary Jita STD Jitt RAP PPQ sPPQ vF0 ShdB Shim APQ sAPQ vAm F0

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier14 Acoustic Parameters Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary Jita STD Jitt RAP PPQ sPPQ vF0 ShdB Shim APQ sAPQ vAm NHR VTI SPI F0

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier15 Acoustic Parameters FTRI ATRI Jita STD Jitt RAP PPQ sPPQ vF0 ShdB Shim APQ sAPQ vAm NHR VTI SPI F0 Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier16 Acoustic Parameters t t(br) N(br) AR FTRI ATRI Jita STD Jitt RAP PPQ sPPQ vF0 ShdB Shim APQ sAPQ vAm NHR VTI SPI F0 Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier17 Acoustic Correlates The estimated age values are generally stronger correlated with the acoustic measures than the chronological age F0-measurements confirm former findings: -F0 is steadily decreasing with increasing age in women’s voices -not correlated to age in /i/ and /u/  intrinsic pitch Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier18 Acoustic Correlates F0 perturbation measures: -minor respectively sporadic correlations with age (compared to amplitude perturbation)  F0 perturbations are rather related to physical fitness Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier19 Acoustic Correlates Amplitude perturbation measures: APQ of spontaneous speech is the best acoustic measure of age in this study Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier20 Acoustic Correlates Amplitude perturbation measures: -The strongest relations of amplitude perturbation measures and age are achieved with a smoothing factor of 5 and 55 cycles (APQ and sAPQ)  AMP SD and shimmer less correlated -relation of amplitude perturbation and age can not be found in read speech and in /i/ and /u/ vowels Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier21 Acoustic Correlates Spectral energy distribution: -SPI (soft phonation) correlates with perceived age in /a/ vowels -NHR (spectral noise) shows only faint correlations with perceived age in /a/ vowels -VTI (breathiness) is not correlated to age Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier22 Acoustic Correlates Vocal Tremor: -FTRI is increasing with age more reliably than other measures in all sustained vowels but not in read and spontaneous speech -ATRI is not correlated to age Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier23 Acoustic Correlates Speech Tempo: AR of read speech is correlated with age Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier24 Summary Acoustic correlates of age: -Amplitude perturbation quotient, best from spontaneous speech samples -Frequency tremor intensity index -Average fundamental frequency Indirectly correlated: -frequency perturbation – fitness -spectral noise – fitness -speech tempo – cognitive performance Relevance of vowel onset Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier25 Questions Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier26 Implications -Are the found acoustic correlates of age perceptively relevant? – synthesis -Why do amplitude perturbation measurements of spontaneous speech correlate to age? – apply measure on segmented speech -What measures the FTRI? Can it be improved? – reproduce and alter algorithm -How is age information decoded in vowel onset? – analysis in different spectral bands Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier27 Accuracy of Perception chron.age = 1.37(perc.age(s-sp)) – 9.63 Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier28 Accuracy of Perception chron.age = 1.37(perc.age(s-sp)) – 9.63 Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier29 Acoustic Correlates -The estimated age values are generally stronger correlated with the acoustic measures than the chronological age -multiple regression explains up to 47% of the variance of chronological age and 40% (corrected? R²) of the variance of perceived age (spontaneous speech) Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier30 Read Text Ich bin zuerst einmal nur geradeaus gegangen. Und dann an der fünften Ampel rechts in die Grabenstraße rein. Die heißt übrigens nach einem halben Kilometer Steinmetzstraße. An der nächsten Ecke bin ich links in die Helenenstraße abgebogen und kurz danach gleich wieder links in die Schloßstraße – ach nein, falsch, da musste ich ja rechts in die Königsberger Straße. Dann lief ich am Schwimmbad vorbei bis zur Überführung – wie Du es mir gesagt hast. (Example: ) Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier31 Described Picture W. E. Hill’s „My wife and my mother-in- law“, demonstrating perceptual ambiguity (Example: ) Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary

Voqual ‘03, Geneva: Aging Female Voices TU Berlin; Institute for Language and Communication; Brückl, Sendlmeier32 Defining Vocal Age Vocal Aging is the process of the long- term alteration of the biological subsystem speaking apparatus. Vocal Age is the sum of information in the acoustical signal on a specific state during the process of vocal aging. Introduction Aims Data Methods Accuracy of Perception Acoustic Parameters Acoustic Correlates Summary