Within-speaker variability in long-term F0

Slides:



Advertisements
Similar presentations
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
Advertisements

Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
© Fraunhofer FKIE Corinna Harwardt Automatic Speaker Recognition in Military Environment.
Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.
Speech Productions of French- English Bilingual Speakers in Western Canada Nicole Netelenbos Fangfang Li.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
Rate of Syllable Production in Selected Languages Aubrey Wilson and Ron Netsell Missouri State University Abstract In different situations and across varying.
Analysis of distorted waveforms using parametric spectrum estimation methods and robust averaging Zbigniew LEONOWICZ 13 th Workshop on High Voltage Engineering.
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Associations of behavioral parameters of speech emotional prosody perception with EI measures in adult listeners Elena Dmitrieva Kira Zaitseva, Alexandr.
Adaption Adjusting Model’s parameters for a new speaker. Adjusting all parameters need a huge amount of data (impractical). The solution is to cluster.
Departments of Medicine and Biostatistics
EBI Statistics 101.
Agricultural and Biological Statistics
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Sentence Durations and Accentedness Judgments ABSTRACT Talkers in a second language can frequently be identified as speaking with a foreign accent. It.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
Voice Onset Time as a Parameter for Identification of Bilinguals Claire Gurski University of Western Ontario London, ON Canada.
Preliminary F0 Statistics for Young Swedish Males and Forensic Phonetics Jonas Lindh – Department.
Edpsy 511 Homework 1: Due 2/6.
As with averages, researchers need to transform data into a form conducive to interpretation, comparisons, and statistical analysis measures of dispersion.
Maria Cristina Matteucci, Dina Guglielmi
A PRESENTATION BY SHAMALEE DESHPANDE
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
ACCURATE TELEMONITORING OF PARKINSON’S DISEASE SYMPTOM SEVERITY USING SPEECH SIGNALS Schematic representation of the UPDRS estimation process Athanasios.
Adaption Def: To adjust model parameters for new speakers. Adjusting all parameters requires too much data and is computationally complex. Solution: Create.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
1 VIU Seminar April 2009 Alcoholized Speech: F0 and Rhythm Florian Schiel Bavarian Archive for Speech Signals Institute of Phonetics and Speech.
ABSTRACT METHODS RESULTS CONCLUSION Background: Georgia rates the quality of early childcare learning centers using a tiered quality improvement system.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
IAFPA 2007 Plymouth, July 22-25, 2007 Developments in automatic speaker recognition at the BKA Michael Jessen, Bundeskriminalamt Franz Broß, Univ. Applied.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
1 Methods for detection of hidden changes in the EEG H. Hinrikus*, M.Bachmann*, J.Kalda**, M.Säkki**, J.Lass*, R.Tomson* *Biomedical Engineering Center.
Speech Perception 4/4/00.
 (Worse) It is a fact that engineers select an appropriate variable and the transformed observations are treated as though they are normally distributed.
A methodology for the creation of a forensic speaker recognition database to handle mismatched conditions Anil Alexander and Andrzej Drygajlo Swiss Federal.
INTRODUCTION Team members: Institution: Professor: Date of submission:
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Perceptual Analysis of Talking Avatar Head Movements: A Quantitative Perspective Xiaohan Ma, Binh H. Le, and Zhigang Deng Department of Computer Science.
Confidence intervals. Estimation and uncertainty Theoretical distributions require input parameters. For example, the weight of male students in NUS follows.
Speaker Identification by Combining MFCC and Phase Information Longbiao Wang (Nagaoka University of Technologyh, Japan) Seiichi Nakagawa (Toyohashi University.
Performance Comparison of Speaker and Emotion Recognition
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Adaption Def: To adjust model parameters for new speakers. Adjusting all parameters requires an impractical amount of data. Solution: Create clusters and.
Phonetic features in ASR Kurzvortrag Institut für Kommunikationsforschung und Phonetik Bonn 17. Juni 1999 Jacques Koreman Institute of Phonetics University.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
Anil Alexander 1, Oscar Forth 1, Marianne Jessen 2 and Michael Jessen 3 1 Oxford Wave Research Ltd, Oxford, United Kingdom 2 Stimmenvergleich, Wiesbaden,
RESEARCH MOTHODOLOGY SZRZ6014 Dr. Farzana Kabir Ahmad Taqiyah Khadijah Ghazali (814537) SENTIMENT ANALYSIS FOR VOICE OF THE CUSTOMER.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Sentence Durations and Accentedness Judgments
Statistics In Research
Noella Handley M.A. Student in Linguistics
4aPPa32. How Susceptibility To Noise Varies Across Speech Frequencies
The Oxford Vocal (OxVoc) Sounds Database:
Acoustics´08 Paris, 29 June – July 2008
Numerical Measures: Centrality and Variability
Data Analysis AMA Collegiate Marketing Research Certificate Program.
Random walk initialization for training very deep feedforward networks
Disseminating statistical data by short quantified sentences of natural language Miroslav Hudec Faculty of Economic Informatics, University of Economics.
Summary descriptive statistics: means and standard deviations:
AP STATISTICS LESSON 10 – 2 (DAY 3)
15.1 The Role of Statistics in the Research Process
A maximum likelihood estimation and training on the fly approach
MATH 6380J Mini-Project 1: Realization of Recent Trends in Machine Learning Community in Recent Years by Pattern Mining of NIPS Words Chan Lok Chun
Chapter 10 Introduction to the Analysis of Variance
Analyzing F0 and vowel formants of Persian based on long-term features
Presentation transcript:

Within-speaker variability in long-term F0 Homa Asadi1, Volker Dellwo2 1Department of Linguistics, Alzahra University, Tehran, Iran 2Institute of Computational Linguistics, University of Zurich, Zurich, Switzerland Center for Forensic Phonetics & Acoustics Introduction Aim of study F0 is considered forensically useful since it conforms to many of the desiderata for FVC parameters (Rose, 2002). F0 is easily extracted and quantified and readily available even in a short stretch of speech. It is also robust to some extent against poor-quality recordings, background noise (in particular non-harmonic noise) and different channel transmissions. Despite its numerous advantages, F0 is sensitive to a large amount of variation within a single speaker. F0 is highly affected by factors such as emotion, state of health, intoxication, voice disguise and recording codecs (Braun, 1995; Gold, 2014). We tried to eliminate factors influencing F0 variation as much as possible. Motivated by the fact that long-term features are less susceptible to within-speaker variability, we aim to analyze long-term features of F0 (namely mean and standard deviation) between recordings of 10 Persian male speakers. Results Participants & speech material LTF extraction Number of speakers= 10 Age= 25 to 28 years old Gender= male Speakers’ language= Persian (Standard Contemporary Persian) 1200 sentences = 10 speakers * 60 sentences * 2 repetitions Average number of syllable per sentence = 12 Measure Factor tested Test Result LTF0-Mean Speaker LM F(141,142)= 39.41, p<0.0001 LTFF0-SD F(141,142)= 0.69, p=0.4 between- and within-speaker variability for the mean of long-term F0 in 10 Persian male speakers Conclusions The results of linear modelling indicated that the effect of speaker is significant for the mean of long-term F0 No main effect was obtained for the standard deviation of LTF0 . The results of within-speaker variability analysis revealed that the variability of long-term distributions of F0 as a function of repetition is not significant. Within-speaker variability was present in speakers 2, 5 ,7 ,8 and 10 which subsequently implies that half of speakers have behaved differently on their two sessions of recording procedures. The results revealed that within-speaker variability was strong across speakers, even though the corpus was extremely controlled in terms of factors influencing variation in F0 measures. References Braun, A. (1995). Fundamental frequency – How speaker-specific is it? In A. Braun and J.P. Köster (eds.) Studies in Forensic Phonetics. Trier: Wissenschaftlicher Verlag, pp. 9-23. Gold, E. and French, J.P. (2011). International practices in forensic speaker comparison. Journal of Speech, Language and the Law, 18, 293-307. Gold, E. (2014). Calculating likelihood ratios for Forensic Speaker Comparisons Using Phonetic and Linguistic Parameters. PhD dissertation, University of York. Rose, P. (2002). Forensic speaker identification. London & New York: Taylor & Francis.