Analyzing F0 and vowel formants of Persian based on long-term features

Slides:



Advertisements
Similar presentations
© Fraunhofer FKIE Corinna Harwardt Automatic Speaker Recognition in Military Environment.
Advertisements

Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.
Speech Productions of French- English Bilingual Speakers in Western Canada Nicole Netelenbos Fangfang Li.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Associations of behavioral parameters of speech emotional prosody perception with EI measures in adult listeners Elena Dmitrieva Kira Zaitseva, Alexandr.
Adaption Adjusting Model’s parameters for a new speaker. Adjusting all parameters need a huge amount of data (impractical). The solution is to cluster.
A comparison of rhythms in Jamaican Creole speech and reggae music Project’s long term goals We chose to compare the rhythmic patterns of Jamaican Creole.
Sentence Durations and Accentedness Judgments ABSTRACT Talkers in a second language can frequently be identified as speaking with a foreign accent. It.
Voice Onset Time as a Parameter for Identification of Bilinguals Claire Gurski University of Western Ontario London, ON Canada.
Phonetic Similarity Effects in Masked Priming Marja-Liisa Mailend 1, Edwin Maas 1, & Kenneth I. Forster 2 1 Department of Speech, Language, and Hearing.
Sonority as a Basis for Rhythmic Class Discrimination Antonio Galves, USP. Jesus Garcia, USP. Denise Duarte, USP and UFGo. Charlotte Galves, UNICAMP.
Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge
Advanced Technology Center Stuttgart EMOTIONAL SPACE IMPROVES EMOTION RECOGNITION Raquel Tato, Rocio Santos, Ralf Kompe Man Machine Interface Lab Advance.
-- A corpus study using logistic regression Yao 1 Vowel alternation in the pronunciation of THE in American English.
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
Adaption Def: To adjust model parameters for new speakers. Adjusting all parameters requires too much data and is computationally complex. Solution: Create.
Topics covered in this chapter
Educational level relative to gender (Ν=28)
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
ABSTRACT METHODS RESULTS CONCLUSION Background: Georgia rates the quality of early childcare learning centers using a tiered quality improvement system.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
The Reliability of Formant Measurements in High Quality Audio Data: The Effect of Agreeing Measurement Procedures Martin Duckworth, Kirsty McDougall,
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Phonetic Variations between Mid-Vowels in Swiss French and Standard French Anna Buffington, Carly Kleiber, Rebecca Kopps, Dr. Jessica Miller
* p
The relationship between objective properties of speech and perceived pronunciation quality in read and spontaneous speech was examined. Read and spontaneous.
A methodology for the creation of a forensic speaker recognition database to handle mismatched conditions Anil Alexander and Andrzej Drygajlo Swiss Federal.
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
1 Branches of Linguistics. 2 Branches of linguistics Linguists are engaged in a multiplicity of studies, some of which bear little direct relationship.
Speaker Identification by Combining MFCC and Phase Information Longbiao Wang (Nagaoka University of Technologyh, Japan) Seiichi Nakagawa (Toyohashi University.
An exploratory analysis of Latino risk and protective health factors in a community sample Julie Gast, PhD, MSCHES, Terry Peak, MSW, PhD, & Jason J. Leiker,
Performance Comparison of Speaker and Emotion Recognition
Unit 5 Phonetics and Phonology. Phonetics Sounds produced by the human speech organs are called the “phonic/auditory medium” Phonetics is the study of.
Adaption Def: To adjust model parameters for new speakers. Adjusting all parameters requires an impractical amount of data. Solution: Create clusters and.
Lecture 1 Phonetics – the study of speech sounds
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Anil Alexander 1, Oscar Forth 1, Marianne Jessen 2 and Michael Jessen 3 1 Oxford Wave Research Ltd, Oxford, United Kingdom 2 Stimmenvergleich, Wiesbaden,
RESEARCH MOTHODOLOGY SZRZ6014 Dr. Farzana Kabir Ahmad Taqiyah Khadijah Ghazali (814537) SENTIMENT ANALYSIS FOR VOICE OF THE CUSTOMER.
What to test at C1? Susan Sheehan. Acknowledgement This project was funded by the British Council through the Assessment Research Grant scheme. The views.
Investigating the Relationship between Incoming A Level Grades and Final Degree Classification Tomas James Introduction Universities use A Levels as the.
17th International Conference on Infant Studies Baltimore, Maryland, March 2010 Language Discrimination by Infants: Discriminating Within the Native.
Sentence Durations and Accentedness Judgments
Introduction   Many 3-D pronunciation tutors with both internal and external articulator movements have been implemented and applied to computer-aided.
Statistics In Research
Introduction   Many 3-D pronunciation tutors with both internal and external articulator movements have been implemented and applied to computer-aided.
An Introduction to : a closer look at analysing vowels
Ma Rui Tianjin Normal University
6th International Conference on Language Variation in Europe
Acoustics´08 Paris, 29 June – July 2008
Differentiating Psychopathic and Alexithymic Emotional Traits With Thin Slices of Verbal Behaviour K. Kaseweter, M.Sc. Candidate, University of Northern.
NATURE NEUROSCIENCE 2007 Coordinated memory replay in the visual cortex and hippocampus during sleep Daoyun Ji & Matthew A Wilson Department of Brain.
Comparison of Hindi version of MMSE with its English version:
Conditional Random Fields for ASR
Mean of Subjectivity of Self Mean of Objectivity of Self
International education Perm city target programme
NUMERACY AND MENTAL COMPUTATION WITH ITS PERSPECTIVE ON BRAIN HEMISPHERICITY ASYMMETRY: WHAT’S RIGHT AND WHAT’S LEFT. TSG 6 Ilda Salwani Ameer and Parmjit.
Differences in comprehension strategies for discourse understanding by native Chinese and Korean speakers learning Japanese Katsuo Tamaoka Graduate.
مقدمة في الإحصاء الحيوي مع تطبيقات برنامج الحزم الإحصائية SPSS
Understanding Variation of VOT in spontaneous speech
Brain Mechanisms in Early Language Acquisition
AP STATISTICS LESSON 10 – 2 (DAY 3)
Women in Physics in Japan
15.1 The Role of Statistics in the Research Process
A maximum likelihood estimation and training on the fly approach
Quantitative Research
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Chapter 13: Using Statistics
Within-speaker variability in long-term F0
Presentation transcript:

Analyzing F0 and vowel formants of Persian based on long-term features Homa Asadi1, Volker Dellwo2 1Department of Linguistics, Alzahra University, Tehran, Iran 2Institute of Computational Linguistics, University of Zurich, Zurich, Switzerland Center for Forensic Phonetics & Acoustics Introduction Aim of study Acoustic correlates of speech which show high between-speaker variabiolty coupled with low-whithin speaker variability play an essential role in reflecting speaker-specific information encoded in human speech. Calculating measures of F0 and formant structures based on long-term features are assumed to be more succesful in FVC. By taking an average over a long stretches of speech, acoustic features responsible for linguistic content cancel out each other so that the remaining acoustic properties in the signal characterise individuals featuers of the speaker (Nolan: 1983, Rose: 2002). Earlier studies claim that long-term formant distributions (LTF) is language-independent. It is not unlikely to obtain different results from the same parameters in different languages, possibly due to the differences in their phonology (Kinoshita:.2001). F0 is langauge-dependent and must be analyzed separately in each language. In this study, we aim to analyze long-term features of F0 and vowel formants of Persian, a language which has rather different phonological system from previous investigated languages. Participants & speech material LTF extraction Results Number of speakers= 12 Age= 22 to 35 years old Gender= male Speakers’ language= Persian (Standard Contemporary Persian) 1296 sentences = 12 speakers * 54 sentences * 2 repetitions Measure Factor tested Test Result LTF3-Mean Speaker LM F(141,142)= 6.594, p<0.0001 LTFF3-SD F(141,142)= 0.574, p<0.0001 LTF0-Mean F(141,142)= 39.41, p<0.0001 Results Conclusions The results of a linear model indicate that the effect of speaker is significant for the mean and standard deviation of LTF3 and for the mean of LTF0. No main effect was obtained for the standard deviation of LTF0. The results of within-speaker variability analysis show that the variability of all LTF measures as a function of repetition is not significant. As for LTF0, within-speaker variability was shown in speakers 3, 4 ,6 ,7 and 8 . In other words, 5 speakers among 12 speakers showed difference on their two recording sessions which is indicative of within-speaker variability in the measures of LTF0. Pairwise correlation tests revealed non-significant relationships between LTF3 and LTF0 which subsequently indicates that they carry different information about speakers’ individual features. The result of LTF analyses replicates previous findings on German and English (Moss: 2010, Gold: 2013) which provides more support for the language-independency of the LTF measures. References Gold, E. and French, J.P. (2011). International practices in forensic speaker comparison. Journal of Speech, Language and the Law, 18, 293-307. Gold, E., French, J.P and Harrison, P. (2013). Examining long-term formant distributions as a discriminant in forensic speaker comparisons under a likelihood ratio framework. Proceedings of Meetings on Acoustics, Montreal, Canada, Vol: 19, 1–87. Jessen, M. (2008). Forensic phonetics. Language and Linguistics Compass, 2, 671–711. Kinoshita, K. (2001). Testing realistic forensic speaker identification in Japanese: A likelihood ratio based approach using formants. Ph.D. dissertation, Australian National University. Moos, A. (2010). Long-term formant distribution as a measure of speaker characteristics in read and spontaneous speech. The Phonetician, 101,7-24. Nolan, F. (1983). The phonetic bases of speaker recognition. Cambridge: Cambridge University Press. Rose, P. (2002). Forensic speaker identification. London & New York: Taylor & Francis.