Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge

Slides:

Advertisements

Similar presentations

Coarticulation Analysis of Dysarthric Speech Xiaochuan Niu, advised by Jan van Santen.

Advertisements

The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University

Polynomial Regression and Transformations STA 671 Summer 2008.

1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.

LIAL HORNSBY SCHNEIDER

Chapter 17 Overview of Multivariate Analysis Methods

1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.

Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.

On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg, Julia Hirschberg Columbia University Interspeech /14/06.

On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.

Input-Output Relations in Syntactic Development Reflected in Large Corpora Anat Ninio The Hebrew University, Jerusalem The 2009 Biennial Meeting of SRCD,

Lecture 27 Polynomial Terms for Curvature Categorical Variables.

1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.

An Introduction to Logistic Regression

1 Simple Linear Regression Linear regression model Prediction Limitation Correlation.

Multiple Regression Research Methods and Statistics.

How Does Ability to Speak English Affect Earnings?

Chapter 7 Correlational Research Gay, Mills, and Airasian

Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.

Classification and Prediction: Regression Analysis

Energy Efficiency Benchmarking for Mobile Networks

Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.

Linear Regression and Correlation

Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.

Estimate of Swimming Energy Expenditure Utilizing an Omnidirectional Accelerometer and Swim Performance Measures Jeanne D. Johnston and Joel M. Stager,

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating the shape of a polynomial from regression coefficients Jane E. Miller,

Statistical learning, cross- constraints, and the acquisition of speech categories: a computational approach. Joseph Toscano & Bob McMurray Psychology.

Data Cleansing for Predictive Models: The Next Level Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Ratemaking & Product Management Seminar Philadelphia, PA.

The Reliability of Formant Measurements in High Quality Audio Data: The Effect of Agreeing Measurement Procedures Martin Duckworth, Kirsty McDougall,

Machine Learning CSE 681 CH2 - Supervised Learning.

1 Everyday is a new beginning in life. Every moment is a time for self vigilance.

ANOVA and Linear Regression ScWk 242 – Week 13 Slides.

6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)

Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.

Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.

Chapter 16 Data Analysis: Testing for Associations.

11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.

Topic: Quadratics and Complex Numbers Grade: 10 Key Learning(s): Analyzes the graphs of and solves quadratic equations and inequalities by factoring, taking.

Introduction to CEM Secondary Pre-16 Information Systems Nicola Forster & Neil Defty Secondary Systems Programme Managers London, June 2011.

Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)

0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.

1 Correlation and Regression Analysis Lecture 11.

Communicative Language Teaching

©2005, Pearson Education/Prentice Hall CHAPTER 6 Nonexperimental Strategies.

Jump to first page Inferring Sample Findings to the Population and Testing for Differences.

D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.

Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.

1 Statistics & R, TiP, 2011/12 Multivariate Methods  Multivariate data  Data display  Principal component analysis Unsupervised learning technique 

Multiple Linear Regression An introduction, some assumptions, and then model reduction 1.

Copyright © 2013, 2009, 2005 Pearson Education, Inc. 1 3 Polynomial and Rational Functions Copyright © 2013, 2009, 2005 Pearson Education, Inc.

Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.

Methods of multivariate analysis Ing. Jozef Palkovič, PhD.

Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.

Maths GCSE 2015 Curriculum changes. Changes to the provision of formulae – only the following formulae will be given: Cone and sphere – surface area and.

Data statistics and transformation revision Michael J. Watts

Multivariate Analysis - Introduction. What is Multivariate Analysis? The expression multivariate analysis is used to describe analyses of data that have.

Stats Methods at IC Lecture 3: Regression.

Statistics & Evidence-Based Practice

The effect of speech timing on velopharyngeal function

MECH 373 Instrumentation and Measurement

Multivariate Analysis - Introduction

Regression Techniques

Understanding Research Results: Description and Correlation

THE NATURE of LEARNER LANGUAGE

Baselining PMU Data to Find Patterns and Anomalies

Scatterplots 40 points.

Checking Assumptions Primary Assumptions Secondary Assumptions

Multivariate Analysis - Introduction

Analyzing F0 and vowel formants of Persian based on long-term features

Structural Equation Modeling

Presentation transcript:

Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge IAFPA 2006

Speaker characteristics and static features of speech Most previous research has focussed on static features - instantaneous, average Straightforward to measure Natural progression from other research areas – delineation of different languages and language varieties

Reflect certain anatomical dimensions of a speaker, e.g. formant frequencies ~ length and configuration of VT Instantaneous and average measures - demonstrate speaker differences, but unable to distinguish all members of a population  look to dynamic (time-varying) features Speaker characteristics and static features of speech

More information than static Reflect movement of a person’s speech organs as well as dimensions - people move in individual ways for skilled motor activities - walking, running, … and speech Dynamic features of speech

can view speech as achievement of a series of linguistic ‘targets’ speakers likely to exhibit similar properties at ‘targets’ (e.g. segment midpoints), but move between these in individual ways  examine formant frequency dynamics

Time (s) / a ɪ / in ‘bike’ uttered by two male speakers of Australian English Frequency (Hz) Time (s) Formant dynamics

Time (s) / a ɪ / in ‘bike’ uttered by two male speakers of Australian English Frequency (Hz) 10% Formant dynamics

Time (s) / a ɪ / in ‘bike’ uttered by two male speakers of Australian English Frequency (Hz) Time (s) Formant dynamics

How do speakers’ formant dynamics reflect individual differences in the production of the sequence /  /? How can this dynamic information be captured to characterise individual speakers? Research Questions

bike hike like mike spike / ba I k / / ha I k / / la I k / / ma I k / / spa I k / Target words: /aIk//aIk/

e.g. I don’t want the scooter, I want the bike now. Later won’t do, I want the bike now. 5 repetitions x 5 words (bike, hike, like, mike, spike) x 2 stress levels (nuclear, non-nuclear) x 2 speaking rates (normal, fast) = 100 tokens per subject Data set

5 adult male native speakers of Australian English (A, B, C, D, E) aged Brisbane/Gold Coast, Queensland Subjects

Speaker A “bike” (normal-nuclear)

1 2 Speaker A “bike” (normal-nuclear)

% Speaker A “bike” (normal-nuclear)

% Speaker A “bike” (normal-nuclear)  F3  F2  F1 F3 F2 F1

F1 normal-nuclear Frequency (Hz) +10% step of / a  /

F2 normal-nuclear Frequency (Hz) +10% step of / a  /

F3 normal-nuclear Frequency (Hz) +10% step of / a  /

Discriminant Analysis Multivariate technique used to determine whether a set of predictors (formant frequency measurements) can be combined to predict group (speaker) membership (ref. Tabachnick and Fidell 1996)

Discriminant Analysis fast-nuclear Function Function ABCDEABCDE Each datapoint represents 1 token Each speaker’s tokens are represented with a different colour

Discriminant Analysis fast-nuclear Function Function ABCDEABCDE Each datapoint represents 1 token Each speaker’s tokens are represented with a different colour e.g. Speaker E’s 25 tokens of /a ɪ k /

Discriminant Analysis fast-nuclear Function Function ABCDEABCDE DA constructs discriminant functions which maximise differences between speakers (each function is a linear combination of the formant frequency predictors)

Discriminant Analysis fast-nuclear Function Function ABCDEABCDE Assess how well the predictors distinguish speakers by extent of clustering of tokens + classification percentage…

Discriminant Analysis fast-nuclear Function Function ABCDEABCDE Assess how well the predictors distinguish speakers by extent of clustering of tokens + classification percentage… 95%

Discriminant Analysis 95% 88% 95% 89%

Discussion DA scatterplots and classification rates promising However, not very efficient – method essentially based on a series of instantaneous measurements, probably containing dependent information Recall: individuals’ F1 contours of /a ɪ k/ …

F1 normal-nuclear Frequency (Hz) +10% step of / a  /

A new approach … Differences in location in frequency range Differences in curvature – location of turning points, convex/concave, steep/shallow Need to capture most defining aspects of the contours efficiently  linear regression to parameterise curves with polynomial equations

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x y = a 0 + a 1 x

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x y = a 0 + a 1 x y- intercept

Linear regression Technique for determining equation of a line or curve which approximates the relationship between a set of ( x, y ) points y x y = a 0 + a 1 x y- intercept gradient

Linear regression Can also be used for curvilinear relationships y x

Linear regression Can also be used for curvilinear relationships quadratic: y = a 0 + a 1 x + a 2 x 2 y x

Linear regression Can also be used for curvilinear relationships quadratic: y = a 0 + a 1 x + a 2 x 2 y- intercept y x

Linear regression Can also be used for curvilinear relationships quadratic: y = a 0 + a 1 x + a 2 x 2 y- intercept determine shape and direction of curve y x

Polynomial Equations x x x y y y Cubic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 Quartic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 + a 4 x 4 Quintic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 + a 4 x 4 + a 5 x 5

Polynomial Equations x x x y y y Cubic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 Quartic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 + a 4 x 4 Quintic y = a 0 + a 1 x + a 2 x 2 + a 3 x 3 + a 4 x 4 + a 5 x 5

/a  k/ data fit F1, F2, F3 contours with polynomial equations test the reliability of the polynomial coefficients in distinguishing speakers Quadratic: y = a 0 + a 1 t + a 2 t 2 Cubic: y = a 0 + a 1 t + a 2 t 2 + a 3 t 3

actual data points Quadratic fit: y = t t 2 Cubic fit: y = t t t 3 “bike”, Speaker A (normal-nuclear token 1) Frequency (Hz) Normalised time F1 contour y t

actual data points Quadratic fit: y = t t 2 R = Cubic fit: y = t t t 3 R = “bike”, Speaker A (normal-nuclear token 1) Frequency (Hz) Normalised time F1 contour y t

“bike”, Speaker A (normal-nuclear token 1) actual data points Quadratic fit: y = t t 2 R = Cubic fit: y = t t t 3 R = Frequency (Hz) Normalised time F2 contour y t

DA on polynomial coefficents Quadratic 3 formants x 3 coefficients = 9 predictors Cubic 3 formants x 4 coefficients = 12 predictors Cubic + duration of /a  / = 13 predictors

Comparison of Classification Rates % Correct Classification

No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

% Correct Classification No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

% Correct Classification No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

% Correct Classification 96%92%89%90% No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

% Correct Classification No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

% Correct Classification No. of predictors: (9) (12) (13) (20) Comparison of Classification Rates

Summary of findings Comparing polynomial-based tests & direct measurement-based tests: reduction in classification accuracy small in return for much smaller no. of predictors required Future: aim to develop this approach to enable inclusion of additional information  parametrise other dynamic aspects of speech to capture a dense amount of speaker-specific info with a small no. of predictors

Conclusion Differences in formant dynamics reflect differences in articulatory strategies (& VT dimensions) among speakers e.g. speaker-specificity of / a  k / formant dynamics - differences in shape and frequency for F1, F2 and F3 - preserved across changes in speaking rate and stress

Conclusion Trialled new technique for characterising individuals’ formant contours using polynomial equations on / a  k / data Able to capture almost same amount of speaker-specific information with far fewer predictors  Polynomial approach using formant dynamics should make an important contribution to speaker characterisation techniques in future

Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge IAFPA 2006