Analyses on IFA corpus Louis C.W. Pols Institute of Phonetic Sciences (IFA) Amsterdam Center for Language and Communication (ACLC) Project meeting INTAS.

Slides:



Advertisements
Similar presentations
2nd level analysis – design matrix, contrasts and inference
Advertisements

Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Nigerian English prosody Sociolinguistics: Varieties of English Class 8.
AN ACOUSTIC PROFILE OF SPEECH EFFICIENCY R.J.J.H. van Son, Barbertje M. Streefkerk, and Louis C.W. Pols Institute of Phonetic Sciences / ACLC University.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
Statistics for Linguistics Students Michaelmas 2004 Week 6 Bettina Braun
PSY 307 – Statistics for the Behavioral Sciences
Development of coarticulatory patterns in spontaneous speech Melinda Fricke Keith Johnson University of California, Berkeley.
AN INTRODUCTION TO PRAAT Tina John M.A. Institute of Phonetics and digital Speech Processing - University Kiel Institute of Phonetics and Speech Processing.
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
Lecture 16 Psyc 300A. What a Factorial Design Tells You Main effect: The effect of an IV on the DV, ignoring all other factors in the study. (Compare.
Lecture 16: Factorial ANOVA Interactions Practice Laura McAvinue School of Psychology Trinity College Dublin.
Lecture 15 Psyc 300A. Example: Movie Preferences MenWomenMean Romantic364.5 Action745.5 Mean55.
15b. Accessing Data: Frequencies in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Attribute databases. GIS Definition Diagram Output Query Results.
Basic Concept of Data Coding Codes, Variables, and File Structures.
Review of the paper entitled “The development of a phonetically balanced word recognition test in the Ilocano language” written by Renita Sagon, Doctor.
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
Building High Quality Databases for Minority Languages such as Galician F. Campillo, D. Braga, A.B. Mourín, Carmen García-Mateo, P. Silva, M. Sales Dias,
Toshiba Update 04/09/2006 Data-Driven Prosody and Voice Quality Generation for Emotional Speech Zeynep Inanoglu & Steve Young Machine Intelligence Lab.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
Inferential Statistics: SPSS
Business Communication: Process and Product, 6e Mary Ellen Guffey Copyright © 2008 Chapter 12 Informal Business Reports.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
STANDARDIZATION OF SPEECH CORPUS Li Ai-jun, Yin Zhi-gang Phonetics Laboratory, Institute of Linguistics, Chinese Academy of Social Sciences.
Introduction –All information systems create, read, update and delete data. This data is stored in files and databases. Files are collections of similar.
ALEXANDER C. LOPILATO R: Because the names of other stat programs don’t make sense so why should this one?
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Suprasegmentals Segmental Segmental refers to phonemes and allophones and their attributes refers to phonemes and allophones and their attributes Supra-
Analysis of Variance (Two Factors). Two Factor Analysis of Variance Main effect The effect of a single factor when any other factor is ignored. Example.
PHONETICS & PHONOLOGY 3/24/2014. AGENDA GO OVER CORRECTED HOMEWORK IN PAIRS/SMALL GROUPS (5 MIN) MAKE ANY CORRECTIONS TO HWK DUE TODAY, THEN TURN IN (5.
Hands-on tutorial: Using Praat for analysing a speech corpus Mietta Lennes Palmse, Estonia Department of Speech Sciences University of Helsinki.
Using Probability and Discrete Probability Distributions
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
LREC 2008, Marrakech, Morocco1 Automatic phone segmentation of expressive speech L. Charonnat, G. Vidal, O. Boëffard IRISA/Cordial, Université de Rennes.
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
Collaborative Annotation of the AMI Meeting Corpus Jean Carletta University of Edinburgh.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Rundkast at LREC 2008, Marrakech LREC 2008 Ingunn Amdal, Ole Morten Strand, Jørn Almberg, and Torbjørn Svendsen RUNDKAST: An Annotated.
The vowel detection algorithm provides an estimation of the actual number of vowel present in the waveform. It thus provides an estimate of SR(u) : François.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Education 793 Class Notes Multiple Regression 19 November 2003.
A Fully Annotated Corpus of Russian Speech
Mixed ANOVA Models combining between and within. Mixed ANOVA models We have examined One-way and Factorial designs that use: We have examined One-way.
More sophisticated ANOVA applications Repeated measures and factorial PSY SP2003.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
RDF Access to Relational Databases Ashok Malhotra Oracle Corporation.
Chapter 6: Analyzing and Interpreting Quantitative Data
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Arlindo Veiga 1,2 Sara Cadeias 1 Carla Lopes 1,2 Fernando Perdigão 1,2 1 Instituto.
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.
Merging Segmental, Rhythmic and Fundamental Frequency Features for Automatic Language Identification Jean-Luc Rouas 1, Jérôme Farinas 1 & François Pellegrino.
Covert Attention Results (Spring). Covert Attention Results (Fall ’05)
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Experimental Psychology PSY 433 Chapter 5 Research Reports.
Overview When in doubt, follow APA guidelines Say it once, clearly. Everything in its place – don’t overlap information in multiple sections. (discuss.
TEACHING STATISTICS ONLINE Dr Alison Bentley Research Coordinator School of Clinical Medicine Faculty of Health Sciences.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
Database.
Relational Databases The Relational Model.
Relational Databases The Relational Model.
Chapter 12 Informal Business Reports
Statistics in SPSS Lecture 9
Presentation transcript:

Analyses on IFA corpus Louis C.W. Pols Institute of Phonetic Sciences (IFA) Amsterdam Center for Language and Communication (ACLC) Project meeting INTAS 915 May 23-25, 2003, Jyvaskyla

May 24, 2003INTAS 915, Jyvaskyla2 overview structure of IFA corpus See three reports R. v Son and papers in open lit. why corrected means? how corrected means some results conclusions

May 24, 2003INTAS 915, Jyvaskyla3 structure of IFA corpus 4 male & 4 female; 5 hrs. of speech; 8 styles a. o. informal story telling (I); retelling (R); reading a story (T), reading sentences (S) ~50 K words (AIFC, 44 kHz, 16 bit) label files with annotation tiers phonemic segmentation and labeling (automatically generated, hand corrected; ~200k boundaries; 0.84 word labels/min; 3.3 boundaries/min) description levels: phoneme, demi-syllable, syllable, word, sentence, paragraph tiers: POS, lemma, lexical freq., etc.

May 24, 2003INTAS 915, Jyvaskyla4 access of IFA corpus use of CGN protocols non-speech data in database structure relational DB, SQL query language basic structure = table items (indiv. phoneme occurrences) x attributes (phoneme parent word, duration, position, speaker, etc.) WWW front end to simplify access (automatically generating SQL queries; direct links to relevant files)

May 24, 2003INTAS 915, Jyvaskyla5 why corrected means? non-ideal design (no fixed numbers of observations of all relevant factors; this precludes the use of e.g. ANOVA) confounding (occurrence of factor values is correlated, thus many combinations of values are rare) interaction (one factor being modulated by other factors): additive, multiplicative, or ordinal interaction factors of interest vs. nuisance factors

May 24, 2003INTAS 915, Jyvaskyla6 how corrected means? incidence matrix from basic data rows = combinations of levels on factors of interest columns = comb. of levels on nuisance factors quasi-minimal pairs method mean difference per row pair: by comparing (non-empty) pairs of columns matrix of differences (fitted with additive model) variable sample sizes: use weighting factors corrected means

example: vowel duration (ms) speaking style (I, R, S, T) vs. lexical stress (+, -) common meanscorrected means stress style +-total I97.0 (4209) 84.6 (762) 95.1 (4971) R101.8 (4861) 94.1 (712) (5573) S95.5 (12179) 84.0 (1850) 94.0 (14029) T96.2 (11664) 85.7 (1824) 94.8 (13488) total96.9 (32913) 86.1 (5148) 95.4 (38061) stress style +-total I R S T total total counts13323 row differences

May 24, 2003INTAS 915, Jyvaskyla8 row difference counts + signif. I +I -R +R -S +S -T +Total I -199*199 R *603 R -106*120152*378 S +975*218* *2519 S -234*309232*221452*1448 T +980*216* *2590*444*5557 T -232*324227*223439*742432*2619 Total * significance

May 24, 2003INTAS 915, Jyvaskyla9 conclusions simple averaging of unbalanced data is dangerous free conversational speech data are always unbalanced the corrected means method then is a good alternative can be interpreted as a least RMS-error approximation of ‘balanced’ means with an unbalanced data set