An investigation of cross-language differences in pitch range for speakers of English and German Ineke Mennen*, Felix Schaeffler#, & Gerard Docherty^ *ESRC.

Slides:



Advertisements
Similar presentations
CNBH, Physiology Department, Cambridge University 2. Experimental procedure The experiment is a 2AFC paradigm design in which.
Advertisements

Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.
Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
Can a prosodic pattern induce/ reduce the perception of a lower- class suburban accent in French? Philippe Boula de Mareüil 1 & Iryna Lehka-Lemarchand.
Pitch range use in speech of Welsh/English bilinguals: Production Study Mikhail Ordin 1,2 Ineke Mennen 1 (1 Bangor University, Centre for Research on Bilingualism.
1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.
Two Types of Listeners? Marie Nilsenov á (Tilburg University) 1. Background When you and I listen to the same utterance, we may not perceive the linguistic.
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
Emotion in Meetings: Hot Spots and Laughter. Corpus used ICSI Meeting Corpus – 75 unscripted, naturally occurring meetings on scientific topics – 71 hours.
Statistics for Linguistics Students Michaelmas 2004 Week 3 Bettina Braun
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
CNBH, Physiology Department, Cambridge University The perception of size and sex in vowel sounds P60 David R. R. Smith and Roy.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 3 Methods for Recording Behavior EDP 7058.
IAEA International Atomic Energy Agency The IAEA Safety Culture Assessment Methodology.
Unit 2: Engineering Design Process
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Data Collection & Processing Hand Grip Strength P textbook.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
Perceived prominence and nuclear accent shape Rachael-Anne Knight LAGB 5 th September 2003.
Suprasegmentals Segmental Segmental refers to phonemes and allophones and their attributes refers to phonemes and allophones and their attributes Supra-
Tone sensitivity & the Identification of Consonant Laryngeal Features by KFL learners 15 th AATK Annual Conference Hye-Sook Lee -Presented by Hi-Sun Kim-
The Reliability of Formant Measurements in High Quality Audio Data: The Effect of Agreeing Measurement Procedures Martin Duckworth, Kirsty McDougall,
CSD 5100 Introduction to Research Methods in CSD Observation and Data Collection in CSD Research Strategies Measurement Issues.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
Signal Detection Theory October 10, 2013 Some Psychometrics! Response data from a perception experiment is usually organized in the form of a confusion.
By: TARUN MEHROTRA 12MCMB11.  More time is spent maintaining existing software than in developing new code.  Resources in M=3*(Resources in D)  Metrics.
Alignment of tonal targets: 30 years on Bob Ladd University of Edinburgh.
Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.
Descriptive Research: Quantitative Method Descriptive Analysis –Limits generalization to the particular group of individuals observed. –No conclusions.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.
Pragmatically-guided perceptual learning Tanya Kraljic, Arty Samuel, Susan Brennan Adaptation Project mini-Conference, May 7, 2007.
Data Collection and Processing (DCP) 1. Key Aspects (1) DCPRecording Raw Data Processing Raw Data Presenting Processed Data CompleteRecords appropriate.
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? Jovan Pehcevski, James A. Thom School of CS and IT, RMIT University, Australia.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Data Analysis.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
1/17/20161 Emotion in Meetings: Business and Personal Julia Hirschberg CS 4995/6998.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Language and Speech, 2000, 43 (2), THE BEHAVIOUR OF H* AND L* UNDER VARIATIONS IN PITCH RANGE IN DUTCH RISING CONTOURS Carlos Gussenhoven and Toni.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
/u/-fronting in RP: a link between sound change and diminished perceptual compensation for coarticulation? Jonathan Harrington, Felicitas Kleber, Ulrich.
Does the brain compute confidence estimates about decisions?
A Text-free Approach to Assessing Nonnative Intonation Joseph Tepperman, Abe Kazemzadeh, and Shrikanth Narayanan Signal Analysis and Interpretation Laboratory,
Investigating the Relationship between Incoming A Level Grades and Final Degree Classification Tomas James Introduction Universities use A Levels as the.
Investigating Pitch Accent Recognition in Non-native Speech
Travelling to School.
Research amongst Physical Therapists in the State of Kuwait: Participation, Perception, Attitude and Barriers Presented by Sameera Aljadi, PT, PhD Assistant.
Inferential statistics,
Linguistic Predictors of Cultural Identification in Bilinguals
Speech Perception CS4706.
Statistical Analysis Error Bars
Attentive Tracking of Sound Sources
WLTP CoP Procedure for CO2/FC
WLTP CoP Procedure for CO2/FC
Presentation transcript:

An investigation of cross-language differences in pitch range for speakers of English and German Ineke Mennen*, Felix Schaeffler#, & Gerard Docherty^ *ESRC Centre for Bilingualism, Bangor University, UK; #Speech Science Research Centre, Queen Margaret University, UK; ^School of Education, Communication & Language Sciences, Newcastle University, UK. There has been very little focus on pitch range as a source of cross- language differences Yet, there is some evidence that when groups of speakers of different languages are compared there can be a significant difference in aspects of pitch range - notwithstanding some degree of overlap Two languages where this may well be the case are Southern Standard British English (SSBE) and Northern Standard German (NSG). Anecdotal evidence suggests that people perceive differences between the two languages, with SSBE sounding higher and having more variation Background Pitch range is notoriously difficult to quantify, and many different measures have been reported Pitch range can vary along two dimensions, level and span (cf. [1]): Level = the typical pitch height (register) of a speaker’s voice Span = the range of frequencies covered by a speaker Commonly used measures are long-term distributional (LTD), such as mean or median f0 for level, and maximum minus minimum F0, 90% range, 80% range, or 4 standard deviations around the mean for span Patterson [1] suggests that there are some problems with LTD measures (e.g. pitch tracking errors, non-normal distribution of F0, less perceptual validity) Alternative to LTD measures are ‘linguistic’ measures, linked to specific linguistically- defined landmarks in the F0 contour ([1], based on [2, 3]) Production Study LTD Measures Level (Figure 1): Similar values for mean and median across languages Maximum higher for SSBE (but ns) 95%, 90% and 75% quantiles were significantly higher in SSBE Lower quantiles (25%, 10%, 5%) were not significant. Span (examples in Figure 2): Generally wider span for SSBE (all measured spans except max – min were significant in Hz and ST, max – min was only significant in ST) Acknowledgments This study was funded by the UK Economic & Social Research Council (RES ). We thank Frank Kügler and his colleagues for the collection of the German data. How to measure pitch range? Perception Study Material and participants: First sentence of a short story read by 60 (30 per language) female speakers, between 20 and 40 years of age As this material contained too few instances of linguistic tones, these were also derived from a larger part of the story (5 sentences), for a subset of 50 speakers Procedure: LTD measures were derived from raw F0 time series, corrected for tracking errors: Level: mean, median, maximum, minimum, quantile (95%, 90%, 75%, 25%, 10%, 5%) Span: 100% span (max-min), 90% span, 80% span, 50% span, standard deviation (SD), 4 SD around the mean Linguistic measures were derived from linguistically relevant F0 landmarks: Level: H*i, Hi, H*, H, L*, L, I, FL Span: H*i-FL, H*i-L, H*-FL, H*-L All span measures were measured in Hz and semitones (ST), except SD (only Hz) Mann-Whitney U tests were used for statistical testing, due to indications for non-normal distribution of some variables Production Perception Material, participants and procedure: On-going study, aiming at a number of 60 participants (30 per language) Currently analysed: 23 German participants Participants listen to a delexicalized version of the sentence from the production study (60 stimuli) The sentences were taken from the story readings from the production study Participants decided whether the stimulus was of English or German origin (binary forced choice), and how confident they were about their judgement (5 pt scale) Spearman’s rho was used for individual correlations, due to non-normal distribution of some variables Analysed Variable: PEJ Tool: Praat Original pitch contour was approximated by resynthesized contour, based on pitch targets, changes in slope and interpolation between targets Pitch targets were local maxima and minima of the contour Changes in slope were also marked where necessary, but not included in present analysis Local maxima and minima were labelled H* and L*, if aligned with prominent syllable, and H and L otherwise Initial and final targets were labelled separately. Final lows as FL, and the first peak of a phrase was separately marked as H*i or Hi Stimuli were resynthesised with the ’humming function’ of Praat This maintains the pitch contour and the voiced/voiceless distinction, but removes all lexical information In order to maintain intensity relationships of the original utterances, the intensity contour of the resynthesised stimuli was multiplied with the original contour of the utterance Production Method and Participants Measure typeVariableRho PEJ Sig level p < N LTD level95% quantile % quantile % quantile Maximum Mean LTD span4 SD around mean SD % range % range Range (max – min) Linguistic levelFirst peak L0.203n.s.56 FL0.187n.s.50 Linguistic spanFirst peak – FL (Hz) First peak – L (Hz) First peak – FL (ST) STfirst_peakLdiff PEJ Percentage of English Judgments per stimulus: How often was a certain stimulus judged as being English? There are two primary dimensions to our investigation: 1.To evaluate which measures are best suited to capture any cross-language differences in pitch range 2.To use a range of these measures to attempt to identify the nature of the cross-language differences in performance which underpin the differences that people perceive Aims A range of acoustic variables show high correlations with PEJ (see Table 1 opposite) Span measures do not exceed the correlations for level measures Linguistic measures do not exceed the correlations for LTD measures Results There is clear evidence for pitch range differences between SSBE and NSG. The linguistic measures indicate that this difference is mainly a consequence of higher H*i in SSBE The more frequent use of H* accents in SSBE might also contribute to the impression of a wider pitch range in SSBE (despite the fact that non-initial H* are lower in SSBE) Measures taken from the lower end of the F0 distribution (minimum, lower quantiles, FL) do not differ greatly across the languages The perception study indicates that people are generally sensitive to the aspects of pitch range which differ across the languages, and that variables that are significant in production also play a role in perception in the context of this task Overall, there is no indication that linguistic measures are more perceptually valid than LTD measures (contra [1]). Further work is under way to investigate the extent to which the perceived pitch range difference between SSBE and NSG is a consequence of across-the- board higher amounts of pitch variation in SSBE, or is linked to only a few and rather local events in the F0 contour Perception Discussion & Conclusion Linguistic Measures Level (larger data-set, Figure 3): Significantly higher values in SSBE for H*i and phrase-initial tones (I). Final lows (FL) were virtually equal across languages. Interestingly, non-initial H* are higher for NSG. Span (larger data-set) Significantly wider for SSBE in the case of H*i – FL and H*i – L, and significantly wider for NSG in the case of H*-FL and H*-L Overall, SSBE and NSG show quite different distributions of tonal categories (see Figure 4) This distributional difference had consequences for the linguistic measures in the first sentence: -only two categories (FL and L) showed sufficient numbers for cross-language comparison. Neither was significantly different across languages -In order to compare initial peaks, H*i and Hi had to be combined into a single category (P1). P1 was significantly higher for SSBE -Span measures could only be calculated for P1-FL and P1-L. Both differences were significantly wider in SSBE (when measured in Hz and ST) References [1] Patterson, D., A linguistic approach to pitch range modelling. PhD dissertation, University of Edinburgh. [2] Ladd, D.R.; Terken, J., 1995; Modelling intra- and inter- speaker pitch range variation. Proceedings of ICPhS. Stockholm, [3] Shriberg, E.; Ladd, D.R.; Terken, J.; Stolcke, A., Modeling pitch range variation within and across speakers: predicting f0 targets when ”speaking up”. In Proceedings ICSLP, 1–4, Philadelphia, PA, USA. Fig. 1 Fig. 2 Fig. 3 Fig. 4