Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1

Slides:



Advertisements
Similar presentations
Cross-modal perception of motion- based visual-haptic stimuli Ian Oakley & Sile OModhrain Palpable Machines Research Group
Advertisements

The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus,
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
“Downstepped contours in the given/new distinction” Agustín Gravano Spoken Language Processing Group Columbia University, New York On the Role of Prosody.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Agustín Gravano 1 · Stefan Benus 2 · Julia Hirschberg 1 Elisa Sneed German 3 · Gregory Ward 3 1 Columbia University 2 Univerzity Konštantína Filozofa.
High Frequency Word Entrainment in Spoken Dialogue ACL, June Columbus, OH Department of Computer and Information Science University of Pennsylvania.
9/5/20051 Acoustic/Prosodic and Lexical Correlates of Charismatic Speech Andrew Rosenberg & Julia Hirschberg Columbia University Interspeech Lisbon.
3. Active vs. Passive Voice
The Vocabulary of Research. What is Credibility? A researcher’s ability to demonstrate that the study is accurate based on the way the study was conducted.
Paraphrasing and Plagiarism. PLAGIARISM Plagiarism is using data, ideas, or words that originated in work by another person without appropriately acknowledging.
User Study Evaluation Human-Computer Interaction.
Word order and tonal shape in the production of focus in short Finnish utterances Martti Vainio 1, Juhani Järvikivi 2 and Stefan Werner 3 1 University.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
Intonation--CMBG’s system 4 levels of pitch 4 = extra high 3 = high 2 = normal, neutral 1 = low.
What vocal cues indicate sarcasm? By: Jack Dolan Rockwell, P. (2000). Lower, slower, louder: Vocal cues of sarcasm. Journal of Psycholinguistic Research,
User Responses to Prosodic Variation in Fragmentary Grounding Utterances in Dialog Gabriel Skantze, David House & Jens Edlund.
Language and Speech, 2000, 43 (2), THE BEHAVIOUR OF H* AND L* UNDER VARIATIONS IN PITCH RANGE IN DUTCH RISING CONTOURS Carlos Gussenhoven and Toni.
Lexical, Prosodic, and Syntactics Cues for Dialog Acts.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
On the role of context and prosody in the interpretation of ‘okay’ Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Héctor Chávez, and Lauren Wilcox.
DEIXIS Deixis belongs within the domain of pragmatics because it directly concerns the relationship between the structure of languages and the contexts.
Effects of Reading on Word Learning
The Pennsylvania state university college of nursing Nursing 200w
Collecting Written Data
E303 Part II The Context of Language Research

ASR-based corrective feedback on pronunciation: does it really work?
AQA GCSE French and German
English, Literacies and Policy Contexts A
August 15, 2008, presented by Rio Akasaka
Language and Television Sports commentaries and documentaries
Grounding by nodding GESPIN 2009, Poznan, Poland
TEACHING LANGUAGE SKILLS: TEACHING SPEAKING
COOPERATION and IMPLICATURE
Observer Participants
Introduction to Corpus Linguistics: Exploring Collocation
The Role of Perceived Consensus in Reactance
Research Methods RQ1: Do greeting expressions mean differently in different pragmatic contexts? Instrument / Software Purpose Analysis Textbooks.
Lesson ANOVA - D Two-Way ANOVA.
Studying Intonation Julia Hirschberg CS /21/2018.
Meanings of Intonational Contours
Studying Intonation Julia Hirschberg CS /21/2018.
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 11/8/2018.
Intonational and Its Meanings
Intonational and Its Meanings
The American School and ToBI
THE NATURE OF SPEAKING Joko Nurkamto UNS Solo.
Agustín Gravano1,2 Julia Hirschberg1
Information Structure and Prosody
Meanings of Intonational Contours
“Downstepped contours in the given/new distinction”
High Frequency Word Entrainment in Spoken Dialogue
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 12/3/2018.
Agustín Gravano & Julia Hirschberg {agus,
Comparative Studies Avesani et al 1995; Hirschberg&Avesani 1997
Agustín Gravano1,2 Julia Hirschberg1
DEIXIS Deixis belongs within the domain of pragmatics because it directly concerns the relationship between the structure of languages and the contexts.
Chapter 1: An Overview of communication
CRC Card Design A CRC Model is a collection of cards (usually standard index cards or larger) that are divided into three sections. 1. Class 2. Responsibility.
Method Results Discussion
Learner motivation and individual differences in language learning Dr Louise Courtney Professor Suzanne Graham University of Reading Siena July 2017.
Lexico-grammar: From simple counts to complex models
Basics of Communication
Las Positas College Graduation Survey
Parts of Speech II.
Guest Lecture: Advanced Topics in Spoken Language Processing
Presentation transcript:

The Effect of Contour Type and Epistemic Modality on the Assessment of Speaker Certainty Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1 Elisa Sneed German3 · Gregory Ward3 1 Columbia University 2 Univerzity Konštantína Filozofa 3 Northwestern University

Agustín Gravano Speech Prosody 2008 Overview Previous researchers disagree about the role of epistemic would in utterance interpretation. A: Who’s the British woman over there? B: That would be J. K. Rowling. Epistemic would conveys... Tentativeness (Palmer 1990, Perkins 1983) A high degree of speaker certainty (Ward et al. 2003) Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Overview What is the relation between epistemic would and perceived speaker certainty? What role does the intonational contour play? Two perception experiments Textual condition Spoken condition Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Epistemic modality Marks the speaker’s estimation of the likelihood that a certain proposition is true in context. A: Who’s the British woman over there? B: That must be J. K. Rowling. That could be J. K. Rowling. That might be J. K. Rowling. How is the perception of speaker certainty affected by the use of epistemic would? That would be J. K. Rowling. Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Task Overview Participants were: 1) Presented with written dialogues. 2) Asked to assess the speaker certainty of a target utterance. Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Materials [Context: David is at his desk when a co-worker knocks on the door.] Co-worker: David, I'm looking for this guy named Frank Jackson. David: That’s the new guy. or That would be the new guy. Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Experiment Design Each session contained 60 tokens: 20 stimuli (only one stimulus from each set) 40 fillers without any of the target constructions Presented in a random order. Participants rated the perceived certainty of each token on a 5-degree Likert scale: Very uncertain, Somewhat uncertain, Neither certain nor uncertain, Somewhat certain, Very certain. Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Computer Interface Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Collected Data 12 participants (8 female, 4 male, mean age: 20.3) 240 data points (120 would be, 120 is) Participants’ responses were: 1) Converted into numeric values: Very uncertain  2 Somewhat uncertain  1 Neither certain nor uncertain  0 Somewhat certain  1 Very certain  2 2) Normalized using z-scores. Agustín Gravano Speech Prosody 2008

Perception Study 1: Textual Condition Results Mean certainty of would be tokens: 0.13 ± 1.11 is tokens: 0.03 ± 1.04 One-way ANOVA: No significant difference. No evidence of a difference in perceived certainty between modal would and indicative be, in a textual condition. Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 What is the case in a spoken condition? How is the perception of speaker certainty affected by: the use of epistemic would? the use of a particular intonational contour? Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Task Overview Participants were: 1) Presented with written dialogues and a recorded target utterance. 2) Asked to assess the speaker certainty of each target utterance. Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Computer Interface Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Intonational Contours Simple declarative contour (H*) H* L- L% Downstepped contour H* !H* (!H*) L- L% Yes-no-question contour (L*) L* H- H% Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Materials 20 stimulus sets, each with six variations of the same utterance (= 120 files): Recorded by a non-professional male speaker of American English in a sound-proof booth. declarative downstepped yn-question would be is Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Experiment Design Each session contained 60 tokens: 20 stimuli (only one stimulus from each set) 40 fillers (with all 3 contours: 13 dec, 13 ds, 14 yn) Presented in a random order. Participants rated the perceived certainty of each token on the same 5-degree Likert scale: Very uncertain, Somewhat uncertain, Neither certain nor uncertain, Somewhat certain, Very certain. Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Collected Data 30 participants (24 female, 6 male, mean age: 21.4) 600 data points: Again, participants’ responses were: 1) Converted into numeric values. 2) Normalized using z-scores. declarative downstepped yn-question would be 100 is Agustín Gravano Speech Prosody 2008

Perception Study 2: Spoken Condition Results No interaction between Contour and Modality. For all 3 contours: would be > is For both modalities: downstepped > declarative >> yn-question (All stat. significant.) Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Conclusions Epistemic would conveys... Tentativeness. A high degree of speaker certainty. would be > is However, the choice of intonational contour has a stronger impact on perceived certainty. downstepped > declarative >> yn-question Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Future Work Production study Before the textual perception study, the same 12 participants recorded each target utterance. What contours were used to convey different degrees of speaker certainty? Agustín Gravano Speech Prosody 2008

The Effect of Contour Type and Epistemic Modality on the Assessment of Speaker Certainty Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1 Elisa Sneed German3 · Gregory Ward3 1 Columbia University 2 Univerzity Konštantína Filozofa 3 Northwestern University

Extra slide Sample Stimuli A: I think the kids are tired of the water park. Maybe we should take them someplace else. B: What's the Six Flags theme park located in Gurnee? A: That {is, would be} Great America. A: What a great party! B: Yeah, but we're stuck cleaning up all the crap. A: Hey, somebody left their iPod out on the floor. B: That {is, would be} my roommate. Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Extra slide Certainty Mean and StDev Agustín Gravano Speech Prosody 2008

Agustín Gravano Speech Prosody 2008 Extra slide Fillers token count certainty mean ± stdev downstepped 390 0.667 ± 0.435 declarative 0.605 ± 0.459 yn-question 420 1.299 ± 0.392 ANOVA: Significant difference (F(2, 1197) = 2778.2, p≈0) Tukey test: Difference is significant (95%) for ds>yn and dec>yn, and approaches significance for ds>dec. Agustín Gravano Speech Prosody 2008

Extra slide Epistemic would: Form Restricted to intransitive sentences: SUBJECT + would + VERB + POST-VERBAL CONSTITUENT Corpus study (Birner et al. ’07) 246 naturally-occurring tokens, from oral and written sources Most frequent subjects are demonstratives (79%) Nearly all verbs are be (98%) Post-verbal constituent is typically, but not necessarily, a noun phrase. Agustín Gravano Speech Prosody 2008