On the role of context and prosody in the interpretation of ‘okay’ Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Héctor Chávez, and Lauren Wilcox.

Slides:



Advertisements
Similar presentations
Information structuring in English dialogue class 4
Advertisements

Phonetics as a scientific study of speech
Agustín Gravano 1,2 Julia Hirschberg 1 (1)Columbia University, New York, USA (2) Universidad de Buenos Aires, Argentina Backchannel-Inviting Cues in Task-Oriented.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus,
Function words are often reduced or even deleted in casual conversation (Fig. 1). Pairs may neutralize: he’s/he was, we’re/we were What sources of information.
Prosody Modeling (in Speech) by Julia Hirschberg Presented by Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.
“Downstepped contours in the given/new distinction” Agustín Gravano Spoken Language Processing Group Columbia University, New York On the Role of Prosody.
Can a prosodic pattern induce/ reduce the perception of a lower- class suburban accent in French? Philippe Boula de Mareüil 1 & Iryna Lehka-Lemarchand.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
1 Spoken Dialogue Systems Dialogue and Conversational Agents (Part IV) Chapter 19: Draft of May 18, 2005 Speech and Language Processing: An Introduction.
Agustín Gravano 1 · Stefan Benus 2 · Julia Hirschberg 1 Elisa Sneed German 3 · Gregory Ward 3 1 Columbia University 2 Univerzity Konštantína Filozofa.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
Outline Project markers and backchanneling – statistics and functions of new tokens secondary functionality of certain tokens as project markers Nonconventional.
Comparing American and Palestinian Perceptions of Charisma Using Acoustic-Prosodic and Lexical Analysis Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg,
Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue SIGDial 2004 Gina-Anne Levow April 30, 2004.
Spoken Language Processing Lab Who we are: Julia Hirschberg, Stefan Benus, Fadi Biadsy, Frank Enos, Agus Gravano, Jackson Liscombe, Sameer Maskey, Andrew.
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
Modeling Other Speaker State COMS 4995/6998 Julia Hirschberg Thanks to William Wang.
Extracting Social Meaning Identifying Interactional Style in Spoken Conversation Jurafsky et al ‘09 Presented by Laura Willson.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg, Julia Hirschberg Columbia University Interspeech /14/06.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.
High Frequency Word Entrainment in Spoken Dialogue ACL, June Columbus, OH Department of Computer and Information Science University of Pennsylvania.
Context and Prosody in the Interpretation of Cue Phrases in Dialogue Julia Hirschberg Columbia University and KTH 11/22/07 Spoken Dialog with Humans and.
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation Gina-Anne Levow University of Chicago October 14, 2005.
Classification of Discourse Functions of Affirmative Words in Spoken Dialogue Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Shira Mitchell, Ilia.
9/5/20051 Acoustic/Prosodic and Lexical Correlates of Charismatic Speech Andrew Rosenberg & Julia Hirschberg Columbia University Interspeech Lisbon.
Intonation September 18, 2014 The Plan for Today Also: I have posted a couple of readings on TOBI (an intonation transcription system) to the course.
-- A corpus study using logistic regression Yao 1 Vowel alternation in the pronunciation of THE in American English.
Agustín Gravano 1,2 Julia Hirschberg 1 (1)Columbia University, New York, USA (2) Universidad de Buenos Aires, Argentina Turn-Yielding Cues in Task-Oriented.
Discourse Markers Discourse & Dialogue CS November 25, 2006.
A Study in Cross-Cultural Interpretations of Back-Channeling Behavior Yaffa Al Bayyari Nigel Ward The University of Texas at El Paso Department of Computer.
AUTOMATIC DETECTION OF REGISTER CHANGES FOR THE ANALYSIS OF DISCOURSE STRUCTURE Laboratoire Parole et Langage, CNRS et Université de Provence Aix-en-Provence,
Intonation in Communication Skill: Recent Research Discourse, both in theoretical linguistics and in foreign language pedagogy,has focused on describing.
Turn-taking Discourse and Dialogue CS 359 November 6, 2001.
Automatic Cue-Based Dialogue Act Tagging Discourse & Dialogue CMSC November 3, 2006.
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
TOBI Basics April 13, 2010.
The Games Corpus Design, implementation and annotation Agustín Gravano Spoken Language Processing Group Columbia University.
Discourse & Dialogue CS 359 November 13, 2001
Lexical, Prosodic, and Syntactics Cues for Dialog Acts.
INTONATION Islam M. Abu Khater.
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
Pitch Tracking + Prosody January 19, 2012 Homework! For Tuesday: introductory course project report Background information on your consultant and the.
A Text-free Approach to Assessing Nonnative Intonation Joseph Tepperman, Abe Kazemzadeh, and Shrikanth Narayanan Signal Analysis and Interpretation Laboratory,
Recognizing Structure: Dialogue Acts and Segmentation
Studying Intonation Julia Hirschberg CS /21/2018.
Studying Intonation Julia Hirschberg CS /21/2018.
Intonational and Its Meanings
Intonational and Its Meanings
The American School and ToBI
THE NATURE OF SPEAKING Joko Nurkamto UNS Solo.
Agustín Gravano1,2 Julia Hirschberg1
Dialogue Acts Julia Hirschberg CS /18/2018.
Comparing American and Palestinian Perceptions of Charisma Using Acoustic-Prosodic and Lexical Analysis Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg,
Information Structure and Prosody
Studying Spoken Language Text 17, 18 and 19
Advanced NLP: Speech Research and Technologies
“Downstepped contours in the given/new distinction”
Dialogue Acts Julia Hirschberg LSA /29/2018.
High Frequency Word Entrainment in Spoken Dialogue
Agustín Gravano & Julia Hirschberg {agus,
Advanced NLP: Speech Research and Technologies
Agustín Gravano1,2 Julia Hirschberg1
Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1
Recognizing Structure: Dialogue Acts and Segmentation
Acoustic-Prosodic and Lexical Entrainment in Deceptive Dialogue
Guest Lecture: Advanced Topics in Spoken Language Processing
Presentation transcript:

On the role of context and prosody in the interpretation of ‘okay’ Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Héctor Chávez, and Lauren Wilcox ACL, June 2007, Prague Spoken Language Processing Group Columbia University

Agustín Gravano - ACL - June Overview What information do subjects use to interpret the word ‘okay’ in dialogue? Perception Study. Findings: Contextual cues stronger predictors than acoustic / prosodic / phonetic cues. Final rising pitch: Strongest prosodic cue.

Agustín Gravano - ACL - June Overview that’s pretty much okay Speaker 1: between the yellow mermaid and the whale Speaker 2: okay Speaker 1: and it is okay we gonna be placing the blue moon

Agustín Gravano - ACL - June Cue Words Linguistic expressions that can be used to convey information about the discourse structure, or to make a semantic contribution. Discourse markers, cue phrases, clue words, … Examples: now, well, so, alright, and, okay, first, on the other hand, by the way, for example, …

Agustín Gravano - ACL - June Research Questions In spoken dialogue, how do hearers disambiguate cue words? How important is acoustic/prosodic information? What is the role of phonetic variation? What is the role of discourse context?

Agustín Gravano - ACL - June Why do we care? Spoken dialogue systems Need to convey potentially ambiguous terms with a particular intended meaning. Must interpret the user’s input correctly.

Agustín Gravano - ACL - June Previous Work Cues to cue phrase disambiguation Hirschberg & Litman ’87, ’93; Hockey ’93; Litman ’94 Cues to Dialogue Act identification Jurafsky et al ’98; Rosset & Lamel ’04 Contextual cues to the production of backchannels Ward & Tsukahara ’00; Sanjanhar & Ward ’06

Agustín Gravano - ACL - June The Columbia Games Corpus 12 spontaneous task-oriented dyadic conversations in Standard American English. 2 subjects playing a computer game, no eye contact. Describer: Follower:

Agustín Gravano - ACL - June The Columbia Games Corpus Annotation of Affirmative Cue Words Cue Words alright gotcha huh mm-hm okay right uh-huh yeah yep yes yup Functions Acknowledgment / Agreement Backchannel Cue beginning discourse segment Cue ending discourse segment Check with the interlocutor Stall / Filler Back from a task Literal modifier Pivot beginning Pivot ending count 1. the of okay and like 753 …

Agustín Gravano - ACL - June Perception Study Experiment Design okay Speaker 1: but it's gonna be below the onion Speaker 2: okay Cue beginning discourse segment Backchannel Acknowledgment / Agreement Speaker 1: okay alright I'll try it okay Speaker 2: okay the owl is blinking Speaker 1: yeah um there's like there's some space there's Speaker 2: okay I think I got it

Agustín Gravano - ACL - June contextualized ‘okay’ Perception Study Experiment Design 54 instances of ‘okay’ (18 for each function). 2 tokens for each ‘okay’:  Isolated condition: Only the word ‘okay’.  Contextualized condition: 2 full speaker turns: The turn containing the target ‘okay’; and The previous turn by the other speaker. speakersokay

Agustín Gravano - ACL - June Perception Study Experiment Design Two parts: Part 1: 54 isolated tokens Part 2: 54 contextualized tokens Subjects asked to classify each token of ‘okay’ as: Acknowledgment / Agreement, or Backchannel, or Cue beginning discourse segment.

Agustín Gravano - ACL - June Perception Study Experiment Implementation Subjects: 20 paid subjects (10 female, 10 male). Ages between 20 and 60. Native speakers of English. No hearing problems. GUI on a laboratory workstation with headphones.

Agustín Gravano - ACL - June Results Inter-Subject Agreement Kappa measure of agreement with respect to chance (Fleiss ’71) Isolated ConditionContextualized Condition Overall Ack / Agree vs. Other Backchannel vs. Other Cue beginning vs. Other

Agustín Gravano - ACL - June Results Cues to Interpretation Phonetic transcription of okay: Isolated Condition Strong correlation for realization of  :  Backchannel  Ack/Agree, Cue Beginning Contextualized Condition No strong correlations found for phonetic variants.

Agustín Gravano - ACL - June Results Cues to Interpretation Isolated ConditionContextualized Condition Ack / Agree Shorter /k/Shorter latency between turns Shorter pause before okay Backchannel Lower intensity Higher final pitch slope Longer 2 nd syllable Higher final pitch slope More words by S2 before okay Fewer words by S1 after okay Cue beginning Lower final pitch slope Lower overall pitch slope Longer latency between turns More words by S1 after okay Lower final pitch slope S1 = Utterer of the target ‘okay’. S2 = The other speaker.

Agustín Gravano - ACL - June Final intonation using the ToBI conventions. (Both isolated and contextualized conditions.) H-H%  Backchannel H-L% L-H%  Ack/Agree, Backchannel L-L%  Ack/Agree, Cue beginning Results Cues to Interpretation

Agustín Gravano - ACL - June Conclusions Agreement: Availability of context improves inter-subject agreement. Cue beginnings easier to disambiguate than the other two functions. Cues to interpretation: Contextual features trump features of word okay. Exception: Final pitch slope of okay in both conditions.

Agustín Gravano - ACL - June Further Work Benus et al, 2007 “The prosody of backchannels in American English”, ICPhS 2007, Saarbrücken, Germany, August Gravano et al, 2007 “Classification of discourse functions of affirmative words in spoken dialogue”, Interspeech 2007, Antwerp, Belgium, August 2007.

On the role of context and prosody in the interpretation of ‘okay’ Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Héctor Chávez, and Lauren Wilcox ACL, June 2007, Prague Spoken Language Processing Group Columbia University

Agustín Gravano - ACL - June The Columbia Games Corpus Annotation Orthographic transcription and alignment. Laughs, coughs, breaths, smacks, throat-clearings. Self repairs. Intonation, using the ToBI convention. Function of affirmative cue words (alright, mm-hm, okay, right, uh-huh, yeah, yes, …). Question form and function.

Agustín Gravano - ACL - June The Columbia Games Corpus Annotation of Intonation: ToBI Tones: Pitch accents:L*, H*, L*+H, H+!H*, … Phrase accents:L-, H-, !H- Boundary tones:L%, H% Break Indices: Degrees of junction 0 = no word boundary... 4 = full intonational phrase boundary Miscellanea: Disfluencies, non-speech sounds, etc.

Agustín Gravano - ACL - June The Columbia Games Corpus Annotation of Intonation: ToBI waveform fundamental frequency (F0)

Agustín Gravano - ACL - June Acknowledge/Agreement: The function of ‘okay’ that indicates “I believe what you said” and/or “I agree with what you say”. Backchannel: The function of ‘okay’ in response to another speaker's utterance that indicates only “I’m still here” or “I hear you and please continue”. Cue beginning discourse segment The function of ‘okay’ that marks a new segment of a discourse or a new topic. This use of ‘okay’ could be replaced by ‘now’. Perception Study Definitions Given to the Subjects