Dialogue Act Tagging Discourse and Dialogue CMSC 35900-1 November 4, 2004.

Slides:



Advertisements
Similar presentations
Three Basic Problems 1.Compute the probability of a text (observation) language modeling – evaluate alternative texts and models P m (W 1,N ) 2.Compute.
Advertisements

Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 8: Creating and Using Assessments, Surveys, and Objective Measures 1.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods William W. Cohen, Sunita Sarawagi.
ISBN Chapter 3 Describing Syntax and Semantics.
CS 355 – Programming Languages
Dialogs on Dialogs reading group March, 19 th 2004 Dialog Structure Design and Annotation Ananlada Chotimongkol Language Technologies Institute School.
Dialogue Management Ling575 Discourse and Dialogue May 18, 2011.
Emotion in Meetings: Hot Spots and Laughter. Corpus used ICSI Meeting Corpus – 75 unscripted, naturally occurring meetings on scientific topics – 71 hours.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
Methodologies for Evaluating Dialog Structure Annotation Ananlada Chotimongkol Presented at Dialogs on Dialogs Reading Group 27 January 2006.
Information, action and negotiation in dialogue systems Staffan Larsson Kings College, Jan 2001.
Classification of Discourse Functions of Affirmative Words in Spoken Dialogue Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Shira Mitchell, Ilia.
Describing Syntax and Semantics
ICS 463, Intro to Human Computer Interaction Design: 8. Evaluation and Data Dan Suthers.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
1 Chapter 19: Dialogue and Conversational Agents Nadia Hamrouni and Ahmed Abbasi 12/5/2006.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Toshiba Update 04/09/2006 Data-Driven Prosody and Voice Quality Generation for Emotional Speech Zeynep Inanoglu & Steve Young Machine Intelligence Lab.
Ashish Vaswani Speech acts for Dialogue agents, Coding schemes and dialogue act taxonomies.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Data Annotation for Classification. Prediction Develop a model which can infer a single aspect of the data (predicted variable) from some combination.
Discourse Markers Discourse & Dialogue CS November 25, 2006.
7-Speech Recognition Speech Recognition Concepts
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
DEPARTMENT of COMPUTER SCIENCE University of Rochester  Activities  Abductive Inference of Multi-Agent Interaction  Capture the Flag Data Collection.
Discriminative Models for Spoken Language Understanding Ye-Yi Wang, Alex Acero Microsoft Research, Redmond, Washington USA ICSLP 2006.
Crowdsourcing for Spoken Dialogue System Evaluation Ling 575 Spoken Dialog April 30, 2015.
ISBN Chapter 3 Describing Semantics -Attribute Grammars -Dynamic Semantics.
Presented by: Fang-Hui Chu Boosting HMM acoustic models in large vocabulary speech recognition Carsten Meyer, Hauke Schramm Philips Research Laboratories,
CS 363 Comparative Programming Languages Semantics.
LML Speech Recognition Speech Recognition Introduction I E.M. Bakker.
Turn-taking Discourse and Dialogue CS 359 November 6, 2001.
1 Determining query types by analysing intonation.
Transformation-Based Learning Advanced Statistical Methods in NLP Ling 572 March 1, 2012.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Speech, Perception, & AI Artificial Intelligence CMSC February 13, 2003.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
Margin-Sparsity Trade-off for the Set Covering Machine ECML 2005 François Laviolette (Université Laval) Mario Marchand (Université Laval) Mohak Shah (Université.
Automatic Cue-Based Dialogue Act Tagging Discourse & Dialogue CMSC November 3, 2006.
Semi-supervised Dialogue Act Recognition Maryam Tavafi.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
1 Modeling Long Distance Dependence in Language: Topic Mixtures Versus Dynamic Cache Models Rukmini.M Iyer, Mari Ostendorf.
Semantics In Text: Chapter 3.
Hidden Markov Models: Decoding & Training Natural Language Processing CMSC April 24, 2003.
Building & Evaluating Spoken Dialogue Systems Discourse & Dialogue CS 359 November 27, 2001.
Supertagging CMSC Natural Language Processing January 31, 2006.
Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Presented by: Fang-Hui Chu Discriminative Models for Speech Recognition M.J.F. Gales Cambridge University Engineering Department 2007.
Challenges in Dialogue Discourse and Dialogue CMSC October 27, 2006.
1/17/20161 Emotion in Meetings: Business and Personal Julia Hirschberg CS 4995/6998.
HMM vs. Maximum Entropy for SU Detection Yang Liu 04/27/2004.
Lexical, Prosodic, and Syntactics Cues for Dialog Acts.
Adapting Dialogue Models Discourse & Dialogue CMSC November 19, 2006.
A Maximum Entropy Language Model Integrating N-grams and Topic Dependencies for Conversational Speech Recognition Sanjeev Khudanpur and Jun Wu Johns Hopkins.
Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.
N-Gram Model Formulas Word sequences Chain rule of probability Bigram approximation N-gram approximation.
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
Agent-Based Dialogue Management Discourse & Dialogue CMSC November 10, 2006.
Discriminative n-gram language modeling Brian Roark, Murat Saraclar, Michael Collins Presented by Patty Liu.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Challenges in Dialogue
Investigating Pitch Accent Recognition in Non-native Speech
CSC 594 Topics in AI – Natural Language Processing
Dialogue Acts Julia Hirschberg CS /18/2018.
Challenges in Dialogue
Chapter 10: Compilers and Language Translation
Huawei CBG AI Challenges
Presentation transcript:

Dialogue Act Tagging Discourse and Dialogue CMSC November 4, 2004

Roadmap Maptask overview Coding –Transactions –Games –Moves Assessing agreement

Maptask Conducted by HCRC – Edinburgh/Glasgow Task structure: –2 participants: Giver, follower –2 slightly different maps Giver guides follower to destination on own map –Forces interaction, ambiguities, disagreements, etc –Conditions: Familiar/not; Visible/not

Dialogue Tagging Goal: Represent dialogue structure as generically as possible Three level scheme: –Transactions Major subtasks in participants overall task –Conversational Games Correspond to G&S discourse segments –Conversational Moves Initiation and response steps

Basic Dialogue Moves Initiations and responses Cover acts observed in dialogue – generalized Initiations: Instruct: tell to carry out some action; Explain: give unelicited information; Check: ask for confirmation; Align:check attention; Query-yn: Query-wh Responses:Acknowledge: signal understand & accept; Reply-y; Reply-n; Reply-wh; Clarify Ready:Inter-game moves

Game Coding Initiation: –Identified by first move Purpose – carry through to completion –May embed other games – Mark level –Mark completion/abandonment

Interrater Agreement How good is tagging? A tagset? Criterion: How accurate/consistent is it? Stability: –Is the same rater self-consistent? Reproducibility: –Do multiple annotators agree with each other? Accuracy: –How well do coders agree with some “gold standard”?

Agreement Measure Kippendorf’s Kappa (K) –Applies to classification into discrete categories –Corrects for chance agreement K<0 : agree less than expected by chance –Quality intervals: >= 0.8: Very good; 0.6<K<0.8: Good, etc Maptask: K=0.92 on segmentation, –K = 0.83 on move labels

Dialogue Act Tagging Other tagsets –DAMSL, SWBD-DAMSL, VERBMOBIL, etc Many common move types –Vary in granularity Number of moves, types Assignment of multiple moves

Dialogue Act Recognition Goal: Identify dialogue act tag(s) from surface form Challenge: Surface form can be ambiguous –“Can you X?” – yes/no question, or info-request “Flying on the 11t h, at what time?” – check, statement Requires interpretation by hearer –Strategies: Plan inference, cue recognition

Plan-inference-based Classic AI (BDI) planning framework –Model Belief, Knowledge, Desire Formal definition with predicate calculus –Axiomatization of plans and actions as well –STRIPS-style: Preconditions, Effects, Body –Rules for plan inference Elegant, but.. –Labor-intensive rule, KB, heuristic development –Effectively AI-complete

Cue-based Interpretation Employs sets of features to identify –Words and collocations: Please -> request –Prosody: Rising pitch -> yes/no question –Conversational structure: prior act Example: Check: Syntax: tag question “,right?” Syntax + prosody: Fragment with rise N-gram: argmax d P(d)P(W|d) –So you, sounds like, etc Details later ….

Recognizing Maptask Acts Assume: – Word-level transcription – Segmentation into utterances, –Ground truth DA tags Goal: Train classifier for DA tagging –Exploit: Lexical and prosodic cues Sequential dependencies b/t Das –14810 utts, 13 classes

Features for Classification Acoustic-Prosodic Features: –Pitch, Energy, Duration, Speaking rate Raw and normalized, whole utterance, last 300ms 50 real-valued features Text Features: –Count of Unigram, bi-gram, tri-grams Appear multiple times features, sparse Features z-score normalized

Classification with SVMs Support Vector Machines –Create n(n-1)/2 binary classifiers Weight classes by inverse frequency Learn weight vector and bias, classify by sign –Platt scaling to convert outputs to probabilities

Incorporating Sequential Constraints Some sequences of DA tags more likely: –E.g. P(affirmative after y-n-Q) = 0.5 – P(affirmative after other) = 0.05 Learn P(yi|yi-1) from corpus –Tag sequence probabilities –Platt-scaled SVM outputs are P(y|x) Viterbi decoding to find optimal sequence

Results SVM OnlySVM+Seq Text Only Prosody Only Text+Prosody

From Human to Computer Conversational agents –Systems that (try to) participate in dialogues –Examples: Directory assistance, travel info, weather, restaurant and navigation info Issues: –Limited understanding: ASR errors, interpretation –Computational costs: broader coverage -> slower, less accurate

Dialogue Manager Tradeoffs Flexibility vs Simplicity/Predictability –System vs User vs Mixed Initiative –Order of dialogue interaction –Conversational “naturalness” vs Accuracy –Cost of model construction, generalization, learning, etc Models: FST, Frame-based, HMM, BDI Evaluation frameworks