Spoken Dialogue Systems: Human and Machine

Slides:



Advertisements
Similar presentations
Atomatic summarization of voic messages using lexical and prosodic features Koumpis and Renals Presented by Daniel Vassilev.
Advertisements

Conversational Implicature (Based on Paltridge, chapter 3)
Conversations  Conversation are cooperative events:  Without cooperation, interaction would be chaotic. Would be no reason to communicate  Grice's.
Topic 10: conversational implicature Introduction to Semantics.
The Cooperative Principle
1 MODULE 2 Meaning and discourse in English COOPERATION, POLITENESS AND FACE Lecture 14.
CS 224S / LINGUIST 285 Spoken Language Processing Dan Jurafsky Stanford University Spring 2014 Lecture 9: Human conversation, frame-based dialogue systems.
U1, Speech in the interface:2. Dialogue Management1 Module u1: Speech in the Interface 2: Dialogue Management Jacques Terken HG room 2:40 tel. (247) 5254.
People & Speech Interfaces CS 260 Wednesday, October 4, 2006.
Speech and Language Processing
6/28/20151 Spoken Dialogue Systems: Human and Machine Julia Hirschberg CS 4706.
Towards Natural Clarification Questions in Dialogue Systems Svetlana Stoyanchev, Alex Liu, and Julia Hirschberg AISB 2014 Convention at Goldsmiths, University.
Pragmatics.
Theories of Discourse and Dialogue. Discourse Any set of connected sentences This set of sentences gives context to the discourse Some language phenomena.
1 Spoken Dialogue Systems Dialogue and Conversational Agents (Part I) Chapter 19: Draft of May 18, 2005 Speech and Language Processing: An Introduction.
Department of English Introduction To Linguistics Level Four Dr. Mohamed Younis.
16.0 Spoken Dialogues References: , Chapter 17 of Huang 2. “Conversational Interfaces: Advances and Challenges”, Proceedings of the IEEE,
Pragmatics.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
Introduction to Dialogue Systems. User Input System Output ?
Dr. Katie Welch LING  Heretofore, we have talked about the form of language  But, this is only half the story.  We must also consider the.
Presentation about pragmatic concepts Implicatures Presuppositions
Welcome Back, Folks! We’re travelling to a littele bit far-end of Language in Use Studies EAA remains your faithful companion.
UNIT 2 - IMPLICATURE.
Integrating Multiple Knowledge Sources For Improved Speech Understanding Sherif Abdou, Michael Scordilis Department of Electrical and Computer Engineering,
NATURAL LANGUAGE PROCESSING
Implicature. I. Definition The term “Implicature” accounts for what a speaker can imply, suggest or mean, as distinct from what the speaker literally.
Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System Diane J. Litman AT&T Labs -- Research
SEMANTICS ??? aardvark SEMANTICS ??? aardvark. SEMANTICS: word and sentence meaning. PRAGMATICS: speaker meaning. The semiotic triangle:
ENGLISH FOR PROFESSIONAL COMMUNICATION
COMMUNICATION OF MEANING
CS 224S / LINGUIST 285 Spoken Language Processing
MODULE 2 Meaning and discourse in English
Natural Language Processing and Speech Enabled Applications
DELEGATION, ACCOUNTABILITY and COACHING
Telephone etiquette Ref: Pinner D & Pinner D 2003 Communication Skills Pearson Longman, New Zealand Ch. 15: pp
COOPERATIVE PRINCIPLE:
Information and Advice
COOPERATION and IMPLICATURE
GRICE’S CONVERSATIONAL MAXIMS
Competency – 3 Engages in active listening an response appropriately
Grice’s Maxims LO: to understand the co-operative principle and how we can use it within our own analysis.
Discourse and Pragmatics
Why Study Spoken Language?
Spoken Dialogue Systems
Spoken Language Processing
Issues in Spoken Dialogue Systems
Spoken Dialogue Systems: Managing Interaction
Automatic Speech Recognition
Specifying, Compiling, and Testing Grammars
Introduction to Linguistics
Lecture 3: October 5, 2004 Dan Jurafsky
Why Study Spoken Language?
Advanced NLP: Speech Research and Technologies
Lecture 29 Word Sense Disambiguation-Revision Dialogues
Managing Dialogue Julia Hirschberg CS /28/2018.
Guidelines for Group Projects and Papers
The Cooperative Principle
COOPERATIVE PRINCIPLE.
Nofsinger. R., Everyday Conversation, Sage, 1991
Lecture 30 Dialogues November 3, /16/2019.
PROJ2: Building an ASR System
How to Avoid Redundant Object-References
Spoken Dialogue Systems: System Overview
The study of meaning in context
The Cooperative Principle
Pragmatics Predmetni nastavnik: doc. dr Valentna Boskovic Markovic
Gricean Cooperative Principle (Maxim) and Implicature
Nofsinger. R., Everyday Conversation, Sage, 1991
Spoken Language Processing
Presentation transcript:

Spoken Dialogue Systems: Human and Machine Julia Hirschberg CS 4706 9/22/2018

Today Introduction to Spoken Dialogue Systems (SDS) Automatic Speech Recognition (ASR) Natural Language Understand (NLU) Natural Language Generation (NLG) and Text-to-Speech Synthesis (TTS) Cooperative Question-Answering Dialogue Representations Issues to consider in designing your dialogue systems 9/22/2018

Spoken Dialogue Systems AKA: Interactive Voice Response Systems Dialogue Systems Spoken Dialogue Systems Conversational Agents Applications: Travel arrangements (Amtrak, United airlines) Telephone call routing Tutoring Communicating with robots Anything with limited screen/keyboard 9/22/2018

A Travel Dialogue: Communicator 9/22/2018

Call routing: ATT HMIHY 9/22/2018

A tutorial dialogue: ITSPOKE 9/22/2018

Conversational Structure Telephone conversations Stage 1: Enter a conversation Stage 2: Identification Stage 3: Establish joint willingness to converse Stage 4: First topic is raised, usually by caller 9/22/2018

Is this a simple conversation? Customer: (rings) Operator: Directory Enquiries, for which town please? Customer: Could you give me the phone number of um: Mrs. um: Smithson? Operator: Yes, which town is this at please? Customer: Huddleston. Operator: Yes. And the name again? Customer: Mrs. Smithson 9/22/2018

Is this? A: And, what day in May did you want to travel? C: OK, uh, I need to be there for a meeting that’s from the 12th to the 15th. Client did not answer question. Meaning of client’s sentence: Meeting Start-of-meeting: 12th End-of-meeting: 15th Doesn’t say anything about flying! How does the agent infer client is informing him/her of travel dates? 9/22/2018

Is this? B: I want a non-stop flight from New York La Guardia to Tucson. A: … there’s 3 non-stops today. True if, in fact, there are 7 non-stops today. But agent means: 3 and only 3. How can client infer that agent means: only 3 9/22/2018

Grice: Conversational Implicature Implicature: a particular class of licensed inferences Grice (1975): how do conversational participants draw inferences beyond what is actually asserted? Cooperative Principle "Make your contribution such as it is required, at the stage at which it occurs, by the accepted purpose or direction of the talk exchange in which you are engaged." A tacit agreement by speakers and listeners to cooperate in communication 9/22/2018

4 Gricean Maxims Relevance: Be relevant Quantity: Do not make your contribution more or less informative than required Quality: Try to make your contribution one that is true (don’t say things that are false or for which you lack adequate evidence) Manner: Avoid ambiguity and obscurity; be brief and orderly 9/22/2018

Maxim of Relevance A: Is Regina here? B: Her car is outside. Implicature: yes Hearer thinks: Why would B mention the car? It must be relevant. How could it be relevant? It could since if her car is here she is probably here. Client: I need to be there for a meeting that’s from the 12th to the 15th Hearer thinks: Speaker is following maxims, would only have mentioned meeting if it was relevant. How could meeting be relevant? If client meant me to understand that he had to depart in time to attend the meeting. 9/22/2018

Maxim of Quantity A: How much money do you have on you? B: I have 5 dollars Implicature: not 6 dollars Similarly, 3 non-stops means ‘only 3’ Hearer thinks: If speaker meant 7 non-stops she would have said 7 non-stops A: Did you do the reading for today’s class? B: I meant to Implicature: No B’s answer would be true if B intended to do the reading AND in fact did do the reading, but would then violate Maxim of Quantity 9/22/2018

Dialogue System Architecture 9/22/2018

Speech Recognition Input: acoustic waveform Output: string of words Basic components: A recognizer for phones, small sound units like [k] or [ae]. A pronunciation dictionary mapping orthography to sounds, e.g. can = [k ae n] A language model or grammar telling us what words are likely to follow what words I {want|need|would like} to {leave|fly} from {La Guardia|JFK|Newark} A search algorithm to find the best string of words 9/22/2018

Natural Language Understanding Or “NLU” Or “Computational Semantics” Many ways to represent the meaning of sentences For speech dialogue systems, most common is Frame and Slot Semantics 9/22/2018

A Sample Frame Show me morning flights from Boston to SF on Tuesday. ORIGIN: CITY: Boston DATE: Tuesday TIME: morning DEST: CITY: San Francisco 9/22/2018

How do we Generate this Semantics? Many methods Simplest: “semantic grammars” E.g. a simple Context Free Grammar LHS of rules defining semantic categories: LIST -> show me | I want | can I see|… DEPARTTIME -> (after|around|before) HOUR | morning | afternoon | evening HOUR -> one|two|three…|twelve (am|pm) FLIGHTS -> (a) flight|flights ORIGIN -> from CITY DESTINATION -> to CITY CITY -> Boston | San Francisco | Denver | Washington 9/22/2018

Semantics for a sentence LIST FLIGHTS ORIGIN Show me flights from Boston DESTINATION DEPARTDATE to San Francisco on Tuesday DEPARTTIME morning 9/22/2018

Generation and TTS Generation component TTS component Chooses concepts to express to user Plans how to express these concepts in words Assigns appropriate prosody to the words TTS component Takes words and prosodic annotations Synthesizes a waveform 9/22/2018

Generation Component Content Planner Language Generation Decides what information to express to user (ask a question, present an answer, etc.) Often merged with dialogue manager Language Generation Chooses syntactic structures and words to express meaning. Simplest method Pre-specify basic sentences and fill in the blanks as appropriate Template-based generation Can have variables: What time do you want to leave CITY-ORIG? Will you return to CITY-ORIG from CITY-DEST? 9/22/2018

More sophisticated language generation component Natural Language Generation Approach: Dialogue manager builds representation of meaning of utterance to be expressed Passes this to a “generator” Generators have three components Sentence planner Surface realizer Prosody assigner 9/22/2018

Architecture of a generator for a dialogue system (after Walker and Rambow 2002) 9/22/2018

HCI constraints on generation for dialogue: Coherence Discourse markers and pronouns (Coherence”): Please say the date.… Please say the start time.… Please say the duration… Please say the subject… First, tell me the date.… Next, I’ll need the time it starts.… Thanks. <pause> Now, how long is it supposed to last?… Last of all, I just need a brief description Bad! Good! 9/22/2018

HCI Constraints on Dialogue Generation Coherence: Tapered Prompts Prompts which get incrementally shorter: System: Now, what’s the first company to add to your watch list? Caller: Cisco System: What’s the next company name? (Or, you can say, “Finished”) Caller: IBM System: Tell me the next company name, or say, “Finished.” Caller: Intel System: Next one? Caller: America Online. System: Next? Caller: … 9/22/2018

Issues in Building SDS Which architecture to use? How much discourse history to collect? What type of dialogue strategy to follow? What type of confirmation strategy to use? What kind of help to provide? How to recover from errors? 9/22/2018

Discourse History What information to keep track of? What you need to find out from the user What you think the user knows What the user has told you What else? 9/22/2018

Dialogue Strategy System initiative User initiative Mixed initiative Please tell me your destination city. User initiative How may I help you? I’d like to go from Boston to Baltimore on Monday, January 24th. Mixed initiative May I give you flight information? Yes, I’d like to go to Boston. Where would you be leaving from? 9/22/2018

Confirmation Strategy Explicit I want to transfer $500. Did you say you want to transfer $500? Implicit I want to transfer $500 What account do you want to transfer $500 to? None at all Okay. 9/22/2018

Providing Help General help module Context-dependent help What kinds of things can I do in this SDS? What kinds of things can I say? What do I do when I get totally stuck? Can I back up without starting over? Can I start over from the beginning? Can I talk to a person? Context-dependent help Make help specific to where the user is in the dialogue 9/22/2018

How can users recover from errors? Clarification subdialogues Did you say you need 47 lettuces shipped to Kandajar? No! That’s not what I said. What do you want shipped to Kandajar? Ladders, not lettuce! Did you say ‘ladders’? Yes. You want ladders shipped to Kandajar? Yes, 47 ladders. 9/22/2018

Next Class Speech sounds  phonetics and phonology Make appointments with me or your TAs to discuss your project Bring a one-paragraph description At least one team member present Project descriptions now due February 6 9/22/2018