Download presentation
Presentation is loading. Please wait.
Published byCorey Fox Modified over 9 years ago
1
Chapter 8. Situated Dialogue Processing for Human-Robot Interaction Greert-Jan M. Kruijff, Pierre Lison, Trevor Benjamin, Henril Jacobsson, Hendrik Zender, Ivana Kruijff-Korbayova, and Nick Hawes Kim, Jong In Oct,2, 2015 College of Interdisciplinary Program in Cognitive Science Seoul National University
2
Contents 8.4. Talking about What You Can See(Small Scale) What you see and what you mean What you All know, and What you are Saying Using What you see to Rank Alternative Interpretations Referring to What you see 8.5. Talking about places You can Visit(Large Scale) Talking about places Representing Places to Talk about Referring to Elsewhere Determining the Appropriate contrast set Understanding References to Elsewhere 8.6 Talking about things you can do 8.7. Conclusions 2
3
Overview 3
4
Talking about What you Can see Talking about What You Can See 1) A small-scale space or closed context 2) Perception – Symbol 3) Grounding – determines the meaning of the linguistic symbol 4) Old information(common ground) vs New information (Guide Learning) “The red mug is Big” 4
5
Guide Learning © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 5
6
Translation of Spatial expressions. 6 Talking about What you Can see
7
Ontology © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 7
8
Talking about What you Can see What you see and What you mean 1) Semantic – Ontologically Sorted 2) Delimitation, Quantification (Grounding) 3) Dealing with instances, update 4) Evaluation whether the new information is added Motivation Working memoryBinding Working memory Prompt to take action Intentional contents Operating modality Visuo-Spatial information Indexical content
9
Talking about What you Can see Using What you see to Rank Alternative Interpretations :assign a score to each possible semantic interpretation of a given spoken input 1) Linguistic feature -The acoustic feature (ASR score ) -The syntactic Level (Parsing) -The semantic Level (Logical form) 1) Contextual Feature -Situated context(the objects in the visual scene) -Dialogue context(previously referred entities in the dialogue history) 2) Current data set -195 individual utterance -Check exact match, partial match, error match -Depending on Grammar relaxation, Activated feature
10
Talking about What you Can see Using What you see to Rank Alternative Interpretations
11
Talking about What you Can see 1) Indexical Ambiguity 2) “Put the ball near the mug to the left of the box” Using What you see to figure out what is meant
12
Talking about What you Can see 1) Robot could refer to objects and the spatial relations 2) Q1. If the number of Object is too much? 3) Problem) Inter-object Spatial Relation 4) Define the set of objects –function as landmark – Referring to What you see(small scale)
13
Talking about Places you can visit Talking about Places(Large Scale) 1) How people tend to employ many different strategies to introduce new location to Robot? (“Space which cannot be perceived at once”) 2) How human presents a familiar indoor environment to a robot (Guided tour scenario) -Human guided the robot around and names and objects
14
Talking about Places you can visit Talking about Places(Large Scale) Strategies to introduce new locations. 1) Naming whole rooms (“this it the kitchen” –referring to the room itself) 2) Naming specific locations (“this is the kitchen” –referring to cooking area) 3) Naming specific locations by the objects (“this is the coffee machine”) ->Personalizing the representati on of the environment that robot Constructs)
15
15 Multi-Layer Spatial Map 1)The first layer of SLAM SLAM: Simultaneous localization and mapping 1) Abstract form. 2) Topological segmentation is represented by the coloring of the nodes 3) In order to determine category of an area, we take a majority vote approach of the classification 1) On the highest level of Abstraction 2) Spatial unit(e.g. room) ->human concept(e.g. kitchen)
16
Overview 16 Conceptual Map is represented as a Description logic ontology -conceptual taxonomy (hand-written commonsense ontololgy representing various aspects of indoor environ ment) -a storage of instances (T-Box and A-Box of a Description Logics reaso ning framework)
17
SLAM – Metric map © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 17 Video https://www.youtube.com/watch?v=mQQL8 pmztb4
18
Talking about Places you can visit Referring to Elsewhere 1.“the kitchen around the corner” 2.“the red mug left of the china plate” 3.“Peter’s office” 4.“the large hall on the first floor” Communicate goal issue. 1) Robot are good at measuring exact distances, but humans are not 2) Infinite recursion 3) The robot might have a vast knowledge but have to separate uniquely the referent from all entities.
19
Talking about Places you can visit Issue1) Determining proper contrast Set 1) If contrast set is too much? 2) if contrast set is too little? Issue2) Robot viewpoints? 1) Exact measure or Topological Abstraction -The context for a dialogue situated in large-scale space can be determined on the basis of a topological representation like human Determining the Appropriate Contrast set
20
8.6 Talking about things you can do 1) Action Planning: the goal of action planning is to choose actions and ordering relations among these actions to achieve a suitable location and plans for executing the action - Action planning and Dialogue processing should interact. 2) Event nucleus -it models the action as an event with temporal and casual dimension 3) Indexicality, intentionality, and information structure “First take the mug, and put it in near the plate” Things you can do
21
Small Scale Guide Tour -Ambiguity, Semantic Grounding -Motivation working memory -Binding working memory Large Scale Guide Tour -Multi-Layered Spatial mapping -topological map, conceptual map Action -Action planning, Summary
22
Debate issue 1. How to solve ambiguity problem in Situated Dialogue. 2. Drone and Robot cooperation
23
Q & A © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 23
24
Appendix 질문 대비 © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 24
25
APPENDIX( 질문 대비 )
26
Ambiguity © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 26
27
Human Augmented Mapping © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 27
28
28 Ambiguity
29
Topological Partitioning © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 29 Solution> Topological Partitioning -Large red stars indicate doorways and the different coloring of the nodes depicts the topological partitioning of the environment
30
30 Ambiguity
31
31 CCG grammar
32
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 32 Clarification
33
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 33 Clarification
34
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 34 Clarification
35
Event nucleus © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 35
36
Generation of Referring Expressions © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 36
37
Basic Incremental Algorithm for GRE © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 37
38
SLAM © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 38
39
Conceptual map © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 39
40
Multi-Layered Conceptual Spatial MAP © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 40
41
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 41
42
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 42
43
Ontolgoy © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 43
44
Semantic Ontology © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 44
45
Parsing © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 45
46
Parsing © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 46
47
Simple Parse © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 47
48
Parsing © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 48
49
Parse © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 49
50
Referential Anchoring © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 50
51
51
52
Temporal-Causal Sequence © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 52
53
Temporal © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 53
54
Robotic Platform © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 54
55
Guide Learning © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 55
56
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 56
57
Guided Learning © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 57
58
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 58
59
Topology © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 59
60
Multi-Layered Conceptual Spatial Map © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 60
61
Perceptual Control System © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 61
62
Robotic Architecture © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 62
63
Spatial Interpretation © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 63
64
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 64
65
T-box © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 65
66
Delimitation © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 66
67
Human Augmented Mapping © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 67
68
Proxy- Reasoner © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 68
69
© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 69 Wizard of Oz experiment, / nominal constructs
70
Indexicality © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 70
71
Intentionality © 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 71
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.