Presentation is loading. Please wait.

Presentation is loading. Please wait.

Creating a Corpus for A Conversational Assistant for Everyday Tasks Henry Kautz, Young Song, Ian Pereira, Mary Swift, Walter Lasecki, Jeff Bigham, James.

Similar presentations


Presentation on theme: "Creating a Corpus for A Conversational Assistant for Everyday Tasks Henry Kautz, Young Song, Ian Pereira, Mary Swift, Walter Lasecki, Jeff Bigham, James."— Presentation transcript:

1 Creating a Corpus for A Conversational Assistant for Everyday Tasks Henry Kautz, Young Song, Ian Pereira, Mary Swift, Walter Lasecki, Jeff Bigham, James Allen University of Rochester

2 Goals Fine-grained activity recognition combining speech and RGBD vision Learning and recognizing multi-step activities from (one-shot) instruction Learning names and properties of objects from instruction Tracking and assistance using task model

3 kinect video overhead micslapel mic open/close sensors RFID sensors power meter

4

5 Language Logical Form “I’m going to make a cup of tea.”

6 Extracted Events “I put it on the stove.” :event ont::put :agent user :theme v123 :start 0 :end 32 :utt 2 :speechtime/eventtime reln: overlap

7

8

9

10 Domains Making tea - 12 subjects x 3 episodes Making sandwiches Building things with blocks Coarse-grained home activities Snack bar surveillance

11 Labeling Corpus Need to label data for Supervised learning methods Evaluating supervised or unsupervised methods “Gold standard” Define event ontology Hand label Review / correct by second investigator 1 hour per 2 minutes Alternative?

12 Crowd AR Idea Try to recognize activities using current model When confidence is low, ask human workers to label video segment Mediate response Update model with new labels

13 Workers watch a live video stream of an activity and enter open-ended text labels into the bottom text field They can see the responses of other workers and the learningmodel (HMM) on to the right of the video, and agree with them by clicking on them. Worker Interface

14 Mediator An example of the graph created by the input mediator Green nodes represent sufficient agreement between multiple workers (here N = 2). The final sequence matches the baseline despite incorrect (over-specific) submissions by 2 out of the 3 workers, and a spelling error by one worker on “walk”the word ‘walk’.

15 Interactive Recognition and Labeling Experiments Domain: coarse-grained activities Model: HMM

16 Privacy

17 Monitoring Multi-Agent Scenarios Surveillance of department honor snack bar 85% correct on 11 trials

18 Parameterized & Complex Activities Average number of objects and actions correctly labeled by worker groups of different sizes over two different activity sequences. As the group size increases, more objects and actions are labeled.


Download ppt "Creating a Corpus for A Conversational Assistant for Everyday Tasks Henry Kautz, Young Song, Ian Pereira, Mary Swift, Walter Lasecki, Jeff Bigham, James."

Similar presentations


Ads by Google