David Chen Supervising Professor: Raymond J. Mooney Doctoral Dissertation Proposal December 15, 2009 Learning Language from Perceptual Context 1.

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

1 CS 388: Natural Language Processing: N-Gram Language Models Raymond J. Mooney University of Texas at Austin.

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.

Online Max-Margin Weight Learning for Markov Logic Networks Tuyen N. Huynh and Raymond J. Mooney Machine Learning Group Department of Computer Science.

Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.

1 Learning Language from its Perceptual Context Ray Mooney Department of Computer Sciences University of Texas at Austin Joint work with David Chen Joohyun.

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.

Adapting Discriminative Reranking to Grounded Language Learning Joohyun Kim and Raymond J. Mooney Department of Computer Science The University of Texas.

Expert System Human expert level performance Limited application area Large component of task specific knowledge Knowledge based system Task specific knowledge.

Chapter 2: Algorithm Discovery and Design

A computational study of cross-situational techniques for learning word-to- meaning mappings Jeffrey Mark Siskind Presented by David Goss-Grubbs March.

Presented by Zeehasham Rasheed

Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.

Chapter 2: Algorithm Discovery and Design

Chapter 2: Algorithm Discovery and Design

Introduction to Machine Learning Approach Lecture 5.

1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.

Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.

1 Learning to Interpret Natural Language Navigation Instructions from Observation Ray Mooney Department of Computer Science University of Texas at Austin.

1 Learning Natural Language from its Perceptual Context Ray Mooney Department of Computer Science University of Texas at Austin Joint work with David Chen.

Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.

Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.

DCT 1123 PROBLEM SOLVING & ALGORITHMS INTRODUCTION TO PROGRAMMING.

More Machine Learning Linear Regression Squared Error L1 and L2 Regularization Gradient Descent.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning Language Semantics from Ambiguous Supervision Rohit J. Kate.

Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.

1 David Chen Advisor: Raymond Mooney Research Preparation Exam August 21, 2008 Learning to Sportscast: A Test of Grounded Language Acquisition.

David L. Chen Supervisor: Professor Raymond J. Mooney Ph.D. Dissertation Defense January 25, 2012 Learning Language from Ambiguous Perceptual Context.

David Chen Advisor: Raymond Mooney Research Preparation Exam August 21, 2008 Learning to Sportscast: A Test of Grounded Language Acquisition.

Chapter 2: Algorithm Discovery and Design Invitation to Computer Science, C++ Version, Third Edition.

Probabilistic Context Free Grammars for Representing Action Song Mao November 14, 2000.

Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.

Learning to Transform Natural to Formal Language Presented by Ping Zhang Rohit J. Kate, Yuk Wah Wong, and Raymond J. Mooney.

David L. Chen Fast Online Lexicon Learning for Grounded Language Acquisition The 50th Annual Meeting of the Association for Computational Linguistics (ACL)

1 Using Perception to Supervise Language Learning and Language to Supervise Perception Ray Mooney Department of Computer Sciences University of Texas at.

1 Semi-Supervised Approaches for Learning to Parse Natural Languages Rebecca Hwa

Unsupervised Constraint Driven Learning for Transliteration Discovery M. Chang, D. Goldwasser, D. Roth, and Y. Tu.

1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.

Using a Story-Based Approach to Teach Grammar

David L. Chen and Raymond J. Mooney Department of Computer Science The University of Texas at Austin Learning to Interpret Natural Language Navigation.

1 David Chen & Raymond Mooney Department of Computer Sciences University of Texas at Austin Learning to Sportscast: A Test of Grounded Language Acquisition.

Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning for Semantic Parsing Raymond.

A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.

The Software Development Process

Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.

Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning a Compositional Semantic Parser.

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.

Natural Language Generation with Tree Conditional Random Fields Wei Lu, Hwee Tou Ng, Wee Sun Lee Singapore-MIT Alliance National University of Singapore.

Wei Lu, Hwee Tou Ng, Wee Sun Lee National University of Singapore

Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

A Simple English-to-Punjabi Translation System By : Shailendra Singh.

Progression in KS3/4 Algorithms MONDAY 30 TH NOVEMBER SUE SENTANCE.

1 Learning Language from its Perceptual Context Ray Mooney Department of Computer Sciences University of Texas at Austin Joint work with David Chen Rohit.

Grounded Language Learning

Statistical Machine Translation Part II: Word Alignments and EM

Semantic Parsing for Question Answering

Using String-Kernels for Learning Semantic Parsers

Learning to Transform Natural to Formal Languages

Joohyun Kim Supervising Professor: Raymond J. Mooney

Integrating Learning of Dialog Strategies and Semantic Parsing

Learning to Sportscast: A Test of Grounded Language Acquisition

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

Presentation transcript:

David Chen Supervising Professor: Raymond J. Mooney Doctoral Dissertation Proposal December 15, 2009 Learning Language from Perceptual Context 1

Semantics of Language The meaning of words, phrases, etc Learning semantics of language is one of the ultimate goals in natural language processing The meanings of many words are grounded in our perception of the physical world: red, ball, cup, run, hit, fall, etc. [Harnad, 1990] Computer representation should also be grounded in real world perception 2

Grounding Language Block(Casillas) Spanish goalkeeper Casillas blocks the ball 3

Grounding Language Block(Casillas) Spanish goalkeeper Casillas blocks the ball 4

Natural Language and Meaning Representation Spanish goalkeeper Casillas blocks the ball Block(Casillas) 5

Natural Language and Meaning Representation Natural Language (NL) NL: A language that has evolved naturally, such as English, German, French, Chinese, etc Block(Casillas) Spanish goalkeeper Casillas blocks the ball 6

Natural Language and Meaning Representation NL: A language that has evolved naturally, such as English, German, French, Chinese, etc MRL: Formal languages such as logic or any computer-executable code Meaning Representation Language (MRL) Block(Casillas) Spanish goalkeeper Casillas blocks the ball Natural Language (NL) 7

Semantic Parsing and Tactical Generation NL Semantic Parsing: maps a natural-language sentence to a complete, detailed semantic representation MRL Semantic Parsing (NL  MRL) Block(Casillas) Spanish goalkeeper Casillas blocks the ball 8

Semantic Parsing and Tactical Generation NL Semantic Parsing: maps a natural-language sentence to a complete, detailed semantic representation Tactical Generation: Generates a natural-language sentence from a meaning representation. MRL Semantic Parsing (NL  MRL) Tactical Generation (NL  MRL) Block(Casillas) Spanish goalkeeper Casillas blocks the ball 9

Strategic Generation Strategic Generation (Content Selection): Given a set of meaning representations, select a subset 10 Pass(Beck, Hunt) Pass(Hunt, Podolski) Shoot(Podolski) Block(Casillas) … Block(Casillas)

Applications Natural language interface – Issue commands and queries in natural language – Computer responds with answer in natural language Knowledge acquisition Computer assisted tasks 11

Traditional Learning Approach Manually Annotated Training Corpora (NL/MRL pairs) Semantic Parser MRLNL Semantic Parser Learner 12

Traditional Learning Approach Manually Annotated Training Corpora (NL/MRL pairs) Tactical Generator MRLNL Tactical Generator Learner 13

Example of Annotated Training Corpus 14 Natural Language (NL) Meaning Representation Language (MRL) Alice passes the ball to BobPass(Alice, Bob) Bob turns the ball over to JohnTurnover(Bob, John) John passes to FredPass(John, Fred) Fred shoots for the goalKick(Fred) Paul blocks the ballBlock(Paul) Paul kicks off to NancyPass(Paul, Nancy) ……

Learning Language from Perceptual Context Constructing annotated corpora for language learning is difficult Children acquire language through exposure to linguistic input in the context of a rich, relevant, perceptual environment Ideally, a computer system can learn language in the same manner 15

Learning in Virtual Environment Many schools use 3D virtual environments to support language learning – Immersive: Surrounded by a stimulating environment – Social: Language learners can interact with others – Creative: Constructing objects as part of learning Online worlds including Second Life Different ways of learning – Task-based learning – Collaborative construction – Virtual tourism 16

Learning in Virtual Environment Growing video game industry – $9.5 billion in the US in 2007, $11.7 billion in 2008 (Entertainment Software Association annual report) Serious games – DARWARS: Military training systems – SimPort: Simulated construction and management of a sea port project 17

Navigation Example 18 在餐廳的地方右轉 Alice: Bob

Navigation Example 19 在醫院的地方右轉 Alice: Bob

Navigation Example 20 在醫院的地方右轉在餐廳的地方右轉 Scenario 1 Scenario 2

Navigation Example 21 在的地方右轉 Scenario 1 Scenario 2 Make a right turn

Navigation Example 22 在醫院的地方右轉在餐廳的地方右轉 Scenario 1 Scenario 2

Navigation Example 23 餐廳 Scenario 1 Scenario 2

Navigation Example 24 在醫院的地方右轉在餐廳的地方右轉 Scenario 1 Scenario 2

Navigation Example 25 醫院 Scenario 1 Scenario 2

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 26

Semantic Parser Learners Learn a function from NL to MR NL: “Purple3 passes the ball to Purple5” MR: Pass ( Purple3, Purple5 ) Semantic Parsing (NL  MR) Tactical Generation (MR  NL) We experiment with two semantic parser learners –WASP [Wong & Mooney, 2006; 2007] –KRISP [Kate & Mooney, 2006] 27

Uses statistical machine translation techniques – Synchronous context-free grammars (SCFG) [Wu, 1997; Melamed, 2004; Chiang, 2005] – Word alignments [Brown et al., 1993; Och & Ney, 2003] Capable of both semantic parsing and tactical generation WASP: Word Alignment-based Semantic Parsing 28

KRISP: Kernel-based Robust Interpretation by Semantic Parsing Productions of MR language are treated like semantic concepts SVM classifier is trained for each production with string subsequence kernel These classifiers are used to compositionally build MRs of the sentences More resistant to noisy supervision but incapable of tactical generation 29

KRISPER: KRISP with EM-like Retraining Extension of K RISP that learns from ambiguous supervision [Kate & Mooney, 2007] Uses an iterative EM-like method to gradually converge on a correct meaning for each sentence. 30

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 31

Tractable Challenge Problem: Learning to Be a Sportscaster Goal: Learn from realistic data of natural language used in a representative context while avoiding difficult issues in computer perception (i.e. speech and vision). Solution: Learn from textually annotated traces of activity in a simulated environment. Example: Traces of games in the RoboCup simulator paired with textual sportscaster commentary. 32

RoboCup Simulation League 33

RoboCup Simulation League Purple goalie blocked the ball 34

Learning to Sportscast Learn to sportscast by observing sample human sportscasts Build a function that maps between natural language (NL) and meaning representation (MR) – NL: Textual commentaries about the game – MR: Predicate logic formulas that represent events in the game 35

Robocup Sportscaster Trace Natural Language CommentaryMeaning Representation Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 36

Robocup Sportscaster Trace Natural Language CommentaryMeaning Representation Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 37

Robocup Sportscaster Trace Natural Language CommentaryMeaning Representation Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 38

Robocup Sportscaster Trace Natural Language CommentaryMeaning Representation Purple goalie turns the ball over to Pink8 P6 ( C1, C19 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 P5 ( C1, C19 ) P2 ( C22, C19 ) P2 ( C19, C22 ) P0 P2 ( C19, C22 ) P1 ( C22 ) P1( C19 ) P1 ( C22 ) P1 ( C19 ) 39

Robocup Data Collected human textual commentary for the 4 Robocup championship games from –Avg # events/game = 2,613 –Avg # English sentences/game = 509 –Avg # Korean sentences/game = 499 Each sentence matched to all events within previous 5 seconds. –Avg # MRs/sentence = 2.5 (min 1, max 12) Manually annotated with correct matchings of sentences to MRs (for evaluation purposes only). 40

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 41

Tactical Generation Learn how to generate NL from MR Example: Two steps 1.Disambiguate the training data 2.Learn a language generator Pass(Pink2, Pink3)  “Pink2 kicks the ball to Pink3” 42

WASPER WASP with EM-like retraining to handle ambiguous training data. Same augmentation as added to KRISP to create KRISPER. 43

First train KRISPER to disambiguate the data Then train WASP on the resulting unambiguously supervised data. KRISPER-WASP 44

WASPER-GEN Determines the best matching based on generation (MR→NL). Score each potential NL/MR pair by using the currently trained WASP -1 generator. Compute NIST MT score [NIST report, 2002] between the generated sentence and the potential matching sentence. 45

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 46

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 47

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 2/3 48

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 2/3 1/2 49

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 2/3 1/2 0/1 50

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 2/3 1/2 0/1 51 BLEU:

BLEU/NIST scores Target: Purple2 quickly passes to Purple3 Candidate: Purple2 passes to Purple3 1-grams: Purple2, passes, to, Purple3 2-grams: Purple2 passes, passes to, to Purple3 3-grams: Purple2 passes to, passes to Purple3 4-gram: Purple2 passes to Purple3 4/4 2/3 1/2 0/1 52 BLEU: NIST:

System Overview Purple7 loses the ball to Pink2 SportscasterRobocup Simulator Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) 53

System Overview SportscasterRobocup Simulator Semantic Parser Learner Initial Semantic Parser 54 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 )

System Overview SportscasterRobocup Simulator Initial Semantic Parser Unambiguous Training Data 55 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) X Purple7 loses the ball to Pink2Kick ( pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

System Overview SportscasterRobocup Simulator Semantic Parser Semantic Parser Learner 56 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data X Purple7 loses the ball to Pink2Kick ( pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

System Overview SportscasterRobocup Simulator Semantic Parser Semantic Parser Learner 57 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data O Purple7 loses the ball to Pink2 Turnover ( purple7, pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8 Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

System Overview SportscasterRobocup Simulator Semantic Parser Semantic Parser Learner 58 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data O Purple7 loses the ball to Pink2 Turnover ( purple7, pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8 Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

System Overview SportscasterRobocup Simulator Semantic Parser Semantic Parser Learner 59 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data O Purple7 loses the ball to Pink2 Turnover ( purple7, pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) O Pink5 makes a long pass to Pink8 Pass ( pink5, pink8 ) O Pink8 shoots the ballKick ( pink8 )

KRISPER and WASPER SportscasterRobocup Simulator Semantic Parser Semantic Parser Learner (KRISP/WASP) 60 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data O Purple7 loses the ball to Pink2 Turnover ( purple7, pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8 Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

WASPER-GEN SportscasterRobocup Simulator Tactical Generator Tactical Generator Learner (WASP) 61 Purple7 loses the ball to Pink2 Ambiguous Training Data Pink2 kicks the ball to Pink5 Pink5 makes a long pass to Pink8 Pink8 shoots the ball Turnover ( purple7, pink2 ) Pass ( pink5, pink8) Pass ( purple5, purple7 ) Kick ( pink2 ) Pass ( pink2, pink5 ) Kick ( pink5 ) Ballstopped Kick ( pink8 ) Unambiguous Training Data O Purple7 loses the ball to Pink2 Turnover ( purple7, pink2 ) O Pink2 kicks the ball to Pink5 Pass ( pink2, pink5 ) X Pink5 makes a long pass to Pink8 Kick ( pink5 ) O Pink8 shoots the ballKick ( pink8 )

Matching 4 Robocup championship games from – Avg # events/game = 2,613 – Avg # English sentences/game = 509 Leave-one-game-out cross-validation Metric: – Precision: % of system’s annotations that are correct – Recall: % of gold-standard annotations produced – F-measure: Harmonic mean of precision and recall 62

Matching Results 63

Tactical Generation Measure how accurately NL generator produces English sentences for chosen MRs in the test games. Use gold-standard matches to determine the correct sentence for each MR that has one. Leave-one-game-out cross-validation Metric: – BLEU score: [Papineni et al, 2002], N=4 64

Tactical Generation Results 65

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 66

Strategic Generation Generation requires not only knowing how to say something (tactical generation) but also what to say (strategic generation). For automated sportscasting, one must be able to effectively choose which events to describe. 67

Example of Strategic Generation pass ( purple7, purple6 ) ballstopped kick ( purple6 ) pass ( purple6, purple2 ) ballstopped kick ( purple2 ) pass ( purple2, purple3 ) kick ( purple3 ) badPass ( purple3, pink9 ) turnover ( purple3, pink9 ) 68

Example of Strategic Generation pass ( purple7, purple6 ) ballstopped kick ( purple6 ) pass ( purple6, purple2 ) ballstopped kick ( purple2 ) pass ( purple2, purple3 ) kick ( purple3 ) badPass ( purple3, pink9 ) turnover ( purple3, pink9 ) 69

Strategic Generation For each event type (e.g. pass, kick) estimate the probability that it is described by the sportscaster. Requires correct NL/MR matching – Use estimated matching from tactical generation – Iterative Generation Strategy Learning 70

Iterative Generation Strategy Learning (IGSL) Directly estimates the likelihood of an event being commented on Self-training iterations to improve estimates Uses events not associated with any NL as negative evidence 71

IGSL Example Natural Language CommentaryMeaning Representation 72 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) Purple3 loses the ball to Pink4 Ballstopped

IGSL Example Natural Language CommentaryMeaning Representation 73 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) 1/3 1/2 1 1/3 Purple3 loses the ball to Pink4 Ballstopped 1/3

IGSL Example Natural Language CommentaryMeaning Representation 74 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) 1/3 1/2 1 1/3 Purple3 loses the ball to Pink4 P ( Kick ) = (1/3 + 1/2) / 3 = Ballstopped 1/3

IGSL Example Natural Language CommentaryMeaning Representation 75 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) 1/3 1/2 1 1/3 Purple3 loses the ball to Pink4 P ( Kick ) = (1/3 + 1/2) / 3 = P ( Pass ) = (1/3 + 1/2) / 2 = P ( Turnover) = (1) / 1 = 1 P (Ballstopped ) = (1/3) / 2 = 0.17 Ballstopped 1/3

IGSL Example Natural Language CommentaryMeaning Representation 76 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) / ( ) = Purple3 loses the ball to Pink4 Ballstopped P ( Kick ) = (1/3 + 1/2) / 3 = P ( Pass ) = (1/3 + 1/2) / 2 = P ( Turnover) = (1) / 1 = 1 P (Ballstopped ) = (1/3) / 2 = 0.17

IGSL Example Natural Language CommentaryMeaning Representation 77 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) Purple3 loses the ball to Pink4 P ( Kick ) = ( ) / 3 = P ( Pass ) = ( ) / 2 = P ( Turnover) = (1) / 1 = 1 P (Ballstopped ) = (0.194) / 2 = Ballstopped 0.323

IGSL Example Natural Language CommentaryMeaning Representation 78 Purple7 passes to Purple 3 Pink4 passes out to Pink8 Pass ( purple7, purple3 ) Turnover ( purple3, pink4 ) Kick ( purple7 ) Ballstopped Kick ( pink3 ) Kick ( pink4 ) Pass ( pink4, pink8 ) Purple3 loses the ball to Pink4 P ( Kick ) = P ( Pass ) = P ( Turnover) = (1) / 1 = 1 P (Ballstopped ) = (1/3) / 2 = Ballstopped 0.160

Strategic Generation Performance Evaluate how well the system can predict which events a human comments on Metric: – Precision: % of system’s annotations that are correct – Recall: % of gold-standard annotations correctly produced – F-measure: Harmonic mean of precision and recall 79

Strategic Generation Results 80

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 81

Used Amazon’s Mechanical Turk to recruit human judges (~40 judges per video) 8 commented game clips – 4 minute clips randomly selected from each of the 4 games – Each clip commented once by a human, and once by the machine Presented in random counter-balanced order Judges were not told which ones were human or machine generated 82 Human Evaluation

Score English Fluency Semantic Correctness Sportscasting Ability 5FlawlessAlwaysExcellent 4GoodUsuallyGood 3Non-nativeSometimesAverage 2DisfluentRarelyBad 1GibberishNeverTerrible 83

Demo Clip Game clip commentated using WASPER-GEN with IGSL, since this gave the best results for generation. FreeTTS was used to synthesize speech from textual output. English: Korean: 84

Human Evaluation SyntaxSemanticOverallHuman? 2001 Human % 2001 Machine % 2002 Human % 2002 Machine % 2003 Human % 2003 Machine % 2004 Human % 2004 Machine % Average Human % Average Machine % 85

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 86

Referential vs. Functional Meanings Referential meanings – Describe objects and events in the world – Completed work on learning to sportscast Functional meanings – Aim to achieve some actions in the world – Proposed work on learning navigation instructions 87

Challenge on Generating Instructions in Virtual Environments (GIVE) 88

Learning Approach Passive learning – Observes human instructor guiding a human follower Interactive learning as follower – Tries to follow human instructions Interactive learning as instructor – Generates instructions to guide human follower 89

Navigation Task Two participants: instructor and follower Given: starting location and destination Instructor: Give directions for navigating Follower: Follows direction Success if follower reaches the intended destination Data contains ~800 instructions for 3 virtual environments [MacMahon et al., 2006] 90

Evaluations Task completion – Did the follower reach the destination? Efficiency – How long and how many steps did it take to complete the task Partial correctness – How much of the task did the follower complete 91

Challenges Many different instructions for the same task – Describe different actions – Use different parameters – Different ways to describe the same parameters Hidden MRs – Needs to infer the MR from observed actions Exponential number of possible MRs 92

Environment 93

H C L S S B C H E L E Environment H – Hat Rack L – Lamp E – Easel S – Sofa B – Barstool C - Chair 94

Example of Instructions Take your first left. Go all the way down until you hit a dead end. Go towards the coat hanger and turn left at it. Go straight down the hallway and the dead end is position 4. Walk to the hat rack. Turn left. The carpet should have green octagons. Go to the end of this alley. This is p-4. Walk forward once. Turn left. Walk forward twice. Start: 3, End: H H

Example of Instructions 3 3 H H 4 4 Take your first left. Go all the way down until you hit a dead end. Go towards the coat hanger and turn left at it. Go straight down the hallway and the dead end is position 4. Walk to the hat rack. Turn left. The carpet should have green octagons. Go to the end of this alley. This is p-4. Walk forward once. Turn left. Walk forward twice. Start: 3, End: 4 Observed primitive actions: Forward, Left, Forward, Forward 96

Overall System … Walk forward twice … … Forward, Forward … Instructions Primitive actions MR… Travel( Distance = 2) … 97

Overall System … Walk forward twice … … Forward, Forward … Instructions Primitive actions MR… Travel( Distance = 2) … 98

Overall System … Walk forward twice … … Forward, Forward … Instructions Primitive actions MR… Travel( Distance = 2) … Semantic ParsingTactical Generation NavigationStrategic Generation 99

Overall System … Walk forward twice … … Forward, Forward … Instructions Primitive actions MR… Travel( Distance = 2) … Semantic ParsingTactical Generation NavigationStrategic Generation 100

Modeling the Instruction Parsing Process Use semantic parser to produce a set of good MRs from the instructions Use the navigation component to verify which of these MRs result in the correct actions Refine the MRs if none of them are correct 101

Training Initial Semantic Parser Construct the most specific MR Overestimates the details 3 3 H H 4 4 Travel( Precondition=(Right=Wall, Left=Concrete Hall, Front=Blue Hall, Back=Blue Hall), Distance=1, Until=(Intersection(Order=1, Current Path=Blue Hallway, Cross Path=Yellow Hallway), Hat Rack), Postcondition=(Right=Yellow Hall, Left=Yellow Hall,Front=Blue Hall, Back=Blue Hall) ) 102

Refinement Modifies an MR until it produces the correct actions First remove any parts that do not appear in the most specific MR Then systematically add parts of the most specific MR Prefers the least amount of modification – Want a MR closer to the original parse Prefers shortest MR – Avoid superfluous connections 103

Interactive Learning The system can participate in the navigation task as instructor or follower Feedback from human partner helps fix errors in understanding Reweigh the rules that led to the positive or negative feedback 104

Overview Background and related works Completed work: Sportscasting – Tactical generation – Strategic generation – Human evaluation Proposed work: Navigation instructions Conclusions 105

Conclusion Current language learning work uses expensive, annotated training data. We have developed a language learning system that can learn from language paired with an ambiguous perceptual environment. We have evaluated it on the task of learning to sportscast simulated RoboCup games. The proposed future work aims to solve the problem of learning how to give and receive navigational instructions in a virtual world 106

Backup Slides 107

KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 1. Assume every possible meaning for a sentence is correct 108

KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 1. Assume every possible meaning for a sentence is correct 109

Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 1/2 1/3 1/2 KRISPER 2. Resulting NL-MR pairs are weighted and given to semantic parser learner 110

KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 3. Estimate the confidence of each NL-MR pair using the resulting trained semantic parser

KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Purple team is very sloppy today Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 4. Use maximum weighted matching on a bipartite graph to find the best NL-MR pairs [Munkres, 1957]

Purple team is very sloppy today KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 4. Use maximum weighted matching on a bipartite graph to find the best NL-MR pairs [Munkres, 1957]

Purple team is very sloppy today KRISPER Purple goalie turns the ball over to Pink8 badPass ( Purple1, Pink8 ) Pink11 looks around for a teammate Pink8 passes the ball to Pink11 Pink11 makes a long pass to Pink8 Pink8 passes back to Pink11 turnover ( Purple1, Pink8 ) pass ( Pink11, Pink8 ) pass ( Pink8, Pink11 ) ballstopped pass ( Pink8, Pink11 ) kick ( Pink11 ) kick ( Pink8) kick ( Pink11 ) kick ( Pink8 ) 5. Give the best pairs to the semantic parser learner in the next iteration, and repeat until convergence 114