Interactively Learning Game Formulations in a Physically Instantiated Environment James Kirk Soar Workshop 2013 June 6, 2013 1.

Slides:

Advertisements

Similar presentations

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA

Advertisements

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA

Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,

Partially Observable Markov Decision Process (POMDP)

CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Mar, 4, 2015 Slide credit: some slides adapted from Stuart.

Learning Tasks through Situated Interactive Instruction James Kirk, John Laird Soar Workshop

Problem Solving and Search in AI Part I Search and Intelligence Search is one of the most powerful approaches to problem solving in AI Search is a universal.

Yiannis Demiris and Anthony Dearden By James Gilbert.

Introduction to SOAR Based on “a gentle introduction to soar: an Architecture for Human Cognition” by Jill Fain Lehman, John Laird, Paul Rosenbloom. Presented.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester, 2010

Emergent Adaptive Lexicons Luc Steels 1996 Artificial Intelligence Laboratory Vrije Universiteit Brussel Presented by Achim Ruopp University of Washington.

The Importance of Architecture for Achieving Human-level AI John Laird University of Michigan June 17, th Soar Workshop

Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.

1 New York State Mathematics Core Curriculum 2005.

Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Cognitive Robots © 2014, SNU CSE Biointelligence Lab.,

ACOS 2010 Standards of Mathematical Practice

Introduction to AI, H. Feili 1 Introduction to Artificial Intelligence LECTURE 1: Introduction What is AI? Foundations of AI The.

Protein Tertiary Structure Prediction

1 AI and Agents CS 171/271 (Chapters 1 and 2) Some text and images in these slides were drawn from Russel & Norvig’s published material.

Learning Prepositions for Spatial Relationships in BOLT Soar Workshop 2012 James Kirk, John Laird 6/21/

Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.

A Multi-Domain Evaluation of Scaling in Soar’s Episodic Memory Nate Derbinsky, Justin Li, John E. Laird University of Michigan.

Artificial Intelligence Chapter 25 Agent Architectures Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.

1 Solving problems by searching This Lecture Chapters 3.1 to 3.4 Next Lecture Chapter 3.5 to 3.7 (Please read lecture topic material before and after each.

Synthetic Cognitive Agent Situational Awareness Components Sanford T. Freedman and Julie A. Adams Department of Electrical Engineering and Computer Science.

Chapter 13 Artificial Intelligence and Expert Systems.

Outline Intro to Representation and Heuristic Search Machine Learning (Clustering) and My Research.

Joseph Xu Soar Workshop Learning Modal Continuous Models.

MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.

Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.

Rational Agency CSMC Introduction to Artificial Intelligence January 8, 2007.

CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Oct, 30, 2015 Slide credit: some slides adapted from Stuart.

Rational Agency CSMC Introduction to Artificial Intelligence January 8, 2004.

RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"

Introduction to State Space Search

Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.

Cognitive Architectures and General Intelligent Systems Pay Langley 2006 Presentation : Suwang Jang.

1 Representation and Evolution of Lego-based Assemblies Maxim Peysakhov William C. Regli ( Drexel University) Authors: {umpeysak,

Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.

Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.

Strategic Reasoning with Game Description Language Ji Ruan, Prof. W. van der Hoek, Prof. M. Wooldridge Department of Computer Science From Specific Game.

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, CA

G5AIAI Introduction to AI

1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.

Introduction to Math Methods Math Standards. Why can math be fun? Math can be fun because… it can have so much variety in topics. many different ways.

Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.

NTT-MIT Collaboration Meeting, 2001Leslie Pack Kaelbling 1 Learning in Worlds with Objects Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory.

Interactive Task Learning: Language Processing for Rosie John E. Laird and Peter Lindes University of Michigan 1.

Bob Wray, Charles Newton, Victor Hung, Norb Timpko 7 Jun 2017

Knowledge Representation and Reasoning

Cognitive Language Processing for Rosie

Learning Fast and Slow John E. Laird

How can Rosie tell me what it can do for me?

Artificial Intelligence Chapter 25 Agent Architectures

Knowledge Representation and Reasoning

Constraint Satisfaction Problems vs. Finite State Problems

Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,

Representation and Evolution of Lego-based Assemblies

Cognitive Language Comprehension in Rosie

CSc4730/6730 Scientific Visualization

Learning the Task Definition of Games through ITL

CASE − Cognitive Agents for Social Environments

Symbolic cognitive architectures

AI and Agents CS 171/271 (Chapters 1 and 2)

Artificial Intelligence Chapter 25. Agent Architectures

Knowledge Representation I (Propositional Logic)

Artificial Intelligence Chapter 25 Agent Architectures

Presentation transcript:

Interactively Learning Game Formulations in a Physically Instantiated Environment James Kirk Soar Workshop 2013 June 6,

General Motivation How can an agent be taught a novel problem in a real-world environment? – Sufficient specification of the problem for agent to attempt to solve – Specifically focusing on games Long Term Goals – Robots with teachable extendable behavior – Flexible interactive instruction – Grounded knowledge acquisition General Requirements – Effective means to communicate problem space Problem space defines legal actions, state representation, terminal states, and goals No policy information – Sufficient representation of problem specification – Grounding of knowledge in shared environment – Integration of perception, communication, reasoning, and action in one agent – Generality-can learn a variety of games Ex: Towers of Hanoi, Tic-Tac-Toe, Frogs and Toads puzzle 2

System Overview 3 Instructive Dialog acquires problem space almost from scratch Starts with some primitive knowledge about: – Primitive verbs: pick-up(obj), put-down(xyz) – Primitive spatial relations: alignment along axes (ex: aligned along X axis) – Feature space knowledge of color, size, and shape Acquires: – Verb-action knowledge (move) – Spatial prepositions (in) – Object attributes (red)

System Overview 4 Instructive Dialog to acquire problem space and needed concepts Game Concept Network Interpret perception to find legal actions and internally search for goal Manipulate environment using discovered solution Game A1 C1 Tic-Tac-Toe P1 blocklocation C11C12 place move

Shortcomings of Existing Approaches Communication of problem space – Limited to formal languages, like C, STRIPS, or GDL – Cannot learn spatial relations for describing problem space – Do not share learned representations across multiple games – Focuses on learning through observation of game play Representation of problem space – Problem space specifications, like STRIPS or GDL, do not ground their representations and are acquired programmatically – Require full action models and initial state descriptions Integration – Few projects have attempted to integrate all of these components for end-to-end behavior – Knowledge must be grounded not only in perception, but across components 5

Major Contributions 1.A system that integrates the following components for end-to-end behavior for learning a subset of 2D grid-based games 2.A method for acquiring grounded concepts of spatial relationships for prepositions, which are used in communicating the problem description 3.The Game Concept Network (GCN) a)A representation of the game, including the problem space and goal/failure states b)The process to acquire the GCN through mixed-initiative structured dialog interaction c)The procedural knowledge to interpret the GCN to extract necessary information from the world 4.A capability to internally simulate actions, search forward for the solutions, and produce action commands to manipulate the environment to achieve the goals. 6

Characterization of Games that can be Learned Fully observable, deterministic, turn-based Playable with discrete actions – No multi-verb actions (like replace) Game encoded in current visual state – No rules based on history Game state defined by – locations – spatial constraints between those locations – pieces that occupy locations Covers many board games – Games such as Tic-Tac-Toe, Connect4, N Queens puzzle – Also games/puzzles that can be described as an isomorphism (Towers of Hanoi) 7

Major Contributions 1.A system that integrates the following components for end-to-end behavior for learning a subset of 2D grid based games 2.A method for acquiring grounded concepts of spatial relationships for prepositions 3.Game Concept Network (GCN) a)A representation of the game, including the problem space and goal/failure states b)The process to acquire the GCN through mixed-initiative structured dialog interaction c)Procedural knowledge to interpret the GCN to extract necessary information from the world 4.A capability to internally simulate actions, search forward for the solutions, and produce action commands to manipulate the environment to achieve the goals. 8

Prepositions for Spatial Relationships Prepositions are necessary for describing the spatial constraints of board games Concepts must be grounded in shared representation- simulator/real-world Basic Requirements – Learned with few examples – Cover basic prepositions between two objects in Euclidean space SVS primitives – Axis(X,Y,Z) alignment(aligned, greater than, less than) of two objects – Distance between objects along axes Can learn/represent prepositions such as – Left/right – Front/behind – Outside/inside – Near/far – Below/Above – Diagonal – Next to 9

Spatial relation representation y-aligned z-aligned x-greater than Z Y X “right of” “Inside” y-aligned z-aligned x-aligned “Above” y-greater-than z-aligned x-(any) Other potential compositions: “Next to” y-aligned z-aligned x-(less-than or greater-than) distance distance 10

Spatial Projection “Put the object to the right of the blue block.” Use average distance information to calculate XYZ projection coordinate – Randomly selects alignment if there are multiple possible alignments along an axis Critical for actions and for simulation Z Y X 11

Major Contributions 1.A system that integrates the following components for end-to-end behavior for learning a subset of 2D grid based games 2.A method for acquiring grounded concepts of spatial relationships for prepositions 3.Game Concept Network (GCN) a)A representation of the game, including the problem space and goal/failure states b)The process to acquire the GCN through mixed-initiative structured dialog interaction c)Procedural knowledge to interpret the GCN to extract necessary information from the world 4.A capability to internally simulate actions, search forward for the solutions, and produce action commands to manipulate the environment to achieve the goals. 12

Representing Tic-Tac-Toe What is a sufficient representation for playing Tic-Tac-Toe? Recognize legal actions – Place your piece (red) in an empty location Detect Goal – Three of your pieces on the board in a line (row, column, or diagonal) 13

Acquisition of GCN 14 Game A1 C1 Tic-Tac-Toe P1 blocklocation C11C12 S1red S2 inlocationtrueinblocktrue 1 2 parameters constraints 1 2 name action prep withnot prep withnot spatial attribute M: The game is tic-tac-toe. A: What is the name of a legal action in the game? M: place A: What is the verb associated with this action? M: move A: Tell me the name of a parameter for this action. M: a block A: What is a constraint for this parameter M: the block is red A: What is a constraint for this parameter M: the block is not in a location A: What is a constraint for this parameter M: finished. A: Tell me the name of another parameter M: a location A: What is a constraint for this parameter M: the location is not in a block place name move verb

Interpret Tic-Tac-Toe Index potential objects for each parameter Apply descriptive constraints Apply spatial constraints Construct full match sets 15

Simulating Tic-Tac-Toe 16 Visible WorldInternal SVS representation Goal Not DetectedGoal Detected!

Evaluation 1.GCN representation is sufficient to describe a variety of games 2.Grounded knowledge representation is sufficient for perceiving and acting in the real world 3.Knowledge acquisition is incremental, and transfers to other learning interactions 17

Games Learned Towers of Hanoi 18 Frogs and Toads puzzle 5-Puzzle Tic-Tac-Toe Connect-3 Bishop swap 4 Queens puzzle

Concepts learned for Games 19 GameSpatial- Prep(s) Action(s)GoalFailure Tic-Tac-Toein, linearplace3-in-a-rownone Connect-3in, linear, above stack, bottom-place3-in-a-rownone Towers-of- Hanoi in, above, smaller-than shift-stack, shift-to- bottom Right-sidenone 5-Puzzlein, diagonal, near slideMatching- location none Frogs and Toads left of, right of, in slide-left, slide-right, jump-left, jump-right Side-swapnone 4 Queensin, linearplaceAll-on-boardNone- attacking

Towers of Hanoi Demo 20

Incremental Knowledge Acquisition The interactive dialog can be long and tedious – Tediousness also do to the inflexibility of language use – Human instructional interactions can also be long/tedious As the agent acquires knowledge of concepts, like prepositions, this interaction becomes quicker Added additional capability to learn common concepts – For example action: place and goal: three-in-a-row – Relies on unique names for actions, goals, and failures 21

Results Experiment: Three games taught separately and sequentially 22

Tic-Tac-Toe after Queen-4 and Connect-3 If Tic-Tac-Toe is taught last, only 5 interactions are needed! M: the game is tic-tac-toe A: I do not know how to play tic-tac-toe. Tell me the name of a legal action in this game. M: place (learned in 4 Queens) A: Ok tell me the name of another legal action in this game, or finished if no more actions. M: finished A: Ok tell me the name of the goal in the game. M: three-in-a-row (learned in Connect-3) A: Ok tell me the name of a failure state in the game. (or none) Mentor: none A: Ok I have now learned the basics of the game. 23

Nuggets and Coals Nuggets Can learn and play many different games Works in real world environment Concepts transfer between games Coals Limitations in object permanence, preposition learning, verb learning Currently limited to 2-Dimensional board games Iterative deepening insufficient for handling many games/puzzles 24

Questions? 25

References Barbu, A.; Narayanaswamy, S.; and Siskind, J. M Learning physically-instantiated game play through visual observation. In Proc. of ICRA’10, 1879–1886. Genesereth, M., and Love, N General game playing: Game description language speciﬁcation. Technical report, Computer Science Department, Stanford University, Stanford, CA, USA. Genesereth, M. and Love, N. General game playing: Overview of the AAAI competition. AI Magazine, 26(2), Hinrichs, T., and Forbus, K Learning Game Strategies by Experimentation. Paper presented atthe IJCAI-09 Workshop on Learning Structural Knowledge from Observations. Pasadena, CA, July 12. Kaiser, Ł. Learning Games from Videos Guided by Descriptive Complexity. In Proceedings of the 26th Conference on Artificial Intelligence, AAAI-12, pp. 963–970. AAAI Press, Laird, J. (2012). The Soar cognitive architecture. Cambridge, MA: MIT Press. Mohan, S., Mininger, A., Kirk, J., & Laird, J. (2012). Acquiring Grounded Representation of Words with Situated Interactive Instruction. Advances in Cognitive Systems. Roy, D. (2005). Grounding words in perception and action: computational insights. Trends in Cognitive Sciences, 9, 389–396. Thielscher., M. A general game description language for incomplete information games. In Proc. of AAAI, 994–999, Thielscher, M. 2011a. The general game playing description language is universal. In Proceedings of IJCAI. Thielscher, M. (2011). General Game Playing in AI Research and Education. In J. Bach & S. Edelkamp (Eds.), Proceedings of the German Annual Conference on Artiﬁcial Intelligence (KI) (Vol. 7006, pp. 26–37). Berlin, Germany: Springer 26

Extra slides 27

N Queens Game 28 4 Queens puzzle: Place each queen(blue object) on the board so that none are attacking. Border locations reduce specification complexity.

5 Puzzle 29 5 puzzle: Slide pieces so that they end in their matching location (here: color). Can express adjacent relationship for slide action with multiple prepositions.

Connect-3 30 Connect-3: Another game described with an isomorphism like Towers of Hanoi