Mazin Assanie University of Michigan Soar 9.5 Beta and Explanation-Based Chunking.

Slides:



Advertisements
Similar presentations
An Indispensable Quality Assurance Tool for Dairy Processing Plants.
Advertisements

Information Processing Technology Office Learning Workshop April 12, 2004 Seedling Overview Learning Hierarchical Reactive Skills from Reasoning and Experience.
Cookies, Sessions. Server Side Includes You can insert the content of one file into another file before the server executes it, with the require() function.
How did we get here? (CMIS v0.5) F2F, January 2009.
Introducing Constrained Heuristic Search to the Soar Cognitive Architecture (demonstrating domain independent learning in Soar) The Second Conference on.
Software Development Life Cycle
Templates and Styles Excel Advanced. Templates are pre- designed and formatted spreadsheets –They provide consistency of layout/structure –They.
Feature requests for Case Manager By Spar Nord Bank A/S IBM Insight 2014 Spar Nord Bank A/S1.
Searching Kruse and Ryba Ch and 9.6. Problem: Search We are given a list of records. Each record has an associated key. Give efficient algorithm.
Specialized Understanding of Mathematics: A Study of Prospective Elementary Teachers Meg Moss.
Raster Based GIS Analysis
The National Geography Standards
Impact of Working Memory Activation on Agent Design John Laird, University of Michigan 1.
Knowledge in intelligent systems So far, we’ve used relatively specialized, naïve agents. How can we build agents that incorporate knowledge and a memory?
1 Soar Semantic Memory Yongjia Wang University of Michigan.
Label in the Margin System
Ai in game programming it university of copenhagen Reinforcement Learning [Intro] Marco Loog.
A New Soar Debugger in Java Douglas Pearson ThreePenny Software
Abstraction and ACT-R. Outline Motivations for the Theory –Architecture –Abstraction Production Rules ACT-R Architecture of Cognition.
Instance Based Learning Bob Durrant School of Computer Science University of Birmingham (Slides: Dr Ata Kabán) 1.
Soar-RL: Reinforcement Learning and Soar Shelley Nason.
Arbitrage in Combinatorial Exchanges Andrew Gilpin and Tuomas Sandholm Carnegie Mellon University Computer Science Department.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
From C++ to C#. Web programming The course is on web programming using ASP.Net and C# The course is on web programming using ASP.Net and C# ASP.Net is.
1 Machine Learning: Lecture 11 Analytical Learning / Explanation-Based Learning (Based on Chapter 11 of Mitchell, T., Machine Learning, 1997)
Session 12 Applying the Class Diagram to the Case Study Written by Thomas A. Pender Published by Wiley Publishing, Inc. October 12, 2011 Presented by Hyewon.
A Multi-Domain Evaluation of Scaling in Soar’s Episodic Memory Nate Derbinsky, Justin Li, John E. Laird University of Michigan.
Interpreted Declarative Representations of Task Knowledge June 21, 2012 Randolph M. Jones, PhD.
New SVS Implementation Joseph Xu Soar Workshop 31 June 2011.
Critical Chain From “Theory of Constraints” Developed by Elihu Goldratt in 1984 Goldratt – Many of his key ideas were first explained via “business novels”
Soar Mazin Assanie, John Laird, Joseph Xu 1.
Problem of the Day  Why are manhole covers round?
DEBUGGING. BUG A software bug is an error, flaw, failure, or fault in a computer program or system that causes it to produce an incorrect or unexpected.
ES Design, Development and Operation Dr. Ahmed Elfaig Knowledge model, knowledge structure, presentation and organization are the bottleneck of expert.
GRASP: Designing Objects with Responsibilities
ES Model development Dr. Ahmed Elfaig The ES attempts to predict results from available information, data and knowledge The model should be able to infer.
The development of common partnership and interuniversitary scientific research according to the principle of sustainable development TRAINING COURSE Romania,
Survey Propagation. Outline Survey Propagation: an algorithm for satisfiability 1 – Warning Propagation – Belief Propagation – Survey Propagation Survey.
File Organization and Processing Week Tree Tree.
Intro to Computation and AI Dr. Jill Fain Lehman School of Computer Science Lecture 6: December 4, 1997.
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
QA and Testing. QA Activity Processes monitoring Standards compliance monitoring Software testing Infrastructure testing Documentation testing Usability.
CS1101: Programming Methodology Aaron Tan.
Automated Reasoning Early AI explored how to automated several reasoning tasks – these were solved by what we might call weak problem solving methods as.
 A plan of attack for your games content  Or (more specifically)  A detailed description of all games mechanics, objects, characters, stats, ect… that.
COMP3241 E-Commerce Technologies Richard Henson University of Worcester November 2014.
Chp. 1 - Managers & Management
CpSc 810: Machine Learning Analytical learning. 2 Copy Right Notice Most slides in this presentation are adopted from slides of text book and various.
TRIALS AND TRIBULATIONS Architectural Constraints on Modeling a Visuomotor Task within the Reinforcement Learning Paradigm.
Soar Expert System Tools Team W09 Daniel Nelson, Emily Schwarz, and Sean Lydon.
Autonomous Mission Management of Unmanned Vehicles using Soar Scott Hanford Penn State Applied Research Lab Distribution A Approved for Public Release;
Competence-Preserving Retention of Learned Knowledge in Soar’s Working and Procedural Memories Nate Derbinsky, John E. Laird University of Michigan.
3/14/20161 SOAR CIS 479/579 Bruce R. Maxim UM-Dearborn.
Lists in Python Lists as Arguments/Parameters. Lists as arguments to functions Just like other data types, lists can be sent to functions as arguments.
Action Modeling with Graph-Based Version Spaces in Soar
Modeling Primitive Skill Elements in Soar
GALLERIA DIAMOND JSWT AGENDA
Phil Tayco Slide version 1.0 Created Sep 18, 2017
Games Programming in Scratch
Soar 9.6.0’s Instance-Based Model of Semantic Memory
ICOTS Helpdesk Training
Routing and Logistics Arc Routing 2018/11/19.
Forward Chaining DFS can supply compatible bindings for forward chaining.
Spreading With Edge Weights
Announcements Homework 3 due today (grace period through Friday)
Human Knot/Cross the Line Cornell Notes-Claim, Evidence and Reasoning
Data Structures & Algorithms
1D Arrays and Lots of Brackets
Lecture 9 Using Objects.
1D Arrays and Lots of Brackets
Presentation transcript:

Mazin Assanie University of Michigan Soar 9.5 Beta and Explanation-Based Chunking

Previously on Soar Releases… At the 2014 Soar Workshop, we announced three releases for the upcoming year: – (June 2014), (October 2014), 9.5.beta (now) What’s new? Explanation-based chunking GQ-Lambda reinforcement learning policy Bug fixes and lots of more technical changes 2

Explanation-Based Chunking Motivation Chunking’s utility was limited in many domains because it was very easy for agents to learn a large number of overly-specific rules. The problem occurs because chunking is not able to generalize knowledge involving numbers and strings. 3

A Chunk sp {chunk*9.4.0 :chunk (state ^operator ) ( ^name fill) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume 5) ( ^contents 3) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents 5 +) ( ^contents 3 -)} 4

Very Specific sp {chunk*9.4.0 :chunk (state ^operator ) ( ^name fill) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume 5) ( ^contents 3) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents 5 +) ( ^contents 3 -)} 5

Chunk Comparison sp {chunk*9.4.0 :chunk (state ^operator ) ( ^name fill) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume 5) ( ^contents 3) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents 5 +) ( ^contents 3 -) ( ^rhs 8 +)} sp {chunk*9.5 :chunk (state ^operator ) ( ^name ) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume {> }) ( ^contents ) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents +) ( ^contents -) ( ^rhs (+ ) +)} 6 Chunk learned in Soar 9.4.0What we want

Chunk Comparison sp {chunk*9.4.0 :chunk (state ^operator ) ( ^name fill) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume 5) ( ^contents 3) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents 5 +) ( ^contents 3 -) ( ^rhs 8 +)} sp {chunk*9.5 :chunk (state ^operator ) ( ^name ) ( ^fill-jug ) ( ^filled-jug yes) ( ^picked-up yes) ( ^volume {> }) ( ^contents ) --> ( ^picked-up yes -) ( ^filled-jug yes -) ( ^contents +) ( ^contents -) ( ^rhs (+ ) +)} 7 Chunk learned in Soar What we want

How does EBC differ from chunking? Chunking learned all its knowledge purely by analyzing the working memory trace. 8

9

How does EBC differ from chunking? EBC learns more general knowledge by also analyzing the explanation trace. – Original human-written rules are superimposed over the WME trace to create the explanation trace. 10

11

12

13

Why call it explanation-based? First, the rules explain the reasons why things matched and hence why they occurred in the problem-solving. The relationships between elements in different conditions What constraints on values had to be met 14

Why call it explanation-based? The rules also explain how relationships and constraints in one rule affect affect relationships and constraints in other rules Via the connection between a right-hand side action in one rule to the working memory element it created to a condition in another rule that later matched the working memory element. 15

How Does EBC Work? EBC analyzes the explanation trace to build four sets of mappings that are needed to achieve these types of chunks 1.Identity sets 2.Identity unification sets 3.A constraint set 4.A literalization set 16

1. Identity A set of variablizable elements in an instantiation that must have the same value They had the same variable in the original rule EBC assigns an instantiation-specific id for each element in an identity set 17

2. Identity unification sets A set of identity sets in a trace that must have the same value EBC builds a mapping from identities to identity sets while Soar backtraces through the working memory trace. Uses propagation rules talk won’t cover. 18

3. Identity Literalization Set A set of identities in a trace that must have some literal value – Technically, a very large set in most agents, because most attributes in rules are literals – EBC handles this efficiently by propagating a null identity unification set 19

4. The Constraint Set The set of all constraints that needed to be met for the problem-solving to occur. – These are constraints on identity unification sets 20

Soar 9.5 EBC Summary 1.Creates an explanation trace 2.Assigns identities to identity unification sets Using identity propagation rules 3.Builds up a constraint set 4.Attaches constraints 5.Variablizes elements in condition based on membership in identity unification sets Items in the null literalization set retain their match value 6.Cleans up chunk Removes ungrounded STIs and merges certain conditions 21

Nuggets We got it to do everything we wanted it to do, and are excited about trying it on many of our agents. Fixed all known bugs we’ve seen so far and few long-standing general bugs Has been tested with complex game learning agents Should not require changes to agents 22

Coals Just started analyzing and improving performance, so there’s a hit right now. – We’ve already improved it to the point where we’re at least in the ball park. – Does affect performance when learning is off. Was expected and necessary Finished last target feature and bug fixes this week. – No documentation yet – No command-line explanation mechanism 23