Competence-Preserving Retention of Learned Knowledge in Soar’s Working and Procedural Memories Nate Derbinsky, John E. Laird University of Michigan.

Slides:

Advertisements

Similar presentations

Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona USA Modeling Social Cognition in a Unified Cognitive Architecture.

Advertisements

Bob Marinier 27 th Soar Workshop May 24, SESAME is a theory of human cognition Stephan Kaplan (University of Michigan) Modeled at the connectionist.

Learning To Use Memory Nick Gorski & John Laird Soar Workshop 2011.

Integrated Episodic and Semantic Memory in Robotics Steve Furtwangler, with Robert Marinier, Jacob Crossman.

Playing Dice with Soar Soar Workshop June 2011 John E. Laird Nate Derbinsky, Miller Tinkerhess University of Michigan 1.

DARPA Mobile Autonomous Robot SoftwareMay Adaptive Intelligent Mobile Robotics William D. Smart, Presenter Leslie Pack Kaelbling, PI Artificial.

Introduction to SOAR Based on “a gentle introduction to soar: an Architecture for Human Cognition” by Jill Fain Lehman, John Laird, Paul Rosenbloom. Presented.

1 Episodic Memory for Soar Andrew Nuxoll 15 June 2005.

12 June, STD( ): learning state temporal differences with TD( ) Lex Weaver Department of Computer Science Australian National University Jonathan.

1 Chunking with Confidence John Laird University of Michigan June 17, th Soar Workshop

1 Learning from Behavior Performances vs Abstract Behavior Descriptions Tolga Konik University of Michigan.

1 Episodic Memory for Soar Agents Andrew Nuxoll 11 June 2004.

Impact of Working Memory Activation on Agent Design John Laird, University of Michigan 1.

1 Soar Semantic Memory Yongjia Wang University of Michigan.

The Importance of Architecture for Achieving Human-level AI John Laird University of Michigan June 17, th Soar Workshop

A Soar’s Eye View of ACT-R John Laird 24 th Soar Workshop June 11, 2004.

Soar-RL Discussion: Future Directions, Open Questions, and Why You Should Use It 31 st Soar Workshop 1.

Polyscheme John Laird February 21, Major Observations Polyscheme is a FRAMEWORK not an architecture – Explicitly does not commit to specific primitives.

Models of Human Performance Dr. Chris Baber. 2 Objectives Introduce theory-based models for predicting human performance Introduce competence-based models.

Intelligent Agents revisited.

Soar-RL: Reinforcement Learning and Soar Shelley Nason.

Mazin Assanie University of Michigan Soar 9.5 Beta and Explanation-Based Chunking.

Curious Characters in Multiuser Games: A Study in Motivated Reinforcement Learning for Creative Behavior Policies * Mary Lou Maher University of Sydney.

Reinforcement Learning in the Presence of Hidden States Andrew Howard Andrew Arnold {ah679

COMPUTATIONAL MODELING OF INTEGRATED COGNITION AND EMOTION Bob MarinierUniversity of Michigan.

Conference Paper by: Bikramjit Banerjee University of Southern Mississippi From the Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence.

Learning Prepositions for Spatial Relationships in BOLT Soar Workshop 2012 James Kirk, John Laird 6/21/

Reinforcement Learning

A Multi-Domain Evaluation of Scaling in Soar’s Episodic Memory Nate Derbinsky, Justin Li, John E. Laird University of Michigan.

Interpreted Declarative Representations of Task Knowledge June 21, 2012 Randolph M. Jones, PhD.

New SVS Implementation Joseph Xu Soar Workshop 31 June 2011.

Using Soar for an indoor robotic search mission Scott Hanford Penn State University Applied Research Lab 1.

Efficient Interaction Strategies for Adaptive Reminding Julie S. Weber & Martha E. Pollack Adaptive Reminder Generation SignalingIntended Approach Learning.

Soar Mazin Assanie, John Laird, Joseph Xu 1.

Integrating Background Knowledge and Reinforcement Learning for Action Selection John E. Laird Nate Derbinsky Miller Tinkerhess.

Bob Marinier Advisor: John Laird Functional Contributions of Emotion to Artificial Intelligence.

Soar Technology, Inc. Proprietary 6/4/ Using Soar to Teach Probabilistic Reasoning Jim Thomas 6/5/2013.

Future Memory Research in Soar Nate Derbinsky University of Michigan.

Verve: A General Purpose Open Source Reinforcement Learning Toolkit Tyler Streeter, James Oliver, & Adrian Sannier ASME IDETC & CIE, September 13, 2006.

Curiosity-Driven Exploration with Planning Trajectories Tyler Streeter PhD Student, Human Computer Interaction Iowa State University

POMDPs: 5 Reward Shaping: 4 Intrinsic RL: 4 Function Approximation: 3.

OntoSoar: Soar Finds Facts in Text Peter Lindes, Deryle Lonsdale, David Embley Brigham Young University 33 rd Soar Workshop, June 2013 pl 6/6/201333rd.

Spring, 2005 CSE391 – Lecture 1 1 Introduction to Artificial Intelligence Martha Palmer CSE391 Spring, 2005.

Cognitive Architectures For Physical Agents Sara Bolduc Smith College CSC 290.

SOAR A cognitive architecture By: Majid Ali Khan.

Beyond Chunking: Learning in Soar March 22, 2003 John E. Laird Shelley Nason, Andrew Nuxoll and a cast of many others University of Michigan.

1 Learning through Interactive Behavior Specifications Tolga Konik CSLI, Stanford University Douglas Pearson Three Penny Software John Laird University.

Stochastic Grammars: Overview Representation: Stochastic grammar Representation: Stochastic grammar Terminals: object interactions Terminals: object interactions.

Learning Behavioral Parameterization Using Spatio-Temporal Case-Based Reasoning Maxim Likhachev, Michael Kaess, and Ronald C. Arkin Mobile Robot Laboratory.

Learning Procedural Knowledge through Observation -Michael van Lent, John E. Laird – 인터넷 기술 전공 022ITI02 성유진.

Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.

Cognitive Modeling Cogs 4961, Cogs 6967 Psyc 4510 CSCI 4960 Mike Schoelles

Use of Soar for Modeling Cyber Operations 36 th Soar Workshop Ann Arbor, Michigan Denise Nicholson, Ph.D., Director of X Ryan O’Grady, Software Engineer.

Action Modeling with Graph-Based Version Spaces in Soar

Knowledge Representation and Reasoning

Cognitive Language Processing for Rosie

Learning Fast and Slow John E. Laird

Miller Tinkerhess University of Michigan

Knowledge Representation and Reasoning

Intelligent Adaptive Mobile Robots

Soar 9.6.0’s Instance-Based Model of Semantic Memory

John Laird, Nate Derbinsky , Jonathan Voigt University of Michigan

Cognitive Language Comprehension in Rosie

John E. Laird 32nd Soar Workshop

Spreading With Edge Weights

Joseph Xu Soar Workshop 31 June 2011

Bryan Stearns University of Michigan Soar Workshop - May 2018

Bryan Stearns University of Michigan Soar Workshop - May 2018

Symbolic cognitive architectures

An Integrated Theory of the Mind

Presentation transcript:

Competence-Preserving Retention of Learned Knowledge in Soar’s Working and Procedural Memories Nate Derbinsky, John E. Laird University of Michigan

Motivation Goal. Agents that exhibit human-level intelligence and persist autonomously for long periods of time (days – months). Problem. Extended tasks that involve amassing large amounts of knowledge can lead to performance degradation. 20 June 2012Soar Workshop Ann Arbor, MI2

Common Approach Forgetting. Selectively retain learned knowledge. Challenge. Balance… – agent task competence & – computational resource growth across a variety of tasks. 20 June 2012Soar Workshop Ann Arbor, MI3

This Work Hypothesis. Useful to forget a memory if… 1.not useful (via base-level activation) & 2.likely can reconstruct if necessary Evaluation. 2 complex tasks, 2 memories in Soar 20 June 2012Soar Workshop Ann Arbor, MI4 Mobile Robot Navigation Working Memory bounds decision time completes task  1 hour Multi-Player Dice Procedural Memory 50% memory reduction competitive play  days Task Independent; Implemented in Soar v9.3.2

Base-Level Activation (Anderson et al., 2004) Predict future usage via history Used to bias ambiguous semantic- memory retrievals (AAAI ‘11) 20 June θ tdtd E FFICIENT & C ORRECT O(# memories) Soar Workshop Ann Arbor, MI 2-phase prediction of future decay Novel approximation Binary parameter search

Task #1: Mobile Robotics Simulated Exploration & Patrol – 3 rd floor, BBB Building, UM 110 rooms 100 doorways – Builds map in memory from experience 20 June 20126Soar Workshop Ann Arbor, MI

Problem: Decision Time Issue. Large working memory – Minor: rule matching (Forgy, 1982) – Major: episodic reconstruction episode size ~ working-memory size Forgetting Policy. Memory hierarchy 1.Automatically remove from WM the o-supported augmentations of LTIs that have not been tested recently/frequently (all or nothing w.r.t. LTI) 2.Agent deliberately performs retrieve commands from SMem as necessary 20 June 20127Soar Workshop Ann Arbor, MI

Map Knowledge Room Features Position, size Walls, doorways Objects Waypoints 20 June Usage Exploration (-->SMem) Planning/navigation (<--SMem) Soar Workshop Ann Arbor, MI

Results: Working-Memory Size 20 June No Forgetting Rules BLA: d=0.3 BLA: d=0.4 BLA: d=0.5 Soar Workshop Ann Arbor, MI

Results: Decision Time 20 June No Forgetting Rules BLA: d=0.3 BLA: d=0.4 BLA: d=0.5 Soar Workshop Ann Arbor, MI

Task #2: Liar’s Dice Complex rules, hidden state, stochasticity – Rampant uncertainty Agent learns via reinforcement learning (RL) – Large state space ( for 2-4 players) 20 June Soar Workshop Ann Arbor, MI

Reasoning --> Action Knowledge 20 June State Bid 6 4’s Bid 3 1’s Challenge! Soar Workshop Ann Arbor, MI

Problem: Memory Growth Issue. RL value-function representation: (s,a)-># – Soar: procedural knowledge (RL rules) – Many possible actions per turn; at most feedback for a single action -> many RL rules representing redundant knowledge Forgetting Policy. Keep what you can’t reconstruct 1.Automatically excise RL chunks that have not been updated via RL and haven’t fired recently/frequently 2.New chunks are learned via reasoning as necessary 20 June Soar Workshop Ann Arbor, MI

Forgetting Action Knowledge 20 June State Bid 6 4’s Bid 3 1’s Challenge! … … Soar Workshop Ann Arbor, MI

Results: Memory Usage 20 June (Kennedy & Trafton ‘07) + BLA Max. Dec. Time = 6 msec. Soar Workshop Ann Arbor, MI

Results: Competence 20 June ~(KT ’07) Soar Workshop Ann Arbor, MI

Evaluation Nuggets Pragmatic forgetting policies for Soar: extends space and temporal extent of domains – Implemented in Soar v9.3.2 Efficient forgetting code can be applied to any instance- based memory Coal Limited domain evaluation Yet another set of parameters (d, θ) x 2 EpMem/SMem? 20 June 2012Soar Workshop Ann Arbor, MI17 For more details, see two papers in proceedings of ICCM 2012 For more details, see two papers in proceedings of ICCM 2012