Bob Marinier Based on paper by Lehman, Laird, Rosenbloom NSF, DARPA, ONR University of Michigan April 18, 2008.

Slides:



Advertisements
Similar presentations
Computer Networks TCP/IP Protocol Suite.
Advertisements

Sugar 2.0 Formal Specification Language D ana F isman 1,2 Cindy Eisner 1 1 IBM Haifa Research Laboratory 1 IBM Haifa Research Laboratory 2 Weizmann Institute.
Reinforcement Learning
Logical Model and Specification of Usage Control Xinwen Zhang, Jaehong Park Francesco Parisi-Presicce, Ravi Sandhu George Mason University.
A Logic Specification for Usage Control Xinwen Zhang, Jaehong Park Francesco Parisi-Presicce, Ravi Sandhu George Mason University SACMAT 2004.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 1 Embedded Computing.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Right Brain vs. Left Brain. Directions Get a blank sheet of lined paper. Every time you read a description or characteristic that applies to you, write.
UNITED NATIONS Shipment Details Report – January 2006.
By John E. Hopcroft, Rajeev Motwani and Jeffrey D. Ullman
1 Hyades Command Routing Message flow and data translation.
Importance-Driven Focus of Attention and Meister Eduard Gröller 1 1 Vienna University of Technology, Austria 2 University of Girona, Spain 3 University.
Writing Pseudocode And Making a Flow Chart A Number Guessing Game
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
3. Sensorimotor Intelligence
Year 6 mental test 5 second questions
Year 6 mental test 10 second questions
Baseball Umpire Training
Solve Multi-step Equations
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Bob Marinier 27 th Soar Workshop May 24, SESAME is a theory of human cognition Stephan Kaplan (University of Michigan) Modeled at the connectionist.
Chapter 11: Models of Computation
David Luebke 1 6/7/2014 ITCS 6114 Skip Lists Hashing.
By Waqas Over the many years the people have studied software-development approaches to figure out which approaches are quickest, cheapest, most.
Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
VOORBLAD.
Name Convolutional codes Tomashevich Victor. Name- 2 - Introduction Convolutional codes map information to code bits sequentially by convolving a sequence.
COMPUTER INTERFACES.
How to convert a left linear grammar to a right linear grammar
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Welcome To The Four Umpire System. © 2007 Softball Canada All Rights Reserved Softball Canada Four Umpire System FP & SP 2 Tips for the Four Umpire System.
Welcome To The Three Umpire System. © 2007 Softball Canada All Rights Reserved Softball Canada Three Umpire System FP 2 Tips for the Three Umpire System.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Executional Architecture
Global Analysis and Distributed Systems Software Architecture Lecture # 5-6.
25 seconds left…...
H to shape fully developed personality to shape fully developed personality for successful application in life for successful.
Januar MDMDFSSMDMDFSSS
OFFICIAL BASKETBALL RULES
A Cognitive Architecture Theory of Comprehension and Appraisal: Unifying Cognitive Functions and Appraisal Bob Marinier John Laird University of Michigan.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
1 Soar Emote Bob Marinier John Laird University of Michigan.
Instructor: Shengyu Zhang 1. Content Two problems  Minimum Spanning Tree  Huffman encoding One approach: greedy algorithms 2.
From Model-based to Model-driven Design of User Interfaces.
The Basics of Baseball for middle school physical education class’s
Slow Pitch Softball Origin: Slow Pitch softball was created by George Hancock in 1887 in Chicago, IL. Equipment: Glove Bat 12” Ball ( 11” ball also used)
Outline Introduction Soar (State operator and result) Architecture
Introduction to SOAR Based on “a gentle introduction to soar: an Architecture for Human Cognition” by Jill Fain Lehman, John Laird, Paul Rosenbloom. Presented.
The Importance of Architecture for Achieving Human-level AI John Laird University of Michigan June 17, th Soar Workshop
COMPUTATIONAL MODELING OF INTEGRATED COGNITION AND EMOTION Bob MarinierUniversity of Michigan.
Bob Marinier Advisor: John Laird Functional Contributions of Emotion to Artificial Intelligence.
Baseball Quiz Question 1: How many outs are in a half an inning in Baseball A). 6 B). 3 C). 2 D). 5.
The Basics of Baseball for middle school physical education class’s
SOAR 1/18/2019.
Presentation transcript:

Bob Marinier Based on paper by Lehman, Laird, Rosenbloom NSF, DARPA, ONR University of Michigan April 18, 2008

2

3

4

Create separate systems for each capability –e.g., language, planning, learning, etc. Create single system of simpler mechanisms out of which these capabilities arise –Cognitive architecture –Inspired by psychology (study of how people think) 5

Computer architectures –Differ in processor type, memory size, commands, etc. –Differences reflect designs intended to be optimal under different assumptions about usage 6

7 APPLICATION: word processor TASK: write a paper HARDWARE: PC architecture for content for BEHAVIOR = ARCHITECTURE + CONTENT

A theory of the fixed mechanisms and structures that underlie human cognition Said another way: A theory of what is common to the wide array of behaviors we think of as intelligent Soar is one such theory (there are others) –Soar is a computational theory (it actually runs on computers) 8

Goal-oriented Takes place in rich, complex, detailed environment Requires a large amount of knowledge Requires use of symbols and abstractions Flexible, and a function of the environment Requires learning from the environment and experience 9

10 Pitcher (Joe) Batter (Sam) First baseman Second baseman Short stop Third baseman Catcher Left-fielder Center-fielder Right-fielder

Behaves in goal-oriented manner –Joe’s goal is to win the game –He adopts several subgoals to help him achieve this Operates in a rich, complex, detailed environment –Positions and movements of the players, current state of the game, etc. Uses a large amount of knowledge –Choosing the pitch draws on his own pitching record, Sam’s batting record, etc. Behaves flexibly as a function of the environment –Choosing the pitch takes into account handed-ness of the batter, etc. –When pitch is hit, Joe must change his subgoal to respond to the new situation Uses symbols and abstractions –Since Joe has never played this particular game before, he must draw on previous experience by abstracting away from this day and place Learns from environment and experience –Joe needs to learn from this experience in order to do better when Sam bats in the future 11

K1: Knowledge of the objects in the game –E.g., baseball, infield, base line, inning, out, etc. K2: Knowledge of abstract events and particular episodes –E.g., how batters hit, how this guy batted last time K3: Knowledge of rules of the game –E.g., number of outs, balk, infield fly K4: Knowledge of objectives –E.g., get batter out, throw strikes K5: Knowledge of actions or methods for attaining objectives –E.g., use a curve ball, throw to first, walk batter K6: Knowledge of when to choose actions or methods –E.g., if behind in the count, throw a fast ball K7: Knowledge of the component physical actions –E.g., how to throw a curve ball, catch, run 12

Knowledge is organized as a sequence of decisions through a problem space 13 Joe is standing on the mound. Sam is at bat. Joe has the goal of getting Sam out. He chooses a curve ball He chooses a fast ball He chooses a slider Strike… Hit Foul… Ball… Strike Hit… Foul Ball… Strike… Single Foul… Homerun Joe faces the next batter He chooses another fast ball He changes to a curve ball He catches it … … … …

14 Initial state f1 v1 f2 v2 S0S0 f1 v1 f2 v1 S1S1 f1 v2 f2 v2 S2S2 f1 v2 f2 v1 S3S3 f1 v5 f2 v1 S 12 f1 v3 f2 v6 S 91 f1 v4 f2 v8 S 30 f1 v2 f2 v6 S 80 … Goal state operator

15 Initial state f1 v1 f2 v2 S0S0 f1 v1 f2 v1 S1S1 f1 v5 f2 v1 S 12 f1 v3 f2 v6 S 91 Goal state operator

16 Current State: batter name Sam batter status not out balls 0 strikes 0 outs 0 … goal batter out problem space pitch Operator: throw-curve

How do operators get proposed and compared? –Knowledge determines when an operator is relevant to the current goal and state Joe’s goal is to get the batter out, and he’s the pitcher, so his available operators are kinds of pitches Knowledge represented in the state may influence the choice of pitch (e.g., is the batter right or left handed) 17

How are operators selected? –Principle of Rationality “If an agent has knowledge that an operator application will lead to one of its goals then the agent will select that operator” –That is: Rational agents behave in a goal-oriented way 18

How is a selected operator applied? –Can execute operator in external world Joe throws a pitch –Can result in internal changes to the state Joe thinks about throwing a pitch –Using states and operators allows us to model both acting and thinking as a function of knowledge 19

How do we know if execution of an operator has achieved the goal? –In baseball, rely on knowledge of rules of the game –Can also have external signals (e.g., umpire) How do goals and problem spaces change over time? –Via the application of operators 20

So how do we: –Represent knowledge so agent acts in a goal-oriented way? –Represent knowledge in a way that is independent of baseball? Represent knowledge in terms of problem spaces, goals, states, and operators Guide operator choice by the principle of rationality 21

What are the architectural processes for using knowledge to create and change states and operators? 22

23 Long-term (Procedural) Memory Short-term (Working) Memory Some knowledge is not specific to the current situation… …and some is

24 Working Memory Procedural Memory Decision Procedure PerceptionAction

IF I am the pitcher, the other team is at bat, and I perceive that I am at the mound THEN suggest a goal to get the batter out via pitching (Pitch). IF the problem space is to Pitch and I perceive a new batter who is left/right handed THEN add batter not out, balls 0, strikes 0, and batter left/right- handed to the state. IF the problem space is to Pitch and the batter is not out THEN suggest the throw-curve-ball operator. 25

(r1) If I am the pitcher, the other team is at bat, and I perceive that I am at the mound then suggest a goal to get the batter out via pitching (Pitch). (r2) If the problem space is to Pitch and I perceive a new batter who is left/right handed then add batter not out, balls 0, strikes 0, and batter left/right-handed to the state. (r3) If the problem space is to Pitch and the batter is not out then suggest the throw-curve-ball operator. (r4) If the problem space is to Pitch and the batter is not out and the batter is left-handed then suggest the throw-fast-ball operator. (r5) If both throw-fast-ball and throw-curve-ball are suggested then consider throw-curve-ball to be better than throw-fast-ball. (r6) If the throw-curve-ball operator has been selected then send throw-curve to the motor system and add pitch thrown to the state. (r7) If the pitch was thrown and I perceive that it was called a ball, then increment the balls count. (r8) If the pitch was thrown and I perceive that it was called a strike, then increment the strikes count. (r9) If the pitch was thrown and I perceive a hit then add pitch hit to the state 26

Each matching rule “maps” from current goal, state and operator to changes to those objects There can be dependencies among rules –Can’t choose a pitch until you’ve decided to pitch to the batter, which you can’t do if you’re not on the mound, etc. –Soar doesn’t recognize dependencies; it just “fires” rules as they match All parts of rules are expressed in terms of perceptions, actions, states and operators 27

Elaboration Decide Application Input Output Working Memory Procedural Memory Decision Procedure PerceptionAction

29 Elaboration Decide Application Input Output Procedural Memory (rules) Perception/Action Interface Working Memory Get- batter-out Pitch 0/0 left … New batter Decision Procedure throw-fast throw-curve Throw curve 0/0 left … throw-curve Hit

How is (ST) knowledge represented in the state? –As sets of features and values How is general (LT) knowledge represented? –Rules that map one set of feature-values to another What are the architectural processes for using knowledge in LTM? –Decision cycle (input, elaborate, decide, apply, output) What are the mechanisms for interacting with the world? –Perception and action go through interfaces embedded in the decision cycle 30

Soar also has ways to deal with a lack of knowledge, including learning Recent work on Soar has focused on new mechanisms to accommodate new kinds of problems –New long-term memories with different properties –New learning mechanisms –Non-symbolic ways of representing knowledge 31

Learn from rewards –Reinforcement learning Learn facts –What you know –Semantic memory Learn events –What you remember –Episodic memory Basic drives and … –Emotions, feelings, mood Non-symbolic reasoning –Mental imagery Working memory relevance –Activation Learn from regularities –Spatial and temporal clusters 32 Symbolic Long-Term Memories Procedural Symbolic Short-Term Memory Decision Procedure Chunking Reinforcement Learning Semantic Learning Episodic Learning Perception Action Visual Imagery Feeling Generation Reinforcement Learning Clustering

Soar homepage – –Read the full Gentle Introduction to Soar –Download Soar and tutorials 28 th Soar Workshop –May 5-7 (Mon-Wed) in Ann Arbor –Invited speakers on Cognitive Robotics Greg Trafton (NRL) and Paul Benjamin (Pace University) –It’s not too late to register! – John Laird’s new book: The Soar Cognitive Architecture (due out Summer 2009) 33