The imperfect brain: Rationality and the limits of knowledge Quentin Huys Wellcome Trust Neuroimaging Centre Gatsby Computational Neuroscience Unit Medical.

Slides:



Advertisements
Similar presentations
Adversarial Search Chapter 6 Sections 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Advertisements

Adversarial Search Chapter 6 Section 1 – 4. Types of Games.
Alpha-Beta Search. 2 Two-player games The object of a search is to find a path from the starting position to a goal position In a puzzle-type problem,
1 1 Deep Thought BA 592 Lesson I.3 Sequential Move Theory At first I thought, if I were Superman, a perfect secret identity would be “Clark Kent, Dentist”,
An Exploration of Decision Processes in an Evolutionary Perspective: the Case of the Framing Effect.
Games & Adversarial Search Chapter 5. Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent’s reply. Time.
Techniques for Dealing with Hard Problems Backtrack: –Systematically enumerates all potential solutions by continually trying to extend a partial solution.
Samantha Nicholas & Khrys Nugent Hanover College
For Friday Finish chapter 5 Program 1, Milestone 1 due.
AI for Connect-4 (or other 2-player games) Minds and Machines.
Thinking: Concept Formation Concept formation: identifying commonalities across stimuli that unite them into a common category Rule learning: identifying.
Rationality Alan Kaylor Cline Department of Computer Sciences The University of Texas at Austin Based upon classic decision puzzlers collected by Gretchen.
Artificial Intelligence Adversarial search Fall 2008 professor: Luigi Ceccaroni.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2008.
CS 484 – Artificial Intelligence
Adversarial Search: Game Playing Reading: Chapter next time.
Adversarial Search Chapter 6.
Games CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Chess AI’s, How do they work? Math Club 10/03/2011.
The Disordered Brain what happens when decision making goes wrong? Neil Harrison University of Sussex Formerly: Institute of Cognitive Neuroscience & Wellcome.
Artificial Intelligence in Game Design
Survey.
Decision making and economics. Economic theories Economic theories provide normative standards Expected value Expected utility Specialized branches like.
Computer Simulation (2) Game Playing. Different Types of Games How many players? –One –Two Element of chance? –Deterministic –Nondeterministic Outcome.
Or Why We’re Not Really As Rational As We’d Like to Believe.
Games with Chance Other Search Algorithms CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 3 Adapted from slides of Yoonsuck Choe.
Game Playing CSC361 AI CSC361: Game Playing.
AI – Week 8 AI + 2 Player Games Lee McCluskey, room 3/10
MAE 552 – Heuristic Optimization Lecture 28 April 5, 2002 Topic:Chess Programs Utilizing Tree Searches.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2006.
Adversarial Search: Game Playing Reading: Chess paper.
Alpha-Beta Search. 2 Two-player games The object of a search is to find a path from the starting position to a goal position In a puzzle-type problem,
Episodic Control: Singular Recall and Optimal Actions Peter Dayan Nathaniel Daw Máté Lengyel Yael Niv.
Copyright © 2000 by Harcourt, Inc. All rights reserved. What is Perception? Perception: The process of recognizing and understanding others By understanding.
Review of Related Literature Different decision-making: – Budget decisions of managers – Irrationality of continuing the risk of losing a prospect – Decision-making.
Decision making Making decisions Optimal decisions Violations of rationality.
Decision Making choice… maximizing utility framing effects
Framing Effects From Chapter 34 ‘Frame and Reality’ of Thinking Fast and Slow, by D. Kahneman.
Game Playing Chapter 5. Game playing §Search applied to a problem against an adversary l some actions are not under the control of the problem-solver.
Adversarial Search Chapter 6 Section 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
A Heuristic Solution To The Allais Paradox And Its Implications Seán Muller, University of Cape Town.
Introduction to Artificial Intelligence CS 438 Spring 2008 Today –AIMA, Ch. 6 –Adversarial Search Thursday –AIMA, Ch. 6 –More Adversarial Search The “Luke.
Game Playing. Towards Intelligence? Many researchers attacked “intelligent behavior” by looking to strategy games involving deep thought. Many researchers.
RISK BENEFIT ANALYSIS Special Lectures University of Kuwait Richard Wilson Mallinckrodt Professor of Physics Harvard University January 13th, 14th and.
RISK BENEFIT ANALYSIS Special Lectures University of Kuwait Richard Wilson Mallinckrodt Professor of Physics Harvard University January 13th, 14th and.
Experimental Economics NSF short course David Laibson August 11, 2005.
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
Decision Making choice… maximizing utility framing effects.
Lecture 15 – Decision making 1 Decision making occurs when you have several alternatives and you choose among them. There are two characteristics of good.
GAME PLAYING 1. There were two reasons that games appeared to be a good domain in which to explore machine intelligence: 1.They provide a structured task.
Adversarial Search Chapter Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time limits.
Psychology 485 March 23,  Intro & Definitions Why learn about probabilities and risk?  What is learned? Expected Utility Prospect Theory Scalar.
PSY 323 – Cognition Chapter 13: Judgment, Decisions & Reasoning.
Today’s Topics Playing Deterministic (no Dice, etc) Games –Mini-max –  -  pruning –ML and games? 1997: Computer Chess Player (IBM’s Deep Blue) Beat Human.
1 DECISION MAKING Suppose your patient (from the Brazilian rainforest) has tested positive for a rare but serious disease. Treatment exists but is risky.
Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.
Learning to Play the Game of GO Lei Li Computer Science Department May 3, 2007.
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module 5 Adversarial Search (Thanks Meinolf Sellman!)
1 BAMS 517 – 2011 Decision Analysis -IV Utility Failures and Prospect Theory Martin L. Puterman UBC Sauder School of Business Winter Term
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
Rationality Alan Kaylor Cline Department of Computer Sciences
Effects of Foreign Language on Decision Making
PSY 323 – Cognition Chapter 13: Judgment, Decisions & Reasoning.
Alpha-Beta Search.
DIS 280 Social Science Research Methodology: Problem Framing
Alpha-Beta Search.
Choices, Values and Frames
Alpha-Beta Search.
Alpha-Beta Search.
Alpha-Beta Search.
Presentation transcript:

The imperfect brain: Rationality and the limits of knowledge Quentin Huys Wellcome Trust Neuroimaging Centre Gatsby Computational Neuroscience Unit Medical School UCL

Why are choices hard? Time present and time past Are both perhaps present in time future, And time future contained in time past. T. S. Eliot

The future, in the long term goodness of an action = immediate reward + all future reward

Making optimal decisions Niv et al. 2007

Many decision systems in parallel Goal-directed system Tree search Habit system Experience average Innate system Evolutionary strategy

Evaluating the future: Think hard Goal-directed decisions L -> S2 ; L -> cheese +4 L -> S2 ; R -> X0 R -> S3 ; L -> water+1 R -> S3 ; R -> carrot+2 General solution: search a tree

Often, this is not feasible. Take chess. Each move 30 odd choices ? MANY!!! – Legal boards ~ Can’t just do full tree search.

So…? How do players do it? How did Deep Blue beat Kasparov?

Evaluating the future… just try it out? Habits Choose action more if expected value higher V(state) = V(state) + learning rate x prediction error Prediction should be: reward + V(next state) But it is: V(state) Prediction error reward + V(next state) - V(state)

Evaluating the future… just try it out? Habits V of all states = 0RVRV V(state) = V(state) + learning rate x prediction error T1V(s1) = V(s1) +.1 x (0 + V(s2) - V(s1)) = 0 T2V(s2) = V(s2) +.1 x (4 - V(s2)) =.4 T3V(s1) = V(s1) +.1 x (0 + V(s2) - V(s1) =.04

Evaluating the future: habit learning over time

Use clever heuristics, position evaluation…

Devaluation Goal-directed vs. habitual behaviour

Evaluating the future… actually, let’s not! Innate simple strategies Choose randomly at S1 Then just go for food if hungry Or for water if thirsty

Innate evolutionary strategies

more survive more survive fewer survive Hirsch and Bolles 1980

Many decision systems in parallel Multiple decision systems “Controllers” Competition and collaboration Goal-directed system Tree search Habit system Experience average Innate system Evolutionary strategy

Poor economists. People are pretty irrational. Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows:

Poor economists. People are pretty irrational. Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows: If Program A is adopted, 200 people will be saved If Program B is adopted, there is a one-third probability that 600 people will be saved and a two-thirds probability that no people will be saved.

Poor economists. People are pretty irrational. Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows: If Program A' is adopted, 400 people will die If Program B' is adopted, there is a one-third probability that nobody will die and a two-thirds probability that 600 people will die

Poor economists. People are pretty irrational. Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows:

Poor economists. People are pretty irrational. Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows: If Program A' is adopted, 400 people will die If Program B' is adopted, there is a one-third probability that nobody will die and a two-thirds probability that 600 people will die If Program A is adopted, 200 people will be saved If Program B is adopted, there is a one-third probability that 600 people will be saved and a two-thirds probability that no people will be saved. A B’ Tversky & Kahnemann 1981

And chicken pretty stupid? Hershberger 1986

Clever innate strategies If Program A' is adopted, 400 people will die If Program B' is adopted, there is a one-third probability that nobody will die and a two-thirds probability that 600 people will die If Program A is adopted, 200 people will be saved If Program B is adopted, there is a one-third probability that 600 people will be saved and a two-thirds probability that no people will be saved. X

Simple is better at times: cars Car A: 75% +ve Car B: 50% +ve Car C: 50% +ve Car D: 25% +ve Dijksterhuis et al Asian disease: time

Sometimes knowledge hurts “We added balsamic vinegar to one of these” “We added balsamic vinegar to the light one” +BV

Simple is better at times: doctors 20 cases for which truth known Cardiologists General physicians A&E physicians Physicians overly cautious, but still miss many -> complications Melly et al. 2002

Discussion Decisions are often hard – Time future in time present – Uncertainty Multiple decision systems – Complex to simple – Neither is perfect Brains has to decide between deciders – Not always easy, not always right Psychiatry Goal-directed system Tree search Habit system Experience average Innate system Evolutionary strategy +BV

I can resist anything but temptation Ariely and Loewenstein 2006 “locking away” money Getting to bed early Alcohol, drugs The pub, disulfiram Buying sweets Long-term vs impulsive behaviours: imagined arousal real arousal How many excuses does the drunkard find when each new temptation comes! It is a new brand of liquor which the interests of intellectual culture in such matters oblige him to test; moreover it is poured out and it would be a sin to waste it; or it isn’t drinking but because he is cold; or it is Christmas day; or it is just this once… William James Get rid of p’s

Innate evolutionary strategies in the brain Bandler and Shipley 1994 Old brain

… 9 initial choices … x 8 x 7 Let’s play XOX Can go through all possible board settings – 9! to 230 symmetries etc. For each, consider all following positions Chose move that gets you closest to winning or keeps you furthest from winning (minimax/maximin) (really just 3) (really just 5)