Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.

Slides:

Advertisements

Similar presentations

Bayesian inference Lee Harrison York Neuroimaging Centre 01 / 05 / 2009.

Advertisements

Linear Regression.

Brief introduction on Logistic Regression

Statistical Decision Theory Abraham Wald ( ) Wald’s test Rigorous proof of the consistency of MLE “Note on the consistency of the maximum likelihood.

Biointelligence Laboratory, Seoul National University

Institute for Theoretical Physics and Mathematics Tehran January, 2006 Value based decision making: behavior and theory.

CMPUT 466/551 Principal Source: CMU

Chapter 4: Linear Models for Classification

Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.

1 Reinforcement Learning Introduction & Passive Learning Alan Fern * Based in part on slides by Daniel Weld.

Ai in game programming it university of copenhagen Statistical Learning Methods Marco Loog.

Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013.

The loss function, the normal equation,

Classification and Prediction: Regression Via Gradient Descent Optimization Bamshad Mobasher DePaul University.

Visual Recognition Tutorial

HON207 Cognitive Science Sequential Sampling Models.

0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.

Statistical Decision Theory, Bayes Classifier

Classification with reject option in gene expression data Blaise Hanczar and Edward R Dougherty BIOINFORMATICS Vol. 24 no , pages

Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual.

Machine Learning CMPT 726 Simon Fraser University

1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.

From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.

Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research.

Distinguishing Evidence Accumulation from Response Bias in Categorical Decision-Making Vincent P. Ferrera 1,2, Jack Grinband 1,2, Quan Xiao 1,2, Joy Hirsch.

Ensemble Learning (2), Tree and Forest

Collaborative Filtering Matrix Factorization Approach

Theory of Decision Time Dynamics, with Applications to Memory.

An Integrated Model of Decision Making and Visual Attention Philip L. Smith University of Melbourne Collaborators: Roger Ratcliff, Bradley Wolfgang.

Seeing Patterns in Randomness: Irrational Superstition or Adaptive Behavior? Angela J. Yu University of California, San Diego March 9, 2010.

Statistical Decision Theory

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.

Statistical learning and optimal control: A framework for biological learning and motor control Lecture 4: Stochastic optimal control Reza Shadmehr Johns.

Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.

Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Annual Workshop Introduction August, 2008.

D ECIDING WHEN TO CUT YOUR LOSSES Matt Cieslak, Tobias Kluth, Maren Stiels & Daniel Wood.

1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.

Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.

Ch15: Decision Theory & Bayesian Inference 15.1: INTRO: We are back to some theoretical statistics: 1.Decision Theory –Make decisions in the presence of.

1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.

The Computing Brain: Focus on Decision-Making

Modeling interactions between visually responsive and movement related neurons in frontal eye field during saccade visual search Braden A. Purcell 1, Richard.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

What’s optimal about N choices? Tyler McMillen & Phil Holmes, PACM/CSBMB/Conte Center, Princeton University. Banbury, Bunbury, May 2005 at CSH. Thanks.

Optimal Eye Movement Strategies In Visual Search.

Simultaneous integration versus sequential sampling in multiple-choice decision making Nate Smith July 20, 2008.

Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.

Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”

Bayesian inference Lee Harrison York Neuroimaging Centre 23 / 10 / 2009.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

Overfitting, Bias/Variance tradeoff. 2 Content of the presentation Bias and variance definitions Parameters that influence bias and variance Bias and.

Bump Hunting The objective PRIM algorithm Beam search References: Feelders, A.J. (2002). Rule induction by bump hunting. In J. Meij (Ed.), Dealing with.

Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.

Bayesian Estimation and Confidence Intervals Lecture XXII.

Bayesian Estimation and Confidence Intervals

Deep Feedforward Networks

Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.

A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternatives At each time step a small sample of noisy information.

Collaborative Filtering Matrix Factorization Approach

Dynamic Causal Model for evoked responses in M/EEG Rosalyn Moran.

Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.

Using Time-Varying Motion Stimuli to Explore Decision Dynamics

Probabilistic Population Codes for Bayesian Decision Making

Banburismus and the Brain

Decision Making as a Window on Cognition

The loss function, the normal equation,

Ch 3. Linear Models for Regression (2/2) Pattern Recognition and Machine Learning, C. M. Bishop, Previously summarized by Yung-Kyun Noh Updated.

Presentation transcript:

Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010

Introduction Speed Accuracyvs. Faster responses save time, but more errors What is optimal policy? What computations are involved? Are humans/animals optimal? How is it neurally implemented?

Outline Review of Bayesian decision theory: examples Quantifying temporal cost Optimal policy for 2-alternative forced choice – optimal policy (SPRT) – behavioral & neural data Decision under deadline

Bayesian Decision Theory: Estimation s x hidden variable observation ? loss function

Bayesian Decision Theory Decision policy Loss function: Expected loss: A policy  : x  d is a mapping from input x into decision d, where x is a sampled from the distribution p(x|s). The loss function l(d(x); s) depends on the decision d(x) and the hidden variable s. The expected loss associated with a policy  is averaged over x and s: The optimal policy minimizes this expected loss.

Examples Ex. 1: loss function is mean square error For each observation x, we need to minimize: Differentiating and setting to 0, we find loss is minimized if: Optimal policy is to report the mean of posterior p(s|x).

Examples Ex. 2: loss function is absolute error: Optimal policy is to report the median of posterior p(s|x). Ex. 3: loss function is 0-1 error: 0 if correct, 1 all other choices Optimal policy is to report the mode of posterior p(s|x).

Time Matters  Repeated Decisions ABABAB wait +: more info -: more time

Hypothesis Testing is a Stopping Problem Decision Problem: xx … x1x1 s x2x2 x3x3 Incorporate evidence iteratively (Bayes’ Rule):

Loss function penalizes both error and delay: Optimal Policy (Bellman equation): Loss Function and Optimal Policy Stopping cost: Expected loss: Continuation cost: After observing inputs x 1, …, x t, continue (observe x t+1 ) only if the continuation cost < stopping cost

An Intractable Solution in Practice Stopping Cost Too hard!

b a Useful Properties of Q s and Q c ? Wald & Wolfowitz (1948): Continuation region is an interval (a, b) on the unit interval: At time t, continue if a < q t < b, stop and report =1 if q t >b, report =0 if q t <a.

Intuition for Optimality of SPRT Cost qtqt 01 c ab Q s (0) = w q t Q s (1) = v (1 - q t ) Q c is concave decide 0decide 1continue

Sequential Probability Ratio Test Log posterior ratio (monotonic to q t ): b’ a’ 0 rtrt Additive updates (random walk):

Do People/Animals Behave Optimally? A simple decision-making (DM) task binary decision rate of information flow constant over time task difficulty easily manipulated

Random-Dot Coherent Motion Paradigm 30% Coherence5% Coherence

Do People/Animals Behave Optimally? Accuracy vs. Coherence vs. Coherence (Roitman & Shadlen, 2002) Model: SPRT

What is Happening in the Brain? Saccade generation system (Roitman & Shalden, 2002) Model: SPRT Neural Activities in LIP

Mid-way Summary Optimal binary DM = evidence accumulation to thresholds Perceptual DM behavior similar to that expected from SPRT Neural activities in LIP  Bayesian evidence integration SPRT a formal link between biology and behavior in DM

Outline Review of Bayesian decision theory: examples Quantifying temporal cost Optimal policy for 2-alternative forced choice – optimal policy (SPRT) – behavioral & neural data Decision under deadline

Caveat: Model Fit Imperfect Mean RTRT Distributions (Data from Roitman & Shadlen, 2002; analysis from Ditterich, 2007) SPRT predicts same RT distribution for correct & error trials Real error trials are slower

b a SPRT: Same RT for Correct and Errors

Fix 1: Variable Drift Rate (Data from Roitman & Shadlen, 2002; analysis from Ditterich, 2007) (idea from Ratcliff & Rouder, 1998) AccuracyMean RTRT Distributions

Fix 2: Increasing Drift Rate (Data from Roitman & Shadlen, 2002; analysis from Ditterich, 2007) AccuracyMean RTRT Distributions

A Principled Approach Trials aborted w/o response or reward if monkey breaks fixation (Data from Roitman & Shadlen, 2002) (Analysis from Ditterich, 2007) More “urgency” over time as risk of aborting trial increases Can we model this in a principled way?

Optimal Policy is a pair of of monotonically decaying thresholds Probability 1 0 Generalized SPRT Classic SPRT Time Intuition for decay With limited time, there is limited opportunity to observe enough evidence to bias the evidence in the opposite direction Optimal Decision Making Under a Deadline (Frazier & Yu, 2007) Loss function depends on error, delay, and whether deadline exceed

Threshold Decay  Fast RT & Slower Errors Probability 1 0 Generalized SPRT Classic SPRT Time (Frazier & Yu, 2007) Earlier deadline  greater urgency Time Probability

Timing Uncertainty (Frazier & Yu, 2007) Timing uncertainty  more “urgency” (lower threhsolds) Time Probability This leads to a set of testable predictions

Summary Bayesian decision theory as a formal framework for DM Inadequacies of SPRT to account for behavior Inherent time pressure  decaying thresholds Neural implementation of decaying thresholds?

Perceptual DM + Saccade Planning Task: find motion target sensory processing cost (time, accuracy) learning/memory attention Current/Future Project BehaviorTheoryNeurobiology Human Rats Bayesian modeling Stochastic control theory fMRI Recording

Generalize Loss Function We assumed loss is linear in error and delay: Wald also proved a dual statement: Amongst all decision policies satisfying the criterion SPRT (with some thresholds) minimizes the expected sample size. This implies that the SPRT is optimal for all loss functions that increase with inaccuracy and (linearly in) delay (proof by contradiction). (Bogacz et al, 2006) What if it’s non-linear, e.g. maximize reward rate:

Some Intuitions for the Proof (Frazier & Yu, 2007) Continuation Region Shrinking!

Generalizing the Cost Function Loss function increases faster than linear in time f(  ) is convex and increasing. Nonlinear trade-off between accuracy and time (e.g. maximizing reward rate)