Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.

Slides:



Advertisements
Similar presentations
Remembering to decide: discrimination of temporally separated stimuli (selecting the best apple) Paul Miller Brandeis University.
Advertisements

Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013.
How well can we learn what the stimulus is by looking at the neural responses? We will discuss two approaches: devise and evaluate explicit algorithms.
For stimulus s, have estimated s est Bias: Cramer-Rao bound: Mean square error: Variance: Fisher information How good is our estimate? (ML is unbiased:
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual.
The Decisive Commanding Neural Network In the Parietal Cortex By Hsiu-Ming Chang ( 張修明 )
Reward processing (1) There exists plenty of evidence that midbrain dopamine systems encode errors in reward predictions (Schultz, Neuron, 2002) Changes.
From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.
Distinguishing Evidence Accumulation from Response Bias in Categorical Decision-Making Vincent P. Ferrera 1,2, Jack Grinband 1,2, Quan Xiao 1,2, Joy Hirsch.
Theory of Decision Time Dynamics, with Applications to Memory.
An Integrated Model of Decision Making and Visual Attention Philip L. Smith University of Melbourne Collaborators: Roger Ratcliff, Bradley Wolfgang.
Chapter 9 Large-Sample Tests of Hypotheses
1 / 41 Inference and Computation with Population Codes 13 November 2012 Inference and Computation with Population Codes Alexandre Pouget, Peter Dayan,
Biological Modeling of Neural Networks: Week 12 – Decision models: Competitive dynamics Wulfram Gerstner EPFL, Lausanne, Switzerland 12.1 Review: Population.
Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Annual Workshop Introduction August, 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Progress and Future Directions November 17, 2009.
BCS547 Neural Decoding. Population Code Tuning CurvesPattern of activity (r) Direction (deg) Activity
BCS547 Neural Decoding.
The Computing Brain: Focus on Decision-Making
© Copyright McGraw-Hill 2004
What’s optimal about N choices? Tyler McMillen & Phil Holmes, PACM/CSBMB/Conte Center, Princeton University. Banbury, Bunbury, May 2005 at CSH. Thanks.
Decision Dynamics and Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Juan Gao, Marius Usher and others.
Machine Learning 5. Parametric Methods.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Progress in MURI 15 ( ) Mathematical modeling of decision behavior. AFOSR, Alexandria, VA, Nov 17th, 2009 Phil Holmes 1. Optimizing monkeys? Balancing.
Dynamics of Reward and Stimulus Information in Human Decision Making Juan Gao, Rebecca Tortell & James L. McClelland With inspiration from Bill Newsome.
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.
Does the brain compute confidence estimates about decisions?
Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.
Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.
Dynamics of Reward Bias Effects in Perceptual Decision Making
Jay McClelland Stanford University
Piercing of Consciousness as a Threshold-Crossing Operation
Sequential sampling models of the choice process
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternatives At each time step a small sample of noisy information.
On the Nature of Decision States: Theory and Data
Comparison of observed switching behavior to ideal switching performance. Comparison of observed switching behavior to ideal switching performance. Conventions.
The Uncertainty Accumulation model.
Changing environment task.
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Decision Making during the Psychological Refractory Period
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Human Reward / Stimulus/ Response Signal Experiment: Data and Analysis
Marius Usher, Phil Holmes, Juan Gao, Bill Newsome and Alan Rorie
Recency vs Primacy -- an ongoing project
Choice Certainty Is Informed by Both Evidence and Decision Time
Braden A. Purcell, Roozbeh Kiani  Neuron 
Perceptual Decision Making in Rodents, Monkeys, and Humans
Decision Making and Sequential Sampling from Memory
Ariel Zylberberg, Daniel M. Wolpert, Michael N. Shadlen  Neuron 
Probabilistic Population Codes for Bayesian Decision Making
Variance as a Signature of Neural Computations during Decision Making
Banburismus and the Brain
Confidence as Bayesian Probability: From Neural Origins to Behavior
C. Shawn Green, Alexandre Pouget, Daphne Bavelier  Current Biology 
Decision Making as a Window on Cognition
Volume 24, Issue 13, Pages (July 2014)
Volume 36, Issue 5, Pages (December 2002)
Jay and Juan building on Feng and Holmes
Ralf M. Haefner, Pietro Berkes, József Fiser  Neuron 
Liu D. Liu, Christopher C. Pack  Neuron 
Confidence Is the Bridge between Multi-stage Decisions
Moran Furman, Xiao-Jing Wang  Neuron 
Redmond G. O’Connell, Michael N. Shadlen, KongFatt Wong-Lin, Simon P
by Kenneth W. Latimer, Jacob L. Yates, Miriam L. R
Volume 23, Issue 11, Pages (June 2013)
Presentation transcript:

Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston as well as Juan Gao and Marius Usher

Is the rectangle longer toward the northwest or longer toward the northeast?

Longer toward the Northeast! 2.00” 1.99”

A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternative Decisions At each time step a small sample of noisy information is obtained; each sample adds to a cumulative relative evidence variable. Mean of the noisy samples is + for one alternative, – for the other, with standard deviation . When a bound is reached, the corresponding choice is made. Alternatively, in ‘time controlled’ or ‘interrogation’ tasks, respond when signal is given, based on value of the relative evidence variable.

The DDM is an optimal model, and it is consistent with neurophysiology It achieves the fastest possible decision on average for a given level of accuracy It can be tuned to optimize performance under different kinds of task conditions –Different prior probabilities –Different costs and payoffs –Variation in the time between trials… The activity of neurons in a brain area associated with decision making seems to reflect the DD process

Neural Basis of Decision Making in Monkeys (Shadlen & Newsome; Roitman & Shadlen, 2002) RT task paradigm of R&T. Motion coherence and direction is varied from trial to trial.

Neural Basis of Decision Making in Monkeys: Results Data are averaged over many different neurons that are associated with intended eye movements to the location of target.

Hard Prob. Correct Easy A Problem with the DDM Accuracy should gradually improve toward ceiling levels as more time is allowed, even for very hard discriminations, but this is not what is observed in human data. Two possible fixes: –Trial-to-trial variance in the direction of drift –Evidence accumulation may reach a bound and stop, even if more time is available

Usher and McClelland (2001) Leaky Competing Accumulator Model Addresses the process of deciding between two alternatives based on external input, with leakage, mutual inhibition, and noise: dy 1 /dt = I 1 -y 1 –f(y 2 )+ 1 dy 2 /dt = I 2 -y 2 –f(y 1 )+ 2 f(y) = [y] + Participant chooses the most active accumulator when the go cue occurs This is equivalent to choosing response 1 iff y 1 -y 2 > 0 Let y = (y 1 -y 2 ). While y 1 and y 2 are positive, the model reduces to: dy/dt = I-y+ I=I 1 -I 2 =-=  -   11 22 y1y1 y2y2

Wong & Wang (2006) ~Usher & McClelland (2001)

Time-accuracy curves for different |k-| or || |k-  = 0 |k-  =.2 |k-  =.4

Prob. Correct

Kiani, Hanks and Shadlen 2008 Random motion stimuli of different coherences. Stimulus duration follows an exponential distribution. ‘go’ cue can occur at stimulus offset; response must occur within 500 msec to earn reward.

The earlier the pulse, the more it matters (Kiani et al, 2008)

These results rule out leak dominance X Still viable

The Full Non-Linear LCA i Model y1y1 y2y2 Although the value of the difference variable is not well-captured by the linear approximation, the sign of the difference is approximated very closely.

Three Studies Related to these Issues Integration of reward and payoff information under time controlled conditions –Gao, Tortell & McClelland Investigations of decision making with non- stationary stimulus information –Usher, Tsetsos & McClelland Does the confidence of a final decision state vary continuously with the strength of the evidence? –Lachter, Corrado, Johnston & McClelland

Timeline of the Experiment

Proportion of Choices toward Higher Reward

Sensitivity varies with time

Three Hypotheses 1.Reward acts as an input from reward cue onset til the end of the integration period 2.Reward influences the state of the accumulators before the onset of the stimulus 3.Reward introduces an offset into the decision

Fits Based on Linear Model

Fits based on full LCA i

Relationship between response speed and choice accuracy

Different levels of activation of correct and incorrect responses in Inhibition-dominant LCA Final time slice correct errors

High-Threshold LCA i

Preliminary Simulation Results

Three Studies Related to these Issues Integration of reward and payoff information under time controlled conditions –Gao, Tortell & McClelland Investigations of decision making with non- stationary stimulus information –Usher, Tsetsos & McClelland Does the confidence of a final decision state vary continuously with the strength of the evidence? –Lachter, Corrado, Johnston & McClelland

Continuous Report of Confidence Lachter, Corrado, Johnston & McClelland (in progress) Expt 1: Probability bias included; trial could end at any time Expt 2: No probability bias; observers terminate trial when they have determined their best answer

Trial Duration Protocol and performance as a function of time for participants grouped by performance

Results and Descriptive Model of Data from 1 Participant