Download presentation
Presentation is loading. Please wait.
1
From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, 2006. MURI Center for Human and Robot Decision Dynamics, Sept 13, 2007. Phil Holmes, Jonathan Cohen + many collaborators. The diffuse dynamics of choice
2
A really simple decision task: “ On each trial you will be shown one of two stimuli, drawn at random. You must identify the direction (L or R) in which the majority of dots are moving. ” The experimenter can vary the coherence of movement (% moving L or R) and the delay between response and next stimulus. Correct decisions are rewarded. “ Your goal is to maximize rewards over many trials in a fixed period. ” A speed-accuracy tradeoff.. 30% coherence 5% coherence Courtesy: W. Newsome Behavioral measures: reaction time distributions, error rates. More complex decisions: buy or sell? Neural economics.
3
An optimal decision procedure for noisy data: the Sequential Probability Ratio Test Mathematical idealization: During the trial, we draw noisy samples from one of two distributions p L (x) or p R (x) (left or right-going dots). The SPRT works like this: set up two thresholds and keep a running tally of the ratio of likelihood ratios: When first exceeds or falls below, declare victory for R or L. Theorem: (Wald, Barnard) Among all fixed sample or sequential tests, SPRT minimizes expected number of observations n for given accuracy. [For fixed n, SPRT maximises accuracy (Neyman-Pearson lemma).] p L (x)p R (x)
4
The continuum limit is a DD process Take logarithms: multiplication in becomes addition. Take a continuum limit: addition becomes integration. The SPRT becomes a drift-diffusion (DD) process (a cornerstone of 20th century physics): drift rate noise strength Here is the accumulated evidence (the log likelihood ratio). When reaches either threshold, declare R or L the winner. But do humans (or monkeys, or rats) drift and diffuse? Evidence comes from three sources: behavior, direct neural recordings, and mathematical models.
5
Behavioral evidence: RT distributions Human reaction time data in free response mode can be fitted to the first passage threshold crossing times of a DD process. Prior or bias toward one alternative can be implemented by starting point. (Ratcliff et al., Psych Rev. 1978, 1999, 2004, … ) thresh. +Z thresh. -Z drift A
6
Neural evidence: firing rates Spike rates of neurons in oculomotor areas rise during stimulus presentation, monkeys signal their choice after a threshold is crossed. J. Schall, V. Stuphorn, J. Brown, Neuron, 2002. Frontal eye field (FEF) recordings. J.I Gold, M.N. Shadlen, Neuron, 2002. Lateral interparietal area (LIP) recordings. thresholds
7
An optimal speed-accuracy tradeoff 1 Threshold too low Too high Optimal D RT $ $ DDD D D D $$ XXX $$$ X DDD The task: maximize rewards for a succession of free response trials in a fixed period. Reward Rate: response-to-stimulus interval RT DD (% correct/average time between resps.)
8
An optimal speed-accuracy tradeoff 2 How fast to be? How careful? The DDM delivers an explicit solution to the speed-accuracy tradeoff in terms of just 3 parameters: normalized threshold and signal-to-noise ratio and D. So, setting the threshold we can express RT in terms of ER and calculate a unique, parameter-free Optimal Performance Curve: RT/D = F(ER)
9
A behavioral test Do people adopt the optimal strategy? Some do; some don ’ t. Is this because they are optimizing a different function? e.g. weighting accuracy more? Or are they trying, but unable to adjust their thresholds? OPC
10
Biasing choices via initial conditions 1 Prior expectations can be incorporated into DD models by biasing the initial conditions. For example, in case of biased stimuli S1 with probability and S2 with probability, we set: Unbiased DD: Biased DD: Simen et al. In preparation, 2007. Biased rewards can be treated similarly, with dependence on D also. Use similar ideas to model social interactions?
11
Biasing choices via initial conditions 2 More frequent stimuli (S1 > S2) or biased rewards also affect optimal thresholds, and initial conditions and thresholds can collide, predicting instant responding. Biased stimuli Biased rewards Simen et al. In preparation, 2007. Thresholds Initial conds SNR RSI Always choose more probable alternative if above critical surface.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.