Cortical Signals for Rewarded Actions and Strategic Exploration

Slides:



Advertisements
Similar presentations
Soyoun Kim, Jaewon Hwang, Daeyeol Lee  Neuron 
Advertisements

Heterogeneous Coding of Temporally Discounted Values in the Dorsal and Ventral Striatum during Intertemporal Choice  Xinying Cai, Soyoun Kim, Daeyeol.
Heather L. Dean, Maureen A. Hagan, Bijan Pesaran  Neuron 
Neuronal Correlates of Metacognition in Primate Frontal Cortex
A Source for Feature-Based Attention in the Prefrontal Cortex
Volume 95, Issue 1, Pages e3 (July 2017)
René Quilodran, Marie Rothé, Emmanuel Procyk  Neuron 
Volume 51, Issue 6, Pages (September 2006)
Choosing Goals, Not Rules: Deciding among Rule-Based Action Plans
Volume 95, Issue 5, Pages e5 (August 2017)
Jung Hoon Sul, Hoseok Kim, Namjung Huh, Daeyeol Lee, Min Whan Jung 
Value Representations in the Primate Striatum during Matching Behavior
Decision Making: From Neuroscience to Psychiatry
Heather L. Dean, Maureen A. Hagan, Bijan Pesaran  Neuron 
Martin O'Neill, Wolfram Schultz  Neuron 
Exploiting Exploration: Past Outcomes and Future Actions
Volume 94, Issue 2, Pages e6 (April 2017)
Volume 87, Issue 1, Pages (July 2015)
Volume 94, Issue 4, Pages e7 (May 2017)
Volume 81, Issue 6, Pages (March 2014)
Timothy J. Vickery, Marvin M. Chun, Daeyeol Lee  Neuron 
Volume 93, Issue 2, Pages (January 2017)
Developmental Inhibition of Gsk3 Rescues Behavioral and Neurophysiological Deficits in a Mouse Model of Schizophrenia Predisposition  Makoto Tamura, Jun.
Vincent B. McGinty, Antonio Rangel, William T. Newsome  Neuron 
Roman F. Loonis, Scott L. Brincat, Evan G. Antzoulatos, Earl K. Miller 
Feature- and Order-Based Timing Representations in the Frontal Cortex
Activity in Posterior Parietal Cortex Is Correlated with the Relative Subjective Desirability of Action  Michael C. Dorris, Paul W. Glimcher  Neuron 
Decision Making: From Neuroscience to Psychiatry
Volume 82, Issue 5, Pages (June 2014)
Volume 71, Issue 4, Pages (August 2011)
Volume 94, Issue 2, Pages e6 (April 2017)
Huihui Zhou, Robert Desimone  Neuron 
Multiple Timescales of Memory in Lateral Habenula and Dopamine Neurons
Human Orbitofrontal Cortex Represents a Cognitive Map of State Space
Differences between Neural Activity in Prefrontal Cortex and Striatum during Learning of Novel Abstract Categories  Evan G. Antzoulatos, Earl K. Miller 
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
A. Saez, M. Rigotti, S. Ostojic, S. Fusi, C.D. Salzman  Neuron 
Functional Microcircuit Recruited during Retrieval of Object Association Memory in Monkey Perirhinal Cortex  Toshiyuki Hirabayashi, Daigo Takeuchi, Keita.
Action Selection and Action Value in Frontal-Striatal Circuits
Independent Category and Spatial Encoding in Parietal Cortex
Caleb E. Strait, Tommy C. Blanchard, Benjamin Y. Hayden  Neuron 
Ethan S. Bromberg-Martin, Masayuki Matsumoto, Okihide Hikosaka  Neuron 
Sharon C. Furtak, Omar J. Ahmed, Rebecca D. Burwell  Neuron 
Ryo Sasaki, Takanori Uka  Neuron  Volume 62, Issue 1, Pages (April 2009)
Volume 95, Issue 5, Pages e5 (August 2017)
Volume 89, Issue 6, Pages (March 2016)
Serial, Covert Shifts of Attention during Visual Search Are Reflected by the Frontal Eye Fields and Correlated with Population Oscillations  Timothy J.
Volume 54, Issue 2, Pages (April 2007)
Guilhem Ibos, David J. Freedman  Neuron 
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
Daniel E. Winkowski, Eric I. Knudsen  Neuron 
Orbitofrontal Cortex Uses Distinct Codes for Different Choice Attributes in Decisions Motivated by Curiosity  Tommy C. Blanchard, Benjamin Y. Hayden,
Volume 97, Issue 1, Pages e3 (January 2018)
Volume 76, Issue 4, Pages (November 2012)
Volume 72, Issue 6, Pages (December 2011)
Ethan S. Bromberg-Martin, Okihide Hikosaka  Neuron 
Volume 88, Issue 4, Pages (November 2015)
Posterior Parietal Cortex Encodes Autonomously Selected Motor Plans
Prefrontal Neurons Coding Suppression of Specific Saccades
Masayuki Matsumoto, Masahiko Takada  Neuron 
John T. Serences, Geoffrey M. Boynton  Neuron 
Volume 92, Issue 2, Pages (October 2016)
Encoding of Stimulus Probability in Macaque Inferior Temporal Cortex
Kristy A. Sundberg, Jude F. Mitchell, John H. Reynolds  Neuron 
Volume 78, Issue 4, Pages (May 2013)
Social Information Signaling by Neurons in Primate Striatum
The Postsaccadic Unreliability of Gain Fields Renders It Unlikely that the Motor System Can Use Them to Calculate Target Position in Space  Benjamin Y.
Volume 99, Issue 1, Pages e4 (July 2018)
Distributed Coding of Actual and Hypothetical Outcomes in the Orbital and Dorsolateral Prefrontal Cortex  Hiroshi Abe, Daeyeol Lee  Neuron  Volume 70,
Orbitofrontal Cortex Uses Distinct Codes for Different Choice Attributes in Decisions Motivated by Curiosity  Tommy C. Blanchard, Benjamin Y. Hayden,
Presentation transcript:

Cortical Signals for Rewarded Actions and Strategic Exploration Christopher H. Donahue, Hyojung Seo, Daeyeol Lee  Neuron  Volume 80, Issue 1, Pages 223-234 (October 2013) DOI: 10.1016/j.neuron.2013.07.040 Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 1 Behavioral task (A) Matching pennies task. (B) Visual search task. The timing for each epoch was identical for the two tasks. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 2 Reinforcement Learning during Matching-Pennies Task (A) Win-stay versus lose-switch behavior. The fraction of trials in which the monkeys chose the same target as in the previous trial after receiving a reward, p(win-stay), is plotted on the ordinate, and the fraction of trials in which they switched to the other target after not receiving a reward, p(lose-switch), is plotted on the abscissa. Different symbols and colors indicate different animals and cortical regions, respectively. (B) Asymmetric effects of reward versus no-reward estimated by a modified reinforcement learning model. Values for θwin + θloss that are greater than zero indicate that rewarded trials had a greater effect on the animals' future behavior than nonrewarded trials. Error bars indicate mean ± SEM. See also Table S1. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 3 Cortical Signals Related to Previous Choice and Reward during the Matching-Pennies Task The histograms show the fraction of neurons that modulated their activity significantly according to the previous outcome (rewarded or unrewarded, A), previous choice (left or right, B), or their interaction (C) in the fore-period and cue-period of the matching-pennies task. Colors indicate different cortical regions. Dotted lines correspond to the significance level used (p = 0.05). Asterisks, p < 0.05 (binomial test). See also Figures S1 and S3. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 4 Activity of Example Neurons from SEF, DLPFC, and LIP during the Matching-Pennies Task (A–C) Spike density functions (SDF) for single neurons for trials separated by the animal’s choice (left or right) and outcome (rewarded, +, or nonrewarded, −) in the previous trial. For all three neurons (A, SEF; B, DLPFC; C, LIP), encoding of the previous choice was more robust when the previous trial was rewarded (two-way ANOVA, previous choice × previous reward interaction, p < 10−4). (D–F) SDF for the same neurons for trials separated by the animal’s choice in the current trial and the reward in the previous trial. The signals related to the upcoming choice was not significantly affected by the previous reward for any neuron (two-way ANOVA, previous reward × current choice interaction, p > 0.25). The shaded area indicates mean ± SEM. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 5 Population Summary for the Effect of Reward on the Neural Encoding of Previous Choice during the Matching-Pennies Task (A) Top. Average decoding accuracy for the previous choice estimated using a sliding window separately according to whether the animal’s previous choice was rewarded or not. The shaded area indicates mean ± SEM. Bottom. Scatter plots show classification accuracy for each neuron after rewarded versus unrewarded trials. Symbol colors indicate whether the decoding accuracy was significantly above chance for both (green), either (orange), or neither (gray) of rewarded and unrewarded outcomes (z-test, p < 0.05). Open and closed symbols, respectively, indicate that classification was or was not significantly different for the two outcomes. (B) The decoding accuracy for the animal’s upcoming choices, shown in the same format as in (A). See also Figures S2 and S3. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 6 Activity of an Example SEF Neuron during the Visual Search and Matching-Pennies Task The number of trials used for calculating the spike density functions was equated for the two tasks (n = 128). Same format as in Figures 4A–4C. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions

Figure 7 Population Summary for the Task-Specific Effect of Reward on the Neural Encoding of Previous Choice (A) Top. Decoding accuracy for the animal’s previous choice in the visual search task. Same format as in Figure 5A. (B) Average decoding accuracies for previous choice shown separately for each region, task, and previous reward. Saturated colors indicate that the accuracy was significantly higher than the chance level (t test, p < 0.05). Error bars indicate ± SEM. See also Figure S3. Neuron 2013 80, 223-234DOI: (10.1016/j.neuron.2013.07.040) Copyright © 2013 Elsevier Inc. Terms and Conditions