Action strengths State values Prediction error.

Slides:



Advertisements
Similar presentations
Module 5 Assessment, Intake, Supervision and Case Management.
Advertisements

Flexible Shaping: How learning in small steps helps Hierarchical Organization of Behavior, NIPS 2007 Kai Krueger and Peter Dayan Gatsby Computational Neuroscience.
NSF Science of Learning Lila Davachi Dept Psychology and Neural Science New York University.
Omri Barak Collaborators: Larry Abbott David Sussillo Misha Tsodyks Sloan-Swartz July 12, 2011 Messy nice stuff & Nice messy stuff.
Fast Readout of Object Identity from Macaque Inferior Tempora Cortex Chou P. Hung, Gabriel Kreiman, Tomaso Poggio, James J.DiCarlo McGovern Institute for.
G052 : Publishing A04 Evaluation of Publication. Evaluation of Publication (G052) Your Publication.
Neural Correlates of Variations in Event Processing during Learning in Basolateral Amygdala Matthew R. Roesch*, Donna J. Calu, Guillem R. Esber, and Geoffrey.
Computational models of cognitive control (II) Matthew Botvinick Princeton Neuroscience Institute and Department of Psychology, Princeton University.
Journal club Marian Tsanov Reinforcement Learning.
How to do high quality research on middleware? Frank Eliassen professor University of Oslo and Simula Research Laboratory Oslo, Norway.
Business Minor grade adjustments on Midterm 2 Opportunity to participate in Cognitive Neuroscience and Perception experiment - sign up for Tuesday, Wednesday.
Neural correlates of processing musical meaning („semantics“) Stefan Koelsch.
Serotonin and Impulsivity Yufeng Zhang. Serotonin Originate from the median and dorsal raphe nuclei. Serotonin has been implicated in a variety of motor,
Computational models of cognitive control (I) Matthew Botvinick Princeton Neuroscience Institute and Department of Psychology, Princeton University.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 11: Temporal Difference Learning (cont.), Eligibility Traces Dr. Itamar Arel College.
Operant Conditioning of Cortical Activity E Fetz, 1969.
fMRI Methods Lecture 12 – Adaptation & classification
Chapter 50 The Prefrontal Cortex and Executive Brain Functions Copyright © 2014 Elsevier Inc. All rights reserved.
Brain Rhythms: key questions  Coordination of processing across space  Coherence in perception, e.g. synchrony & binding, cognitive moment, causation/objects.
Introduction to Neural Networks and Example Applications in HCI Nick Gentile.
Metacognition Neil H. Schwartz Psych 605. Three Terms: Organizing the Concept Exogenous Constructivism Endogenous Constructivism Spence Piaget Dialectical.
A View from the Bottom Peter Dayan Gatsby Computational Neuroscience Unit.
Pattern Classification of Attentional Control States S. G. Robison, D. N. Osherson, K. A. Norman, & J. D. Cohen Dept. of Psychology, Princeton University,
Neural Modeling - Fall NEURAL TRANSFORMATION Strategy to discover the Brain Functionality Biomedical engineering Group School of Electrical Engineering.
Design and Implementation of General Purpose Reinforcement Learning Agents Tyler Streeter November 17, 2005.
I’d also like to acknowledge the presence and outstanding help of Dr
Adolescent Brain Development: Part 2
Part 3 PLANNING Chapter 6 PLANNING
Commentary: Memory for spatial locations, motor responses, and objects: triple dissociation among the hippocampus, caudate nucleus, and extrastriate visual.
Agroforestry Adoption: Keynes’ Animal Spirits are Alive and Well in the Yucatan James F. Casey Washington & Lee University Prepared for SAPPCREE Session.
A B SAMPLE DELAY CHOICE A B A B 1st Reversal2nd Reversal etc… Trial Time (ms) Conditional Visuomotor Learning Task Asaad, W.F., Rainer, G. and.
Cognitive control, hierarchy, & the rostro-caudal organization of the prefrontal cortex David Badre Department of Cognitive & Linguistic Sciences Department.
Myers 5e Chapter 2 Neuroscience, Genetics, and Behavior Neural Communication Neurons How Neurons Communicate How Neurotransmitters Influence Us Summing.
Attitudes and Attitude Change
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Edward Taylor Al Renner Lois Robinson Training Supervisors To Avoid Performance Appraisal Pitfalls.
Lecture 12. Outline of Rule-Based Classification 1. Overview of ANN 2. Basic Feedforward ANN 3. Linear Perceptron Algorithm 4. Nonlinear and Multilayer.
Final Projects The final project is expected to be an independent project of your own devising addressing a topic related to cognition or neuroscience.
Looking at connections between brain regions
Chapter 6: Temporal Difference Learning
An Overview of Reinforcement Learning
Neuroimaging of associative learning
Risk-Responsive Orbitofrontal Neurons Track Acquired Salience
Computational models for imaging analyses
מוטיבציה והתנהגות free operant
Spatial differences of burstiness in the temporal occurrence
Institute of New Khmer And Motivation Prepared by: Nouv Brosh/ BBA.
تلفيق اطلاعات سنسوري: مکانيسم و مدلها
Emotion, Decision Making, and the Amygdala
Volume 39, Issue 5, Pages (August 2003)
Neuroimaging of associative learning
Reinforcement Learning Signal Predicts Social Conformity
Chapter 6: Temporal Difference Learning
Wallis, JD Helen Wills Neuroscience Institute UC, Berkeley
Orbitofrontal Cortex: A Neural Circuit for Economic Decisions
Neuroimaging of associative learning
ESF EVALUATION PARTNERSHIP
Network topology. Network topology. Nodes are linked by edges. Node size represents a quantifiable node property (e.g. fold-change in two different experimental.
Neural and Computational Mechanisms of Action Processing: Interaction between Visual and Motor Representations  Martin A. Giese, Giacomo Rizzolatti  Neuron 
Map by Glenda baker, 2016 Tutorial overview
Volume 39, Issue 5, Pages (August 2003)
August 8, 2006 Danny Budik, Itamar Elhanany Machine Intelligence Lab
Structural Emergence in Partially Ordered Sets
JJ Orban de Xivry Hands on session JJ Orban de Xivry
Streptozotocin treatment increased phospho–Thr231-tau immunoreactivity in mouse brain. Streptozotocin treatment increased phospho–Thr231-tau immunoreactivity.
Brain-computer interfaces.
Orbitofrontal Cortex as a Cognitive Map of Task Space
Decomposing the motor system
Remembrance of Odors Past
World models and basis functions
Presentation transcript:

Action strengths State values Prediction error

1 2 3 4

Extension 1: Support for representing option identifiers

White & Wise, Exp Br Res, 1999 (See also: Assad, Rainer & Miller, 2000; Bunge, 2004; Hoshi, Shima & Tanji, 1998; Johnston & Everling, 2006; Wallis, Anderson & Miller, 2001; White, 1999…)

Miller & Cohen, Ann. Rev. Neurosci, 2001

From Curtis & D’Esposito, TICS, 2003, after Funahashi et al. , J From Curtis & D’Esposito, TICS, 2003, after Funahashi et al., J. Neurophysiol,1989.

Fuster, Neuron, 2001

Extension 2: Option-specific policies

Bear, Connors, & Paradiso, 2001

O’Reilly & Frank, Neural Computation, 2006

Aldridge & Berridge, J Neurosci, 1998

Extension 3: Option-specific state values

Kringelbach, Nature Rev Neurosci, 2005

Padoa-Schioppa & Assad, Nature, 2006

Schoenbaum, et al. J Neurosci. 1999 See also: O’Doherty, Critchley, Deichmann, Dolan, 2003

Extension 4: Temporal scope of the prediction error

Schoenbaum, Roesch & Stalnaker, TICS, 2006

Roesch, Taylor & Schoenbaum, Neuron, 2006

Daw, NIPS, 2003

--- objectives/overview -- actor-critic as link to brain -- HRL implementation in actor-critic -- opportunity to further introduce hrl and brain… -- apologies to both 1/2s of the audience (for covering stuff each learned in 1st year grad school)

--- Overview of actor critic -- main putative neural correlates -- then talk about 4 changes demanded by HRL -- 1. option rep’n -- 2. (actor) option-specific policies (and selection of options) -- 3 (critic) option-specific value functions -- 4 (critic) change in scope of prediction error …then move on to neural correlates… “selective activity in OFC did not consistently represent the identity of particular odors, the motivational characteristics of the associated reinforcer, or preparation for the motoric response. Instead it would appear that the selective activity in OFC during accurate performance represents the integration of information regarding the significance of a particular cue (or cues) with subsequent behavior.”

Additional questions: --Hierarchical representations in PFC? -- Origins of options (subgoals)? -- Purposive vs. habit?