Read Montague Baylor College of Medicine Houston, TX www.hnl.bcm.tmc.edu Reward Processing and Social Exchange.

Slides:



Advertisements
Similar presentations
Week 4 Performance evaluation and performance reporting.
Advertisements

Congestion Control and Fairness Models Nick Feamster CS 4251 Computer Networking II Spring 2008.
Neural Correlates of Trust and Adaptation in a Dynamic Neuroeconomic Task Project Presentation Adam P.R. Smith, Esther Kessler, Chiara Fiorentini, Fiona.
Institute for Theoretical Physics and Mathematics Tehran January, 2006 Value based decision making: behavior and theory.
Neural Correlates of Variations in Event Processing during Learning in Basolateral Amygdala Matthew R. Roesch*, Donna J. Calu, Guillem R. Esber, and Geoffrey.
Hormones and Behavior Hormones Modulate Behavior Hormones Can Be Necessary for a Behavior A particular level of hormone in an animal does not insure a.
Working Memory Active short-term memory – Maintenance of task-relevant information online. Keeps relevant information available. Like RAM in a computer.
Dopamine, Uncertainty and TD Learning CNS 2004 Yael Niv Michael Duff Peter Dayan Gatsby Computational Neuroscience Unit, UCL.
Returns Economics 71a: Spring 2007 Lecture notes 3.2 (extra)
Decision making. ? Blaise Pascal Probability in games of chance How much should I bet on ’20’? E[gain] = Σgain(x) Pr(x)
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
Software Agents in Economic Environments Robert S. Gazzale Ph.D. Candidate, Department of Economics Jeffrey MacKie Mason Professor, Dept. of Economics.
Introduction: What does phasic Dopamine encode ? With asymmetric coding of errors, the mean TD error at the time of reward is proportional to p(1-p) ->
Dopamine, Uncertainty and TD Learning CoSyNe’04 Yael Niv Michael Duff Peter Dayan.
Reward processing (1) There exists plenty of evidence that midbrain dopamine systems encode errors in reward predictions (Schultz, Neuron, 2002) Changes.
Biologically Inspired Robotics Group,EPFL Associative memory using coupled non-linear oscillators Semester project Final Presentation Vlad TRIFA.
Reinforcement Learning: Computational Roles for Dopamine, Striatum, and Hippocampus 1:00-1:05 Welcome to the symposium (Jay McClelland) 1:05-1:30 Introduction.
Stakeholders’ Centrality in Modern Corporations and Their Management Antonio M. Chiesi Department of Social and Political Studies University of Milan Chiesi.
Social Behavior- Game Theory and Reciprocity
© 2015 Center for Creative Leadership. All rights reserved. Culture Eats Strategy for Breakfast 1 Impact of Culture on Leadership: Managing Across Cultures.
Efficient Market Hypothesis EMH Presented by Inderpal Singh.
Reinforcement learning and human behavior Hanan Shteingart and Yonatan Loewenstein MTAT Seminar in Computational Neuroscience Zurab Bzhalava.
Reinforcers and Punishers versus Incentives Reinforcers and punishers refer to good and bad behavior consequences.
Dopamine enhances model-based over model-free choice behavior Peter Smittenaar *, Klaus Wunderlich *, Ray Dolan.
CS344 : Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 26- Reinforcement Learning for Robots; Brain Evidence.
Experimental Economics and Neuroeconomics. An Illustration: Rules.
Neuroeconomics and Rationality Ben Ollen Indiana University.
SR-1108 | © 2007 Institute for the Future. All rights reserved. Jane McGonigal Research Affiliate & Resident Game Designer INSTITUTE FOR THE FUTURE Technology.
Trust and Communications in HSCB Simulations LT Shawn Pollock Advisor: Dr. Chris Darken Co-Advisor: LTC Jon Alt.
Chapter 50 The Prefrontal Cortex and Executive Brain Functions Copyright © 2014 Elsevier Inc. All rights reserved.
Voluntary National Content Standards For Economics Presented by Joe Lockerd.
A View from the Bottom Peter Dayan Gatsby Computational Neuroscience Unit.
Summary of results. Reiterate goal of investigation: How general is anticipatory behavior observed by Land & McCleod? Found: Untrained subjects exhibit.
Punishment and Norm Compliance Daniel Houser Professor of Economics Director, Interdisciplinary Center for Economics Science George Mason University, Fairfax,
Pattern Classification of Attentional Control States S. G. Robison, D. N. Osherson, K. A. Norman, & J. D. Cohen Dept. of Psychology, Princeton University,
Uncertainty and Consumer Behavior Chapter 5. Uncertainty and Consumer Behavior 1.In order to compare the riskiness of alternative choices, we need to.
Motivation and HR DIFD 321. STEVE JOBS ON MANAGEMENT
Design and Implementation of General Purpose Reinforcement Learning Agents Tyler Streeter November 17, 2005.
CULTURAL STEREOTYPES AND NORM OF RECIPROCITY 11/03/
Sports and Entertainment Marketing © Thomson/South-Western Do Now List the three note taking strategies that work best for you. Chapter 2 Slide 1.
What is meant by “top-down” and “bottom-up” processing? Give examples of both. Bottom up processes are evoked by the visual stimulus. Top down processes.
Reverse engineering the brain Prof. Jan Lauwereyns Advanced Engineering A.
Experiments and “Rational” Behavior, 5/1/07. Beauty Contest Game Each person choose a number from 0 to 100. We will average these numbers. The person.
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Robot Intelligence Technology Lab. Evolutionary Robotics Chapter 3. How to Evolve Robots Chi-Ho Lee.
Social Networks and Trust: not the Experimental Evidence you may Expect Daniela Di Cagno Emanuela Sciubba Luiss Guido Carli, Rome Birkbeck College, London.
Does the brain compute confidence estimates about decisions?
Manipulating the teaching signal: effects of dopamine-related drugs on human learning systems Wellcome Trust Centre for NeuroImaging University College.
Neural Coding of Basic Reward Terms of Animal Learning Theory, Game Theory, Microeconomics and Behavioral Ecology Wolfram Schultz Current Opinion in Neurobiology.
Cognition and Decision-Making
An Overview of Reinforcement Learning
SOCIAL EXCHANGE & RELATIONSHIPS
Neuroimaging of associative learning
Computer-Mediated Communication
To Detect and Correct: Norm Violations and Their Enforcement
Emotion, Decision Making, and the Amygdala
Challenges to Aid Effectiveness: Recipient Institutions
Neuroeconomic Approaches to Mental Disorders
Neuroimaging of associative learning
Volume 57, Issue 3, Pages (February 2008)
Ronald Keiflin, Patricia H. Janak  Neuron 
Wallis, JD Helen Wills Neuroscience Institute UC, Berkeley
Kerstin Preuschoff, Peter Bossaerts, Steven R. Quartz  Neuron 
Neuroimaging of associative learning
Samuel M McClure, Gregory S Berns, P.Read Montague  Neuron 
Human-centered Machine Learning
The Neural Circuitry of a Broken Promise
Luke J. Chang, Alec Smith, Martin Dufwenberg, Alan G. Sanfey  Neuron 
Economic Choice as an Untangling of Options into Actions
Volume 27, Issue 6, Pages (March 2017)
Presentation transcript:

Read Montague Baylor College of Medicine Houston, TX Reward Processing and Social Exchange

I love you Let’s do lunch B A A B A B

Natural visual statistics Specific algorithms evolved to solve efficiently the array of problems of early vision

Recharge or die The real world imposes another difficult requirement on all mobile organisms

Natural reward-harvesting statistics? generic woodland creature

Models of reinforcement learning can provide insight What computations should we expect? Reward harvesting is an economic problem involving both valuation and choice

Sustain goal Select goal Pursue goal Goal-directed choice Actual experience Counterfactual experience ‘what could have been’ guidance signals

Midbrain dopamine neurons burst pause R time naive R time After learning time After learning (catch trial) Pause, burst, and ‘no change’ responses represent reward prediction errors – ongoing emission of information ERROR SIGNAL = current reward +  next prediction - current prediction

Can we detect a reward prediction error signal in a human subject?

Neural correlates of these error signals have been observed in conditioning and decision tasks Passive conditioning tasks Instrumental conditioning tasks Sequential decision tasks Social and economic exchange tasks

Midbrain dopamine neurons burst and pause responses encode reward prediction errors ERROR SIGNAL = current reward +  next prediction - current prediction Can these systems be re-deployed for abstractly defined rewards (ideas)? “I want to solve Fermat’s last theorem”

What about the something more abstract like the expression and repayment of trust?

Trust Modeling Must involve risk (uncertainty) Harvesting returns from another agent

TRUST I love you Let’s do lunch B A A B A B

Simplifying and quantifying Trust (Berg et al., 1995; Weigelt and Camerer, 1988) Trust is the amount of money a sender sends to a receiver without external enforcement. Agent 2Agent 1 I R

X 3X 3 InvestorTrustee $20 A dynamic version of the Trust game (10 rounds)

Structure of a round

What is the behavioral signal that most strongly influences changes in trust (money sent) ? Agent 2Agent 1 I R Reciprocity = TIT-FOR-TAT

money sent to partner + Benevolent signal - Malevolent signal Neutral signal

Questions about brain response Responses that differentiate benevolent from neutral? Responses that differentiate malevolent from neutral? Responses that differentiate benevolent from malevolent? Surprise response

* time (sec) Trustee Brain: ‘intention to increase trust’ shifts with reputation building Reputation develops Trustee will increase trust on next move Trustee will decrease trust on next move submitreveal * increases or decreases in future trust by trustee time (sec) Signal is now anticipating the outcome

Temporal shift resembles value transfer in reward learning experiments burst pause R time naive R time After learning time After learning (catch trial)

r =.00 TRUSTEE decreases repayment Trustee decreases  No information r =.27 Change in next investment TRUSTEE increases repayment Trustee decreases  positive correlation Why should intentions to increase repayment burn more energy? Change in next investment

1.Social trust is about modeling others - always some underlying currency. 2.Re-deploy reward-harvesting machinery that we share with all vertebrates (abstractions gain reward status) 3.Use to probe pathologies (addicted state, autism spectrum disorder, borderline…) Trust?

Baylor College of Medicine Pearl Chiu Amin Kayali Brooks King-Casas Terry Lohrenz Sam McClure (Princeton) Read Montague Damon Tomlin Caltech Cedric Anen Colin Camerer Steve Quartz UCL Peter Dayan Nathaniel Daw Salk Institute Terry Sejnowski Princeton University Jon Cohen Emory University Greg Berns University of Alabama Laura Klinger Mark Klinger Families of autistic subjects Funding Sources NIDA NIMH Dana Foundation Kane Family Foundation Angel Williamson Imaging Center Institute for Advanced Study, Princeton NJ Collaborators

rounds fraction of highly accurate guesses by trustee Investor events Trustee events How do we know a reputation is forming? 8 s10 s4 s... invest period cue to invest investment revealed to both brains delay period guess period cue to guess Investment phase synchronized on submission time of both partners Trustee

What is Fair? i i i Split the profit Split ‘loaf’ Split the total 10 + i each i Investor Trustee 20 -i collective ownership? common goods?