Reasons to be careful about reward A flow (policy) cannot be specified with a scalar function of states: the fundamental theorem of vector calculus – aka.

Slides:



Advertisements
Similar presentations
The Helmholtz Machine P Dayan, GE Hinton, RM Neal, RS Zemel
Advertisements

Thomas Trappenberg Autonomous Robotics: Supervised and unsupervised learning.
How much about our interaction with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
How much about our interaction with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Bayesian models for fMRI data
How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Vector integrals Line integrals Surface integrals Volume integrals Integral theorems The divergence theorem Green’s theorem in the plane Stoke’s theorem.
Free Energy Workshop - 28th of October: From the free energy principle to experimental neuroscience, and back The Bayesian brain, surprise and free-energy.
How much about our interaction with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
The free-energy principle: a rough guide to the brain? Karl Friston
How much about our interaction with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Visual Recognition Tutorial
Computational aspects of motor control and motor learning Michael I. Jordan* Mark J. Buller (mbuller) 21 February 2007 *In H. Heuer & S. Keele, (Eds.),
CDB Exploring Science and Society Seminar Thursday 19 November 2009 at 5.30pm Host: Prof Giorgio Gabella The Bayesian brain, surprise and free-energy.
How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Free energy and active inference Karl Friston Abstract How much about our interaction with – and experience of – our world can be deduced from basic principles?
Optimality in Motor Control By : Shahab Vahdat Seminar of Human Motor Control Spring 2007.
Statistical learning and optimal control:
Abstract We start with a statistical formulation of Helmholtz’s ideas about neural energy to furnish a model of perceptual inference and learning that.
The free-energy principle: a rough guide to the brain? K Friston Summarized by Joon Shik Kim (Thu) Computational Models of Intelligence.
Population Coding Alexandre Pouget Okinawa Computational Neuroscience Course Okinawa, Japan November 2004.
Prediction in Human Presented by: Rezvan Kianifar January 2009.
Abstract This talk summarizes recent attempts to integrate action and perception within a single optimization framework. We start with a statistical formulation.
Learning Theory Reza Shadmehr Optimal feedback control stochastic feedback control with and without additive noise.
Motor Control. Beyond babbling Three problems with motor babbling: –Random exploration is slow –Error-based learning algorithms are faster but error signals.
Input Single-state DCM Intrinsic (within- region) coupling Extrinsic (between- region) coupling Multi-state DCM with excitatory and inhibitory connections.
Abstract This talk summarizes our recent attempts to integrate action and perception within a single optimization framework. We start with a statistical.
Cognition Through Imagination and Affect Murray Shanahan Imperial College London Department of Computing.
Recent advances in the theory of brain function
Abstract How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts.
Abstract We offer a formal treatment of choice behaviour based on the premise that agents minimise the expected free energy of future outcomes. Crucially,
Abstract Predictive coding models and the free-energy principle, suggests that cortical activity in sensory brain areas reflects the precision of prediction.
Abstract This presentation questions the need for reinforcement learning and related paradigms from machine-learning, when trying to optimise the behavior.
Zangwill Club Seminar - Lent Term The Bayesian brain, surprise and free-energy Karl Friston, Wellcome Centre for Neuroimaging, UCL Abstract Value-learning.
Abstract How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts.
How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Free energy and active inference
BCS547 Neural Decoding.
Causality, symmetry, brain, evolution, DNA, and a new theory of Physics Sergio Pissanetzky 1.
Free-energy and active inference
Abstract We suggested recently that attention can be understood as inferring the level of uncertainty or precision during hierarchical perception. In.
Abstract This presentation will look at action, perception and cognition as emergent phenomena under a unifying perspective: This Helmholtzian perspective.
Workshop on Mathematical Models of Cognitive Architectures December 5-9, 2011 CIRM, Marseille Workshop on Mathematical Models of Cognitive Architectures.
Abstract In this presentation, I will rehearse the free-energy formulation of action and perception, with a special focus on the representation of uncertainty:
How much about our interaction with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Abstract In this presentation, I will rehearse the free-energy formulation of action and perception, with a special focus on the representation of uncertainty:
Tracking with dynamics
(Example) Class Presentation: John Desmond
Workshop on: The Free Energy Principle (Presented by the Wellcome Trust Centre for Neuroimaging) July 5 (Thursday) - 6 (Friday) 2012 Workshop on: The.
How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand.
Abstract How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to.
Use the next two pairs of slides to understand what is meant by motion ambiguity when the eye is the frame of reference. Use the arrow keys to go back.
Autonomous Mobile Robots Autonomous Systems Lab Zürich Probabilistic Map Based Localization "Position" Global Map PerceptionMotion Control Cognition Real.
Tutorial Session: The Bayesian brain, surprise and free-energy Value-learning and perceptual learning have been an important focus over the past decade,
Dynamic Causal Modeling of Endogenous Fluctuations
Variational filtering in generated coordinates of motion
Free energy and active inference
Free energy and life as we know it
Computational models for imaging analyses
Emerging Frontiers of Science Information
Wellcome Trust Centre for Neuroimaging University College London
Aims Research aim to provide internally consistent, practically applicable, methodology of dynamic decision making (DM) Talk aims to provide DM-based.
Case Study on Robotic Systems Using Intelligent Approach
Interoceptive inference, emotion, and the embodied self
The free-energy principle: a rough guide to the brain? K Friston
Probabilistic Map Based Localization
Chapter 1 lesson 1.
Biology Overview The Study of Life What does the term Biology mean?
DCM Demo – Model Specification, Inversion and 2nd Level Inference
Presentation transcript:

Reasons to be careful about reward A flow (policy) cannot be specified with a scalar function of states: the fundamental theorem of vector calculus – aka the Helmholtz decomposition Any (curl free) flow specified with reward can only have a fixed point attractor: reward cannot specify itinerant movement or policies Value is produced by flow – not its cause: reward is a consequence of (defined by) behaviour not its cause The inherent tautology of reward: explaining behaviour in terms of maximising reward is like explaining the evolution of the eye by saying it maximises adaptive value Unresolved questions in motor control: A UCL-JHU workshop

A PhysicistAn EngineerAn Economist

Random dynamical systems Random attractors with small measure Kolmogorov forward equation Free-energy formulation Ergodic theorem Helmholtz decomposition Value and reward Free energy upper bounds expected cost

Value and reward Helmholtz decomposition Optimal control theory

Value and reward

Forward models in motor control Intrinsic frame of reference Extrinsic frame of reference hidden states control Optimal control Motor commands Efference copy Forward model State estimation Sensory mapping Cost function Plant kinetics

Predictive coding in motor control Intrinsic frame of reference Extrinsic frame of reference Optimal control Motor commands Efference copy Sensory mapping Cost function Plant kinetics Forward model Top-down predictions Bottom up prediction error sensationscontrol

Active inference Intrinsic frame of reference Extrinsic frame of reference sensations Classical reflex Corollary discharge Sensory mapping Prior beliefs Plant kinetics movements Forward model Bottom up prediction error Proprioceptive predictions

visual input proprioceptive input Action with point attractors cf., equilibria point hypothesis Descending proprioceptive predictions Exteroceptive predictions

action position (x) position (y) observation position (x) Heteroclinic cycle Action with heteroclinic cycles Descending proprioceptive predictions

Unresolved questions in motor control: A UCL-JHU workshop Reasons to be careful about reward A flow (policy) cannot be specified with a scalar function of states: the fundamental theorem of vector calculus – aka the Helmholtz decomposition Any (curl free) flow specified with reward can only have a fixed point attractor: reward cannot specify itinerant movement or policies Value is produced by flow – not its cause: reward is a consequence of (defined by) behaviour not its cause The inherent tautology of reward: explaining behaviour in terms of maximising reward is like explaining the evolution of the eye by saying it maximises adaptive value – cf., Intelligent design