How much about our interactions with – and experience of – our world can be deduced from basic principles? This talk reviews recent attempts to understand the self-organised behaviour of embodied agents, like ourselves, as satisfying basic imperatives for sustained exchanges with the environment. In brief, one simple driving force appears to explain many aspects of action and perception. This driving force is the minimisation of surprise or prediction error that – in the context of perception – corresponds to Bayes-optimal predictive coding (that suppresses exteroceptive prediction errors) and – in the context of action – reduces to classical motor reflexes (that suppress proprioceptive prediction errors). We will look at some of the implications for the anatomy of this active inference, in terms of large-scale anatomical graphs and canonical microcircuits, and then turn to some examples of active inference – such as perceptual categorisation, action and its perception. Hierarchal brain architectures and the Bayesian brain Karl Friston, University College London
Overview The free-energy principle action and perception predictive coding with reflexes The anatomy of inference graphical models canonical microcircuits Some examples perceptual categorization omission responses action observation
“Objects are always imagined as being present in the field of vision as would have to be there in order to produce the same impression on the nervous mechanism” - von Helmholtz Thomas Bayes Geoffrey Hinton Richard Feynman From the Helmholtz machine to the Bayesian brain and self-organization Richard Gregory Hermann von Helmholtz Ross Ashby
Minimizing prediction error Change sensations sensations – predictions Prediction error Change predictions Action Perception
Prior distribution Posterior distribution Likelihood distribution temperature Action as inference – the ‘Bayesian thermostat’ Perception Action
Overview The free-energy principle action and perception predictive coding with reflexes The anatomy of inference graphical models canonical microcircuits Some examples perceptual categorization omission responses action observation
A simple hierarchy Generative models whatwhere Sensory fluctuations
Generative model Model inversion (inference) A simple hierarchy Expectations: Predictions: Prediction errors: Descending predictions Descending predictions Ascending prediction errors From models to perception
Haeusler and Maass: Cereb. Cortex 2006;17: Bastos et al: Neuron 2012; 76: Canonical microcircuits for predictive coding
frontal eye fields geniculate visual cortex retinal input pons oculomotor signals Errors (superficial pyramidal cells) Expectations (deep pyramidal cells) Top-down or backward predictions Bottom-up or forward prediction error proprioceptive input reflex arc Perception David Mumford Predictive coding with reflexes Action
superficial deep Errors (superficial pyramidal cells) Expectations (deep pyramidal cells) x frequency (Hz) spectral power V1 Autospectra (SPC) spectral power Forward transfer function frequency (Hz) spectral power V4 Autospectra (DPC) frequency (Hz) spectral power Backward transfer function
Errors (superficial pyramidal cells) Expectations (deep pyramidal cells) Linear or driving connections Nonlinear or modulatory connections superficial deep NMDA receptor density
Biological agents resist the second law of thermodynamics They must minimize their average surprise (entropy) They minimize surprise by suppressing prediction error (free-energy) Prediction error can be reduced by changing predictions (perception) Prediction error can be reduced by changing sensations (action) Perception entails recurrent message passing in the brain to optimize predictions Action makes predictions come true (and minimizes surprise)
Overview The free-energy principle action and perception predictive coding with reflexes The anatomy of inference graphical models canonical microcircuits Some examples perceptual categorization omission responses action observation
Generating bird songs with attractors Syrinx HVC time (sec) Frequency Sonogram Hidden causesHidden states
prediction and error hidden states Backward predictions Forward prediction error causal states Predictive coding stimulus time (seconds)
Perceptual categorization Frequency (Hz) Song a time (seconds) Song bSong c
Perceptual inference and sequences of sequences SyrinxNeuronal hierarchy Time (sec) Frequency (KHz) sonogram Prediction errorSensory statesHidden states
omission and violation of predictions Stimulus but no percept Percept but no stimulus Frequency (Hz) stimulus (sonogram) Time (sec) Frequency (Hz) percept peristimulus time (ms) LFP (micro-volts) ERP (prediction error) without last syllable Time (sec) percept peristimulus time (ms) LFP (micro-volts) with omission
Overview The free-energy principle action and perception predictive coding with reflexes The anatomy of inference graphical models canonical microcircuits Some examples perceptual categorization omission responses action observation
Prior distribution temperature Action as inference – the “Bayesian thermostat” Perception: Action:
visual input proprioceptive input Action with point attractors Descending proprioceptive predictions Descending proprioceptive predictions Exteroceptive predictions
action position (x) position (y) observation position (x) Heteroclinic cycle (central pattern generator) Descending proprioceptive predictions Descending proprioceptive predictions
“Each movement we make by which we alter the appearance of objects should be thought of as an experiment designed to test whether we have understood correctly the invariant relations of the phenomena before us, that is, their existence in definite spatial relations.” 'The Facts of Perception' (1878) in The Selected Writings of Hermann von Helmholtz, Ed. R. Karl, Middletown: Wesleyan University Press, 1971 p. 384 Hermann von Helmholtz
Thank you And thanks to collaborators: Rick Adams Andre Bastos Sven Bestmann Harriet Brown Jean Daunizeau Mark Edwards Xiaosi Gu Lee Harrison Stefan Kiebel James Kilner Jérémie Mattout Rosalyn Moran Will Penny Lisa Quattrocki Knight Klaas Stephan And colleagues: Andy Clark Peter Dayan Jörn Diedrichsen Paul Fletcher Pascal Fries Geoffrey Hinton James Hopkins Jakob Hohwy Henry Kennedy Paul Verschure Florentin Wörgötter And many others
Overview The free-energy principle action and perception predictive coding with reflexes The anatomy of inference graphical models canonical microcircuits Some examples perceptual categorization omission responses action observation visual searches
If percepts are hypotheses, where do we look for evidence? Richard Gregory
saliencevisual inputstimulussampling Sampling the world to minimise uncertainty Perception as hypothesis testing – saccades as experiments Free energy minimisationminimise uncertainty
Frontal eye fields Pulvinar salience map Fusiform (what) Superior colliculus Visual cortex oculomotor reflex arc Parietal (where)
Saccadic fixation and salience maps Visual samples Conditional expectations about hidden (visual) states And corresponding percept Saccadic eye movements Hidden (oculomotor) states
Perception and Action: The optimisation of neuronal and neuromuscular activity to suppress prediction errors (or free- energy) based on generative models of sensory data. Learning and attention: The optimisation of synaptic gain and efficacy over seconds to hours, to encode the precisions of prediction errors and causal structure in the sensorium. This entails suppression of free-energy over time. Neurodevelopment: Model optimisation through activity- dependent pruning and maintenance of neuronal connections that are specified epigenetically Evolution: Optimisation of the average free-energy (free-fitness) over time and individuals of a given class (e.g., conspecifics) by selective pressure on the epigenetic specification of their generative models. Time-scale Free-energy minimisation leading to…
Searching to test hypotheses – life as an efficient experiment Free energy principleminimise uncertainty
Epilogue (what we have not covered)
Synaptic gain Synaptic activity Synaptic efficacy Perception and inference Learning and memory Posterior beliefs and sufficient statistics Attention and precision Perception and inference Learning and memory Attention and affordance Sensory attenuation
Random dynamical attractors and ergodic theorem (path integral formulations and principle of least action) Random dynamical attractors and ergodic theorem (path integral formulations and principle of least action) Discrete formulations and Markovian processes (optimal decision theory) Discrete formulations and Markovian processes (optimal decision theory) Continuous formulations and dynamical systems theory (self-organised criticality) Continuous formulations and dynamical systems theory (self-organised criticality) The free energy principle Variational Bayes = ensemble learning Generalized Bayesian filtering = predictive coding Fokker-Planck equation = ensemble dynamics
Sleeping and dreaming (complexity minimisation and synaptic homoeostasis) Sleeping and dreaming (complexity minimisation and synaptic homoeostasis) Interoception and predictive coding (emotional valence and self-awareness) Interoception and predictive coding (emotional valence and self-awareness) Neuropsychiatry (false inference and failures of sensory attenuation) Neuropsychiatry (false inference and failures of sensory attenuation) The free energy principle Predictive coding and embodied cognition (philosophy) Predictive coding and embodied cognition (philosophy)