Image Stabilization by Bayesian Dynamics Yoram Burak Sloan-Swartz annual meeting, July 2009.

Slides:

Advertisements

Similar presentations

Spike Based Visual Encoding Activity level (a m ) Visual encoder implemented in the NEF as network of 1024 laterally inhibiting neural columns Network.

Advertisements

Bayesian Belief Propagation

What is the neural code? Puchalla et al., What is the neural code? Encoding: how does a stimulus cause the pattern of responses? what are the responses.

What do we know about Primary Visual Cortex (V1)

The linear/nonlinear model s*f 1. The spike-triggered average.

Biological Modeling of Neural Networks: Week 9 – Coding and Decoding Wulfram Gerstner EPFL, Lausanne, Switzerland 9.1 What is a good neuron model? - Models.

What is vision Aristotle - vision is knowing what is where by looking.

Neuronal Coding in the Retina and Fixational Eye Movements Christian Mendl, Tim Gollisch Max Planck Institute of Neurobiology, Junior Research Group Visual.

Introduction: Neurons and the Problem of Neural Coding Laboratory of Computational Neuroscience, LCN, CH 1015 Lausanne Swiss Federal Institute of Technology.

Marseille, Jan 2010 Alfonso Renart (Rutgers) Jaime de la Rocha (NYU, Rutgers) Peter Bartho (Rutgers) Liad Hollender (Rutgers) Néstor Parga (UA Madrid)

Chapter 6 The Visual System

The visual system II Eye and retina. The primary visual pathway From perret-optic.ch.

For stimulus s, have estimated s est Bias: Cramer-Rao bound: Mean square error: Variance: Fisher information How good is our estimate? (ML is unbiased:

How does the visual system represent visual information? How does the visual system represent features of scenes? Vision is analytical - the system breaks.

Unrelated vs. Related Color Unrelated color: color perceived to belong to an area in isolation (CIE 17.4) Related color: color perceived to belong to.

The Human Visual System Short Overview. Terms: LGN, cortex, primary visual cortex, V1.

Spike Train decoding Summary Decoding of stimulus from response –Two choice case Discrimination ROC curves –Population decoding MAP and ML estimators.

The visual system Lecture 1: Structure of the eye

Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.

Connected Populations: oscillations, competition and spatial continuum (field equations) Lecture 12 Course: Neural Networks and Biological Modeling Wulfram.

Cracking the Population Code Dario Ringach University of California, Los Angeles.

Machine learning Image source:

-Gaurav Mishra -Pulkit Agrawal. How do neurons work Stimuli  Neurons respond (Excite/Inhibit) ‘Electrical Signals’ called Spikes Spikes encode information.

Abstract We start with a statistical formulation of Helmholtz’s ideas about neural energy to furnish a model of perceptual inference and learning that.

Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.

Neuronal Coding in the Retina and Fixational Eye Movements Friday Seminar Talk November 6, 2009 Friday Seminar Talk November 6, 2009 Christian Mendl Tim.

Neural Information in the Visual System By Paul Ruvolo Bryn Mawr College Fall 2012.

1 / 41 Inference and Computation with Population Codes 13 November 2012 Inference and Computation with Population Codes Alexandre Pouget, Peter Dayan,

Low Level Visual Processing. Information Maximization in the Retina Hypothesis: ganglion cells try to transmit as much information as possible about the.

1 Perception, Illusion and VR HNRS , Spring 2008 Lecture 3 The Eye.

2 2  Background  Vision in Human Brain  Efficient Coding Theory  Motivation  Natural Pictures  Methodology  Statistical Characteristics  Models.

THE VISUAL SYSTEM: EYE TO CORTEX Outline 1. The Eyes a. Structure b. Accommodation c. Binocular Disparity 2. The Retina a. Structure b. Completion c. Cone.

Projects: 1.Predictive coding in balanced spiking networks (Erwan Ledoux). 2.Using Canonical Correlation Analysis (CCA) to analyse neural data (David Schulz).

Encoding/Decoding of Arm Kinematics from Simultaneously Recorded MI Neurons Y. Gao, E. Bienenstock, M. Black, S.Shoham, M.Serruya, J. Donoghue Brown Univ.,

Xiao-Jing Wang Department of Neurobiology Yale University School of Medicine The Concept of a Decision Threshold in Sensory-Motor Processes.

The “ ” Paige in Kalman Filtering K. E. Schubert.

Biological Modeling of Neural Networks: Week 12 – Decision models: Competitive dynamics Wulfram Gerstner EPFL, Lausanne, Switzerland 12.1 Review: Population.

Visual Acuity Adler’s Physiology of the Eye 11th Ed. Chapter 33 - by Dennis Levi

1 Computational Vision CSCI 363, Fall 2012 Lecture 5 The Retina.

Human vision Jitendra Malik U.C. Berkeley. Visual Areas.

1 Perception and VR MONT 104S, Fall 2008 Lecture 2 The Eye.

The Computing Brain: Focus on Decision-Making

What’s optimal about N choices? Tyler McMillen & Phil Holmes, PACM/CSBMB/Conte Center, Princeton University. Banbury, Bunbury, May 2005 at CSH. Thanks.

Motivation and Overview

The Eye: III. Central Neurophysiology of Vision L12

Neural Coding: Integrate-and-Fire Models of Single and Multi-Neuron Responses Jonathan Pillow HHMI and NYU Oct 5, Course.

Several strategies for simple cells to learn orientation and direction selectivity Michael Eisele & Kenneth D. Miller Columbia University.

6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,

Sensation & Perception. Motion Vision I: Basic Motion Vision.

CHARACTERIZATION OF NONLINEAR NEURON RESPONSES AMSC 664 Final Presentation Matt Whiteway Dr. Daniel A. Butts Neuroscience.

Simultaneous integration versus sequential sampling in multiple-choice decision making Nate Smith July 20, 2008.

Biological Modeling of Neural Networks: Week 10 – Neuronal Populations Wulfram Gerstner EPFL, Lausanne, Switzerland 10.1 Cortical Populations - columns.

Dynamic Causal Models Will Penny Olivier David, Karl Friston, Lee Harrison, Andrea Mechelli, Klaas Stephan Mathematics in Brain Imaging, IPAM, UCLA, USA,

Dynamic Causal Models Will Penny Olivier David, Karl Friston, Lee Harrison, Stefan Kiebel, Andrea Mechelli, Klaas Stephan MultiModal Brain Imaging, Copenhagen,

Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.

Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies Shinji Nishimoto, An T. Vu, Thomas Naselaris, Yuval Benjamini, Bin Yu, Jack.

The Neural Code Baktash Babadi SCS, IPM Fall 2004.

Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.

Processing visual information - pathways

A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition Gustavo Deco Edmund T. Rolls Vision Research, 2004.

Bayesian Brain - Chapter 11 Neural Models of Bayesian Belief Propagation Rajesh P.N. Rao Summary by B.-H. Kim Biointelligence Lab School of.

Mechanisms of Simple Perceptual Decision Making Processes

Randomness in Neural Networks

Spontaneous activity in V1: a probabilistic framework

Article Review Todd Hricik.

fMRI and neural encoding models: Voxel receptive fields (continued)

Effective Connectivity

Space Perception and Binocular Vision

Effective Connectivity

Presentation transcript:

Image Stabilization by Bayesian Dynamics Yoram Burak Sloan-Swartz annual meeting, July 2009

What does neural activity represent? In Bayesian models: probabilities Direction of motion: single, static variable Accumulated evidence in area LIP Shadlen and Newsome (2001)

What does neural activity represent? In Bayesian models: probabilities Direction of motion: single, static variable What about multi-dimensional, dynamic quantities? Accumulated evidence in area LIP Shadlen and Newsome (2001)

Foveal vision and fixational drift

- between micro-saccades - ~20 receptive fields Image from: X. Pitkow - between spikes (100 Hz) - ~2-4 receptive fields ! Fixational drift is large in the fovea: cone separation: 0.5 arcmin

Foveal vision and fixational drift - between micro-saccades - ~20 receptive fields Image from: X. Pitkow - between spikes (100 Hz) - ~2-4 receptive fields ! Downstream areas require knowledge of trajectory to interpret spikes Fixational drift is large in the fovea: cone separation: 0.5 arcmin

Joint decoding of image and position Bayesian: Discrimination task: vs. X. Pitkow et al, Plos Biology (2007) N x 2 probabilities # positions

Bayesian: Discrimination task: vs. X. Pitkow et al, Plos Biology (2007) N x 2 probabilities Unconstrained image 30 x 30 binary pixels # positions N x probabilities Joint decoding of image and position

Bayesian: Discrimination task: vs. X. Pitkow et al, Plos Biology (2007) N x 2 probabilities Unconstrained image 30 x 30 binary pixels # positions N x probabilities Can the brain apply a Bayesian approach to this problem? Joint decoding of image and position

Can the brain apply a Bayesian approach to this problem? Decoding strategy Performance in parameter space What are the biological implications?

Can the brain apply a Bayesian approach to this problem? Decoding strategy Performance in parameter space What are the biological implications?

Decoding strategy Discards information about correlations Factorized representation:

Decoding strategy Discards information about correlations minimize D KL Factorized representation: Exact if trajectory is known. evidence, diffusion Update dynamics:

Decoding strategy Discards information about correlations minimize D KL Factorized representation: Exact if trajectory is known. evidence, diffusion evidence - Poisson spiking (rate λ 1 for on pixels, λ 0 for off) diffusion - Random walk (diffusion coefficient D) Retinal encoding model: Update dynamics:

Decoding strategy Discards information about correlations Neural Implementation - Two populations: where, what For 30 x 30 pixels: N × → N quantities. Factorized representation:

Update rules Update of what neurons: multiplicative gating Ganglion cells What Where nonlinearity

Update rules Update of what neurons: Update of where neurons: multiplicative gating Ganglion cells What Where What multiplicative gating Ganglion cells + diffusion nonlinearity

Demo image retina m x m binary pixels 2d diffusion (D) Poisson spikes: 100 Hz (on), 10 Hz (off) Decoder

Demo

Decoding strategy Performance in parameter space What are the biological implications? Can the brain apply a Bayesian approach to this problem?

Performance DD Convergence time [s] accuracy Performance degrades with larger D (and smaller λ)

Performance DD Convergence time [s] Faster and more accurate for larger images m = 5, 10, 30, 50, 100 accuracy

Demo

Performance DD Convergence time [s] Faster and more accurate for larger images accuracy m = 5, 10, 30, 50, 100

Performance DD Convergence time [s] Faster and more accurate for larger images accuracy m = 5, 10, 30, 50, 100

Performance DD Convergence time [s] Faster and more accurate for larger images accuracy m = 5, 10, 30, 50, 100

Performance D/m Convergence time [s] accuracy scales with linear image size m m x m pixels

Performance D/m Convergence time [s] accuracy scales with linear image size m Analytical scaling: D* m x m pixels

Performance Performance improves with image size. Success for images 10 x 10 or larger Prediction for psychophysics: Degradation in high acuity tasks when visual scene contains little background detail.

Temporal response of Ganglion cells Common view: fixational motion important to activate cells, due to biphasic response f(t) t Temporal response makes decoding much more difficult. 50 ms Need history Non-Markovian:

Temporal response of Ganglion cells Approach: Choose decoder that is Bayes optimal if the trajectory is known. What Ganglion “filtered trajectory” Where history dependent decoder / naive decoder Convergence time [s] accuracy D D

Temporal response of Ganglion cells Is fixational motion beneficial? Known trajectory, perfect inhibitory balance Convergence time [s] D Optimal D - order of magnitude smaller than biological value

Can the brain apply a Bayesian approach to this problem? Decoding strategy Performance in parameter space What are the biological implications?

Network architecture Each ganglion cell innervates multiple what & where cells (spread: ~10 arcmin) WhereWhat Ganglion Reciprocal, multiplicative gating

Activity: What neurons Slow dynamics, evidence accumulation Where neurons Fewer. Highly dynamic activity Tonic, sparse in retinal stabilization conditions.

Activity: What neurons Slow dynamics, evidence accumulation Where neurons Fewer. Highly dynamic activity Tonic, sparse in retinal stabilization conditions. Where in the brain? Monocular LGN? V1? If so, suggests LGN or V1 Modulatory inputs to relay cells (gating?) Lateral connectivity in where network, Increase in number of neurons.

Summary Strategy for stabilization of foveal vision Factorized Bayesian approach to multi-dimensional inference

Summary Strategy for stabilization of foveal vision Explicit representation of stabilized image “What” and “where” populations Factorized Bayesian approach to multi-dimensional inference

Summary Strategy for stabilization of foveal vision Explicit representation of stabilized image “What” and “where” populations Good performance at 1 arcmin resolution Problem is easier for large images, for coarser reconstruction Factorized Bayesian approach to multi-dimensional inference

Summary Strategy for stabilization of foveal vision Explicit representation of stabilized image “What” and “where” populations Good performance at 1 arcmin resolution Problem is easier for large images, for coarser reconstruction Factorized Bayesian approach to multi-dimensional inference Network architecture: Many-to-one inputs from retina, multiplicative gating (what/where)

Uri Rokni Haim Sompolinsky Markus Meister Special thanks - the Swartz foundation Acknowledgments