Class 21, 1999 CBCl/AI MIT Neuroscience II T. Poggio.

Slides:



Advertisements
Similar presentations
Chapter 4: The Visual Cortex and Beyond
Advertisements

Read this article for Friday next week [1]Chelazzi L, Miller EK, Duncan J, Desimone R. A neural basis for visual search in inferior temporal cortex. Nature.
Standard Brain Model for Vision
Midterm 1 Oct. 21 in class. Read this article by Wednesday next week!
Electrophysiology of Visual Attention. Does Visual Attention Modulate Visual Evoked Potentials? The theory is that Visual Attention modulates visual information.
V1 Physiology. Questions Hierarchies of RFs and visual areas Is prediction equal to understanding? Is predicting the mean responses enough? General versus.
The Visual System: Feature Detection Model Lesson 17.
Gabor Filter: A model of visual processing in primary visual cortex (V1) Presented by: CHEN Wei (Rosary) Supervisor: Dr. Richard So.
A Neural Model for Detecting and Labeling Motion Patterns in Image Sequences Marc Pomplun 1 Julio Martinez-Trujillo 2 Yueju Liu 2 Evgueni Simine 2 John.
Object recognition and scene “understanding”
Visual Attention Attention is the ability to select objects of interest from the surrounding environment A reliable measure of attention is eye movement.
Fast Readout of Object Identity from Macaque Inferior Tempora Cortex Chou P. Hung, Gabriel Kreiman, Tomaso Poggio, James J.DiCarlo McGovern Institute for.
HMAX Models Architecture Jim Mutch March 31, 2010.
Gain Modulation Huei-Ju Chen Papers: Chance, Abbott, and Reyes(2002) E. Salinas & T. Sejnowski(2001) E. Salinas & L.G. Abbott (1997, 1996) Pouget & T.
Read this article for Friday Oct 21! Trends in Neuroscience (2000) 23, Hint #1: there are at least 3 ways of getting this article Hint #2: none.
Electrophysiology of Visual Attention. Moran and Desimone (1985) “Classical” RF prediction: there should be no difference in responses in these two conditions.
Visual Pathways W. W. Norton Primary cortex maintains distinct pathways – functional segregation M and P pathways synapse in different layers Ascending.
Test Oct. 21 Review Session Oct 19 2pm in TH201 (that’s here)
Use a pen on the test. The distinct modes of vision offered by feedforward and recurrent processing Victor A.F. Lamme and Pieter R. Roelfsema.
Writing Workshop Find the relevant literature –Use the review journals as a first approach e.g. Nature Reviews Neuroscience Trends in Neuroscience Trends.
Searching for the NCC We can measure all sorts of neural correlates of these processes…so we can see the neural correlates of consciousness right? So what’s.
Read this article for Friday [1]Chelazzi L, Miller EK, Duncan J, Desimone R. A neural basis for visual search in inferior temporal cortex. Nature 1993;
Attention as Information Selection. Early Selection Early Selection model postulated that attention acted as a strict gate at the lowest levels of sensory.
How does the visual system represent visual information? How does the visual system represent features of scenes? Vision is analytical - the system breaks.
Computational Vision: Object Recognition Object Recognition Jeremy Wyatt.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Robust Object Recognition with Cortex-Like Mechanisms Thomas Serre, Lior Wolf, Stanley Bileshi, Maximilian Riesenhuber, and Tomaso Poggio, Member, IEEE.
Levels in Computational Neuroscience Reasonably good understanding (for our purposes!) Poor understanding Poorer understanding Very poorer understanding.
Visual Cognition I basic processes. What is perception good for? We often receive incomplete information through our senses. Information can be highly.
A.F. Lamme and Pieter R. Roelfsema
Overview 1.The Structure of the Visual Cortex 2.Using Selective Tuning to Model Visual Attention 3.The Motion Hierarchy Model 4.Simulation Results 5.Conclusions.
Michael Arbib & Laurent Itti: CS664 – USC, spring Lecture 6: Object Recognition 1 CS664, USC, Spring 2002 Lecture 6. Object Recognition Reading Assignments:
Basic Processes in Visual Perception
Consequences of Attentional Selection Single unit recordings.
Multiclass object recognition
Low Level Visual Processing. Information Maximization in the Retina Hypothesis: ganglion cells try to transmit as much information as possible about the.
2 2  Background  Vision in Human Brain  Efficient Coding Theory  Motivation  Natural Pictures  Methodology  Statistical Characteristics  Models.
1 Computational Vision CSCI 363, Fall 2012 Lecture 31 Heading Models.
To study changes in the brain... Multiple memory system Top down approach Lesion of part(s) of CNS Behaviours (system level) Cortical involvement in memory.
THE VISUAL SYSTEM: EYE TO CORTEX Outline 1. The Eyes a. Structure b. Accommodation c. Binocular Disparity 2. The Retina a. Structure b. Completion c. Cone.
The Visual Cortex: Anatomy
fMRI Methods Lecture 12 – Adaptation & classification
Chapter 7. Network models Firing rate model for neuron as a simplification for network analysis Neural coordinate transformation as an example of feed-forward.
Introduction ATTENTION SPANS MULTIPLE STIMULUS DIMENSIONS IN MACAQUE VISUAL CORTEX Jitendra Sharma*, James Schummers, Hiroki Sugihara, Paymon Hosseini.
Neuronal Adaptation to Visual Motion in Area MT of the Macaque -Kohn & Movshon 지각 심리 전공 박정애.
Chapter 3: Neural Processing and Perception. Neural Processing and Perception Neural processing is the interaction of signals in many neurons.
Human vision Jitendra Malik U.C. Berkeley. Visual Areas.
Pattern Classification of Attentional Control States S. G. Robison, D. N. Osherson, K. A. Norman, & J. D. Cohen Dept. of Psychology, Princeton University,
Expertise, Millisecond by Millisecond Tim Curran, University of Colorado Boulder 1.
Object and face recognition
High-Level Vision Object Recognition.
The Human Visual System Background on Vision Human vision – the best system around Deep network models.
The visual system eye thalamus lateral geniculate nucleus (LGN) visual cortex The visual system s.
Maxlab proprietary information – 5/4/09 – Maximilian Riesenhuber CT2WS at Georgetown: Letting your brain be all that.
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Basics of Computational Neuroscience. What is computational neuroscience ? The Interdisciplinary Nature of Computational Neuroscience.
Activity Recognition Journal Club “Neural Mechanisms for the Recognition of Biological Movements” Martin Giese, Tomaso Poggio (Nature Neuroscience Review,
A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition Gustavo Deco Edmund T. Rolls Vision Research, 2004.
1 How the Visual Cortex Recognizes Objects, The Tale of the Standard Model Greg McChesney Texas Tech University Jan 23, 2009 CS5331:
9.012 Presentation by Alex Rakhlin March 16, 2001
Visual Computation and Learning Lab
Article Review Todd Hricik.
Crowding by a single bar
The Visual System: Feature Detection Model
Nicholas J. Priebe, David Ferster  Neuron 
Object Recognition in Biological Vision Systems
Psychophysical and Physiological Evidence for Viewer-centered Object Representations in the Primate N.K. Logothetis and J. Pauls Cerebral Cortex (1995)
Volume 50, Issue 1, Pages (April 2006)
Volume 27, Issue 2, Pages (August 2000)
Volume 27, Issue 1, Pages (January 2017)
Presentation transcript:

Class 21, 1999 CBCl/AI MIT Neuroscience II T. Poggio

Class 21, 1999 CBCl/AI MIT Neuroscience Brain Overview?

Class 21, 1999 CBCl/AI MIT The Ventral Visual Pathway modified from Ungerleider and Haxby, 1994

Class 21, 1999 CBCl/AI MIT

Class 21, 1999 CBCl/AI MIT Visual Areas

Class 21, 1999 CBCl/AI MIT Face-tuned cells in IT

Class 21, 1999 CBCl/AI MIT Model of view-invariant recognition: learning from views VIEW ANGLE  Poggio, Edelman Nature, A graphical rewriting of mathematics of regularization (GRBF), a learning technique

Class 21, 1999 CBCl/AI MIT Learning to Recognize 3D Objects in IT Cortex Logothetis, Pauls, Poggio 1995 Examples of Visual Stimuli After human psychophysics (Buelthoff, Edelman, Tarr, Sinha,…), which supports models based on view-tuned units...physiology!

Class 21, 1999 CBCl/AI MIT Task Description Stimulus Blue Fixspot Response Stimulus Yellow Fixspot Response Left Lever Right Lever T = Target D = Distractor Logothetis, Pauls, Poggio 1995

Class 21, 1999 CBCl/AI MIT Recording Sites in Anterior IT Logothetis, Pauls, and Poggio, 1995; Logothetis, Pauls, 1995

Class 21, 1999 CBCl/AI MIT Model’s predictions: View-tuned Neurons VIEW ANGLE VIEW- TUNED UNITS

Class 21, 1999 CBCl/AI MIT The Cortex: Neurons Tuned to Object Views Logothetis, Pauls, Poggio 1995

Class 21, 1999 CBCl/AI MIT A View Tuned Cell Logothetis, Pauls, Poggio 1995

Class 21, 1999 CBCl/AI MIT Model’s predictions : View- invariant, Object-specific Neurons View Angle VIEW- INVARIANT, OBJECT- SPECIFIC UNIT 

Class 21, 1999 CBCl/AI MIT The Cortex: View-invariant, Object-specific Neurons Logothetis, Pauls, Poggio,1995

Class 21, 1999 CBCl/AI MIT Recognition of Wire Objects

Class 21, 1999 CBCl/AI MIT Generalization Field

Class 21, 1999 CBCl/AI MIT

Class 21, 1999 CBCl/AI MIT

Class 21, 1999 CBCl/AI MIT

Class 21, 1999 CBCl/AI MIT View-dependent Response of an IT Neuron

Class 21, 1999 CBCl/AI MIT Sparse Representations in IT About 400 view tuned cells per object Perhaps 20 view-invariant cells per object In the recording area in AMTS -- a specialized region for paperclips (!) -- we estimate that there are, after training, we estimate that there are, after training, (within an order of magnitude or two) … Logothetis, Pauls, Poggio, 1997

Class 21, 1999 CBCl/AI MIT Previous glimpses: cells tuned to face identity and view Perrett, 1989

Class 21, 1999 CBCl/AI MIT 2. View-tuned IT neurons View-tuned cells in IT Cortex: how do they work? How do they achieve selectivity and invariance? Max Riesenhuber and T. Poggio, Nature Neuroscience, just published

Class 21, 1999 CBCl/AI MIT max Some of our funding is from Honda...

Class 21, 1999 CBCl/AI MIT Model’s View-tuned Neurons VIEW ANGLE VIEW- TUNED UNITS

Class 21, 1999 CBCl/AI MIT Scale-Invariant Responses of an IT Neuron (training on one size only!) Logothetis, Pauls and Poggio, 1995 Scale Invariant Responses of an IT Neuron

Class 21, 1999 CBCl/AI MIT Invariance around training view Invariance while maintaining specificity Spike Rate (Target Response)/ (Mean of Best Distractors) Invariances: Overview Logothetis, Pauls and Poggio, 1995

Class 21, 1999 CBCl/AI MIT Our quantitative model builds upon previous hierarchical models Hubel & Wiesel (1962): Simple to complex to ``higher order hypercomplex cells’’ Fukushima (1980): Alternation of “S” and “C” layers to build up feature specificity and translation invariance, resp. Perrett & Oram (1993): Pooling as general mechanism to achieve invariance

Class 21, 1999 CBCl/AI MIT Model of view tuned cells MAX Riesenhuber and Tommy Poggio, 1999

Class 21, 1999 CBCl/AI MIT Model Diagram “IT” “V4” “V1”... w View-specific learning: synaptic plasticity

Class 21, 1999 CBCl/AI MIT Max (or “softmax”) key mechanism in the model computationally equivalent to selection (and scanning in our object detection system)

Class 21, 1999 CBCl/AI MIT V1: Simple Features, Small Receptive Fields Simple cells respond to bars Hubel & Wiesel, 1959 “Complex Cells”: translation invariance; pool over simple cells of the same orientation (Hubel&Wiesel)

Class 21, 1999 CBCl/AI MIT Two possible Pooling Mechanisms thanks to Pawan Sinha Nn nn

Class 21, 1999 CBCl/AI MIT An Example: Simple to Complex Cells “simple”cells “complex” cell ?

Class 21, 1999 CBCl/AI MIT Simple to Complex: Invariance to Position and Feature Selectivity ? “simple”cells “complex” cell

Class 21, 1999 CBCl/AI MIT 3. Some predictions of the model Scale and translation invariance of view-tuned AIT neurons Response to pseudomirror views Effect of scrambling Multiple objects Robustness to clutter Consistent with K. Tanaka’s simplification procedure More and more complex features from V1 to AIT

Class 21, 1999 CBCl/AI MIT Testing Selectivity and Invariance of Model Neurons Test specificity AND transformation tolerance of view-tuned model neurons Same objects as in Logothetis’ experiment 60 distractors

Class 21, 1999 CBCl/AI MIT Invariances of IT (view-tuned) Model Neuron

Class 21, 1999 CBCl/AI MIT Invariances: Experiment vs. Model (view-tuned cells) *

Class 21, 1999 CBCl/AI MIT MAX vs. Summation

Class 21, 1999 CBCl/AI MIT Response to Pseudo-Mirror Views As in experiment, some model neurons show tuning to pseudo-mirror image

Class 21, 1999 CBCl/AI MIT Robustness to scrambling: model and IT neurons Experiments: Vogels, 1999

Class 21, 1999 CBCl/AI MIT Recognition in Context: Two Objects

Class 21, 1999 CBCl/AI MIT Recognition in Context: some experimental support Sato: Response of IT cells to two stimuli in RF Sato, 1989

Class 21, 1999 CBCl/AI MIT Recognition in Clutter:data How does response of IT neurons change if background is introduced? Missal et al., 1997

Class 21, 1999 CBCl/AI MIT Recognition in Clutter: model average model neuron response recognition rates

Class 21, 1999 CBCl/AI MIT Further Support: Keiji just mentioned his simplification paradigm... Wang et al., 1998

Class 21, 1999 CBCl/AI MIT Consistent behaviour of the model

Class 21, 1999 CBCl/AI MIT Higher complexity and invariances in Higher Areas Kobatake & Tanaka, 1994

Class 21, 1999 CBCl/AI MIT Fujita and Tanaka’s Dictionary of Shapes (about 3000) in posterior IT (columnar organization)

Class 21, 1999 CBCl/AI MIT Similar properties in the model... M. Tarr, Nature Neuroscience

Class 21, 1999 CBCl/AI MIT Layers With Linear Pooling and With Max Pooling Linear pooling: yields more complex features (e.g. from LGN inputs to simple cells and -- perhaps -- from PIT to AIT cells) Max pooling: yields invariant (position, scale) features over a larger receptive field(e.g. from simple to complex V1 cells)

Class 21, 1999 CBCl/AI MIT 4. Hypothetical circuitry for Softmax The max operation is at the core of the model properties Which biophysical mechanisms and circuitry underlies the max operation?

Class 21, 1999 CBCl/AI MIT Softmax circuitry The SOFTMAX operation may arise from cortical microcircuits of lateral inhibition between neurons in a cortical layer. An example:a circuit based on feed forward (or recurrent) shunting presynaptic (or post synaptic) inhibition. Key elements: 1) shunting inhibition 2) nonlinear transformation of the signals (synaptic nonlinearities or active membrane properties). The circuit performs: a gain control operation (as in the canonical microcircuit of Martin and Douglas…) and -- for certain values of the parameters -- a softmax operation:

Class 21, 1999 CBCl/AI MIT Summary Model is consistent with: psychophysics of Buelthoff, Edelman,Tarr Logothetiss’ physiology on view-tuned IT cells One-view invariance ranges in AIT to shift, scale and rotation Response to pseudo-mirror views Sato’s data on 2 stimuli in IT cells Simplification data from Tanaka Clutter experiments in Orban’s lab Scrambling in Orban’s lab Biederman’s psychophysics with different “geons”

Class 21, 1999 CBCl/AI MIT Summary: main points of model Max-like operation, computationally similar to scanning and selecting Hypothetical inhibitory microcircuit for Softmax in cortex Easy grafting of top-down attentional effect on circuitry Segmentation is a byproduct of recognition No binding problem, syncrhonization not needed Model is extension of classical hierarchical H-W scheme Model deals with nice object classes (e.g. faces) and can be extended to object classification (rather then subordinate level recognition). Just a plausibility proof! Experiments wanted (to prove it wrong)!

Class 21, 1999 CBCl/AI MIT Category boundary Prototypes 100% Cat 80% Cat Morphs 60% Cat Morphs 60% Dog Morphs 80% Dog Morphs Prototypes 100% Dog Novel 3D morphing system to create new objects that are linear combinations of 3D prototypes

Class 21, 1999 CBCl/AI MIT Fixation Sample Delay Test (Nonmatch) Delay (Match) Test (Match) 600 ms ms. 500 ms. Object classification task for monkey physiology

Class 21, 1999 CBCl/AI MIT dog100% dog80% dog60% cat60% cat80% cat100% l04.spk 1301 time (msec) spike rate (Hz) Dog activity Cat activity Sample on Delay period Fixation Preliminary results from Prefrontal Cortex Recordings This suggests that prefrontal neurons carry information about the category of objects

Class 21, 1999 CBCl/AI MIT

Class 21, 1999 CBCl/AI MIT Recognition in Context: some experimental support Sato: Response of IT cells to two stimuli in RF Sato, 1989 Summation index for Max is 0 for Sum is 1 Sato finds -0.1 in the average

Class 21, 1999 CBCl/AI MIT Simulation