Physiology of Vision: a swift overview Learning-Based Methods in Vision A. Efros, CMU, Spring 2012 Some figures from Steve Palmer
Tale of Martians with an old PC
Understanding the Brain Anatomy versus Physiology Anatomy: The biological study of the physical structure of organisms. Physiology: The biological study of the functional structure of organisms. © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
The Brain is tricky business Aristotle thought it’s for cooling the blood Localized or distributed?
Phrenology One of the first application of statistics This is how bad Machine Learning got started…
Localized or Distributed? Evidence from patients with partial brain damage Lots of useful data from soldiers in the Russo-Japanese war But also evidence for distributed nature of processing E.g. [Lashley] showed “graceful degradation” of memory performance in rats
The Visual System Both eye and brain are required for functional vision Two kinds of blindness: Normal blindness (eye dysfunction) Cortical blindness (brain dysfunction) © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
The Big Picture The Visual System Eyes register optical information Pathways to occipital cortex Two pathways from V1 “What” pathway to temporal cortex “Where” pathway to parietal cortex Convergence on frontal cortex © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Pathways to the Brain Anatomy of Pathway to Visual Cortex © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Pathways to the Brain The Lateral Geniculate Nucleus (LGN) Waystation in Thalamus Projections from both eyes Six layers Projects to cortical area V1 © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Visual Cortex Map of Visual Areas in Cortex “Unfolded” view of visual areas in the macaque cortex (sizes not to scale). © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Visual Cortex What/Where Pathways Evidence from lesions of monkey cortex © Stephen E. Palmer, 2002
Evidence from Neuropsychology Visual Cortex What/Where Pathways Evidence from Neuropsychology Visual agnosia: Inability to identify objects and/or people Caused by damage to inferior (lower) temporal lobe Disruption of the “what” pathway Visual neglect: Inability to see objects in the left visual field Caused by damage to right parietal lobe Disruption of the “where” pathway © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Feature-based Pathways Hypothesis Visual Cortex Feature-based Pathways Hypothesis Visual Features Color Shape Depth Motion Featural Pathways Separate neural pathways in which different features are processed. © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
The Gross Summary The visual system is composed of many interactive functional parts: Eye (optics of image formation) Retina (light transduction) LGN (waystation?) Area V1 (hypercolumns) Higher cortical areas (features) Cortical pathways (what/where) © Stephen E. Palmer, 2002 © Stephen E. Palmer, 2002
Image Formation Digital Camera Film The Eye
Monocular Visual Field: 160 deg (w) X 135 deg (h) Binocular Visual Field: 200 deg (w) X 135 deg (h)
What do we see? 3D world 2D image Figures © Stephen E. Palmer, 2002
What do we see? 3D world 2D image Painted backdrop
The Plenoptic Function Figure by Leonard McMillan Q: What is the set of all things that we can ever see? A: The Plenoptic Function (Adelson & Bergen) Let’s start with a stationary person and try to parameterize everything that he can see…
Grayscale snapshot P(q,f) is intensity of light Seen from a single view point At a single time Averaged over the wavelengths of the visible spectrum (can also do P(x,y), but spherical coordinate are nicer)
Color snapshot P(q,f,l) is intensity of light Seen from a single view point At a single time As a function of wavelength
Spherical Panorama All light rays through a point form a ponorama See also: 2003 New Years Eve http://www.panoramas.dk/fullscreen3/f1.html All light rays through a point form a ponorama Totally captured in a 2D array -- P(q,f) Where is the geometry???
A movie P(q,f,l,t) is intensity of light Seen from a single view point Over time As a function of wavelength
Space-time images t y x
Holographic movie P(q,f,l,t,VX,VY,VZ) is intensity of light Seen from ANY viewpoint Over time As a function of wavelength
The Plenoptic Function P(q,f,l,t,VX,VY,VZ) Can reconstruct every possible view, at every moment, from every position, at every wavelength Contains every photograph, every movie, everything that anyone has ever seen! it completely captures our visual reality! Not bad for a function…
The Eye is a camera The human eye is a camera! Iris - colored annulus with radial muscles Pupil - the hole (aperture) whose size is controlled by the iris What’s the “film”? photoreceptor cells (rods and cones) in the retina
The Retina
Retina up-close Light
Two types of light-sensitive receptors Cones cone-shaped less sensitive operate in high light color vision Rods rod-shaped highly sensitive operate at night gray-scale vision © Stephen E. Palmer, 2002
Rod / Cone sensitivity The famous sock-matching problem…
Distribution of Rods and Cones Night Sky: why are there more stars off-center? © Stephen E. Palmer, 2002
Electromagnetic Spectrum At least 3 spectral bands required (e.g. R,G,B) Human Luminance Sensitivity Function http://www.yorku.ca/eye/photopik.htm
…because that’s where the Visible Light Why do we see light of these wavelengths? …because that’s where the Sun radiates EM energy © Stephen E. Palmer, 2002
The Physics of Light Any patch of light can be completely described physically by its spectrum: the number of photons (per time unit) at each wavelength 400 - 700 nm. © Stephen E. Palmer, 2002
The Physics of Light Some examples of the spectra of light sources © Stephen E. Palmer, 2002
The Physics of Light Some examples of the reflectance spectra of surfaces Red 400 700 Yellow 400 700 Blue 400 700 Purple 400 700 % Photons Reflected Wavelength (nm) © Stephen E. Palmer, 2002
The Psychophysical Correspondence There is no simple functional description for the perceived color of all lights under all viewing conditions, but …... A helpful constraint: Consider only physical spectra with normal distributions mean area variance © Stephen E. Palmer, 2002
The Psychophysical Correspondence Mean Hue # Photons Wavelength © Stephen E. Palmer, 2002
The Psychophysical Correspondence Variance Saturation Wavelength # Photons © Stephen E. Palmer, 2002
The Psychophysical Correspondence Area Brightness # Photons Wavelength © Stephen E. Palmer, 2002
Physiology of Color Vision Three kinds of cones: Why are M and L cones so close? © Stephen E. Palmer, 2002
Retinal Processing © Stephen E. Palmer, 2002
Single Cell Recording Microelectrode Electrical response Amplifier Electrical response (action potentials) mV © Stephen E. Palmer, 2002
Single Cell Recording © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in ganglion cells: On-center Off-surround Stimulus condition Electrical response © Stephen E. Palmer, 2002
Retinal Receptive Fields RF of On-center Off-surround cells © Stephen E. Palmer, 2002
Retinal Receptive Fields RF of Off-center On-surround cells Surround Center © Stephen E. Palmer, 2002
Retinal Receptive Fields
Retinal Receptive Fields Receptive field structure in bipolar cells Light © Stephen E. Palmer, 2002
Retinal Receptive Fields Receptive field structure in bipolar cells © Stephen E. Palmer, 2002
Visual Cortex Cortical Area V1 aka: Primary visual cortex Striate cortex Brodman’s area 17 © Stephen E. Palmer, 2002
Cortical Receptive Fields Single-cell recording from visual cortex David Hubel & Thorston Wiesel © Stephen E. Palmer, 2002
Cortical Receptive Fields Single-cell recording from visual cortex http://www.youtube.com/watch?v=IOHayh06LJ4 © Stephen E. Palmer, 2002
Cortical Receptive Fields Three classes of cells in V1 Simple cells Complex cells Hypercomplex cells © Stephen E. Palmer, 2002
Cortical Receptive Fields Simple Cells: “Line Detectors” http://www.youtube.com/watch?v=8VdFf3egwfg © Stephen E. Palmer, 2002
Cortical Receptive Fields Simple Cells: “Edge Detectors” © Stephen E. Palmer, 2002
Cortical Receptive Fields Constructing a line detector © Stephen E. Palmer, 2002
Cortical Receptive Fields Complex Cells 0o © Stephen E. Palmer, 2002
Cortical Receptive Fields Complex Cells 60o © Stephen E. Palmer, 2002
Cortical Receptive Fields Complex Cells 90o © Stephen E. Palmer, 2002
Cortical Receptive Fields Complex Cells 120o © Stephen E. Palmer, 2002
Cortical Receptive Fields Constructing a Complex Cell © Stephen E. Palmer, 2002
Cortical Receptive Fields Hypercomplex Cells © Stephen E. Palmer, 2002
Cortical Receptive Fields Hypercomplex Cells © Stephen E. Palmer, 2002
Cortical Receptive Fields Hypercomplex Cells © Stephen E. Palmer, 2002
Cortical Receptive Fields Hypercomplex Cells “End-stopped” Cells © Stephen E. Palmer, 2002
Cortical Receptive Fields “End-stopped” Simple Cells © Stephen E. Palmer, 2002
Cortical Receptive Fields Constructing a Hypercomplex Cell © Stephen E. Palmer, 2002
Mapping from Retina to V1
Why edges?
Why edges?