Download presentation
Presentation is loading. Please wait.
1
Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 1 Vision in Humans and Machines September 10, 2009
2
Visible light is just a part of the electromagnetic spectrum September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 2
3
Cross Section of the Human Eye September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 3
4
Anatomy of the Visual System The Eyes Cornea: Transparent outer covering of the eye that admits light Pupil: Adjustable opening in the iris that regulates the amount of light that enters the eye Iris: Pigmented ring of muscles situated behind the cornea September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 4
5
Anatomy of the Visual System Photoreceptors Retina: The neural tissue and photoreceptive cells located on the inner surface of the posterior portion of the eye. Rod: Photoreceptor cells of the retina, sensitive to light of low intensity. Cone: Photoreceptor cells of the retina; maximally sensitive to one of three different wavelengths of light and hence encodes color vision.
6
6
7
7 Anatomy of the Visual System The Eyes Lens: Consists of a series of transparent, onion-like layers. Its shape can be changed by contraction of ciliary muscles. Accommodation: Changes in the thickness of the lens, accomplished by the ciliary muscles, that focus images of near or distant objects on the retina
8
8
9
9 Anatomy of the Visual System The Eyes Fovea: Area of retina that mediates the most acute vision. Contains only color-sensitive cones. Optic Disk: Location on retina where fibers of ganglion cells exit the eye. Responsible for the blind spot.
10
10 Coding of Visual Information in the Retina Coding of Light and Dark Receptive field: That portion of the visual field in which the presentation of visual stimuli will produce an alteration in the firing rate of a particular neuron.
11
11 PhotoreceptorBipolarGanglion
12
12 Major cell types of the retina
13
13 Receptive fields
14
14 Color Mixing
15
15 Coding of Visual Information in the Retina Photoreceptors: Trichromatic Coding Peak wavelength sensitivities of the three cones: Blue cone:Short-Blue-violet (420 nm) Green cone:Medium-Green (530 nm) Red Cone:Long-Yellow-green (560nm)
16
16
17
17 Coding of Visual Information in the Retina Retinal Ganglion Cells: Opponent-Process Coding Negative afterimage: The image seen after a portion of the retina is exposed to an intense visual stimulus; consists of colors complimentary to those of the physical stimulus. Complimentary colors: Colors that make white or gray when mixed together.
18
18
19
19 Analysis of Visual Information Anatomy of the Striate cortex David Hubel and Torsten Wiesel 1960’s at Harvard University Discovered that neurons in the visual cortex did not simply respond to light; they selectively responded to specific features of the visual world.
20
20
21
21
22
22 Stimuli in receptive field of neuron
23
23 Cat V1 (striate cortex) Orientation preference map Ocular dominance map
24
24
25
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 25 “Data Flow Diagram” of Visual Areas in Macaque Brain Blue: motion perception pathway Green: object recognition pathway
26
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 26 Computer Vision A typical computer vision applications are complex and consist of different levels of processing, from the low-level pixel-by-pixel analysis to the high-level creation of scene descriptions. Generally, computer vision systems consist of an image processing stage, followed by a scene analysis stage. The following slide outlines the structure of a computer vision system.
27
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 27 Computer Vision A simple two-stage model of computer vision: Image processing Scene analysis Bitmap image Scene description feedback (tuning) Prepare image for scene analysis Build an iconic model of the world
28
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 28 Computer Vision The image processing stage prepares the input image for the subsequent scene analysis. Usually, image processing results in one or more new images that contain specific information on relevant features of the input image. The information in the output images is arranged in the same way as in the input image. For example, in the upper left corner in the output images we find information about the upper left corner in the input image.
29
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 29 Computer Vision The scene analysis stage interprets the results from the image processing stage. Its output completely depends on the problem that the computer vision system is supposed to solve. For example, it could be the number of bacteria in a microscopic image, or the identity of a person whose retinal scan was input to the system.
30
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 30 Digitizing Visual Scenes With regard to spatial resolution, we will map the intensity in our image onto a two-dimensional finite array: [0, 0][0, 1][0, 2][0, 3] [1, 0][1, 1][1, 2][1, 3] [2, 0][2, 1][2, 2][2, 3] y’ x’
31
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 31Thresholding Here, the right image is created from the left image by thresholding, assuming that object pixels are darker than background pixels. As you can see, the result is slightly imperfect (dark background pixels).
32
September 4, 2007Computer Vision Lecture 1: Digital Images/Binary Image Processing 32 Geometric Properties
33
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 33 Geometric Properties We could teach our program what the objects look like at different sizes and orientations, and let the program search all possible positions in the input. However, that would be a very inefficient and inflexible approach. Instead, it is much simpler and more efficient to standardize the input before performing object recognition. We can scale the input object to a given size, center it in the image, and rotate it towards a specific orientation.
34
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 34 Noise Reduction Here, a size filter perfectly removes all noise in the input image.
35
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 35 Noise Reduction However, if our threshold is too high, “accidents” may happen.
36
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 36 Edge Detection Calculating the magnitude of the brightness gradient with a Sobel filter. Left: original image; right: filtered image.
37
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 37Texture
38
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 38Texture Texture is an important cue for biological vision systems to estimate the boundaries of objects. Also, texture gradient is used to estimate the orientation of surfaces. For example, on a perfect lawn the grass texture is the same everywhere. However, the further away we look, the finer this texture becomes – this change is called texture gradient. For the same reasons, texture is also a useful feature for computer vision systems.
39
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 39 Texture Gradient
40
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 40Texture The most fundamental question is: How can we “measure” texture, i.e., how can we quantitatively distinguish between different textures? Of course it is not enough to look at the intensity of individual pixels. Since the repetitive local arrangement of intensity determines the texture, we have to analyze neighborhoods of pixels to measure texture properties.
41
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 41 Stereo Vision Geometry of binocular stereo vision
42
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 42 Statistical Pattern Recognition
43
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 43 Object Recognition This algorithm learns to recognize 25 different chairs: It is shown each chair from 25 different viewing angles.
44
September 10, 2009Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 44 The Algorithm
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.