Visual Cognition August 30, 2007.

Slides:



Advertisements
Similar presentations
Chapter 3: Visual Perception
Advertisements

Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Perception Chapter 4.
Introductory Psychology Concepts
Announcement MIDTERM When: 2/ PM Where: 128 Dennison.
Perception Putting it together. Sensation vs. Perception A somewhat artificial distinction Sensation: Analysis –Extraction of basic perceptual features.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Object Perception. Perceptual Grouping and Gestalt Laws Law of Good continuation. This is perceived as a square and triangle, not as a combination of.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Part 1. Part 1: Sensory Processing Vision Part 2: Other Sensory Systems.
PY202 Overview. Meta issue How do we internalise the world to enable recognition judgements to be made, visual thinking, and actions to be executed.
Basic Processes in Visual Perception
Cognitive Processes PSY 334 Chapter 2 – Perception.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Visual Cognition September 11, Overview of Visual Cognition Visual system: mission critical Multivariate inputs, unitary experience Multiple types.
1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.
Module 6 Perception.
Perception Illusion A false representation of the environment
MIND: The Cognitive Side of Mind and Brain  “… the mind is not the brain, but what the brain does…” (Pinker, 1997)
With respect to STM, grouping several items together to form a single larger item is called: A.BlockingB.Lumping C.ChunkingD.Grouping Electrochemical.
Module 6 Perception.
Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.
VIEWING THE WORLD IN COLOR. COLOR A psychological interpretation Based on wavelength, amplitude, and purity Humans can discriminate among c. 10 million.
Perception Chapter 5.
Perception Is… The process of recognizing, organizing, and interpreting sensory information.
Korea University Dept.of Industrial System & Information Engineering User Interface Lab Chapter 3 _ Object Recognition + 이병용.
Perception.
Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.
Fundamentals of Sensation and Perception RECOGNIZING VISUAL OBJECTS ERIK CHEVRIER NOVEMBER 23, 2015.
Introduction to Psychology Sensation and Perception Prof. Jan Lauwereyns
Perception  How do we define it? How we recognize and interpret stimuli How we recognize and interpret stimuli Top down processing… Top down processing…
Perception. The means by which information acquired from the environment via the sense organs is transformed into experiences of objects, events, sounds,
Sight Our Visual Perception
Thinking About Psychology: The Science of Mind and Behavior
Chapter 3 Perception.
Vision.
Sensation and Perception
Unit 4: Perceptual Organization and Interpretation
KA 2: Perception and Memory
Myers’ PSYCHOLOGY (6th Ed)
Cognitive Processes PSY 334
Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.
Prof. Riyadh Al_Azzawi F.R.C.Psych
Dr.safeyya Adeeb Alchalabi
Perception.
Section 2: The Visual System
THE VISUAL SYSTEM: PERCEPTUAL PROCESSES
Visual Perception Chapter 3.
Visual Perception Principles
Module 6 Perception.
VIEWING THE WORLD IN COLOR
Perceiving and Recognizing Objects
Review Session 3: Sensation and Perception
Neuropsychology of Vision Anthony Cate April 19, 2001
The previous picture “moves” because of tiny muscular movements of your eyes.
Prof. Riyadh Al_Azzawi F.R.C.Psych
How you perceive your surroundings
Perception We have previously examined the sensory processes by which stimuli are encoded. Now we will examine the ultimate purpose of sensory information.
Mind, Brain & Behavior Wednesday February 12, 2003.
Pattern recognition (…and object perception).
Cognitive Processes PSY 334
Unit 4: Perceptual Organization & Interpretation
Cognitive Processes PSY 334
Perception.
Module 6 Perception.
Introduction to Perception: Visual Perception
Prof. Riyadh Al_Azzawi F.R.C.Psych
Introduction to Visual Perception
PERCEPTION Def: the mental process of organizing sensory input into meaningful patterns.
Perception.
Presentation transcript:

Visual Cognition August 30, 2007

Overview of Visual Cognition Visual system: mission critical Multivariate inputs, unitary experience Multiple types of vision means distributed visual network Segregating vs. parsing Visual experience reveals nature of underlying representations Theory >> data

Who’s Coming to Dinner? Gestalt Psychology (interested in basic dimensions of perceptual organization) Cognitive Psychology (interested primarily in effects of cognitive operations on visual experience) Cognitive Neuropsychology (interested in effects of selective lesions of visual cortex on visual behavior) Electrophysiology (interested in single- and multiple-unit activity of cells in visual cortex) Cognitive science (interested in developing overall theories of visual experience and object recognition)

Sensation and Perception Sensation: Conscious outcome of sense organs and projection regions. (“I detect something”, not necessarily conscious, and not necessarily meaningful) Perception: means by which information acquired from the environment via the sense organs is transformed (organized) into conscious experiences of objects, events, sounds, tastes, etc. (“I know, recognize, appreciate what I am sensing, and it means something to me”)

Three Stages of Visual Processing Reception: absorption of physical energy by receptors Transduction: translation of physical energy into electrochemical activity Coding: relationship between aspects of the physical stimulus and resultant nervous system activity

Basic Perceptual Phenomena You Already Know About Size and Shape Constancies: reflect mechanisms whereby perception of a distal object stays the same despite proximal changes in the perception – implies storage of a “structural representation” Depth Cues: monocular or binocular sources of information that convey information about relative distance of objects from viewer – involve cues about the structure, movement, and proximity of objects

2

Object vs. Spatial Vision (Mishkin & Ungerleider) General principle: inferior lesions produce perceptual impairments; superior lesions produce syndromes dominated by spatial impairment

Milner & Goodale: Perception-Action Model Alternative to the Mishkin-Ungerleider model Dorsal: “How do I interact with that object”? Ventral: “What is that object”? Evidence: distinctions between object recognition and visual control of action; Balint’s syndrome; optic ataxia

Perceptual Organization Two complementary issues: Organizing coherent objects and events out of segregated sensory/perceptual inputs (“binding”) “Parsing” the perceptual world; understanding which inputs belong together and which come from separate objects

Perceptual Organization Gestalt Psychology Law of Prägnanz: perceptual system organizes to the simplest and most stable shape possible from the array other laws describe how disparate perceptual elements are grouped although their “laws” are probably incorrect, the elementary concept of perceptual grouping is critical Inferring processes from performance

Examples of some of the Gestalt laws of perceptual organization: (a) the law of proximity; (b) the law of similarity; (c) the law of good continuation; and (d) the law of closure.

Figure–Ground Segregation An ambiguous drawing which can be seen either as two faces or as a goblet

Principle of Good Continuation Infants do not use the principle of good continuation, suggesting that it is a learned phenomenon Schematic depiction of two types of displays: (a) homogeneous displays and (b) heterogeneous displays. From Spelke et al. (1993) with permission from Elsevier.

Top-Down Influences in Grouping and Segmentation Is the “x” on the same or different object? Answering this question takes longer with the display on the right. Vecera & Farah, 1997

The study by Quinlan and Wilton (1998) shows that the Gestaltists de-emphasized the complexities involved when laws of grouping are in conflict. The nature of the psychological processes that underlie the Gestalt principles of grouping by proximity and grouping by similarity is examined. Similarity was defined relative to the principles of grouping by common colour and grouping by common shape. Subjects were presented with displays comprising a row of seven coloured shapes and were asked to rate the degree to which the central target shape grouped with either the right or the left flanking shapes. Across the displays the proximal and featural relationships between the target and flankers were varied. These ratings reflected persuasive effects of grouping by proximity and common colour; there was only weak evidence for grouping by common shape. Nevertheless, both common colour and common shape were shown to override grouping by proximity, under certain conditions. The data also show that to understand how the Gestalt principles operate it appears necessary to consider processes that operate within and between groups of elements that are initially identified on the basis of proximity. Whether such groups survive further analysis depends critically on the featural content of the constituent elements. (a) Display involving a conflict between proximity and similarity; (b) display with a conflict between shape and colour; (c) a different display with a conflict between shape and colour. All adapted from Quinlan and Wilton (1998).

Gestalt Theory in the 21st Century Basic perceptual principles have survived Contentions that such processes are necessarily very “early” in perception have not Proximity computation occurs after depth perception Segmentation is not just “bottom-up” (Vecera &Farah, 1997)

Illusions are fun…. But their real value is that they reveal shed light on what is stored about object structure

Relative Size Texture Gradient

Illustrates how visual illusions may be based on stored information about visual structure

Shape Constancy

Depth Perception Monocular depth cues – cues that provide depth information with one eye closed pictorial cues (e.g., linear perspective) aerial perspective (differing contrast) texture gradients (decreased detail from front to back) interposition shading (production of shadows by 3-D objects) familiar size as anchor motion parallax

Monocular Depth Cues -texture gradients -relative size -interposition Crivelli: The Annunciation

When Worlds Collide: Ingenious use of monocular depth cues to produce paradoxical perceptions M.C. Escher: Waterfall

Ames Room – importance of perceived distance

Ames Room: Importance of perceived distance

Kanizsa (1976) – interposition as cue

Kanizsa (1976)

Shading as a Depth Cue to Concavity or Convexity Where is our major light source?

Depth Perception (cont’d) Binocular Cues to Depth Binocular disparity: different images to each eye as a function of object closeness; basis of stereoscopic vision Binocular convergence: rotation of eyes depending on object closeness; brain receives efferent motor signals from ocular muscles and interprets distance accordingly Motion parallax: differential perception of motion (speed and direction) as a function of distance from perceiver

How are depth cues combined? Additively Preferentially Multiplicatively Most data suggests additivity, though one cue may occasionally overwhelm another in certain circumstances

Color Vision Young-Helmholtz theory: three populations of cones Wide receptivity to wavelenth (continuous, not categorical; i.e., not just RGB detectors) Opponent-Process theory: R-G, B-Y, B-W LGN cells show  activity to some wavelengths and  activity to others Synthesis: Two-stage theory (Trichromatic at the level of neurons; O-P at the level of fibers and pathways)

(“large-bodied”) (“small-bodied”) *

Multiple Visual Areas (Zeki) V1 and V2: responsive to color and form V3 and V3A: form and motion V4: color V5: motion

Color perception/cognition can be selectively impaired Achromatopsia: selective impairment of inferior visual cortex (V4?) Color anomia: a general visual-verbal disconnection (seen with other visual-verbal disturbances) Specific color aphasia: specific inability to name colors

Theories of Perception “Indirect” perception: perception is the result of interactive processes from without and from within – use of internal representations based on “top-down” and “bottom-up” processing “Direct” perception (Gibson): information from the visual world is sufficient to permit perception without the involvement of internal representations - focus in “bottom-up” processing, without the need to posit internal representations

“Direct” View (Gibson) “Optic array” contains all necessary visual information Layout of objects in space given by texture gradients, optic flow patterns, and affordances (implied meaning of objects) Perception involves “picking up” information through “resonance” Has had historical impact in restoring interest in the perceptual environment Has been criticized as being underspecified, and neglects role of knowledge in stimulus exploration

Data in Favor of “Indirect” View Context effects in perception: what is seen depends upon surround Effects of emotion (e.g., weapon focus) Visual illusions: idea is that general knowledge about objects is applied inappropriately to the perception of two-dimensional figures (e.g., Muller-Lyer)

Template Theories of Pattern Recognition Template theories: a miniature copy or template of each known pattern is stored in long-term memory straightforward template theory normalized template theory Problems not adaptable impose large storage requirements

Feature Theories of Pattern Recognition Feature theories: pat-terns consist of a set of specific features or attributes Advantages: elementary features can combine to form multiple objects Problems: context effects in perception recognition can take place when features are occluded

Prototype Theories of Pattern Recognition Individual instances are not stored; what is stored is an “exemplar” or representative element of a category Recognition based on “distance” between perceived item and prototype Nature of computation still relatively unknown

Global Processing Precedence Navon (1977): Is the large letter an H or an S? Are the small letters H’s or S’s? Decision speed for small letters greatly affected by whether large letter was the same or different; decision speed for large lettters unaffected by small letters

Theories of Object Recognition

Object Recognition: Key Questions When does one object end and another begin? (grouping/segmentation) Viewpoint independence (perception of objects as objects, regardless of view) How do we know that dogs are dogs and not cats, and how do we know the meaning of what we’re seeing? (categorization)

More Key Issues The “binding problem”: if perception of a given object is distributed throughout visual areas, how does the brain combine features to produce unitary percept? Conscious vs. unconscious perception: perception of some object or object qualities may proceed pre-consciously or without effort (a general principle)

The Building Blocks of Object Recognition: Marr’s Representations Primal Sketch: 2-D representation of light intensity changes, information about edges, contours, and blobs Raw: pure light intensity changes Full: uses information to identify shapes 2 1/2-D Sketch: depth and orientation of visible surfaces, shading, texture, motion, binocular disparity; observer-centered 3-D Sketch: three dimensional description of objects independent of view

Marr-Hildreth Algorithm Attempts to account for development of primal sketch Idea of “blurred repres-entations” “Zero-crossings” identify edges within a visual image Only ‘reliable’ zero-cross-ings are kept Four types of tokens: edge-segments, bars, terminations, and blobs Intensity change Rate of change (1st D) Zero-crossing (2nd D)

Marr & Nishihara (1978) Development of 3-D sketch based on processing of more elementary shape primitives (basic primitive is a cylinder with a major axis) Hierarchical organization of primitives Concavities important in segmenting parts This is a completely computational, not empirical, model

Biederman’s RBC Theory Objects built from primitives called “geons” (n = 36) Viewer must decide appropriate way a visual object is to be segmented into geons Two key components of decision: locating concavity deciding which edge information remains invariant across different viewing angles (invariant properties like curvature, parallelism, etc.)

Biederman’s Recognition-by-Components Theory Adapted from Biederman (1987)

Recognition-by-Components Theory Biederman (1987): five invariant properties of edges Curvature: points on a curve Parallelism: sets of points in parallel Cotermination: edges terminating at a common point Symmetry: versus asymmetry Collinearity: points sharing a common line

Biederman (1987). Participants were presented with degraded line drawings of objects. Recognition was much harder to achieve when parts of the contour containing concavity information were omitted than when other parts of the contour were deleted. This confirms the assumption that information about concavities is important for object recognition. Figures adapted from Biederman (1987). Biederman’s view is “viewpoint invariant”

Viewpoint-dependent and Viewpoint-invariant Theories Biederman (1987) The ease of object recognition is not affected by the observer’s viewpoint Tarr (1995), Tarr and Bülthoff (1995, 1998) Changes in viewpoint reduce the speed and/or accuracy of object recognition Milner and Goodale (1995) Dorsal pathway makes use of viewpoint-dependent information Ventral pathway makes use of viewpoint-invariant information

Common Elements in Object Recognition Theories Edge coding Grouping or encoding into higher-order features Matching to a stored “structural representation” Access to semantic knowledge

Face Recognition Two general theories: Neural substrate model (module) Specialized processing model Face recognition can be selectively impaired (prosopagnosia), or can it? Global (configurational) vs. local (feature-based) processing

-neural substrate (modules) Two Models: -neural substrate (modules) -specialized processing

V4 (color) FFA (face)

Neural Substrate Example: the “Fusiform Face Area”

Spiridon, Fischl, & Kanwisher, Hum Brain Mapping, 2006

Spiridon, Fischl, & Kanwisher, Hum Brain Mapping, 2006

Examples of “Greebles” Examples of “Greebles”. In the top row, four different “families” are represented. For each family, two members of different “genders” are shown (e.g., Ribu is one gender and Pila is the other). The bottom row shows a new set of Greeble figures constructed on the same logic but asymmetrical in structure. Images provided courtesy of Michael J. Tarr (Brown University, Providence, RI).

Speed of Greeble matching as a function of stage training and difference in orientation between successive Greeble stimuli. Based on data in Gauthier and Tarr (2002). Figure shows that, although speed increases with training, Greeble identification is still viewpoint dependent

Face Recognition Probably involves specialized processing/visual expertise Dedicated neural substrates aren’t the only answer Hybrid model: neural substrates dedicated to configural processing

Disorders of Object Recognition – Visual Agnosia Apperceptive agnosia Object recognition is impaired because of deficits in perceptual processing Associative agnosia Perceptual processes are essentially intact, but object recognition is impaired partly or mainly because of difficulties in accessing relevant knowledge about objects from memory

Riddoch and Humphreys (2001) A hierarchical model of object recognition and naming, specifying different component processes which, when impaired, can produce varieties of apperceptive and associative agnosia.

Structural Descriptions Structural descriptions consist of propositions which “describe the nature of the components of a configuration and make explicit the spatial arrangement of these parts” (Bruce & Green, 1990) Perceptual representation systems for faces, objects, etc. Evidence for separate systems: category-specific recognition defects

Illustrates how visual illusions may be based on stored information about visual structure