Cognitive Processes PSY 334

Slides:



Advertisements
Similar presentations
Chapter 5: Space and Form Form & Pattern Perception: Humans are second to none in processing visual form and pattern information. Our ability to see patterns.
Advertisements

Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/
Language Comprehension Speech Perception Semantic Processing & Naming Deficits.
Cognition, 8e by Margaret W. MatlinChapter 2 Cognition, 8e Chapter 2 Perceptual Processes I: Visual and Auditory Recognition.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Last week... why object recognition is difficult, the template model the feature recognition model, word recognition as a case study Today... Recognition.
Perception Putting it together. Sensation vs. Perception A somewhat artificial distinction Sensation: Analysis –Extraction of basic perceptual features.
Cognitive Processes PSY 334 Chapter 2 – Perception April 9, 2003.
Multimedia and the World Wide Web
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
1 Pattern Recognition (cont.). 2 Auditory pattern recognition Stimuli for audition is alternating patterns of high and low air pressure called sound waves.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
PSY 369: Psycholinguistics
1 3 Processes of Pattern Recognition Sensation – you have to detect or see the pattern Perception – you have to organize the features into a whole Memory.
Instructor: Vincent Duffy, Ph.D. Associate Professor Lecture 5 – Cognition & Information Processing Tues. Jan. 30, 2007 IE 486 Work Analysis & Design II.
Pattern Recognition Pattern - complex composition of sensory stimuli that the human observer may recognize as being a member of a class of objects Issue.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Phonetics, day 2 Oct 3, 2008 Phonetics 1.Experimental a. production b. perception 2. Surveys/Interviews.
Language Comprehension Speech Perception Naming Deficits.
Cognitive Processes PSY 334 Chapter 2 – Perception.
An aside: peripheral drift illusion illusion of motion is strongest when reading text (such as this) while viewing the image in your periphery. Blinking.
Language Comprehension Speech Perception Meaning Representation.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Perception Illusion A false representation of the environment
Cognitive Processes PSY 334 Chapter 2 – Perception.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,
Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Sensation and Perception - shape.ppt © 2001 Dr. Laura Snodgrass, Ph.D.1 Shape, Pattern, Form What is needed for shape (pattern, form) ? Facts a theory.
Cognitive Processes PSY 334
Information: Perception and Representation Lecture #7 Part A.
Lecture 3 - Race against Time 1 Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing.
Korea University Dept.of Industrial System & Information Engineering User Interface Lab Chapter 3 _ Object Recognition + 이병용.
Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.
1 Perceptual Processes Introduction –Pattern Recognition –Top-down Processing & Pattern Recognition –Face Perception Attention –Divided attention –Selective.
Perception. Question of the Day Why is recognizing an object so easy for humans, but so difficult for computers?
Perception.
Perception & Pattern Recognition II
Chapter 3: Sensation and Perception Sensation: activity of receptor organs Perception: interpretation of sensory system activity Visual system organization:
3:01 PM Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing the contents of SM.
Language Perception.
High level vision.
Cognitive - perception.ppt © 2001 Laura Snodgrass, Ph.D.1 Perception The final image we are consciously aware of is a “constructed” representation of the.
How we actively interpret our environment..  Perception: The process in which we understand sensory information.  Illusions are powerful examples of.
Chapter 11 Language. Some Questions to Consider How do we understand individual words, and how are words combined to create sentences? How can we understand.
Adapted from by E.Day THE COGNITIVE APPROACH TYPES OF PROCESSING.
Sight Our Visual Perception
Recognizing Visual and Auditory Stimuli
Theories of Perception
Phonological Priming and Lexical Access in Spoken Word Recognition
Speech Perception Models
Cognitive Processes PSY 334
THE VISUAL SYSTEM: PERCEPTUAL PROCESSES
How do we realize design? What should we consider? Technical Visual Interaction Search Context of use Information Interacting/ transacting.
Perceiving and Recognizing Objects
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Sensation and Perception
How you perceive your surroundings
Cognitive Processes PSY 334
Pattern recognition (…and object perception).
Cognitive Processes PSY 334
Perceiving Form – Part I
Cognitive Processes PSY 334
Perception.
Introduction to Perception: Visual Perception
Perception.
Presentation transcript:

Cognitive Processes PSY 334 Chapter 2 – Perception

Visual Illusions Depend on experience. Influenced by culture. Illustrate normal perceptual processes. These are not errors but rather failures of perception in unusual situations.

Visual Pattern Recognition Bottom-up approaches: Template-matching Feature analysis Recognition by components

Template-Matching A retinal image of an object is compared directly to stored patterns (templates). The object is recognized as the template that gives the best match. Used by computers to recognize patterns. Evidence shows human recognition is more flexible than template-matching: Size, place, orientation, shape, blurred or broken (ambiguous or degraded items easily recognized by people.

Feature Analysis Stimuli are combinations of elemental features. Features are recognized and combined. Features are like output of edge detectors. Features are simpler, so problems of orientation, size, etc., can be solved. Relationships among features are specified to define the pattern.

Evidence for Feature Analysis Confusions – people make more errors when letters presented at brief intervals contain similar features: G misclassified: as C (21), as O (6), as B (1), as 9 (1) When a retinal image is held constant, the parts of the object disappear: Whole features disappear. The remaining parts form new patterns.

Object Recognition Biederman’s recognition-by-components: Parts of the larger object are recognized as subobjects. Subobjects are categorized into types of geons – geometric ions. The larger object is recognized as a pattern formed by combining geons. Only edges are needed to recognize geons.

Tests of Biederman’s Theory Object recognition should be mediated by recognition of object components. Two types of degraded figures presented for brief intervals: Components (geons) missing Line segments missing At fast intervals (65-100 ms) subjects could not recognize components when segments were missing.

Speech Recognition The physical speech signal is not broken up into parts that correspond to recognizable units of speech. Undiminished sound energy at word boundaries – gaps are illusory. Cessation of speech energy in the middle of words. Word boundaries cannot be heard in an unfamiliar language.

Phoneme Perception No one-to-one letter-to-sound correspondence. Speech is continuous – phonemes are not discrete (separate) but run together. Speakers vary in how they produce the same phoneme. Coarticulation – phonemes overlap. The sound produced depends on the sound immediately preceding it.

Feature Analysis of Speech Features of phonemes appear to be: Consonantal feature (consonant vs vowel). Voicing – do vocal cords vibrate or not. Place of articulation – where the vocal track is constricted (where is tongue placed). The phoneme heard by listeners changes as you vary these features. Sounds with similar features are confused.

Categorical Perception For speech, perception does not change continuously but abruptly at a category boundary. Categorical perception – failure to perceive gradations among stimuli within a category. Pairs of [b]’s or [p]’s sound alike despite differing in voice-onset times.

Two Views of Categorical Perception Weak view – stimuli are grouped into recognizable categories. Strong view – we cannot discriminate among items within such a category. Massaro – people can discriminate within category but have a bias to same items are the same despite differences. Category boundaries can be shifted by fatiguing the feature detectors.

Top Down Processing General knowledge (context, high-level thinking) combines with interpretation of low-level perceptual units (features). Context limits the possibilities so fewer features must be processed: Word superiority effect – D or K vs WORD or WORK – words do 10% better. To xllxstxatx, I cxn rxplxce xvexy txirx lextex of x sextexce xitx an x, anx yox stxll xan xanxge xo rxad xt wixh sxme xifxicxltx.

Context and Speech Phoneme restoration effect: It was found that the *eel was on the axle. It was found that the *eel was on the shoe. It was found that the *eel was on the orange. It was found that the *eel was on the table. The identification of the missing word depends on what happens after it.

Faces and Scenes When parts are presented in isolation, more feature information is needed to recognize them. Face parts are recognized with less detail when in the context of a face. Subjects are better able to identify objects when they are part of coherent novel scenes rather than jumbled scenes.

Models of Object Perception Two competing models explain how context and feature information are combined: Massaro’s FLMP (fuzzy logic model of perception) -- Context and detail are two independent sources of information. McClelland & Rumelhart’s PDP model – connectionist model in which both sources of information interact.

Testing the FLMP Model Four kinds of stimuli: Only an e can make a real word. Only a c can make a real word. Both letters can make a word. Neither letter can make a word. Within each group, stimuli go from e to c. Subjects saw each stimulus word briefly and had to identify the letter, e or c.

FLMP Results Observed frequencies for naming a letter e increase as it has more e features, but also as the context demands an e. Baye’s theorem gives a formula for combining the independent contributions of two sources of information. Massaro’s results conform to predictions of Baye’s theorem, suggesting that the information sources must be independent of each other.

Testing the PDP Model Activation spreads from features to excite letters and from letters to excite words (bottom up processing). Activation also spreads from words to the component letters (top-down processing). The more activation, the more likely the correct letter will be identified: TRAP vs TRIP

Comparing the Two Models Subjects heard a phoneme that varied from r to an l in two contexts: A syllable beginning with t – tr or tl. A syllable beginning with s – sl or sr. Both the FLMP and PDP models were compared to actual subject data. FLMP was close to what subjects did. PDP was too strongly affected by context.

PDP Model Describes More The PDP model suggests that information is not separately processed but each letter affects each other letter. Recognition of “a” in MAVE is almost as good as recognizing it in MADE. This occurs because MAVE is similar to many other words with an A in that position. We do not have a context but four letters that each influence the others.

Marr Depth cues (texture gradient, stereopsis) – where are edges in space? How are visual cues combined to form an image with depth? Primal sketch – extracts features. 2-1/2 D sketch – identifies where visual features are in relation to observer (depth). 3-D model – refers to the representation of the objects in a scene, combines context.

Putting it All Together The output of these stages (see Fig 2.31) is a representation of an object and its location. This output is used as input to higher-level cognitive processes. Conscious awareness (a higher-level process) involves the recognition stage, but lots of processing occurs first.