The Appeal of Parallel Distributed Processing J.L. McClelland, D.E. Rumelhart, and G.E. Hinton 인지과학 협동과정 강소영.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA
Advertisements

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona A Cognitive Architecture for Integrated.
ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Knowledge Representation
PDP: Motivation, basic approach. Cognitive psychology or “How the Mind Works”
Topics in Cognition and Language: Theory, Data and Models *Perceptual scene analysis: extraction of meaning events, causality, intentionality, Theory of.
Perception Putting it together. Sensation vs. Perception A somewhat artificial distinction Sensation: Analysis –Extraction of basic perceptual features.
Language, Mind, and Brain by Ewa Dabrowska Chapter 10: The cognitive enterprise.
What is Cognitive Science? … is the interdisciplinary study of mind and intelligence, embracing philosophy, psychology, artificial intelligence, neuroscience,
Introduction to Cognitive Science Sept 2005 :: Lecture #1 :: Joe Lau :: Philosophy HKU.
Un Supervised Learning & Self Organizing Maps Learning From Examples
Introduction to Cognitive Science Lecture #1 : INTRODUCTION Joe Lau Philosophy HKU.
Knowledge Representation and Organization
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Chapter Seven The Network Approach: Mind as a Web.
Chapter Four The Cognitive Approach I: History, Vision, and Attention.
COGN1001 Introduction to Cognitive Science Sept 2006 :: Lecture #1 :: Joe Lau :: Philosophy HKU.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Physical Symbol System Hypothesis
The Cognitive Approach I: History, Vision, and Attention
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Processing and Constraint Satisfaction: Psychological Implications The Interactive-Activation (IA) Model of Word Recognition Psychology /719 January.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Lecture 1 Introduction: Linguistic Theory and Theories
PSY 369: Psycholinguistics Language Production & Comprehension: Conversation & Dialog.
Information Processing Approach Define cognition and differentiate among the stage, levels-of-processing, parallel distributed processing, and connectionist.
General Knowledge Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.
Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach Jay McClelland Department of Psychology and Center for.
Artificial Intelligence CIS 479/579 Bruce R. Maxim UM-Dearborn.
James L. McClelland Stanford University
SLA Seminar, NSYSU 11/17/2006 Ch. 9 Cognitive accounts of SLA OUTLINE Cognitive theory of language acquisition Models of cognitive accounts Implicit vs.
The changing face of face research Vicki Bruce School of Psychology Newcastle University.
Perceptual Processes: Visual & Auditory Recognition Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.
The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University January 21, 2014.
The Interactive Activation Model. Ubiquity of the Constraint Satisfaction Problem In sentence processing –I saw the grand canyon flying to New York –I.
Awakening from the Cartesian Dream: The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University February 7, 2013.
Cognition & Language Claudia Stanny PSY What is Cognition? Processes of knowing  Attending  Remembering  Reasoning Content of these processes.
Illustrations and Answers for TDT4252 exam, June
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Developing and Evaluating Theories of Behavior.
© 2008 The McGraw-Hill Companies, Inc. Chapter 8: Cognition and Language.
Perception, Thought and Language as Graded Constraint Satisfaction Processes Jay McClelland SymSys 100 April 12, 2011.
PSY 369: Psycholinguistics Cognitive Psychology Day 2.
The Past Tense Model Psych /719 Feb 13, 2001.
PSY 369: Psycholinguistics Conversation & Dialog: Language Production and Comprehension in conjoined action.
Similarity and Attribution Contrasting Approaches To Semantic Knowledge Representation and Inference Jay McClelland Stanford University.
Cognitive Processes Chapter 8. Studying CognitionLanguage UseVisual CognitionProblem Solving and ReasoningJudgment and Decision MakingRecapping Main Points.
Rapid integration of new schema- consistent information in the Complementary Learning Systems Theory Jay McClelland, Stanford University.
PSY270 Michaela Porubanova. Language  a system of communication using sounds or symbols that enables us to express our feelings, thoughts, ideas, and.
PDP Class Stanford University Jan 4, 2010
Copyright © 2010, Pearson Education Inc., All rights reserved.  Prepared by Katherine E. L. Norris, Ed.D.  West Chester University of Pennsylvania This.
Semantic Cognition: A Parallel Distributed Processing Approach James L. McClelland Center for the Neural Basis of Cognition and Departments of Psychology.
Cognitive Processes PSY 334 Chapter 5 – Meaning-Based Knowledge Representation.
Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.
Knowledge Representation
The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University January 21, 2014.
The Emergent Structure of Semantic Knowledge
Cognitive Architectures and General Intelligent Systems Pay Langley 2006 Presentation : Suwang Jang.
Minds and Computers Discovering the nature of intelligence by studying intelligence in all its forms: human and machine Artificial intelligence (A.I.)
NEURONAL NETWORKS AND CONNECTIONIST (PDP) MODELS Thorndike’s “Law of Effect” (1920’s) –Reward strengthens connections for operant response Hebb’s “reverberatory.
Information Processing
What is cognitive psychology?
James L. McClelland SS 100, May 31, 2011
Today Review: “Knowing a Language” Complete chapter 1
Simple learning in connectionist networks
Pattern recognition (…and object perception).
Simple learning in connectionist networks
The Network Approach: Mind as a Web
Presentation transcript:

The Appeal of Parallel Distributed Processing J.L. McClelland, D.E. Rumelhart, and G.E. Hinton 인지과학 협동과정 강소영

Contents 1. Introduction 2. Parallel Distributed Processing 3. Examples of PDP Models 4. Representation and Learning In PDP Models 5. Origins of Parallel Distributed Processing

1. Introduction Multiple Simultaneous Constraints  Reaching and Grasping  The Mutual Influence of Syntax and Semantics  Simultaneous Mutual Constraints in Word Recognition  Understanding the Interplay of Multiple Sources of Knowledge

The Mutual Influence of Syntax and Semantics Syntactic constraint  The boy the man chased kissed the girl Semantic constraint  I saw the grand canyon flying to New York  I saw the sheep grazing in the field Mutual constraint within each of these domains  I like the joke  I like the drive  I like to joke  I like to drive

Simultaneous Mutual Constraints in Word Recognition Selfridge’s example Paradox:  How can we get the process started? Solution:  our perceptual system is capable of exploring all these possibilities without committing itself to one until all of the constraints are taken into account

Understanding the Interplay of Multiple Sources of Knowledge Knowledge Structure  scripts (Schank 1976)  frames (Minsky 1975)  schemata (Norman and Bobrow 1976; Rumelhart 1975) Most everyday situations cannot be rigidly assigned to just a single script  Interplay between a number of different sources of information  ex) birthday party at a restaurant The generative capacity of human understanding in novel situations --> interact with each other

2. Parallel Distributed Processing Properties of the tasks that people are good at.  A number of different pieces of information must be kept in mind at once.  Each plays a part, constraining others and being constrained by them Assumption of PDP model:  interactions of a large number of simple processing elements  each sending excitatory and inhibitory signals to other units. Elements of model  unit, activation, interaction among units

PDP Models: Cognitive Science or Neuroscience The appeal of PDP :  Computationally sufficient and psychologically accurate mechanistic accounts of the phenomena of human cognition PDP models have radically altered the way we think about  the time course of processing  the nature of representation  the mechanisms of learning

Microstructure of Cognition Parallel Distributed Model offer alternatives to serial models of the microstructure of cognition.  They do not deny that there is a macrostructure Objects referred to in macrostructural models of cognitive processing are seen as approximate descriptions of emergent properties of the microstructure

3. Examples of PDP Models Recent application of PDP  Motor control, perception, memory, language PDP mechanisms are used to provide natural accounts of the exploitation of multiple, simultaneous, mutual constraint

3.1 Motor Control Hinton’s stick person Two constraints on the task  the tip of the forearm must touch the object  center of gravity over the foot Each processor receives two information  how far the tip of the hand was from the target  where the center of gravity was with respect to the foot Combination of joint angles

3.2 Perception Stereoscopic Vision  Random Dot Stereogram --> Depth Perception Perceptual Completion of Familiar Patterns Completion of Novel Patterns

Marr and Poggio (1976)  explain the perception of depth in random-dot stereograms Two general principles about the visual world Stereoscopic Vision

Perceptual Completion of Familiar Patterns Perception is influenced by familiarity  Less time  ambiguous lower level information  to fill in missing lower-level information  phonemic restoration effect visual perception of words (McClelland and Rumelhart 1981) Assumption of model  detectors for the visual features

Two hypotheses or activation  mutually consistant  support each other  mutually inconsistant  weaken each other two kinds of inconsistency  between-level inconsistency   between-level inhibition  mutual exclusion   competitive inhibition

Completion of Novel Patterns Result of word perception model  exhibits perceptual facilitation to pronounceable nonwords as well as words  general principles or rules can emerge from the interactions of simple processing elements.  does not implement exactly any of the systems of orthographic rules that have been proposed by linguists or psychologists PDP models may provide more accurate accounts of the details of human performance than models based on a set of rules representing human competence

3.3 Retrieving Information From Memory Content Addressability Graceful Degradation Default Assignment Spontaneous Generalization

Jets and Sharks Model

4. Representation and Learning In PDP Models What is the stored knowledge that gives rise to that pattern of activation? The difference between PDP models and other models of cognitive processes  others: knowledge is stored as a static copy of a pattern  PDP:  the patterns themselves are not stored  what is stored is the connection strengths between units that allow these patterns to be re-created

Local Versus Distributed Representation Distributed Representation  The knowledge about any individual pattern is not stored in the connections of a special unit reserved for that pattern, but is distributed over the connections among a large number of processing units.  Units are conceptual primitives  Units have no particular meaning as individuals Pattern Associator --> Hebbian Rule

Attractive Properties of Pattern Associator Models Uncorrelated patterns do not interact with each other, but more similar ones do if we present the same pair of patterns over and over, but each time we add a little random noise to each element of each member of the pair, the system will automatically learn to associate the central tendency of the two patterns and will learn to ignore the noise What will be stored will be an average of the similar patterns with the slight variations removed.

Extracting the Structure of an Ensemble of Patterns Distributed Model  if there are regularities in the correspondences between pairs of patterns, the model will naturally extract these regularities. Language Learning Model - learning past tense  creation of regular past tenses of new verbs  overregularization of the irregular verbs  same phenomena as what is shown in children’s past tense acquisition  we can see how the acquisition of performance that conforms to linguistic rules can emerge from a simple, local, connection strength modulation process

5. Origins of Parallel Distributed Processing  Jackson(1869/1958) and Luria(1966)  distributed, multilevel conceptions of processing systems  dynamic functional system  Hebb(1949) and Lashley(1950)  “there are no special cells reserved for special memories”  Rosenblatt(1959, 1962) and Selfridge(1955)  perceptron  Pandemonium : importance of interactive processing  Anderson, Grossberg, Longuet-Higgins (60’s, 70’s)  concept learning  competitive learning mechanism  distributed memory models

 Marr and Poggio(1976)  Morton’s logogen model(1969)  one of the first models to capture concretely the principle of interaction of different sources of information  Marseln-Wilson(1978)  empirical demonstrations of interaction between different levels of language processing  Levin’s Proteus model(1976)  virtues of activetion-competition model  Feldman and Ballard(1982)  Hofstadter(1979, 1985)  Sutton and Barto(1981) --> delta rule  Hopfield(1982)