Emergence in Cognitive Science: Semantic Cognition Jay McClelland Stanford University.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA
Advertisements

Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
Summer 2011 Tuesday, 8/ No supposition seems to me more natural than that there is no process in the brain correlated with associating or with.
Naïve-Bayes Classifiers Business Intelligence for Managers.
Cognitive - knowledge.ppt © 2001 Laura Snodgrass, Ph.D.1 Knowledge Structure of semantic memory –relationships among concepts –organization of memory –memory.
Combining Inductive and Analytical Learning Ch 12. in Machine Learning Tom M. Mitchell 고려대학교 자연어처리 연구실 한 경 수
Probabilistic inference in human semantic memory Mark Steyvers, Tomas L. Griffiths, and Simon Dennis 소프트컴퓨팅연구실오근현 TRENDS in Cognitive Sciences vol. 10,
PDP: Motivation, basic approach. Cognitive psychology or “How the Mind Works”
Does the Brain Use Symbols or Distributed Representations? James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford.
Emergence of Semantic Structure from Experience Jay McClelland Stanford University.
Language, Mind, and Brain by Ewa Dabrowska Chapter 10: The cognitive enterprise.
Knowing Semantic memory.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Categorization  How do we organize our knowledge?  How do we retrieve knowledge when we need it?
Cognitive Psychology, 2 nd Ed. Chapter 8 Semantic Memory.
Chapter 5 Data mining : A Closer Look.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
General Knowledge Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.
Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach Jay McClelland Department of Psychology and Center for.
Dynamics of learning: A case study Jay McClelland Stanford University.
Using Backprop to Understand Apects of Cognitive Development PDP Class Feb 8, 2010.
Representation, Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology.
Emergence of Semantic Structure from Experience Jay McClelland Stanford University.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
The changing face of face research Vicki Bruce School of Psychology Newcastle University.
Conceptual Hierarchies Arise from the Dynamics of Learning and Processing: Insights from a Flat Attractor Network Christopher M. O’ConnorKen McRaeGeorge.
Integrating New Findings into the Complementary Learning Systems Theory of Memory Jay McClelland, Stanford University.
The PDP Approach to Understanding the Mind and Brain J. McClelland Cognitive Core Class Lecture March 7, 2011.
The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University January 21, 2014.
Disintegration of Conceptual Knowledge In Semantic Dementia James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford.
Contrasting Approaches To Semantic Knowledge Representation and Inference Psychology 209 February 15, 2013.
Shane T. Mueller, Ph.D. Indiana University Klein Associates/ARA Rich Shiffrin Indiana University and Memory, Attention & Perception Lab REM-II: A model.
Emergence of Semantic Knowledge from Experience Jay McClelland Stanford University.
PSY 323 – COGNITION Chapter 9: Knowledge.  Categorization ◦ Process by which things are placed into groups  Concept ◦ Mental groupings of similar objects,
The Influence of Feature Type, Feature Structure and Psycholinguistic Parameters on the Naming Performance of Semantic Dementia and Alzheimer’s Patients.
Development, Disintegration, and Neural Basis of Semantic Cognition: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Emergence of Semantic Structure from Experience Jay McClelland Stanford University.
Similarity and Attribution Contrasting Approaches To Semantic Knowledge Representation and Inference Jay McClelland Stanford University.
Rapid integration of new schema- consistent information in the Complementary Learning Systems Theory Jay McClelland, Stanford University.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Semantic Cognition: A Parallel Distributed Processing Approach James L. McClelland Center for the Neural Basis of Cognition and Departments of Psychology.
Cognitive Processes PSY 334 Chapter 5 – Meaning-Based Knowledge Representation.
Cognitive Science and Biomedical Informatics Department of Computer Sciences ALMAAREFA COLLEGES.
1 How is knowledge stored? Human knowledge comes in 2 varieties: Concepts Concepts Relations among concepts Relations among concepts So any theory of how.
The Origins of Knowledge Debate How do people gain knowledge about the world around them? Are we born with some fundamental knowledge about concepts like.
Origins of Cognitive Abilities Jay McClelland Stanford University.
The Emergent Structure of Semantic Knowledge
Verbal Representation of Knowledge
Emergent Semantics: Meaning and Metaphor Jay McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford University.
From NARS to a Thinking Machine Pei Wang Temple University.
Semantic Knowledge: Its Nature, its Development, and its Neural Basis James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation.
Organization and Emergence of Semantic Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center for.
Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Chapter 9 Knowledge. Some Questions to Consider Why is it difficult to decide if a particular object belongs to a particular category, such as “chair,”
Big data classification using neural network
Psychology 209 – Winter 2017 January 31, 2017
What is cognitive psychology?
Machine Learning for Computer Security
Psychology 209 – Winter 2017 Feb 28, 2017
Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center.
Ontology From Wikipedia, the free encyclopedia
James L. McClelland SS 100, May 31, 2011
Does the Brain Use Symbols or Distributed Representations?
Emergence of Semantic Structure from Experience
Emergence of Semantics from Experience
CLS, Rapid Schema Consistent Learning, and Similarity-weighted Interleaved learning Psychology 209 Feb 26, 2019.
Presentation transcript:

Emergence in Cognitive Science: Semantic Cognition Jay McClelland Stanford University

Approaches to Understanding Intelligence Symbolic approaches – explicit symbolic structures – structure-sensitive rules – discrete computations Emergence-based approaches – Symbolic structures and processes are seen as approximate characterizations of the emergent consequences of continuous and distributed processing and representation

Some Constructs in Cognitive Science: Can They All Be Viewed as Emergents? Putative Represented Entities –Concepts, categories –Rules of language and inference –Entries in the mental lexicon –Syntactic and semantic representations of linguistic inputs Architectural Constructs –Working memory –Attention –The central executive Cognitive Processes and Their Outcomes –Choices –Decisions –Inferences –Beliefs Developmental Constructs –Stages –Sensitive periods –Developmental anomalies Cognitive Neuroscience –Modules for Face recognition Episodic memory …

Two Views of Semantic Knowledge Structure and Learning Structured Probabilistic Models –Knowledge is assigned to a mental structure of one or another specific type. –Learning is the process of selecting the best structure type, then selecting a specific instance of the type. Learning-Based PDP Models –Semantic knowledge arises as a consequence of a gradual learning process. –The structure that emerges is naturally shaped by the structure present in experience. –It may correspond (approximately) to a specific structure type, but it need not.

Emergent vs. Stipulated Structure Old Boston Midtown Manhattan

What Happens to Natural Structure in a Structured Statistical Model?

Plan for the Rest of the Talk A PDP model of semantic cognition capturing – Emergence of semantic knowledge from experience – Disintegration of semantic knowledge in neurodegenerative illness – Comparison to a structured statistical model The theory of the model – What is it learning, and why? Initial steps in a new direction – Capturing shared structure across domains

The Rumelhart Model The Quillian Model

1.Show how learning could capture the emergence of hierarchical structure 2.Show how the model could make inferences as in a symbolic model Rumelhart’s Goals for the Model

ExperienceExperience Early Later Later Still

Start with a neutral representation on the representation units. Use backprop to adjust the representation to minimize the error.

The result is a representation similar to that of the average bird…

Use the representation to infer what this new thing can do.

Phenomena in Development Progressive differentiation Overgeneralization of – Typical properties – Frequent names Emergent domain-specificity of representation Basic level advantage Expertise and frequency effects Conceptual reorganization

Disintegration in Semantic Dementia Loss of differentiation Overgeneralization

Architecture for the Organization of Semantic Memory color form motion action valance Temporal pole name Medial Temporal Lobe

Disintegration in Semantic Dementia Loss of differentiation Overgeneralization

Neural Networks and Probabilistic Models The model learns to match the conditional probability structure of the training data: P(Attribute i = 1|Item j & Context k ) for all i,j,k It does so subject to strong constraints imposed by the initial connection weights and the architecture. – These constraints produce progressive differentiation, overgeneralization, etc. Depending on the structure in the training data, it can behave as though it is learning a specific type of structured representation: – Hierarchy – Linear Ordering – Two-dimensional similarity space…

The Hierarchical Naïve Bayes Classifier Model (with R. Grosse and J. Glick) Items are organized in categories Categories may contain sub- categories Features are probabilistic and conditionally independent, given the category. We start with a one-category model, and learn p(f|C) for each feature. We differentiate as evidence builds up violating conditional independence. Sub-categories inherit feature probabilities of the parents, then learn feature probabilities from there. Living Things … Animals Plants Birds Fish Flowers Trees

PropertyOne-Class ModelPlant class in two-class model Animal class in two-class model Can Grow1.0 Is Living1.0 Has Roots Has Leaves Has Branches Has Bark Has Petals Has Gills Has Scales Can Swim Can Fly Has Feathers Has Legs Has Skin Can See A One-Class and a Two-Class Naïve Bayes Classifier Model

Accounting for the network’s feature attributions with mixtures of classes at different levels of granularity Regression Beta Weight Epochs of Training Property attribution model: P(f i |item) =  k p(f i |c k ) + (1-  k )[(  j p(f i |c j ) + (1-  j )[…])

Should we replace the PDP model with the Naïve Bayes Classifier? It explains a lot of the data, and offers a succinct abstract characterization But –It only characterizes what’s learned when the data actually has hierarchical structure So it may be a useful approximate characterization in some cases, but can’t really replace the real thing.

An exploration of these ideas in the domain of mammals What is the best representation of domain content? How do people make inferences about different kinds of object properties?

Structure Extracted by a Structured Statistical Model

Predictions Both similarity ratings and patterns of inference will violate the hierarchical structure Experiments by Jeremy Glick confirm these predictions

Applying the Rumelhart model to the mammals dataset Items Contexts Behaviors Body Parts Foods Locations Movement Visual Properties

Simulation Results The model progressively extracts more and more detailed aspects of the structure as training progresses

Network Output

Simulation Results The model progressively extracts more and more detailed aspects of the structure as training progresses Can we characterize mathematically what the model has learned?

Progressive PCs

Network Output

Learning the Structure in the Training Data Progressive sensitization to successive principal components captures learning of the mammals dataset. This subsumes the naïve Bayes classifier as a special case. So are PDP networks – and our brain’s neural networks – simply performing PCA?  No!

Extracting Cross- Domain Structure

v

Input Similarity Learned Similarity

A Newer Direction Exploiting knowledge sharing across domains –Lakoff: Abstract cognition is grounded in concrete reality –Boroditsky: Cognition about time is grounded in our conceptions of space –My Take: In general cognition in any domain borrows on knowledge in other domains. This happens automatically in a neural network that processes both domains with the same connections. Initial modeling work by Paul Thibodeau supports this view.

Thanks!

In Rogers and McClelland (2004) we also address: – Conceptual differentiation in prelinguistic infants. – Many of the phenomena addressed by classic work on semantic knowledge from the 1970’s: Basic level Typicality Frequency Expertise – Disintegration of conceptual knowledge in semantic dementia – How the model can be extended to capture causal properties of objects and explanations. – What properties a network must have to be sensitive to coherent covariation.