Cognitive Modeling February 5, 2010. Today’s Class Cognitive Modeling Assignment #3 Probing Questions Surveys.

Slides:



Advertisements
Similar presentations
Bayesian Knowledge Tracing and Discovery with Models
Advertisements

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.
Bayesian Knowledge Tracing Prediction Models
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.
Gallup Q12 Definitions Notes to Managers
May Ranch Science Fair 2014.
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Knowledge Inference: Advanced BKT Week 4 Video 5.
SAL: A Hybrid Cognitive Architecture Y. Vinokurov 1, C. Lebiere 1, D. Wyatte 2, S. Herd 2, R. O’Reilly 2 1. Carnegie Mellon University, 2. University of.
Evaluating Requirements. Outline Brief Review Stakeholder Review Requirements Analysis Summary Activity 1.
Collaborative Warrior Tutoring Tom Livak Neil Heffernan 8/24/06.
Discovery with Models Week 8 Video 1. Discovery with Models: The Big Idea  A model of a phenomenon is developed  Via  Prediction  Clustering  Knowledge.
Ryan S.J.d. Baker Adam B. Goldstein Neil T. Heffernan Detecting the Moment of Learning.
Best-First Search: Agendas
Evaluation Metrics February 12, A break in the usual order of things… Today’s Probing Question will be discussed later in the class rather than.
Introduction  Bayesian methods are becoming very important in the cognitive sciences  Bayesian statistics is a framework for doing inference, in a principled.
Planning for Inquiry The Learning Cycle. What do I want the students to know and understand? Take a few minutes to observe the system to be studied. What.
CEP Welcome September 1, Matthew J. Koehler September 1, 2005CEP Cognition and Technology Who’s Who?  Team up with someone you don’t.
CSC 212 – Data Structures Prof. Matthew Hertz WTC 207D /
Models of Human Performance Dr. Chris Baber. 2 Objectives Introduce theory-based models for predicting human performance Introduce competence-based models.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 5): Outliers Fall, 2008.
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Research Methods for HCI: Cognitive Modelling BCS HCI Tutorial 1 st September, 2008.
Recovering the Struggling Reader Debra K. Nicholson Hillcrest Elementary Morristown, Tennessee.
Test Preparation Strategies
Educational Data Mining Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Richard Scheines Professor of Statistics, Machine Learning, and Human-Computer.
Effective Questioning in the classroom
Reflective practice Session 4 – Working together.
Chapter 5 Models and theories 1. Cognitive modeling If we can build a model of how a user works, then we can predict how s/he will interact with the interface.
Abstraction IS 101Y/CMSC 101 Computational Thinking and Design Tuesday, September 17, 2013 Carolyn Seaman University of Maryland, Baltimore County.
Modeling Driver Behavior in a Cognitive Architecture
SLB /04/07 Thinking and Communicating “The Spiritual Life is Thinking!” (R.B. Thieme, Jr.)
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 2, 2012.
C R E S S T / U C L A 1 Productive Collaborative Problem Solving: Noreen Webb and Ann Mastergeorge Responsibilities of Teachers and Students.
Rational/Theoretical Cognitive Task Analysis Ken Koedinger Key reading: Zhu, X., & Simon, H. A. (1987). Learning mathematics from examples and by doing.
Cognitive Architectures: A Way Forward for the Psychology of Programming Michael Hansen Ph.D Student in Comp/Cog Sci CREST / Percepts & Concepts Lab Indiana.
Overview of Cognitive Science for Teachers
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 January 28, 2013.
Office of School Improvement Differentiated Webinar Series Formative Assessment – Feedback February 28,2012 Dr. Dorothea Shannon, Thomasyne Beverly, Dr.
Advanced BKT February 11, Classic BKT Not learned Two Learning Parameters p(L 0 )Probability the skill is already known before the first opportunity.
C++ Basics C++ is a high-level, general purpose, object-oriented programming language.
Korea Univ. Division Information Management Engineering UI Lab. Korea Univ. Division Information Management Engineering UI Lab. IMS 802 Cognitive Modeling.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 4, 2013.
OSSLT Test Preparation
1 Cognitive Modeling GOMS, Keystroke Model Getting some details right!
Evaluating Requirements
Development of Expertise. Expertise We are very good (perhaps even expert) at many things: - driving - reading - writing - talking What are some other.
THE HOMEWORK... WHAT’S THE POINT?. GOT TO DO HOMEWORK? Because it’s assigned by the teacher, and will be corrected/graded Because quizzes/exams are kind.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"
A TEACHER NEW AT MAPPING ASKS STUDENTS TO MAP HERE IS THE ASSIGNMENT AND SOME MAPS.
Marking and Feedback CPD
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 25, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
A TEACHER NEW AT MAPPING ASKS STUDENTS TO MAP HERE IS THE ASSIGNMENT AND SOME MAPS.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 6, 2012.
© 2015 albert-learning.com How to talk to your boss How to talk to your boss!!
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Cognitive Modeling Cogs 4961, Cogs 6967 Psyc 4510 CSCI 4960 Mike Schoelles
Avalon Science and Engineering Fair 2015 Let’s Get Started Science and Engineering Fair packets will go home this week. All 2 nd, 3 rd, 4 th and 5 th.
Core Methods in Educational Data Mining
Special Topics in Educational Data Mining
Detecting the Learning Value of Items In a Randomized Problem Set
Big Data, Education, and Society
A TEACHER NEW AT MAPPING ASKS STUDENTS TO MAP
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Advanced Tips and Tricks
Presentation transcript:

Cognitive Modeling February 5, 2010

Today’s Class Cognitive Modeling Assignment #3 Probing Questions Surveys

Cognitive Modeling “Cognitive modeling… deals with simulating human problem solving and mental task processes in a computerized model. Such a model can be used to simulate or predict human behavior or performance on tasks similar to the ones modeled.” (whatis.com)

Several Different Types of Cognitive Models Today, I will discuss three prominent types that are useful in the learning sciences A different set are dominant in HCI – CPM-GOMS, KLM-GOMS, ACT-SIMPLE

Cognitive Models Prominent in Learning Sciences Production-rule systems ACT-R Constraint-based models

Production-System Models Often similar in nature to early versions of ACT-R – ACT*, ACT-R 2.0

Production-System Models As seen in Koedinger & Terao (2002) Represent performance (and therefore skill) as a set of if-then rules (“productions”) – Can be written in plain English

Let’s say we want to create A cognitive model of the process of creating a scatterplot of data (cf. Baker, Corbett, & Koedinger, 2001, 2002; Baker, Corbett, Koedinger, & Schneider, 2004; Baker, Corbett, & Koedinger, 2007) A subject formerly near and dear to my heart!

Task

Draw a scatterplot of this fake data CityPopulation (in 1000) Number of Brazilian Restaurants Worcester1554 Fitchburg650 Boston6506 Providence1500 Springfield701 Manchester1302 Hartford2204 New Haven1200 New Bedford553 Arapiraca, Brazil14080

Protocol Trace OK. I like talking. Alright, so.. I’m not entirely sure I understand the problem, but we’re gonna go for it I’m gonna because I’m terrible at spelling.. alright.. So I know we’re going from 0 to 80 I kinda wanna ignore 80 but alright… 3…4…5…6….8 that symbol there is my way of saying a long time later. This time we’re going from 65 to wow.. ok that’s great Ok what’s a good scale for that [counting by 50’s to 600] Eh well. 650 Alright, um. Now let’s start plotting them… oh crap I’m writing out the thing so I have some vague idea of what I’m doing At 4 and 155 I place the first point At 0 and 65 I place the 2nd point 6 and 650 I do the next one Um I hope that doesn….

Protocol Trace next one… Let’s see uhhh Alright the next one A lot of things goin on at 0 exciting And then what…. Oh…. Yeah yeah then So um and that seems to be the problem but just for fun I’m gonna try and do a best fit line Well that’s not fair Ok I’m done.

Simulated Performance #1 (VCE) Of same task… Based on research in (Baker, Corbett, & Koedinger, 2001)

Simulated Performance #2 (NE) Of same task… Based on research in (Baker, Corbett, & Koedinger, 2002)

Simulated Performance #3 (DH) Of same task… Based on research in (Baker, Corbett, Koedinger, & Schneider, 2004)

Let’s create some production rules What are the key subgoals in this task? – (i.e. major steps)

Let’s create some production rules Split into groups for each key subgoal

Let’s create some production rules Write some production rules for your subgoal, covering both correct behavior and key errors

Put productions up on the screen For each rule – Is this a good rule? – Is it over-generalized or under-generalized?

Will this process work? Let’s put everything together and simulate it! I need a volunteer to execute the model – Try for fully correct performance that only uses these rules and no other rules

Questions? Comments?

Applicability What kinds of phenomena could production- rule models handle well? What kinds of phenomena would production- rule models handle poorly?

Applicability What kinds of phenomena could production- rule models handle well? What kinds of phenomena would production- rule models handle poorly? – Creativity and discovery (there have been attempts to do this, by Simon and Schunn, but there has been debate as to whether the resultant models have face validity)

Applicability What kinds of phenomena could production- rule models handle well? What kinds of phenomena would production- rule models handle poorly? – Analogy (handled in ACT-R by other processes – see Salvucci & Anderson, 2006)

Applicability What kinds of phenomena could production- rule models handle well? What kinds of phenomena would production- rule models handle poorly? – Strengthening of memory (handled in ACT-R by other processes – more in a few minutes)

Another use of Production-Rule Models Model Tracing (Corbett & Anderson, 1995) A production-rule model of correct and incorrect behavior is created As a student solves problems, the model is used to interpret whether the student’s behavior is correct or incorrect – This information is used to give feedback, and for knowledge tracing, which traces the probability the student knows a given skill (Corbett & Anderson, 1995) – I will discuss this in more detail, either at the end of today’s class, or on March 3 rd (depending on time)

Model-Tracing: Example Let’s go back through our four examples of attempts to create a scatterplot At each action, tell me what production rule fired – Correct production: CORRECT – Incorrect production: BUG – No production: WRONG

Questions? Comments?

Cognitive Models Prominent in Learning Sciences Production-rule systems ACT-R Constraint-based models

ACT-R

The Hunt for a Unified Theory of Cognition Alan Newell Student of Herb Simon MHP, KLM-GOMS, SOAR CMU Psychology John Anderson Hired by Herb Simon ACT, ACT-R 2, 5, 6, 7 CMU Psychology vs

The basic idea A cognitive modeling architecture is a framework for developing models of human {behavior, learning}. The architecture forces you to make your model plausible based on what we know about humans.

Cognitive Modeling Architectures SOAR was competitive until a decade ago (now it is only used by a small number of researchers) ACT-R is the dominant framework and has been for a while GOMS is heavily used in HCI

ACT-R “Adaptive Character of Thought” “Atomic Components of Thought” “Anderson’s Cool Theory”

ACT-R “Adaptive Character of Thought” “Atomic Components of Thought” “Anderson’s Cool Theory” I will be discussing ACT-R 5 (ACT-R 6 has moved towards focusing on neural architecture, and has been far less used in education research)

ACT-R’s strengths Accurate and predictive models of human performance at complex tasks. Models the cognitive processes that lead to behavior: – Decision-Making – Problem-Solving – Analogy (more with ACT, ACT*, ACT-R 2) – Memory Retrieval and Strengthening – Learning to be an Expert

The ACT-R Architecture The human mind is modeled by a set of systems. Each individual system is serial Multiple systems can be running at once Visual Perception Auditory Perception Motor Skills Productions Declarative Memory

Performance Interaction between production rules and chunks of declarative memory – Each chunk can have sub-chunks Like – Each chunk has a certain strength of activation, which predicts speed and accuracy of recall (as discussed in the Pavlik et al article)

Performance Interaction between production rules and chunks of declarative memory – Each production also has a strength of activation – When productions reach a certain strength, they become “compiled” with neighboring productions into “automatized behavior”

Automatized Behavior Those of you who have keyboards, type “kaleidoscope”

Automatized Behavior Those of you who have keyboards, type “kaleidoscope” Now, close your eyes

Automatized Behavior Those of you who have keyboards, type “kaleidoscope” Now, close your eyes Where’s the letter “k” on the keyboard?

Questions? Comments?

Behavior Governed by a set of literally dozens of complex equations

Example: Memory Activation Equation (Pavlik et al, 2008)

Uses in Education ACT-R 2 underlies Cognitive Tutors (significant divergence since then), with model tracing – Essentially, what we discussed a few minutes ago Tailor student order of practice to what we know about memory (Pavlik et al, 2008)

Questions? Comments?

Cognitive Models Prominent in Learning Sciences For the learning sciences, the most prominent types have been – Production-rule systems – ACT-R – Constraint-based models

Constraint-based models Model performance in a very different fashion With a list of conditions that must be met

For creating scatterplots… What are some conditions that must be met for a scatterplot to be correct?

One example… The space between all axis labels must be equal

One example… The space between all axis labels must be equal (What do you think, Matt? Is this an appropriate constraint?)

Now your turn… Let’s list some constraints…

Now let’s test the approach I will create some scatterplots, and you tell me if my scatterplot is right (violates no constraints) or wrong (violates 1+ constraints), and which constraint is violated

Now let’s test the approach Should we add or change any constraints?

Constraint-based modeling Stellan Ohlsson argues that CBM is a more accurate model of human cognition than ACT- R – Some excellent points, in particular in terms of how people recognize that they’ve made an error, but the field generally has stayed with production- rule or neural network models

Constraint-Based Tutoring Constraint-based modeling underpins the second most highly used tutoring system in the world, SQL-Tutor (Mitrovic, Martin, & Mayo, 2003)

Constraint-Based Tutoring Has been argued to be more effective for ill- defined domains, where student problem- solving may take a huge number of paths, but an incorrect solution can be recognized (Weerasinghe & Mitrovic, 2006)

Questions? Comments?

Cognitive Modeling Any high-level thoughts or observations?

Cognitive Modeling Any high-level thoughts or observations? What would Jean Lave say about Cognitive Modeling?

Where does cognitive modeling fit on this diagram? ENTITATIVE HOLISTIC ESSENTIALISTEXISTENTIALIST

Bayesian Knowledge-Tracing (Is there time?) (If not, we will cover this on March 3 rd )

Goal: For each knowledge component (KC), infer the student’s knowledge state from performance. Suppose a student has six opportunities to apply a KC and makes the following sequence of correct (1) and incorrect (0) responses, according to model tracing. Has the student has learned the rule? Bayesian Knowledge Tracing

Model Learning Assumptions Two-state learning model – Each skill is either learned or unlearned In problem-solving, the student can learn a skill at each opportunity to apply the skill A student does not forget a skill, once he or she knows it Only one skill per action

Model Performance Assumptions If the student knows a skill, there is still some chance the student will slip and make a mistake. If the student does not know a skill, there is still some chance the student will guess correctly.

Corbett and Anderson’s Model Not learned Two Learning Parameters p(L 0 )Probability the skill is already known before the first opportunity to use the skill in problem solving. p(T)Probability the skill will be learned at each opportunity to use the skill. Two Performance Parameters p(G)Probability the student will guess correctly if the skill is not known. p(S)Probability the student will slip (make a mistake) if the skill is known. Learned p(T) correct p(G)1-p(S) p(L 0 )

Bayesian Knowledge Tracing Whenever the student has an opportunity to use a skill, the probability that the student knows the skill is updated using formulas derived from Bayes’ Theorem.

Formulas

Knowledge Tracing How do we know if a knowledge tracing model is any good? Our primary goal is to predict knowledge

Knowledge Tracing How do we know if a knowledge tracing model is any good? Our primary goal is to predict knowledge But knowledge is a latent trait

Knowledge Tracing How do we know if a knowledge tracing model is any good? Our primary goal is to predict knowledge But knowledge is a latent trait But we can check those knowledge predictions by checking how well the model predicts performance

Fitting a Knowledge-Tracing Model In principle, any set of four parameters can be used by knowledge-tracing But parameters that predict student performance better are preferred

Knowledge Tracing So, we pick the knowledge tracing parameters that best predict performance Defined as whether a student’s action will be correct or wrong at a given time

Recent Extensions Recently, there has been work towards contextualizing the guess and slip parameters (Baker, Corbett, & Aleven, 2008a, 2008b) Do we really think the chance that an incorrect response was a slip is equal when – Student has never gotten action right; spends 78 seconds thinking; answers; gets it wrong – Student has gotten action right 3 times in a row; spends 1.2 seconds thinking; answers; gets it wrong

Recent Extensions In this work, P(G) and P(S) are determined by a model that looks at time, previous history, the type of action, etc. Significantly improves predictive power of method – Probability of distinguishing right from wrong within the tutor increases from around 66% to around 71% – Worse performance when it comes to predicting the post- test, so still more work needed, but a different use of contextual slip lead to significantly better post-test performance (Baker, Corbett, et al, under review)

Recent Extensions Extending Bayesian Knowledge Tracing with additional parameters – Splitting P(T|H), P(T|~H) to study the impact of help on learning (Beck et al, 2008 ITS best paper)

Uses Within educational data mining, there are several things you can do with these models – We’ll talk about this more on March 3rd

Today’s Class Cognitive Modeling Assignment #3 Probing Question Surveys

Assignment #3 Any questions?

Today’s Class Cognitive Modeling Assignment #3 Probing Question Surveys

Probing Question for Friday, February 12 Should state/national/international assessments of learning (like the MCAS) have Preparation for Future Learning items? Why or why not?

Today’s Class Cognitive Modeling Assignment #3 Probing Question Surveys