Bayesian Hierarchical Models of Individual Differences in Skill Acquisition Dr Jeromy Anglim Deakin University 22 nd May 2015.

Slides:

Advertisements

Similar presentations

IB Portfolio Tasks 20% of final grade

Advertisements

Model checking in mixture models via mixed predictive p-values Alex Lewin and Sylvia Richardson, Centre for Biostatistics, Imperial College, London Mixed.

MCMC estimation in MlwiN

Brewer, B. W., Selby, C.L., Linder, D.E., & Petitpas, A.J. (1999)

Basis Functions. What’s a basis ? Can be used to describe any point in space. e.g. the common Euclidian basis (x, y, z) forms a basis according to which.

The influence of domain priors on intervention strategy Neil Bramley.

Second Language Acquisition

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.

Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.

Overarching Goal: Understand that computer models require the merging of mathematics and science. 1.Understand how computational reasoning can be infused.

Introduction  Bayesian methods are becoming very important in the cognitive sciences  Bayesian statistics is a framework for doing inference, in a principled.

Lecture 8 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D

When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.

Results 2 (cont’d) c) Long term observational data on the duration of effective response Observational data on n=50 has EVSI = £867 d) Collect data on.

Knowledge Acquisition. Knowledge Aquisition Definition – The process of acquiring, organising, & studying knowledge. Identified by many researchers and.

PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.

Today Concepts underlying inferential statistics

Classroom Assessment A Practical Guide for Educators by Craig A

Lorelei Howard and Nick Wright MfD 2008

Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.

HTA as a framework for task analysis Presenter: Hilary Ince, University of Idaho.

An Examination of Learning Processes During Critical Incident Training: Implications for the Development of Adaptable Trainees Andrew Neal, Stuart T. Godley,

Introduction to Multilevel Modeling Using SPSS

Multiple Sample Models James G. Anderson, Ph.D. Purdue University.

Issues in Experimental Design Reliability and ‘Error’

Determining Sample Size

Chapter 2 Psychology: Research Methods and Critical Thinking.

The Journey Of Adulthood, 5/e Helen L. Bee & Barbara R. Bjorklund Chapter 1 Defining the Journey: Some Assumptions, Definitions, and Methods The Journey.

Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.

Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.

Chapter 1: Research Methods

CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.

A Framework of Mathematics Inductive Reasoning Reporter: Lee Chun-Yi Advisor: Chen Ming-Puu Christou, C., & Papageorgiou, E. (2007). A framework of mathematics.

Measuring Complex Achievement

CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.

Educational Psychology, 7 th edition Jeanne E. Ormrod © 2011 Pearson Education, Inc. All rights reserved. 1-1 Understanding research.

DDMs -From Conception to Impact Rating D Easthampton High School – Team Leader Meeting March 17, 2014 Facilitated by Shirley Gilfether.

Measuring What Matters: Technology & the Assessment of all Students Jim Pellegrino.

Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.

ES Model development Dr. Ahmed Elfaig The ES attempts to predict results from available information, data and knowledge The model should be able to infer.

Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.

JAM-boree: A Meta-Analysis of Judgments of Associative Memory Kathrene D. Valentine, Erin M. Buchanan, Missouri State University Abstract Judgments of.

Discontinuous Growth Models Paul D. Bliese Walter Reed Army Institute of Research.

1 METHODS FOR DETERMINING SIMILARITY OF EXPOSURE-RESPONSE BETWEEN PEDIATRIC AND ADULT POPULATIONS Stella G. Machado, Ph.D. Quantitative Methods and Research.

Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.

Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.

- 1 - Calibration with discrepancy Major references –Calibration lecture is not in the book. –Kennedy, Marc C., and Anthony O'Hagan. "Bayesian calibration.

RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.

1 Probability and Statistics Confidence Intervals.

Using DataShop Tools to Model Students Learning Statistics Marsha C. Lovett Eberly Center & Psychology Acknowledgements to: Judy Brooks, Ken Koedinger,

Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.

Jump to first page Inferring Sample Findings to the Population and Testing for Differences.

Contact Info: Improving Decision Making: The use of simple heuristics Dr. Guillermo Campitelli Cognition Research Group Edith.

Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”

Better to Give or to Receive?: The Role of Dispositional Gratitude

Children’s Understanding of Multiplication and Division: Novel effects identified through a meta-analysis of 7 studies Katherine M. Robinson and Adam.

Is High Placebo Response Really a Problem in Clinical Trials?

MCMC Stopping and Variance Estimation: Idea here is to first use multiple Chains from different initial conditions to determine a burn-in period so the.

The Relationship Between Emphasis of Cell-phone Use on Performance and Anxiety: Classroom Implications Jordan Booth, Leah Cotton, Jeni Dillman, Kealey.

Department of Psychology

Effects of Targeted Troubleshooting Activities on

CJT 765: Structural Equation Modeling

The involvement of visual and verbal representations in a quantitative and a qualitative visual change detection task. Laura Jenkins, and Dr Colin Hamilton.

National Conference on Student Assessment

Gerald Dyer, Jr., MPH October 20, 2016

Essential Statistics Introduction to Inference

A Hierarchical Bayesian Look at Some Debates in Category Learning

PSY 626: Bayesian Statistics for Psychological Science

Confirmatory Factor Analysis

Presentation transcript:

Bayesian Hierarchical Models of Individual Differences in Skill Acquisition Dr Jeromy Anglim Deakin University 22 nd May 2015

Functional form of the learning curve Researchers have long been interested in functional form of the learning curve – Power law of practice (Newell and Rosenbloom, 1981; Snoddy 1926) – Evidence for exponential function at individual level (Heathcote, Brown, & Mewhort, 2001) Early example: 1024 choice-reaction time task Data from Seibel 1963; shown in Delaney et al 1998 Early example: 1024 choice-reaction time task Data from Seibel 1963; shown in Delaney et al 1998 Task Results

Relating subtask to overall task learning Issue of how to integrate basic findings from cognitive psychology with learning on more complex tasks Lee and Anderson (2001) proposed reducibility hypothesis suggesting that learning a complex task could be understood as the culmination of learning many component subtasks They also proposed that subtask learning will be consistent across subtasks and follow the power law of practice

Lee & Anderson (2001) Overall Task Performance KA Air-Traffic Controller Task Task Analysis Subtask Performance Source: Lee, F. J., & Anderson, J. R. (2001). Does learning a complex task have to be complex?: A study in learning decomposition. Cognitive Psychology, 42(3),

Gaps / Issues Gaps Reliance on group-level analysis Need to refine definitions and tests of subtask learning consistency Lack of incorporation of trial level strategy use data Approach Need for task that facilitates measurement of strategy use and subtask performance A Bayesian hierarchical approach offers benefits over piece-wise individual-level analysis.

Wynton-Anglim Booking (WAB) Task 1. Information Gathering (I) 2. Filtering (F) 3. Timetabling (T)

Bayesian Hierarchical Models Increased interest in application of Bayesian Methods in psychology Benefits of Bayesian Approach – Clear and direct inference – Flexible model specification – Range of sophisticated model comparison tools (e.g., DIC, Posterior predictive checks) – Well-suited to modelling repeated measures psychological data (i.e., observations nested within people)

Models of Overall Performance

Models of Subtask Performance

Aims 1.Assess support for power and exponential functions on overall and subtask performance 2.Assess degree of consistency in subtask learning 3.Estimate effect of strategy use on subtask performance 4.Assess degree to which strategy use could explain inconsistency

Method Participants – 25 adults (68% female) Procedure – Read WAB Task instructions – Complete as many trials as possible in 50 minutes Processing – Extract strategy use, subtask performance and overall task performance – Trial performance was aggregated into average block performance (15 blocks with approximately equal numbers of trials)

Data analytic approach Bayesian hierarchical models were estimated using MCMC methods using JAGS with supporting analyses performed in R Model comparison – Graphs overlaying model fits and data – Deviance Information Criterion (DIC) – Posterior predictive checks

1. Overall performance Does a power or exponential model provide a better model of the effect of practice on overall task performance?

Overall performance (group-level)

Overall task completion time by block (individual-level)

Overall performance: Parameter estimates and model comparison (DIC) Interpretation Power has larger deviance but smaller penalty and smaller DIC Differences are small Interpretation Power has larger deviance but smaller penalty and smaller DIC Differences are small DIC = Mean Deviance + Penalty Rules of thumb for DIC difference: 10+: rule out model with larger DIC 5-10: model with smaller DIC is better DIC = Mean Deviance + Penalty Rules of thumb for DIC difference: 10+: rule out model with larger DIC 5-10: model with smaller DIC is better

2. Subtask performance Does a power or exponential model provide a better model of the effect of practice on subtask performance and what is the effect of constraining subtask learning curve parameters?

Subtask performance (group-level)

Subtask performance (individual-level)

Subtask performance: Parameter estimates Subtask Abbreviations: I = Information Gathering F = Filtering T = Timetabling Subtask Abbreviations: I = Information Gathering F = Filtering T = Timetabling Parameters 1: Amount of learning 2: Rate of learning 3: Asymptotic performance Parameters 1: Amount of learning 2: Rate of learning 3: Asymptotic performance

Subtask performance: Model comparison (DIC) Power has lower DIC (3862 vs 3885); but larger mean deviance Constraints substantially damage fit Power has lower DIC (3862 vs 3885); but larger mean deviance Constraints substantially damage fit

Subtask performance: Model comparison (posterior predictive checks) Interpretation: When data is simulated from a model and statistics are calculated on simulated data, good models generate statistics similar to actual data Bolding reflects discrepancies Interpretation: When data is simulated from a model and statistics are calculated on simulated data, good models generate statistics similar to actual data Bolding reflects discrepancies

3. Strategy Use on Subtask Performance What is the effect of strategy use on subtask performance?

Strategy use (group-level)

Strategy use on performance: Parameter estimates Note: Parameter estimates (i.e., exp (lambda)) for strategy covariates on subtask performance exp(lambda): expected multiple to task completion time resulting from strategy use exp(lambda) greater than 1: strategy use increases task completion time exp(lambda) less than 1: strategy use decreases task completion time Note: Parameter estimates (i.e., exp (lambda)) for strategy covariates on subtask performance exp(lambda): expected multiple to task completion time resulting from strategy use exp(lambda) greater than 1: strategy use increases task completion time exp(lambda) less than 1: strategy use decreases task completion time

4. Strategy Use and Subtask Learning Consistency To what extent does strategy use explain subtask learning inconsistency?

Strategy use explaining subtask inconsistency (group-level)

Strategy use explaining subtask inconsistency (individual-level)

Subtask performance with strategies: Model comparison (DIC) Strategies improve fit (e.g., 3885 – 3506 = 379) Damage to DIC fit of constraints is less with strategies (e.g., 3794 – 3506 = 288) than without strategies (e.g., 4497 – 3885 = 612) Strategies improve fit (e.g., 3885 – 3506 = 379) Damage to DIC fit of constraints is less with strategies (e.g., 3794 – 3506 = 288) than without strategies (e.g., 4497 – 3885 = 612)

Subtask performance: Model Comparison (Posterior predictive checks)

Concluding Thoughts

Concluding thoughts Differences between power and exponential are fairly subtle Task learning may be decomposed into subtask learning but functional form of subtask learning can vary Strategy use both expresses learning and learning to trade-off time on subtasks is a strategy itself More generally, the study provides a case study of Bayesian hierarchical methods

Future Work Further Bayesian skill acquisition research – Formal models of strategy acquisition – Models of discontinuities in the learning curve – Integrating traits (ability and personality) into dynamic models of performance Extending Bayesian Hierarchical methods to a range of domains – personality faking, longitudinal life satisfaction data, diary employee well-being data

Notes Code and data – Publication – Based on work with Sarah Wynton – Anglim, J., & Wynton, S. K. (2015). Hierarchical Bayesian Models of Subtask Learning. Journal of Experimental Psychology. Learning, Memory, and Cognition. Online First. My Contact details – –

Thank you Questions?