A Hierarchical Framework for Modeling Speed and Accuracy on Test Items Van Der Linden.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Chapter 6 Continuous Random Variables and Probability Distributions
How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.
Item Response Theory in Health Measurement
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
IRT Equating Kolen & Brennan, IRT If data used fit the assumptions of the IRT model and good parameter estimates are obtained, we can estimate person.
Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.
Chapter 10 Simple Regression.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Measurement Models and CFA; Chi-square and RMSEA Ulf H. Olsson Professor of Statistics.
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
1 Engineering Computation Part 6. 2 Probability density function.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
GRA 6020 Multivariate Statistics Confirmatory Factor Analysis Ulf H. Olsson Professor of Statistics.
REGRESSION AND CORRELATION
Multivariate Distributions. Distributions The joint distribution of two random variables is f(x 1,x 2 ) The marginal distribution of f(x 1 ) is obtained.
1Prof. Dr. Rainer Stachuletz Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Chapter 15: Model Building
© UCLES 2013 Assessing the Fit of IRT Models in Language Testing Muhammad Naveed Khalid Ardeshir Geranpayeh.
Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 1 More details can be found in the “Course Objectives and Content”
Chapter 9 Two-Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
Conceptual Issues in Response-Time Modeling Wim J. van der Linden CTB/McGraw-Hill.
Moderation in Structural Equation Modeling: Specification, Estimation, and Interpretation Using Quadratic Structural Equations Jeffrey R. Edwards University.
Chapter 3 Basic Concepts in Statistics and Probability
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Confidential and Proprietary. Copyright © 2010 Educational Testing Service. All rights reserved. 10/7/2015 A Model for Scaling, Linking, and Reporting.
T-TEST Statistics The t test is used to compare to groups to answer the differential research questions. Its values determines the difference by comparing.
I. Logistic Growth (Chap. 11, continued) A. Patterns: Sigmoidal curve B. Equation C. What limits population growth? Density dependent factors Density independent.
Dimensionality of the latent structure and item selection via latent class multidimensional IRT models FRANCESCO BARTOLUCCI.
Ch5. Probability Densities II Dr. Deshi Ye
Section 10.5 The Kinetic Molecular Theory. The Kinetic Molecular Theory In this section… a.Gases and Gas Laws on the Molecular Scale b.Molecular speed,
1 Conceptual Issues in Observed-Score Equating Wim J. van der Linden CTB/McGraw-Hill.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Chapter 16 Data Analysis: Testing for Associations.
Introduction section of article
Question paper 1997.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
Starting point for generating other distributions.
Item Factor Analysis Item Response Theory Beaujean Chapter 6.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 22, 2011 ORLANDO, FL.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Economics 20 - Prof. Anderson1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Introduction to Multilevel Analysis Presented by Vijay Pillai.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
IRT Equating Kolen & Brennan, 2004 & 2014 EPSY
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Introduction to Probability - III John Rundle Econophysics PHYS 250
Inference about the slope parameter and correlation
Bayesian Semi-Parametric Multiple Shrinkage
Statistical Modelling
Vertical Scaling in Value-Added Models for Student Learning
Section 11.1 Day 2.
Parameter Estimation 主講人:虞台文.
Copyright © American Speech-Language-Hearing Association
Chapter 7: Sampling Distributions
Regression Analysis Week 4.
BA 275 Quantitative Business Methods
Sample vs Population comparing mean and standard deviations
Homoscedasticity/ Heteroscedasticity In Brief
Banburismus and the Brain
Statistics II: An Overview of Statistics
Homoscedasticity/ Heteroscedasticity In Brief
Multiple Regression Berlin Chen
Presentation transcript:

A Hierarchical Framework for Modeling Speed and Accuracy on Test Items Van Der Linden

Outline Introduction of speed-accuracy tradeoff Current Models Discussion of Models General Hierarchical Framework Priors Distributions Empirical Example Discussion

Introduction of speed-accuracy tradeoff working faster with lower accuracy or more slowly with higher accuracy

Current models M1: M2: M3: M4: Weibull distribution M5:

Discussion (1) the within-person level, at which the value of the person parameters is allowed to change over time (2) the fixed-person level, at which the parameters remain constant; (3) the level of a population of fixed persons, for which we have a distribution of parameter values across persons M1 & M2: Confound the within-person and the fixed-person level.

(4) speed parameters and time parameters should be separated & incorporated M2: directly equate speed with the response time M3 & M5: contain time parameters M1, M2 & M4: no time parameters

General Hierarchical Framework Key assumptions (a) a test taker operates at a fixed level of speed & accuracy; (b) for a fixed test taker, the response and the time on an item are random variables; (c) separate item and person parameters for the distribution of the response on the items and the time (d) given the person’s ability and speed, the responses and time are conditional independent (e) model the relations between speed and accuracy for a population of test takers separately from the impact of these parameters on the responses and times of the individual test takers

Levels of Modeling First-level models: a response model and a response-time model (3PNO) (lognormal) Joint distribution (locally dependent given a person’s ability and speed)

Second-level model: For person parameters ( ), it was assumed that the and are multivariate normal distributed.

Likewise, for item parameters ( ), it was assumed that,,,, and are multivariate normal distributed

For the full model, the sampling distribution becomes

Identifiablity Alternative models response model: use IRT models response time model: exponential model, Weibull distribution, constrain some parameters for the second model

Priors Distribution Posterion distribution

Empirical Example 1104 test takers on 96 items in the computerized CPA examination. The items were shown to have a good fit to the 3PL logistic model and the response times to the lognormal model.

Results:, indicating the more able test takers tended to work faster (tradeoff between speed and accuracy)

Discussion Response time (1) improve testing routines that are traditionally based on the response only. (2) improve item selection in CAT; (3) diagnose differential speededness and thus to adjust the problem. (4) detect aberrant behavior (5) The hierarchical framework can be used to equate the result from different experiments or results from different conditions of speed.

Questions & Future Studies Scale problem for response model and response time model Use different models for response model (e.g., 3PL logistic model) and response time model Consider situations of cheating when response time is rather short…