AN ALGORITHM FOR TESTING UNIDIMENSIONALITY AND CLUSTERING ITEMS IN RASCH MEASUREMENT Rudolf Debelak & Martin Arendasy.

Slides:



Advertisements
Similar presentations
DIF Analysis Galina Larina of March, 2012 University of Ostrava.
Advertisements

Structural Equation Modeling. What is SEM Swiss Army Knife of Statistics Can replicate virtually any model from “canned” stats packages (some limitations.
How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.
Hypothesis testing and confidence intervals by resampling by J. Kárász.
Item Response Theory in Health Measurement
Dimension reduction (1)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
Classification and risk prediction
Uncertainty and Variability in Point Cloud Surface Data Mark Pauly 1,2, Niloy J. Mitra 1, Leonidas J. Guibas 1 1 Stanford University 2 ETH, Zurich.
Research Design After: finding an interesting research question; finding an interesting research question; reviewing the literature on the topic area;
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Factor Analysis Ulf H. Olsson Professor of Statistics.
© UCLES 2013 Assessing the Fit of IRT Models in Language Testing Muhammad Naveed Khalid Ardeshir Geranpayeh.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Segmentation Analysis
Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.
Categorical Data Prof. Andy Field.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Chapter 3 Statistical Concepts.
Chapter 12 Multiple Regression and Model Building.
Simple Linear Regression Models
 Random Guessing › A function of the proficiency of a person relative to the difficulty of an item(Waller, 1973, 1976, 1989) › Not a property of an item.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Rasch trees: A new method for detecting differential item functioning in the Rasch model Carolin Strobl Julia Kopf Achim Zeileis.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
CLASSIFICATION. Periodic Table of Elements 1789 Lavosier 1869 Mendelev.
1 EPSY 546: LECTURE 1 SUMMARY George Karabatsos. 2 REVIEW.
Examining Data. Constructing a variable 1. Assemble a set of items that might work together to define a construct/ variable. 2. Hypothesize the hierarchy.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 41 Lecture 4 Sample size reviews 4.1A general approach to sample size reviews 4.2Binary data 4.3Normally.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Inferences Concerning Variances
Chapter 13.  Both Principle components analysis (PCA) and Exploratory factor analysis (EFA) are used to understand the underlying patterns in the data.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Item Response Theory in Health Measurement
FIT ANALYSIS IN RASCH MODEL University of Ostrava Czech republic 26-31, March, 2012.
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Tutorial I: Missing Value Analysis
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Demonstration of SEM-based IRT in Mplus
 Youth Teasing and Bullying are a major public health problem  ~20% of youths report being bullied or bullying at school in a given year  160,000.
CJT 765: Structural Equation Modeling Class 9: Putting it All Together.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Estimating standard error using bootstrap
Effect Sizes.
Confidence Intervals and Sample Size
Claus H. Carstensen, Institute for Science Education IPN Kiel, Germany
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Evaluation of measuring tools: validity
Psychology 202a Advanced Psychological Statistics
Classical Test Theory Margaret Wu.
CJT 765: Structural Equation Modeling
National Conference on Student Assessment
EPSY 5245 EPSY 5245 Michael C. Rodriguez
UNIT IV ITEM ANALYSIS IN TEST DEVELOPMENT
Simple Linear Regression
Power Calculation for QTL Association
Examining Data.
Factor Analysis.
Presentation transcript:

AN ALGORITHM FOR TESTING UNIDIMENSIONALITY AND CLUSTERING ITEMS IN RASCH MEASUREMENT Rudolf Debelak & Martin Arendasy

Outline 1. Aims of this study 2. PCA and Parallel analysis based on tetrachoric correlations 3. The proposed algorithm 1. Procedures 2. Statistical test 4. Simulation study 5. Empirical study 6. Discussion

Aims of Study  Clustering items: exploratory approach to identify items scales with strict criterion  Testing unidimensionality: confirmatory approach to test a unidimensional item set whether yielding a single cluster

Literature Review  Commonly used procedures to test unidimensionality: PCA and EFA  Applying to binary data -> based on tetrachoric correlations  Correct number of components/factors -> parallel analysis  Cluster analysis

Cluster analysis (from Wikipedia)  Cluster analysis or clustering is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other than to those in other clusters.  Hierarchical Cluster Analysis is based on the core idea of objects being more related to nearby objects than to objects farther away.  Measure of similarity (distance)

The Basic Structure of the Procedure Test Item Triplets Expand Item Set O3O3 A3A3 OnOn Function f Maximum NOT Maximum A n+1 O n+1 Function f NOT Maximum Maximum OkOk p less than 0.5

Assessing the Model Fit  “a function f” is a global fit statistics in this study  The test can be used to evaluate whether the set of items, as a whole, fits the model. ( Suarez-Falcon & Glas, 2003) R 1  First-order statistics (R 1 ): violation of the property of monotone increasing and parallel item characteristic curves  Second-order statistics (R 2 ): violation of the assumptions of unidimensionality and local independence

R 1C Statistics R 1C can be regarded as being asymptotically chi-square distribution with (k – 1)(k – 2) degrees of freedom.

Simulation Study  Aim: whether able to detect and reconstruct subsets of items that fit the Rasch model.  Two subsets of items which fit the Rasch model  Six variables were manipulated (next slide)  10,000 replications were carried out with eRm package which employed the CML estimation method.

Variables 1. The distribution of the item parameters (normal, uniform) 2. The standard deviations of the item and person parameters 3. The size of the person sample (250, 500, 1000) 4. The size of the item set (10, 30, 50) 5. The correlation between the person parameters (0.0, 0.5) TypePersonItem A B C D2.51.5

Data Analysis  The proposed algorithm  The PCA based on tetrachoric correlations with parallel analysis (95 th percentile eigenvalues)

Results (Proposed Method)

Results (PCA with Parallel Analysis)

Sample Size in PCA  Sample size small than 250 (test length = 10 items) would result in large numbers of indefinite matrices of tetrachoric correlations, making the application of PCA impossible. (Parry & McArdle, 1991; Weng & Cheng, 2005)

Empirical Study  The Basic Intelligence Functions (IBF; Blum et al., 2005)  Subtests and Items: verbal intelligence functions (2; 12+15), numerical intelligence functions (2), long- term memory (1; 15), visualization (1; 13).  Between 281 and 284 persons

Data Analysis  Using Raschcon for scale-construct with the proposed algorithm.  Andersen likelihood ratios (eRm), fit statistics and PCA on residuals (Winsteps) were calculated.  PCA and parallel analysis of tetrachoric correlations were performed.  all subtests were analyzed separately

Results  Proposed algorithm: all subtests identical to the respective subtest; fit to Rasch model.  Andersen tests: fit at.01 level; 3 out of 4 subtests unfit at.05 level.  Mean square in/outfit: all ranged [1.33, 0.65]  PCA on residuals: long-term memory(2.0), others (<1.4)  PCA: long-term memory (2 components)

Discussion  A new algorithm was presented and compared with another method PCA of tetrachoric correlations.  R 1c statistics: when sample size is large and correlation between latent traits was low, and better in small scales, large variances of item and person parameters.  More preferable than PCA of tetrachoric when sample size is small and scales are large.

Further Studies 1. Systematic comparison with PCA of tetrachoric 2. Involve more tests for model assumptions 3. Compare test statistics for the fit of Rasch model 4. Conduct other IRT models  For the conditions of higher correlation and small sample size, is it possible to find a cut-point (correction) to improve the use of this method?