Reporting item response theory results Jeffrey B. Brookings Wittenberg University Presented at the SAMR/SWPA Symposium: Handy tips for communicating and.

Slides:



Advertisements
Similar presentations
Psychometrics to Support RtI Assessment Design Michael C. Rodriguez University of Minnesota February 2010.
Advertisements

Test Development.
Item Response Theory in a Multi-level Framework Saralyn Miller Meg Oliphint EDU 7309.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Item Response Theory in Health Measurement
Rasch analysis of the Roland-Morris Disability Questionnaire Megan Davidson, PhD School of Physiotherapy, La Trobe University, Melbourne.
Basic Issues in Measurements P M V Subbarao Professor Mechanical Engineering Department How to generate reliable Numbers?????
Introduction to Item Response Theory
AN OVERVIEW OF THE FAMILY OF RASCH MODELS Elena Kardanova
Models for Measuring. What do the models have in common? They are all cases of a general model. How are people responding? What are your intentions in.
UNIDIMENSIONALITY – MULTIDIMENSIONALITY (An example) Panayiotis Panayides.
Overview of field trial analysis procedures National Research Coordinators Meeting Windsor, June 2008.
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
A Different Way to Think About Measurement Development: An Introduction to Item Response Theory (IRT) Joseph Olsen, Dean Busby, & Lena Chiu Jan 23, 2015.
AN ALGORITHM FOR TESTING UNIDIMENSIONALITY AND CLUSTERING ITEMS IN RASCH MEASUREMENT Rudolf Debelak & Martin Arendasy.
© UCLES 2013 Assessing the Fit of IRT Models in Language Testing Muhammad Naveed Khalid Ardeshir Geranpayeh.
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Why Scale -- 1 Summarising data –Allows description of developing competence Construct validation –Dealing with many items rotated test forms –check how.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Measurement Problems within Assessment: Can Rasch Analysis help us? Mike Horton Bipin Bhakta Alan Tennant.
        Analysis of Preschool Assessment Data Desired Results Development Profile Preschool © DRDP – PS (2010)       Ifthika “Shine” Nissar, M.A.
Item Response Theory Psych 818 DeShon. IRT ● Typically used for 0,1 data (yes, no; correct, incorrect) – Set of probabilistic models that… – Describes.
Item Response Theory for Survey Data Analysis EPSY 5245 Michael C. Rodriguez.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Factor Analysis PowerPoint Prepared by Alfred.
DIFFERENCES BETWEEN WOMEN IN SHORT- AND LONG-TERM RELATIONSHIPS IN CUES FOR SEXUAL DESIRE Ana Carvalheira, PhD 1, Lori Brotto, Ph.D 2 & Isabel Leal, PhD.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Prototypical Level 4 Performances Students use a compensation strategy, recognizing the fact that 87 is two less than 89, which means that the addend coupled.
Modern Test Theory Item Response Theory (IRT). Limitations of classical test theory An examinee’s ability is defined in terms of a particular test The.
Validation of the Assessment and Comparability to the PISA Framework Hao Ren and Joanna Tomkowicz McGraw-Hill Education CTB.
Introduction Neuropsychological Symptoms Scale The Neuropsychological Symptoms Scale (NSS; Dean, 2010) was designed for use in the clinical interview to.
The ABC’s of Pattern Scoring Dr. Cornelia Orr. Slide 2 Vocabulary Measurement – Psychometrics is a type of measurement Classical test theory Item Response.
Mearns (1996, 1997) - an extension of Rogers’ (1957) facilitative conditions of therapeutic change. Mearns (2003) - serves as a distinctive hallmark of.
Variables and their Operational Definitions
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Examining Data. Constructing a variable 1. Assemble a set of items that might work together to define a construct/ variable. 2. Hypothesize the hierarchy.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
Estimation. The Model Probability The Model for N Items — 1 The vector probability takes this form if we assume independence.
Reliability, Validity and Fit. Functional Independence Measure (FIM): Example 17s  In Example 17, 35 arthritis patients have been through rehabilitation.
Item Factor Analysis Item Response Theory Beaujean Chapter 6.
Psychometric Evaluation of Questionnaire Design and Testing Workshop December , 10:00-11:30 am Wilshire Suite 710 DATA.
Item Response Theory in Health Measurement
FIT ANALYSIS IN RASCH MODEL University of Ostrava Czech republic 26-31, March, 2012.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.
 Youth Teasing and Bullying are a major public health problem  ~20% of youths report being bullied or bullying at school in a given year  160,000.
Essentials for Measurement. Basic requirements for measuring 1) The reduction of experience to a one dimensional abstraction. 2) More or less comparisons.
Unraveling the Mysteries of Setting Standards and Scaled Scores Julie Miles PhD,
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire.
Overview of Item Response Theory Ron D. Hays November 14, 2012 (8:10-8:30am) Geriatrics Society of America (GSA) Pre-Conference Workshop on Patient- Reported.
Using Rasch modeling to investigate the psychometric properties of the OSCE = 51.86* * *0.2 Aim To present a prototype of a validated.
The Invariance of the easyCBM® Mathematics Measures Across Educational Setting, Language, and Ethnic Groups Joseph F. Nese, Daniel Anderson, and Gerald.
OFFICE OF EDUCATION Consultations – 13 August 2014 Work to date related to the ELP loading.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 25 Critiquing Assessments Sherrilene Classen, Craig A. Velozo.
Measurement: A Rasch Analysis of Malaysian Automotive Quality Management-Cost of Quality Scale (MAQM-CoQ Scale) Muhammad Shahar Hj Jusoh , PhD Rushami.
A Different Way to Think About Measurement Development:
UCLA Department of Medicine
Evaluating Multi-Item Scales
Adopting The Item Response Theory in Operations Management Research
Item Analysis: Classical and Beyond
Showcasing the use of Factor Analysis in data reduction: Research on learner support for In-service teachers Richard Ouma University of York SPSS Users.
Validation of the NHS Scotland Employee Engagement Index
Personality An individual’s characteristic pattern of thinking, feeling, and acting.
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Can We Rely on the Dermatology Life Quality Index as a Measure of the Impact of Psoriasis or Atopic Dermatitis?  James Twiss, David M. Meads, Elizabeth.
Examining Data.
Item Analysis: Classical and Beyond
Reliability, Validity and Fit
Item Analysis: Classical and Beyond
Presentation transcript:

Reporting item response theory results Jeffrey B. Brookings Wittenberg University Presented at the SAMR/SWPA Symposium: Handy tips for communicating and reporting your findings April 5, 2013

Ph.D. Comics, 2013

Item Response Theory 1.Mathematical models that probabilistically describe the relation between a person’s response to an item and his/her standing on a latent trait. 2.The Rasch model—a “one-parameter” model (difficulty)— locates person ability and item difficulty on the same scale (logits or log odds). 3.“…a person having a greater ability than another person should have the greater probability of solving any item of the type in question, and similarly, one item being more difficult than another means that for any one person the probability of solving the second item is the greater one.” (Rasch, 1960, p. 117) 4.The purpose of Rasch analysis is to produce unidimensional measures that cover a wide range of the latent trait.

Reporting results from a Rasch analysis 1.Item and scale descriptive statistics 2.PCA of standardized residuals following extraction of the Rasch component (test for unidimensionality) 3.Item “difficulty” estimates (in logits) 4.Item fit statistics 5.Item characteristic curves (ICCs) 6.Category response curves (CRRs) 7.Person/item map 8.Person/item separation reliability

The Psychosocial Risk Factor Survey (Eichenauer, Feltz, Wilson, & Brookings, 2010) Assesses psychosocial risk factors for cardiac disease 70 items, 5-point response scale: 0 - “Strongly Agree” to 4 - “Strongly Disagree” Scales: Depression, Anxiety, Hostility, Social Isolation, and Emotional Guardedness. Analyses: Responses to the 14 Depression Scale items (340 patients from five cardiac rehabilitation programs in the Midwest)

Rasch Item Statistics

PCA of Standardized Residuals Total raw variance in observations % Raw variance explained by measures % Raw variance explained by persons % Raw variance explained by items % Raw unexplained variance (total) % Unexplained variance in 1st contrast % Unexplained variance in 2nd contrast % Unexplained variance in 3rd contrast % Unexplained variance in 4th contrast % Unexplained variance in 5th contrast %

Item characteristic curve—with 95% CI—for item 27: “My thoughts feel so scattered lately”

Item characteristic curve—with 95% CI—for item 12: “I think more about ending my life lately”

Rasch Category Responses

Person/Item Map Mean person measure = Mean item measure =.00

Reliability Person separation reliability – Analogous to Cronbach’s α; degree to which the scale differentiates persons; range 0 – 1 –For PRFS Depression:.88 Item separation reliability – Degree to which item difficulties are differentiated; range 0 – 1 –For PRFS Depression:.99

Summary of Rasch Analysis for the PRFS Depression Scale Good evidence for unidimensionality Mean point-measure r =.626 Acceptable person and item separation reliabilities (.88 and.99, respectively) Some misalignment of persons and items One mis-fitting item: #12 (“I think more about ending my life lately”)

Recommended Reading Bond, T.G., & Fox, C.M. (2007). Appling the Rasch model: Fundamental measurement in the human sciences (2 nd ed.). Mahwah, NJ: Erlbaum.