Remaining Challenges and What to Do Next: Undiscovered Areas Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research

Slides:



Advertisements
Similar presentations
Item Response Theory in Health Measurement
Advertisements

Models for Measuring. What do the models have in common? They are all cases of a general model. How are people responding? What are your intentions in.
15-minute Introduction to PROMIS Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research Roundtable Meeting on Measuring.
Item Response Theory in Patient-Reported Outcomes Research Ron D. Hays, Ph.D., UCLA Society of General Internal Medicine California-Hawaii Regional Meeting.
ITEM RESPONSE MODELING OF PRESENCE-SEVERITY ITEMS: APPLICATION TO MEASUREMENT OF PATIENT-REPORTED OUTCOMES Ying Liu and Jay Verkuilen.
1 BA 275 Quantitative Business Methods Statistical Inference: Confidence Interval Estimation Estimating the population mean  Margin of Error Sample Size.
IRT Models to Assess Change Across Repeated Measurements James S. Roberts Georgia Institute of Technology Qianli Ma University of Maryland University of.
A Hierarchical Framework for Modeling Speed and Accuracy on Test Items Van Der Linden.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.
Item Response Theory for Survey Data Analysis EPSY 5245 Michael C. Rodriguez.
Primer on Evaluating Reliability and Validity of Multi-Item Scales Questionnaire Design and Testing Workshop October 25, 2013, 3:30-5:00pm Wilshire.
Psychometric Properties of the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Clinician and Group Adult Visit Survey September 11, 2012.
Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. RCMAR/EXPORT September 15, 2008, 3-4pm
The emotional distress of children with cancer in China: An Item Response Analysis of C-Ped-PROMIS Anxiety and Depression Short Forms Yanyan Liu 1, Changrong.
2 nd Order CFA Byrne Chapter 5. 2 nd Order Models The idea of a 2 nd order model (sometimes called a bi-factor model) is: – You have some latent variables.
Use of Health-Related Quality of Life Measures to Assess Individual Patients July 24, 2014 (1:00 – 2:00 PDT) Kaiser Permanente Methods Webinar Series Ron.
SAS PROC IRT July 20, 2015 RCMAR/EXPORT Methods Seminar 3-4pm Acknowledgements: - Karen L. Spritzer - NCI (1U2-CCA )
Maryland Department of Health and Mental Hygiene WB&A Market Research Executive Summary THE 2003 MARYLAND MEDICAID MANAGED CARE CUSTOMER SATISFACTION SURVEY.
Multitrait Scaling and IRT: Part II Ron D. Hays, Ph.D. Questionnaire.
1 Assessing the Minimally Important Difference in Health-Related Quality of Life Scores Ron D. Hays, Ph.D. UCLA Department of Medicine October 25, 2006,
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Are Positive Experiences with Health Care Bad for Health? June 14, 2015 (AcademyHealth) Minneapolis Convention Center 1301 S. 2nd Avenue, Minneapolis,
Measurement Issues by Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research (June 30, 2008, 4:55-5:35pm) Diabetes & Obesity.
1 Health-Related Quality of Life Assessment as an Indicator of Quality of Care (HPM 216) Ron D. Hays April 11, 2013(8:30-11:30 am) Wilshire Blvd.
CAHPS Clinician & Group Survey 2.0 Update Ron D. Hays RAND, Santa Monica, CA UCLA, Los Angeles, CA.
1 11/17/2015 Psychometric Modeling and Calibration Ron D. Hays, Ph.D. September 11, 2006 PROMIS development process session 10:45am-12:30 pm.
1 Differential Item Functioning in Mplus Summer School Week 2.
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
Assessing Patient Satisfaction Ron D. Hays UCLA Division of General Internal Medicine and Health Services Research RAND Health Program AUA Foundation Summer.
Multi-item Scale Evaluation Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
Item Factor Analysis Item Response Theory Beaujean Chapter 6.
NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 22, 2011 ORLANDO, FL.
Psychometric Evaluation of Questionnaire Design and Testing Workshop December , 10:00-11:30 am Wilshire Suite 710 DATA.
Item Response Theory in Health Measurement
Ron D. Hays, Ph.D. Joshua S. Malett, Sarah Gaillot, and Marc N. Elliott Physical Functioning Among Medicare Beneficiaries International Society for Quality.
Approaches for Estimating Minimally Important Differences Ron D. Hays, Ph.D. January 12, 2004 (8:50-9:10am) Minimal Clinically Important Differences in.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.
Can patients be satisfied to death? What was Joshua J. Fenton thinking? Ron D. Hays, Ph.D. July 21, 2015 UCLA Center for Maximizing Outcomes and Research.
Performance of CAHPS® Health IT items September 20, 2011 Ron D. Hays, Ph.D. RAND, Santa Monica, Ca.
Considerations in Comparing Groups of People with PROs Ron D. Hays, Ph.D. UCLA Department of Medicine May 6, 2008, 3:45-5:00pm ISPOR, Toronto, Canada.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology University of California, Davis.
Patient-Reported Physical Functioning Ron D. Hays November 27, 2012 (11:15-11:30) UCLA Department of Medicine MCID for Orthopaedic Devices Silver Springs,
Performance of CAHPS® Health IT items September 20, 2011 Ron D. Hays, Ph.D. RAND, Santa Monica, Ca.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire.
Two Approaches to Estimation of Classification Accuracy Rate Under Item Response Theory Quinn N. Lathrop and Ying Cheng Assistant Professor Ph.D., University.
Overview of Item Response Theory Ron D. Hays November 14, 2012 (8:10-8:30am) Geriatrics Society of America (GSA) Pre-Conference Workshop on Patient- Reported.
Evaluating Multi-item Scales Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research HS237A 11/10/09, 2-4pm,
Health-Related Quality of Life (HRQOL) Assessment in Outcome Studies Ron D. Hays, Ph.D. UCLA/RAND GCRC Summer Course “The.
Plausible Values of Latent Variables: A Useful Approach of Data Reduction for Outcome Measures in Pediatric Studies Jichuan Wang, Ph.D. Children’s National.
Evaluating Patient-Reports about Health
Psychometric Evaluation of Items Ron D. Hays
QDET2, Miami, FL, Hibiscus A
UCLA Department of Medicine
Evaluating Patient-Reports about Health
UCLA Department of Medicine
Evaluating Multi-Item Scales
The Impact of Item Response Theory in Educational Assessment: A Practical Point of View Cees A.W. Glas University of Twente, The Netherlands University.
The Impact of Item Response Theory in Educational Assessment: A Practical Point of View Cees A.W. Glas University of Twente, The Netherlands University.
Item Analysis: Classical and Beyond
Evaluating IRT Assumptions
Item Analysis: Classical and Beyond
Evaluating the Psychometric Properties of Multi-Item Scales
Evaluating Multi-item Scales
Multitrait Scaling and IRT: Part I
Three Issues in Modern Test Theory for the Next 10 Years
Item Analysis: Classical and Beyond
GIM & HSR Research Seminar: October 5, 2018
Patient-reported Outcome Measures
Presentation transcript:

Remaining Challenges and What to Do Next: Undiscovered Areas Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research Item Response Theory in Health Measurement University of Twente Symposium Van der Valk Hotel Schiphol, Netherlands September 17, 2013

This Presentation is Dedicated to Last Night’s Hero (Santa Claus) 2

Five Future Areas of Opportunity Person fit Distributional Issues Presence/severity Group-level Models Multidimensional IRT/Hierarchical Models 3

Person Fit Large negative Z L values indicate misfit. – one person who responded to 14 of the PROMIS physical functioning items had a Z L = – For 13 items the person could do the activity (including running 5 miles) without any difficulty. But this person reported a little difficulty being out of bed for most of the day.

Person Fit Item misfit significantly associated with – Less than high school education – More chronic conditions – Non-white Including response time in the model lead to significant associations for: – Longer response time – More chronic conditions – Younger age

Distributional Issues Normal distribution for latent trait assumed in estimating item parameters using marginal ML Degree to which non-normality of latent distribution is consequential for IRT modeling – Extent to which violating normality assumption distorts item slope and threshold parameters Ramsay Curve IRT (Carol Woods) – Detects and adjusts for non-normal latent variables – Need “enough” items and respondents 6

Memorial Symptom Assessment Scale Presence/Severity Example (Liu & Verkuilen, 2013, Applied Psych Measurement) Did you have diarrhea in the past 2 weeks? – No (0) – Yes How much did diarrhea bother or distress you? – Not at all (1) – A little bit (2) – Somewhat (3) – Quite a bit (4) – Very much (5) 7

Presence/Severity Ordinal Scoring 0 = No diarrhea in the past 2 weeks. 1 = Diarrhea and bothered/distressed not at all. 2 = Diarrhea and bothered/distressed a little bit. 3 = Diarrhea and bothered/distressed somewhat. 4 = Diarrhea and bothered/distressed quite a bit. 5 = Diarrhea and bothered/distressed very much. 8

δ- plot for Diarrhea item (Nominal Response Model) 9 x axis = 0 -5 categories; y axis = a k – a 0 category severity

δ- plot for Pain Item 10 x axis = 0 -5 categories; y axis = a k – a 0 category severity

Alternatives to Logistic/Normal-Ogive for Item Response Curves “Positive trait” item response models – Alcohol dependency (below average level of alcohol dependency is not meaningful) Latent trait scale begins at theta = 0 – Log-logistic (with B = 1 is Rasch’s original item response model) – Lognormal (similar to Steven’s psychophysical stimulus-response function) – Weibull (frequently used in biostatistics) Markov Chain Monte Carlo 11 Lucke, J. F.

CAHPS® Health Plan Survey 35,572 adults in 131 plans (n = 271 per plan) Never/Sometimes/Usually/Always items – Got help/advice needed when phoned – Got appointment for routine care as soon as wanted – Got appointment when ill or injured as soon wanted – Doctors listed carefully to you – Doctors explained things so you could understand – Doctors respect what you had to say – Doctors spent enough time with you – Office staff treated you with courtesy and respect – Office staff helpful 12

Group-Level Modeling Analogous to person-based IRT Estimate relationships between group trait level and proportion of group members (patients) that endorse each item Reise, Meijer et al., 2006, Multi Beh Res 13

Items Do Not Discriminate Between Groups As Well as Among Persons ItemPerson-Level Discrimination Group (Plan) Level Discrimination C C C C C C C C C

Health Plan-level Scale Information and SE for 9-Item Scale 15

Standard Error for Each Plan Depends Upon Plan’s estimated theta Number of respondents 16

Multidimensional and Other Hierarchical Item Factor Models Multidimensional nominal models – Historically not easy to do, but IRTPRO makes it easier Cai, L. (2010). A two-tier full-information item factor analysis model with applications. Psychometrika, 75, – Includes correlated traits multidimensional IRT, bifactor model, and testlet response theory models 17

18 Questions?