Evaluating Patient-Reports about Health

Slides:



Advertisements
Similar presentations
Introduction to IRT/Rasch Measurement with Winsteps Ken Conrad, University of Illinois at Chicago Barth Riley and Michael Dennis, Chestnut Health Systems.
Advertisements

Ron D. Hays, Ph.D. Alex Y. Chen, M.D. UCLA Children’s Hospital LA
Item Response Theory in a Multi-level Framework Saralyn Miller Meg Oliphint EDU 7309.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Reliability Definition: The stability or consistency of a test. Assumption: True score = obtained score +/- error Domain Sampling Model Item Domain Test.
Item Response Theory in Health Measurement
PROMIS DEVELOPMENT METHODS, ANALYSES AND APPLICATIONS Presented at the Patient-Reported Outcomes Measurement Information System (PROMIS): A Resource for.
1 A Parsimonious Patient-Reported Measure of Care Coordination Ron D. Hays, Ph.D. UCLA Department of Medicine September 27, 2013 Los Angeles, CA
Methods for Assessing Safety Culture: A View from the Outside October 2, 2014 (9:30 – 10:30 EDT) Safety Culture Conference (AHRQ Watts Branch Conference.
1 Evaluating Multi-Item Scales Using Item-Scale Correlations and Confirmatory Factor Analysis Ron D. Hays, Ph.D. RCMAR.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
CAHPS Overview Clinician & Group Surveys: Practical Options for Implementation and Use AHRQ ANNUAL MEETING SEPTEMBER 18, 2011 Christine Crofton, PhD CAHPS.
Primer on Evaluating Reliability and Validity of Multi-Item Scales Questionnaire Design and Testing Workshop October 25, 2013, 3:30-5:00pm Wilshire.
Psychometric Properties of the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Clinician and Group Adult Visit Survey September 11, 2012.
CAHPS® Survey to Evaluate the Patient’s Experience with the Medical Home Ron D. Hays, Ph.D. UCLA Department of Medicine RAND Health Program November 18,
LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.
SAS PROC IRT July 20, 2015 RCMAR/EXPORT Methods Seminar 3-4pm Acknowledgements: - Karen L. Spritzer - NCI (1U2-CCA )
Reliability & Agreement DeShon Internal Consistency Reliability Parallel forms reliability Parallel forms reliability Split-Half reliability Split-Half.
MEASUREMENT: SCALE DEVELOPMENT Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
Measurement Issues by Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research (June 30, 2008, 4:55-5:35pm) Diabetes & Obesity.
1 Should We Care about What Patients Say About Coordination of Care? Ron D. Hays, Ph.D. UCLA Department of Medicine RAND.
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 6, 2008 (3:30-6:30pm) HS 249F.
Assessing Patient Satisfaction Ron D. Hays UCLA Division of General Internal Medicine and Health Services Research RAND Health Program AUA Foundation Summer.
CAHPS PATIENT EXPERIENCE SURVEYS AHRQ ANNUAL MEETING SEPTEMBER 2012 Christine Crofton, PhD CAHPS Project Officer.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
CAHPS® Consumer Assessment of Healthcare Providers and Systems Ernest Moy Center for Quality Improvement & Patient Safety.
Psychometric Evaluation of Questionnaire Design and Testing Workshop December , 10:00-11:30 am Wilshire Suite 710 DATA.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 8, 2006 (3:00-6:00pm) HS 249F.
Item Response Theory in Health Measurement
Ron D. Hays, Ph.D. Joshua S. Malett, Sarah Gaillot, and Marc N. Elliott Physical Functioning Among Medicare Beneficiaries International Society for Quality.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology
Item Response Theory Dan Mungas, Ph.D. Department of Neurology University of California, Davis.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire.
Overview of Item Response Theory Ron D. Hays November 14, 2012 (8:10-8:30am) Geriatrics Society of America (GSA) Pre-Conference Workshop on Patient- Reported.
Evaluating Multi-item Scales Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research HS237A 11/10/09, 2-4pm,
Measuring Research Variables
Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.
Test-Retest Reliability of the Work Disability Functional Assessment Battery (WD-FAB) Dr. Leighton Chan, MD, MPH Chief, Rehabilitation Medicine Department.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 25 Critiquing Assessments Sherrilene Classen, Craig A. Velozo.
Patient Experience of Care Surveys
Ron D. Hays September 12, 2006 Network Testing ~ 5 or 6pm
Psychometric Evaluation of Items Ron D. Hays
Further Validation of the Personal Growth Initiative Scale – II: Gender Measurement Invariance Harmon, K. A., Shigemoto, Y., Borowa, D., Robitschek, C.,
UCLA CMORE Lecture Series
Background and Context Research Question and Proposition
UCLA Department of Medicine
Evaluating Patient-Reports about Health
UCLA Department of Medicine
Evaluating Multi-Item Scales
Ron D. Hays GIM-HSR Friday Noon Seminar Series November 4, 2016
METHODS AND PARTICIPANTS ANALYSES AND STUDY QUESTIONS
Item Analysis: Classical and Beyond
Final Session of Summer: Psychometric Evaluation
First study published in JOGS.
Item Analysis: Classical and Beyond
Evaluating the Psychometric Properties of Multi-Item Scales
Evaluating Multi-item Scales
Multitrait Scaling and IRT: Part I
Three Issues in Modern Test Theory for the Next 10 Years
Evaluating Multi-item Scales
Item Analysis: Classical and Beyond
Item Response Theory Applications in Health Ron D. Hays, Discussant
Evaluating Multi-Item Scales
Software Alternatives for IRT Analyses Ron D. Hays
Psychometric testing and validation (Multi-trait scaling and IRT)
Introduction to Psychometric Analysis of Survey Data
Evaluating the Significance of Individual Change
UCLA Department of Medicine
Patient-reported Outcome Measures
Presentation transcript:

Evaluating Patient-Reports about Health Ron D. Hays, Ph.D. UCLA Department of Medicine http://gim.med.ucla.edu/FacultyPages/Hays/ May 19, 2016 12 noon – 1pm University of Utah Learning Health System Seminar (HSEB 2600)

Physical Functioning Able to do a range of activities from basic (e.g., self-care) to advanced (e.g., running) Six physical functioning items included in the 2010 Consumer Assessment of Healthcare Providers and Systems (CAHPS®) Medicare Survey

Summary of CAHPS Project 1995 2017 CAHPS I (1995–2001) CAHPS II (2002–2007) CAHPS III (2007–2012) CAHPS IV (2012–2017) Develop surveys Enhance reporting guidelines and advance science of reporting Evaluating quality improvement efforts

CAHPS Now Has a Family of Surveys Ambulatory Care Health Plan Survey Clinician & Group Survey Home Health Care Survey Surgical Care Survey ECHO® Survey Dental Plan Survey American Indian Survey Facility Hospital Survey In-Center Hemodialysis Survey Nursing Home Surveys CAHPS undisputed leader in measuring patient experience

Because of a health or physical problem are you unable to do or have any difficulty doing the following activities? Walking? Getting in or out of chairs? Bathing? Dressing? Using the toilet? Eating? I am unable to do this activity (0) Yes, I have difficulty (1) No, I do not have difficulty (2)

Simple-summated Scoring of Physical Functioning Scale I am unable to do this activity (0) Yes, I have difficulty (1) No, I do not have difficulty (2) Possible 6-item scale range: 0-12 Mean = 11 (2% floor, 65% ceiling)

Medicare Beneficiary Sample (n = 366,701) 58% female 57% high school education or less 14% 18-64; 48% 65-74, 29% 75-84, 9% 85+

(Some difficulty) (1/3) (1/5) (1/7) (1/9) (1/10) (1/16)

Item-Scale Correlations Walking (0, 1, 2) 0.71 Chairs (0, 1, 2) 0.80 Bathing (0, 1, 2) 0.83 Dressing (0, 1, 2) 0.86 Toileting (0, 1, 2) 0.84 Eating (0, 1, 2) 0.75

Reliability Formulas Model Reliability Intraclass Correlation Two-way random Two-way mixed One-way BMS = Between Ratee Mean Square N = n of ratees WMS = Within Mean Square k = n of items or raters JMS = Item or Rater Mean Square EMS = Ratee x Item (Rater) Mean Square 11

Internal Consistency Reliability (Coefficient Alpha) (MSbms – MSems)/MSbms Ordinal alpha = 0.98 http://support.sas.com/resources/papers/proceedings14/2042-2014.pdf http://gim.med.ucla.edu/FacultyPages/Hays/utils/

Confirmatory Factor Analysis (Polychoric* Correlations) Dressing Eating Bathing Walking Chairs *Estimated correlation between two underlying normally distributed continuous variables Toileting Residual correlations <= 0.04

Item Response Theory (IRT) IRT graded response model estimates relationship between a person’s response Yi to the question (i) and his or her level on the latent construct (): bik estimates how difficult it is to have a score of k or more on item (i). ai estimates item discrimination.

Item Characteristic Curves Unable to do I do not have difficulty Unable to do Have difficulty Have difficulty I do not have difficulty Unable to do I do not have difficulty Unable to do I do not have difficulty Have difficulty Have difficulty Unable to do I do not have difficulty I do not have difficulty Unable to do Have difficulty Have difficulty

Item Characteristic Curves for Emotional Health Scale Very little Very little Very little Very much Very much Very much No No No Very little Very little Very much Very much No No

Item Characteristic Curves for Recoded Items

People and Items on Same z-score metric Person 1 Person 2 Person 3 -3 3

-3 -3 3

Unable to do Have difficulty

Reliability = (Info – 1) / Info

is mainstreaming BIGSTEPS and WINSTEPS PARSCALE and MULTILOG IRTPRO and FLEXMIRT SAS and STATA

Computer Adaptive Testing (CAT) 2004 www.nihpromis.org

Reliability Target for Use of Measures with Individuals z-score (mean = 0, SD = 1) Reliability ranges from 0-1 0.90 or above is goal SE = SD (1- reliability)1/2 Reliability = 1 – SE2 Reliability = 0.90 when SE = 0.32 95% CI = true score +/- 1.96 x SE (CI = -0.63  0.63 z-score when reliability = 0.90) T = z*10 + 50

Invariance of Item Parameters “Parameter values are identical in separate subgroups or across different measurement conditions.” “It is the often misunderstood feature of parameter invariance that is frequently cited in introductory or advanced texts” (Rupp & Zumbo, 2006).

Interval-Level? “Modern day psychometric analyses such as Rasch analysis convert ordinal data to an interval scale so that response scores meet the criteria for measurement” Correlation (product-moment and ICC) between simple-summated scoring and IRT estimated score for physical functioning = 0.91

Ben Wright or Been Wrong? “Application of the Rasch model to the data set estimates a measure that can be considered valid.” The “Rasch model is the only valid approach to measurement” Bergan, 2013, Rasch versus Birnbaum: New arguments in an old debate (p. 3)

Questions? Hays, R. D., Mallett, J. S., Gaillot, S., & Elliott, M. N. (2015). Performance of the Medicare Consumer Assessment of Healthcare Providers and Systems (CAHPS®) Physical Functioning Items. Medical Care, 54, 205-209