Estimating Minimally Important Differences (MIDs)

Slides:



Advertisements
Similar presentations
An Overview of Minimally Important Difference Estimation in Health-Related Quality of Life Studies Ron D. Hays, Ph.D. October 14, 2004,
Advertisements

ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
15-minute Introduction to PROMIS Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research Roundtable Meeting on Measuring.
Why Patient-Reported Outcomes Are Important: Growing Implications and Applications for Rheumatologists Ron D. Hays, Ph.D. UCLA Department of Medicine RAND.
How Much Improvement in Functional Status Is Considered Important by Patients With Active Psoriatic Arthritis: Applying the Outcome Measures in Rheumatoid.
1 8/14/2015 Evaluating the Significance of Health-Related Quality of Life Change in Individual Patients Ron Hays October 8, 2004 UCLA GIM/HSR.
1 Health-Related Quality of Life Ron D. Hays, Ph.D. - UCLA Department of Medicine: Division of General Internal Medicine.
Primer on Evaluating Reliability and Validity of Multi-Item Scales Questionnaire Design and Testing Workshop October 25, 2013, 3:30-5:00pm Wilshire.
Basic Methods for Measurement of Patient-Reported Outcome Measures Ron D. Hays, Ph.D. UCLA/RAND ISOQOL Conference on.
FDA Approach to Review of Outcome Measures for Drug Approval and Labeling: Content Validity Initiative on Methods, Measurement, and Pain Assessment in.
Is the Minimally Important Difference Really 0.50 of a Standard Deviation? Ron D. Hays, Ph.D. June 18, 2004.
Use of Health-Related Quality of Life Measures to Assess Individual Patients July 24, 2014 (1:00 – 2:00 PDT) Kaiser Permanente Methods Webinar Series Ron.
Health-Related Quality of Life as an Indicator of Quality of Care May 4, 2014 (8:30 – 11:30 PDT) HPM216: Quality Assessment/ Making the Business Case for.
SAS PROC IRT July 20, 2015 RCMAR/EXPORT Methods Seminar 3-4pm Acknowledgements: - Karen L. Spritzer - NCI (1U2-CCA )
1 Assessing the Minimally Important Difference in Health-Related Quality of Life Scores Ron D. Hays, Ph.D. UCLA Department of Medicine October 25, 2006,
Evaluating Health-Related Quality of Life Measures June 12, 2014 (1:00 – 2:00 PDT) Kaiser Methods Webinar Series Ron D.Hays, Ph.D.
Measuring Health-Related Quality of Life Ron D. Hays, Ph.D. UCLA Department of Medicine RAND Health Program UCLA Fielding School of Public Health
Development of Physical and Mental Health Summary Scores from PROMIS Global Items Ron D. Hays ( ) UCLA Department of Medicine
Measuring Health-Related Quality of Life
1 Health-Related Quality of Life Assessment as an Indicator of Quality of Care (HPM 216) Ron D. Hays April 11, 2013(8:30-11:30 am) Wilshire Blvd.
1 Session 6 Minimally Important Differences Dave Cella Dennis Revicki Jeff Sloan David Feeny Ron Hays.
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 6, 2008 (3:30-6:30pm) HS 249F.
Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 8, 2006 (3:00-6:00pm) HS 249F.
Approaches for Estimating Minimally Important Differences Ron D. Hays, Ph.D. January 12, 2004 (8:50-9:10am) Minimal Clinically Important Differences in.
Overlap between Subjective Well-being and Health-related Quality of Life. 3 Ron D. Hays, Ph.D. (Alina Palimaru) November 18, 2015 (11:30-12:00 noon) Geriatric.
Measurement of Outcomes Ron D. Hays Accelerating eXcellence In translational Science (AXIS) January 17, 2013 (2:00-3:00 pm) 1720 E. 120 th Street, L.A.,
Patient-Reported Physical Functioning Ron D. Hays November 27, 2012 (11:15-11:30) UCLA Department of Medicine MCID for Orthopaedic Devices Silver Springs,
Estimating Minimally Important Differences on PROMIS Domain Scores Ron D. Hays UCLA Department of Medicine/Division of General Internal Medicine & Health.
Evaluating Multi-Item Scales Health Services Research Design (HS 225B) January 26, 2015, 1:00-3:00pm CHS.
Health-Related Quality of Life in Outcome Studies Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research GCRC Summer Session.
Health-Related Quality of Life (HRQOL) Assessment in Outcome Studies Ron D. Hays, Ph.D. UCLA/RAND GCRC Summer Course “The.
July 18, 2017, 9:00-11:00 am (Geffen Hall Seminar Room 112)
Class 5 Additional Psychometric Characteristics: Validity and Bias, Responsiveness, Sensitivity to Change October 22, 2009 Anita L. Stewart Institute.
Psychometric Evaluation of Items Ron D. Hays
NIH: Patient-Reported Outcomes Measurement Information System (PROMIS®) Ron D. Hays Functional Vision and Visual Function November 10, 2016, 8:55-9:15am.
The Minimal Important Difference for St
Quality of Life Assessment
PROMIS-29 V2.0 Physical and Mental Health Summary Scores Ron D. Hays
UCLA Department of Medicine
Health Status in Chronic Obstructive Pulmonary Disease: Measuring the Minimal Clinically Important Difference over Different Periods of Time H.J. Alma,
Evaluating IRT Assumptions
Health-Related Quality of Life Assessment in Outcome Studies
Proportion of patients reporting scores ≥age-matched and gender-matched normative PRO values at baseline and 24 weeks in the (A) AMBITION and (B) ADACTA.
Mean changes in the Short Form-36 subscales from baseline values for combined pulmonary fibrosis and emphysema (CPFE) (n=16), and chronic obstructive pulmonary.
Clinical Epidemiology and Global Health
Health-Related Quality-of-Life Measures: Evidence from Tunisian Population Using the SF-12 Health Survey  Moheddine Younsi, PhD  Value in Health Regional.
Correlations between observed patient-reported outcomes and disease activity scores at week 24. Correlations between observed patient-reported outcomes.
Clinical Epidemiology and Global Health
Patient-reported outcomes: proportion of patients with clinically meaningful improvements in (A) SF-36 PCS and MCS at Week 52 and Week 104*†‡ and (B) HAQ-DI.
May 14, :00-12:00 noon (Geffen Hall 207)
Evaluating the Psychometric Properties of Multi-Item Scales
Health-Related Quality of Life as an indicator of Quality of Care
Health-Related Quality of Life Measures (HS249T: Decision Analysis and Cost-Effectiveness Analysis) Ron D. Hays, Ph.D. UCLA Division.
Mean individual and summative SF-36 V2 scores before and 3 months after ablation. Mean individual and summative SF-36 V2 scores before and 3 months after.
Evaluating Multi-item Scales
Percentages of patients reporting improvements from baseline ≥minimum clinically important difference (MCID) and number needed to treat (NNT) in (A) patient-reported.
Patient-Reported Indicators of Quality of Care
Proportion of patients reporting improvements from baseline in patient-reported outcomes (PROs) ≥ the MCID at (A) 16 weeks in OPTION, (B) 12 weeks in BREVACTA.
Introduction to Psychometric Analysis of Survey Data
Health-Related Quality of Life as an indicator of Quality of Care
Geffen Hall 122, Los Angeles, CA, June 10, 2019
Evaluating the Significance of Individual Change
GIM & HSR Research Seminar: October 5, 2018
LPA groups display vastly different outcomes.
UCLA Department of Medicine
How to Measure Quality of Life
Patient-reported Outcome Measures
Post hoc analysis of differences from placebo in the percentage of patients reporting improvements ≥MCID at week 24. Post hoc analysis of differences from.
Presentation transcript:

Estimating Minimally Important Differences (MIDs) of Health-Related Quality of Life Measures? WORK IN PROGRESS Ron D. Hays UCLA Department of Medicine/Division of General Internal Medicine & Health Services Research and RAND http://twitter.com/rondhays UCLA GIM/HSR Seminar Series, Los Angeles, CA October 15, 2010 12:02 – 1:00 pm

Take Away Points External indicators of change (“anchors”) are needed to estimate minimally important differences (MIDs) in health-related quality of life measures MID estimate is useful only if anchor is positively correlated with the measure being evaluated Multiple anchors are desirable What the MID is may be unclear MID is a group-level average change that is not applicable to individual change (“responder”)

Minimally Important Difference (MID) Rationale Differences in group means can be statistically significant but small (with large sample size) MID is the smallest difference to “care about” Clinically important or implies non-trivial treatment benefit

“Anchor-based” Estimates of MID Anchor used to classify respondents in terms of change Self-report, provider report, clinical measure Estimate change (“delta”) on measure for subgroup that changed by a minimally important amount (via anchor) Since the start of the study, how would you describe the change (if any) in <<symptom X, severity of condition>>? Much better Moderately better A little better No change A little worse Moderately worse Much worse MID MID 4

Hypothetical Change in Physical Function (T-score units) by magnitude of intervention

Three scenarios for use of “No Change” group in MID estimation + 2 Doesn’t matter No Change on Anchor +4 Minimal Change on Anchor Change #3 MID = 4 Change #2 MID = ? Change #1 MID = ?

Despite “The truly remarkable universality of half a standard deviation” Norman, Sloan, & Wyrwich, 2004, “The truly remarkable universality of half a standard deviation” Expert Rev Pharmacoecon Outcome Res

Effect Sizes (mean = 0.34) for SF-36 Changes Linked to Minimal Change in Anchors (Kosinski et al., Arth Rheu, 2000) Scale Self-R Clin.-R Pain Swell Tender Mean PF .35 .33 .34 .26 .32 Role-P .56 .52 .29 .36 .42 .83 .70 .47 .69 .62 GH .20 .12 .09 .04 EWB .39 .25 .18 .05 .23 Role-E .41 .28 .38 .30 SF .43 EF .50 .22 PCS .49 .48 MCS .27 .19

Health Assessment Questionnaire (HAQ) 20 physical functioning questions Are you able to dress yourself, including tying shoelaces and doing buttons? 0 = Without any difficulty; 1 = With some difficulty; 2 = With much difficulty; 3 = Unable to do Mean = 1.06 and SD = 0.753 in 1,079 arthritis patients MID estimates: 0.22 (ES = 0.29) in clinical trials 0.10-0.15 (ES = 0.13-0.20) in observational studies (e.g., Kwok & Pope, J Rheumatology, 2010; 0.10 in this study)

PROMIS Physical Function Scale in Rheumatoid Arthritis” (Fries et al.) 124-item physical functioning bank/20-item short form T-score mean of 50 and SD of 10 in general U.S. pop. Liu et al., J Clin Epidemiology, 2010. www.nihpromis.org Three waves of data Baseline (n = 521) 6 months post-baseline (n = 483) One year post-baseline (n = 472)

Retrospective Ratings of Change in RA Study Change in activity (CHG_ACTIVITY) Change in fatigue (CHG_FATIGUE) Change in pain (CHG_PAIN) Got a lot better Got a little better <--- Stayed the same Got a little worse <--- Got a lot worse

SF-36 Retrospective Change Item Compared to one year ago, how would you rate your health in general now? (HT) Much better now than one year ago Somewhat better now than one year ago <-- About the same as one year ago Somewhat worse now than one year ago <-- Much worse now than one year ago

Global Rating of Physical Functioning To what extent are you able to carry out your everyday physical activities such as walking, climbing stairs, carrying groceries, or moving a chair? (GLOBAL06) Completely Mostly Moderately A little Not at all

Minutes Spent Exercising Vigorous exercise Aerobics Biking Racquet sports Swimming Other exercises

Correlations of Anchor with Change in PROMIS Physical Functioning Domain (goal: r = 0.371 or higher) PF2 - PF1 PF3-PF2 PF3-PF1 Activity 0.26 (0.29) 0.34 (0.33) -------- Fatigue 0.20 (0.21) 0.23 (0.24) --------- Pain 0.27 (0.28) 0.30 (0.30) --------- Overall health -------------- --------------- 0.21(0.23) Δ Global PF 0.17 (0.19) 0.20 (0.20) --------- Δ Min. exercise 0.10 (0.07) 0.08 (0.06) --------- =========================================== Note: Product moment correlations presented, followed by Spearman rank-order correlations (in parentheses)

Change in PROMIS Physical Functioning (W2 - W1) by Retrospective Ratings of Change (n = 463) PF1 = 40.19 (SD = 9.18); PF2 = 39.81 (SD = 9.44) F (4, 458 dfs) = 9.41, p<.0001(Activity) r = 0.26 (0.29) F (4, 457 dfs) = 4.68, p=.0010(Fatigue) r = 0.20 (0.21) F (4, 457 dfs) = 9.81, p<.0001(Pain) r = 0.27 (0.28) Activity Fatigue Pain n Got a lot better: + 0.94a 0.94a 1.25a ( 19- 21) Got a little better: + 0.65a 0.54a 0.50a ( 41- 61) Stayed the same: - 0.04a,b - 0.16a,b 0.02a,b (224-258) Got a little worse: - 1.31b - 1.06b,c - 1.16b (107-126) Got a lot worse: - 3.19c - 2.06c - 3.12c ( 28- 32)

Change in PROMIS Physical Functioning (W2 - W1) by Change in Global PF (n = 465) F (4, 460 dfs) = 3.86, p =.0043 r = 0.17 (0.19) Global PF n 2+ levels better: 0.53a 22 1 level better: 0.32a 68 Stayed the same: - 0.31a,b 273 1 level worse: - 1.52b 60 2+ levels worse: - 1.39b 42

Change in PROMIS Physical Functioning (W3 - W2) by Retrospective Rating of Change in Activity (n = 443) PF2 = 39.95 (SD = 9.24); PF3 = 40.07 (SD = 9.60) F (4, 438 dfs) = 14.98, p<.0001 (Activity) r = 0.34 (0.33) F (4, 438 dfs) = 6.32, p<.0001 (Fatigue) r = 0.23 (0.24) F (4, 437 dfs) = 11.34, p<.0001 (Pain) r = 0.30 (0.30) Activity Fatigue Pain n Got a lot better: + 3.26a 2.24a 3.37a ( 16- 20) Got a little better: + 1.96a,b 1.67a,b 1.31b ( 33- 55) Stayed the same: 0.43b,c 0.38b,c 0.40b,c (211-245) Got a little worse: - 0.82c - 0.48c,d - 0.79c,d (114-138) Got a lot worse: - 3.16d - 1.94d - 2.28d ( 29- 31)

Change in PROMIS Physical Functioning (W3 - W2) by Change in Global PF (n = 439) PF2 = 39.95 (SD = 9.24); PF3 = 40.07 (SD = 9.60) F (4, 434 dfs) = 4.70, p = 0.0010 r = 0.20 (0.20) Global PF n 2+ levels better: 1.84a 27 1 level better: 0.54a,b 74 Stayed the same: 0.25b 235 1 level worse: - 0.86b,c 77 2+ levels worse: - 1.67c 26

Change in PROMIS Physical Functioning (W3 - W1) by Retrospective Rating of Change, Overall Health (n = 451) PF1 = 40.18 (SD = 9.03); PF3 = 39.91 (SD = 9.54) F (4, 446 dfs) = 13.34, p<.0001 r = 0.21 (0.23) Much better (n = 38): 1.26a Somewhat better (n = 221): 0.29a About the same (n = 39): - 2.57b Somewhat worse (n = 34): 1.45a Much worse (n = 119): - 1.51b

Change in PROMIS Physical Functioning (W3 - W1) by Retrospective Rating of Change, Overall Health (n = 451) PF1 = 40.18 (SD = 9.03); PF3 = 39.91 (SD = 9.54) F (4, 446 dfs) = 13.34, p<.0001 r = 0.21 (0.23) Much better (n = 38): 1.26a Somewhat better (n = 221): 0.29a About the same (n = 39): - 2.57b Somewhat worse (n = 34): 1.45a Much worse (n = 119): - 1.51b X

FDA Guidance for Industry Patient-Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims “Responder” Change in score over a predetermined time period that should be interpreted as a treatment benefit. Empiric evidence for any responder definition is derived using anchor-based method. http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM193282.pdf

Standard Error of Measurement (SEM) and Reliable Change Index (RCI) SEM = SD * SQRT (1-reliability) 95% CI = Estimated true score +/- 1.96 * SEM RCI= (X2 – X1)/ (SEM * SQRT (2)) Note: The change needed to be significant for an individual is about 0.50 SD for the SEM and 0.70 SD for the RCI when reliability is 0.94. Expert Review of Pharmacoeconomics & Outcomes

Individual Significant Change for SF-36 Scales and Summary Scores % Improving % Declining Difference PF-10 13% 2% + 11% RP-4 31% + 29% BP-2 22% 7% + 15% GH-5 0% + 7% EN-4 9% SF-2 17% 4% + 13% RE-3 15% EWB-5 19% PCS 24% + 17% MCS 11%

Thank you. Farivar, S. S., Liu, H., & Hays, R. D. (2004). Half standard deviation estimate of the minimally important difference in HRQOL scores?. Expert Review of Pharmacoeconomics and Outcomes Research., 4 (5), 515-523. Hays, R. D., Brodsky, M., Johnston, M. F., Spritzer, K. L., & Hui, K. (2005). Evaluating the statistical significance of health-related quality of life change in individual patients. Evaluation and the Health Professions, 28, 160-171. Hays, R. D., Farivar, S. S., & Liu, H. (2005). Approaches and recommendations for estimating minimally important differences for health-related quality of life measures. COPD: Journal of Chronic Obstructive Pulmonary Disease, 2, 63-67. Hays, R. D., & Woolley, J. M. (2000). The concept of clinically meaningful difference in health- related quality-of-life research: How meaningful is it? PharmacoEconomics, 18, 419-423. Revicki, D. A., Cella, D., Hays, R. D., Sloan, J. A., Lenderking, W. R., & Aaronson, N. K. (2006). Responsiveness and minimal important differences for patient reported outcomes. Health and Quality of Life Outcomes, 4: 70. Revicki, D., Hays, R. D., Cella, D., & Sloan, J. (2008). Recommended methods for determining responsiveness and minimally important differences for patient reported outcomes. Journal of Clinical Epidemiology., 61, 102-109

Appendix A: “Distribution-Based” Methods Change in PROMIS domain score theta that is equal to a “prior” for the minimally important change 0.5*SDb = 5 0.2*SDb = 2 Standard error of measurement (SEM) = SDb = standard deviation at baseline rxx = reliability 26

Appendix B: Change in HAQ (W2 - W1) by Retrospective Ratings of Change (n = 461) in Rheumatoid Arthritis HAQ1 = 0.88 (SD = 0.72); HAQ2 = 0.92 (SD = 0.74) F (4, 456 dfs) = 18.72, p<.0001(Activity) r = 0.34 (0.30) F (4, 455 dfs) = 10.52, p<.0001(Fatigue) r = 0.28 (0.25) F (4, 455 dfs) = 16.94, p<.0001(Pain) r = 0.35 (0.35) Activity Fatigue Pain n Got a lot better: + 0.11a 0.10a 0.16a (19-21) Got a little better: + 0.10a 0.06a 0.08a,b (41-61) Stayed the same: 0.00a,b 0.00a,b 0.00b,c (223-257) Got a little worse: - 0.09b - 0.10b - 0.10c (107-125) Got a lot worse: - 0.37c - 0.26c - 0.31d (28-35)