1 Session 6 Minimally Important Differences Dave Cella Dennis Revicki Jeff Sloan David Feeny Ron Hays.

Slides:



Advertisements
Similar presentations
Standardized Scales.
Advertisements

Statistical vs. Practical Significance
An Overview of Minimally Important Difference Estimation in Health-Related Quality of Life Studies Ron D. Hays, Ph.D. October 14, 2004,
ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Inference Sampling distributions Hypothesis testing.
Statistical Issues in Research Planning and Evaluation
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 9_part I ( and 9.7) Tests of Significance.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Sample Size Determination In the Context of Hypothesis Testing
BCOR 1020 Business Statistics
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Sample Size Determination
Chapter 14 Inferential Data Analysis
1 8/14/2015 Evaluating the Significance of Health-Related Quality of Life Change in Individual Patients Ron Hays October 8, 2004 UCLA GIM/HSR.
1 Health-Related Quality of Life Ron D. Hays, Ph.D. - UCLA Department of Medicine: Division of General Internal Medicine.
2 Accuracy and Precision Accuracy How close a measurement is to the actual or “true value” high accuracy true value low accuracy true value 3.
Basic Methods for Measurement of Patient-Reported Outcome Measures Ron D. Hays, Ph.D. UCLA/RAND ISOQOL Conference on.
“A Critical Look at Health-Related Quality of Life Measures” SGIM Annual Meeting Ron D. Hays May 2, 2003 (12:30-1:30 pm)
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Quality Measures for Rehabilitation: Policy, Provider and Patient Perspectives Measuring Clinical Change: Quality Indicators ACRM-ASNR Pre-Conference Institute.
FDA Approach to Review of Outcome Measures for Drug Approval and Labeling: Content Validity Initiative on Methods, Measurement, and Pain Assessment in.
Is the Minimally Important Difference Really 0.50 of a Standard Deviation? Ron D. Hays, Ph.D. June 18, 2004.
Evidence Based Medicine
Use of Health-Related Quality of Life Measures to Assess Individual Patients July 24, 2014 (1:00 – 2:00 PDT) Kaiser Permanente Methods Webinar Series Ron.
Health-Related Quality of Life as an Indicator of Quality of Care May 4, 2014 (8:30 – 11:30 PDT) HPM216: Quality Assessment/ Making the Business Case for.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Lecture 6: Reliability and validity of scales (cont) 1. In relation to scales, define the following terms: - Content validity - Criterion validity (concurrent.
1 Assessing the Minimally Important Difference in Health-Related Quality of Life Scores Ron D. Hays, Ph.D. UCLA Department of Medicine October 25, 2006,
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
Evaluating Health-Related Quality of Life Measures June 12, 2014 (1:00 – 2:00 PDT) Kaiser Methods Webinar Series Ron D.Hays, Ph.D.
Measuring Health-Related Quality of Life
Sampling Error.  When we take a sample, our results will not exactly equal the correct results for the whole population. That is, our results will be.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Issues concerning the interpretation of statistical significance tests.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 6, 2008 (3:30-6:30pm) HS 249F.
EQ-5D and SF-36 Quality of Life Measures in Systemic Lupus Erythematosus: Comparisons with RA, Non-Inflammatory Disorders (NIRD), and Fibromyalgia (FM)
1 Study Design Issues and Considerations in HUS Trials Yan Wang, Ph.D. Statistical Reviewer Division of Biometrics IV OB/OTS/CDER/FDA April 12, 2007.
Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 8, 2006 (3:00-6:00pm) HS 249F.
Research planning. Planning v. evaluating research To a large extent, the same thing Plan a study so that it is capable of yielding data that could possibly.
Comparing the Means of Two Dependent Populations.
Approaches for Estimating Minimally Important Differences Ron D. Hays, Ph.D. January 12, 2004 (8:50-9:10am) Minimal Clinically Important Differences in.
Measurement of Outcomes Ron D. Hays Accelerating eXcellence In translational Science (AXIS) January 17, 2013 (2:00-3:00 pm) 1720 E. 120 th Street, L.A.,
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
Chapter 13 Understanding research results: statistical inference.
Evaluating Health-Related Quality of Life Measures Ron D. Hays, Ph.D. UCLA GIM & HSR February 9, 2015 (9:00-11:50 am) HPM 214, Los Angeles, CA.
Chapter ?? 7 Statistical Issues in Research Planning and Evaluation C H A P T E R.
Estimating Minimally Important Differences on PROMIS Domain Scores Ron D. Hays UCLA Department of Medicine/Division of General Internal Medicine & Health.
Evaluating Multi-Item Scales Health Services Research Design (HS 225B) January 26, 2015, 1:00-3:00pm CHS.
Health-Related Quality of Life in Outcome Studies Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research GCRC Summer Session.
Health-Related Quality of Life (HRQOL) Assessment in Outcome Studies Ron D. Hays, Ph.D. UCLA/RAND GCRC Summer Course “The.
CoRPS London 26 & 27 October 2010 Center of Research on Psychology in Somatic diseases Understanding PRO in hematological disorders: Do we have a consensus?
Critical Appraisal Course for Emergency Medicine Trainees Module 2 Statistics.
Statistical Inference
Evidence-Based Medicine Appendix 1: Confidence Intervals
Confidence Intervals and p-values
Understanding Results
Critical Reading of Clinical Study Results
Health-Related Quality of Life Assessment in Outcome Studies
Interpreting Epidemiologic Results.
Estimating Minimally Important Differences (MIDs)
Introduction to Psychometric Analysis of Survey Data
Evaluating the Significance of Individual Change
GIM & HSR Research Seminar: October 5, 2018
UCLA Department of Medicine
Presentation transcript:

1 Session 6 Minimally Important Differences Dave Cella Dennis Revicki Jeff Sloan David Feeny Ron Hays

2 Minimally Important Difference (MID) u One can observe a difference between two groups or within one group over time that is statistically significance but small. u With a large enough sample size, even a tiny difference could be statistically significant. u The MID is the smallest difference that we care about.

3 FDA Draft Guidance Document u “Smallest difference that is considered clinically important, this can be a specified difference (the minimum important difference (MID)) or, in some cases, any detectable difference” (p. 17). u For many wide used measures (pain, treadmill distance, HamD), the ability to show any difference between treatment groups has been considered evidence of a relevant treatment effect. If PRO instruments are to be considered more sensitive than past measures, it can be useful to specify a minimum important difference (MID) as a benchmark for interpreting mean differences” (p. 19)

4 Distribution-Based “Estimation” of MID u Is not an estimate of the MID u Is raw score difference derived from prior information about the MID –e.g., D measure = ES * SD measure u Distribution-based formulas –Effect size (ES) = D/SD –Standardized Response Mean (SRM) = D/SD † –Responsiveness statistic (RS) = D/SD ‡ SD = baseline SD; SD † = SD of D; SD ‡ = SD of D among “unchanged”

5 Standard Error of Measurement u SEM = SD * SQRT (1-reliability) u 95% CI = Estimated true score +/ * SEM u 1 SEM = 0.50 SD when reliability is 0.75

6 Estimating the MID u External anchors –Self-report –Provider report –Clinical measure –Intervention u Anchor correlated with change on target measure –Small (r=.1), moderate (r=.3), large (r=.5) u Anchor indicates “minimal” change

7 Hypothetical Change in Physical Function (T-score units) by magnitude of intervention

8 Getting Hit By Bike is > Minimal Getting Hit by Rock is Closer to MID

9 Self-Report Anchor u People who report a “minimal” change u How is your physical health now compared to 4 weeks ago? u Much improved; Moderately Improved; u Minimally Improved; u No Change; u Minimally Worse; u Moderately Worse; Much Worse

10 Example with Multiple Anchors u 693 RA clinical trial participants evaluated at baseline and 6-weeks post-treatment. u Five anchors: –1) patient global self-report; –2) physician global report; –3) pain self-report; –4) joint swelling; –5) joint tenderness Kosinski, M. et al. (2000). Determining minimally important changes in generic and disease-specific health-related quality of life questionnaires in clinical trials of rheumatoid arthritis. Arthritis and Rheumatism, 43,

11 Patient and Physician Global Reports u How the patient is doing, considering all the ways that RA affects him/here? Very good (asymptomatic and no limitation of normal activities) Good (mild symptoms and no limitation of normal activities) Fair (moderate symptoms and limitation of normal activities) Poor (severe symptoms and inability to carry out most normal activities) Very poor (very severe symptoms that are intolerable and inability to carry out normal activities) --> Improvement of 1 level over time

12 Global Pain, Joint Swelling and Tenderness u 0 = no pain, 10 = severe pain; 10 centimeter visual analog scale u Number of swollen and tender joints -> 1-20% improvement over time

13 Effect Sizes (mean = 0.34) for SF-36 Changes Linked to Minimal Change in Anchors ScaleSelf-RClin.-RPainSwellTenderMean PF Role-P Pain GH EWB Role-E SF EF PCS MCS

14 Use of “No Change” Group in Estimating MID Change #1 MID = ? Change #2 MID = ? Change #3 MID = 4 No Change on Anchor Doesn’t matter + 2 0, +1, or + 2 Minimal Change on Anchor

15 FDA Draft Guidance Document u “When defining a meaningful change on an individual patient basis (i.e., a responder), that definition is generally larger than the minimum important difference for application to group mean comparisons” (p. 30).

16 Change in SF-36 Scores Over Time (n = 54) Effect Size   Reliable Change Index

17 Concluding Thoughts u It is easier to conclude that a difference is clearly or obviously important than it is to say it is always unimportant. u No single best way to estimate MID –Use multiple anchors –Use anchors that represent minimum change u Wide variation in estimates of MID –Report range, inter-quartile range, and confidence intervals around mean estimates.

18 The National Safety Council has determined that if installed properly this new Seat Belt will decrease traffic accidents by 45%. And will also reduce road-rage by 75%.

19 Appendix A u p. 7: iii. Assess Measurement Properties …”identify meaningful differences in scores” u p. 15: “FDA intends to review a PRO instrument for … interpretability (e.g., minimum important difference)” u p. 17: “The FDA is specifically requesting comment on appropriate review of derivation and application of an MID in the clinical trial setting.” u P. 19, “An MID is usually specific to the population under study….If an MID is to be applied to clinical study results, it is generally helpful to use a variety of methods to discover whether concordance among methods confirms the choice of an MID.”

20 Appendix B u p. 25: “In some cases, the FDA may request an a priori definition of the minimum observed difference between treatment group means (i.e., MID) that will serve as a benchmark to interpret whether study findings are conclusive.” u p. 29: “clinically meaningful in the context of the total composite and other individual component results.” u p. 30: “If the MID is truly to be the smallest effect considered meaningful, however, it would be logical to establish the null hypothesis to rule out a difference less than or equal to the MID.” u p. 31: “amount of difference or change observed in a PRO measure between treatment groups in a clinical trial that will be interpreted as a treatment benefit.”

21 Formulas for Significance of Individual Change SEM 95% CI 1.96 * SD b * (1- reliability) 1/2 SEp 90% CI 1.64* SD b * (1- reliability 2 ) 1/2 SEp 95% CI 1.96* SD b * (1- reliability 2 ) 1/2 Estimated true score Mean + reliability (score – mean) Reliable change index X 2 -X 1 / SD b = standard deviation at baseline