1 9/23/2015 Patient Reported Outcomes Measurement Information System (PROMIS) Patient Reported Outcomes Measurement Information System (PROMIS) National.

Slides:



Advertisements
Similar presentations
ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
Advertisements

Patient-Reported Outcomes Measurement Information System.
How does PROMIS compare to other HRQOL measures? Ron D. Hays, Ph.D. UCLA, Los Angeles, CA Symposium 1077: Measurement Tools to Enhance Health- Related.
PROMIS DEVELOPMENT METHODS, ANALYSES AND APPLICATIONS Presented at the Patient-Reported Outcomes Measurement Information System (PROMIS): A Resource for.
PROMIS: The Right Place at the Right Time? David Cella, Ph.D. Department of Medical Social Sciences Northwestern University Chair, PROMIS Steering Committee.
15-minute Introduction to PROMIS Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research Roundtable Meeting on Measuring.
Why Patient-Reported Outcomes Are Important: Growing Implications and Applications for Rheumatologists Ron D. Hays, Ph.D. UCLA Department of Medicine RAND.
RM DYNAMIC ASSESSMENT OF PATIENT-REPORTED CHRONIC DISEASE OUTCOMES Pre-Application Meeting January 26, 2004 Rockville, MD.
1 Health-Related Quality of Life Ron D. Hays, Ph.D. - UCLA Department of Medicine: Division of General Internal Medicine.
® Introduction Mental Health Predictors of Pain and Function in Patients with Chronic Low Back Pain Olivia D. Lara, K. Ashok Kumar MD FRCS Sandra Burge,
LOINC for Patient-Reported and Clinician-Administered Psychological Tests Saul Rosenberg, PhD Clinical LOINC Meeting January 28, 2011 Tucson, AZ.
Primer on Evaluating Reliability and Validity of Multi-Item Scales Questionnaire Design and Testing Workshop October 25, 2013, 3:30-5:00pm Wilshire.
Rare Diseases and PROMIS : Opportunities Natcher Conference Center March 1, 2013 James Witter MD, PhD FACR CSO PROMIS Medical Officer: Rheumatic Diseases.
Working Paper No.7 22 November 2005 STATISTICAL COMMISSION andSTATISTICAL OFFICE OF THE UN ECONOMIC COMMISSION FOREUROPEAN COMMUNITIES EUROPE (EUROSTAT)
Patient-Reported Outcomes Measurement Information System (PROMIS): Opportunities in Health Services Research Steven Clauser, Ph.D. National Cancer Institute.
Overview of Phase I Data: Approach and Findings Gary Bess Associates April 15, 2009.
Construction and Evaluation of Multi-item Scales Ron D. Hays, Ph.D. RCMAR/EXPORT September 15, 2008, 3-4pm
The emotional distress of children with cancer in China: An Item Response Analysis of C-Ped-PROMIS Anxiety and Depression Short Forms Yanyan Liu 1, Changrong.
FDA Approach to Review of Outcome Measures for Drug Approval and Labeling: Content Validity Initiative on Methods, Measurement, and Pain Assessment in.
® From Bad to Worse: Comorbidities and Chronic Lower Back Pain Margaret Cecere JD, Richard Young MD, Sandra Burge PhD The University of Texas Health Science.
Introduction Neuropsychological Symptoms Scale The Neuropsychological Symptoms Scale (NSS; Dean, 2010) was designed for use in the clinical interview to.
PROMIS ® Alcohol Use Item Banks: Construction and Calibration *Presenter, 1 University of Pittsburgh Medical Center, 2 Department of Psychiatry, University.
September 151 Screening for Disability Washington Group on Disability Statistics.
Use of Health-Related Quality of Life Measures to Assess Individual Patients July 24, 2014 (1:00 – 2:00 PDT) Kaiser Permanente Methods Webinar Series Ron.
SAS PROC IRT July 20, 2015 RCMAR/EXPORT Methods Seminar 3-4pm Acknowledgements: - Karen L. Spritzer - NCI (1U2-CCA )
#1 STATISTICS 542 Intro to Clinical Trials Quality of Life Assessment.
1 Assessing the Minimally Important Difference in Health-Related Quality of Life Scores Ron D. Hays, Ph.D. UCLA Department of Medicine October 25, 2006,
PROMIS ® : Advancing the Science of PRO measurement Common Data Elements NIH CDE Webinar September 8, 2015 Ashley Wilder Smith, PhD, MPH Chief, Outcomes.
Development of Physical and Mental Health Summary Scores from PROMIS Global Items Ron D. Hays ( ) UCLA Department of Medicine
Introduction to the Patient-Reported Outcomes Measurement Information System (PROMIS) UCLA Center for East-West Medicine 2428 Santa Monica Blvd., Suite.
1 11/17/2015 Psychometric Modeling and Calibration Ron D. Hays, Ph.D. September 11, 2006 PROMIS development process session 10:45am-12:30 pm.
Impact of using a next button in a web-based health survey on time to complete and reliability of measurement Ron D. Hays (Rita Bode, Nan Rothrock, William.
Presented By Ron D. Hays, Ph.D. April 8, 2010 (MNRS Pre-Conference Workshop) Domains of PROMIS and how they were developed Dynamic Tools to Measure Health.
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
Reliability and validity of the adapted Spanish version of the Early Onset Scoliosis-24 questionnaire María del Mar Pozo-Balado, PhD Hiroko Matsumoto PhD.
Psychometric Evaluation of Questionnaire Design and Testing Workshop December , 10:00-11:30 am Wilshire Suite 710 DATA.
Overlap between Subjective Well-being and Health-related Quality of Life. 3 Ron D. Hays, Ph.D. (Alina Palimaru) November 18, 2015 (11:30-12:00 noon) Geriatric.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology
Ensuring Consistency in Assessment of Continuing Care Needs: An Application of Differential Item Functioning Analysis R. Prosser, M. Gelin, D. Papineau,
Considerations in Comparing Groups of People with PROs Ron D. Hays, Ph.D. UCLA Department of Medicine May 6, 2008, 3:45-5:00pm ISPOR, Toronto, Canada.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology University of California, Davis.
Impact of using a next button in a web-based health survey on time to complete and reliability of measurement Ron D. Hays (Rita Bode, Nan Rothrock, William.
Overview of Item Response Theory Ron D. Hays November 14, 2012 (8:10-8:30am) Geriatrics Society of America (GSA) Pre-Conference Workshop on Patient- Reported.
Health-Related Quality of Life in Outcome Studies Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research GCRC Summer Session.
Mental health workgroup UPDATE 15 TH WASHINGTON GROUP MEETING OCTOBER 2015.
Health-Related Quality of Life (HRQOL) Assessment in Outcome Studies Ron D. Hays, Ph.D. UCLA/RAND GCRC Summer Course “The.
Predictors of Functioning in Women with Fibromyalgia Syndrome (FMS) Alexa Stuifbergen, PhD, RN, FAAN Professor Dolores V.Sands Chair in Nursing Research.
Reducing Burden on Patient- Reported Outcomes Using Multidimensional Computer Adaptive Testing Scott B. MorrisMichael Bass Mirinae LeeRichard E. Neapolitan.
Quantification of dyspnea using descriptors: Development and initial testing of the Dyspnea-12 J Yorke, S H Moosavi, C Shuldham, P W Jones (Thorax
Instrument Development and Psychometric Evaluation: Scientific Standards May 2012 Dynamic Tools to Measure Health Outcomes from the Patient Perspective.
Introduction to PROMIS®
Introduction to Neuro-QoL
Introduction to ASCQ-MeSM
Psychometric Evaluation of Items Ron D. Hays
NIH: Patient-Reported Outcomes Measurement Information System (PROMIS®) Ron D. Hays Functional Vision and Visual Function November 10, 2016, 8:55-9:15am.
UCLA Department of Medicine
UCLA Department of Medicine
PROMIS-29 V2.0 Physical and Mental Health Summary Scores Ron D. Hays
Evaluating IRT Assumptions
Introduction to ASCQ-Me®
Introduction to PROMIS®
Introduction to Neuro-QoL
Introduction to PROMIS®
Introduction to Neuro-QoL
Introduction to ASCQ-Me®
Introduction to PROMIS®
International Perthes Study Group
Estimating Minimally Important Differences (MIDs)
GIM & HSR Research Seminar: October 5, 2018
Patient-reported Outcome Measures
Presentation transcript:

1 9/23/2015 Patient Reported Outcomes Measurement Information System (PROMIS) Patient Reported Outcomes Measurement Information System (PROMIS) National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS) U01 grant Ron D. Hays, Ph.D., UCLA September 29, 2006 GIM/HSR Research Seminar Series 12:02pm-12:59 pm

2 9/23/2015 NIH Director Elias A. Zerhouni, MD “There is a pressing need to better quantify clinically important symptoms and outcomes that are now difficult to measure. Clinical outcome measures, such as x-rays and lab tests, have minimally immediate relevance to the day-to-day functioning of patients with chronic diseases such as arthritis, multiple sclerosis, and asthma, as well as chronic pain conditions. Often, the best way patients can judge the effectiveness of treatments is by perceived changes in symptoms. One main goal of the PROMIS initiative is to develop a set of publicly available computerized adaptive tests for the clinical research community.”

3 9/23/2015 NIH RM04-011: Dynamic Assessment of Patient-Reported Chronic Disease Outcomes Translation arm of re-engineering clinical research enterprise Chronic diseases and their treatment affect “quality of life” Improved assessment of “subjective” clinical outcomes - Self-reported symptoms and other health-related quality of life domains

4 9/23/2015 ADL – Activities of Daily Living IADL – Instrumental Activities of Daily Living G – Global Item Satisfaction Performance G Symptoms G Activities: Instrumental Activities of Daily Living [IADL] (e.g. errands) Other Social Support G Anxiety G Anger/Aggression G Depression G Fatigue G Substance Abuse Positive Psychological Functioning Negative Impacts of Illness Subjective Well-Being (positive affect) Positive Impacts of Illness Meaning and Coherence (spirituality) G Emotional Distress Mastery and Control (self-efficacy) Cognitive Function G Central: neck and back (twisting, bending, etc) G Lower Extremities: walking, arising, etc [mobility] G Upper Extremities: grip, buttons, etc [dexterity] G Function/Disability G Physical Health G Mental Health G Social Health Satisfaction G Health G Pain Patient-Reported Outcomes (PROs) Preliminary PROMIS Domains (light shade) G Role Participation

5 9/23/2015 Initial PROMIS Domains Physical functioning (4) Pain (3) Fatigue (2) Social/role participation (2) Emotional distress Anxiety Depression Anger Alcohol abuse

6 9/23/2015 Objectives Develop and test a large bank of items measuring health- related quality of life Create a publicly available, adaptable and sustainable system allowing clinical researchers access to a * common item repository * computer adaptive testing (CAT) platform for efficient assessment across a range of chronic diseases

7 9/23/2015 What would you like to measure in your study? Domain definitions item content review item characteristics Click below for: or select a box on the right and proceed pain emotional distress physical function fatigue social role participation

8 9/23/2015 Collaborative Agreement Steering Committee 6 Primary Research Sites Statistical Coordinating Center Scientific Advisory Board

9 9/23/2015 PROMIS Network UNC –Chapel Hill ● ● Duke University ● Stanford University Stony Brook University ● Stony Brook University ● ● University of Pittsburgh ● University of Washington CORE ♥ NIH ● NIH

10 9/23/2015 Primary Research Sites Duke (Kevin Weinfurt, evaluation committee, participation and data quality committee, use in clinical trials, cancer supplement) Stanford (Jim Fries, physical function, domain hierarchy) Stony Brook (Arthur Stone, fatigue, ecological momentary assessment) UNC (Darren DeWalt, social/role participation, pediatrics, literacy) University of Pittsburgh (Paul Pilkonis, emotional distress, sleep) University of Washington (Dagmar Amtmann, pain, universal access)

11 9/23/2015 Statistical Coordinating Center Northwestern (David Cella and cast) UCLA (Hays, Liu, Reise, Spritzer, Morales) Other consultants (e.g., Dennis Revicki) Westat

12 9/23/2015 What PROMIS Promises P recision R epository O utcome tools M ethodologies I nterpretability S oftware

13 9/23/2015 Precision  Fewer items needed for equal precision  Making assessment briefer  Error is understood at the individual level  Enabling individual assessment

14 9/23/2015 Item Bank (IRT-calibrated items) Item Response Theory (IRT) Short Form Instruments CAT Items from Instrument A Item Pool Items from Instrument B Items from Instrument C New Items Secondary Data Analysis Cognitive Testing Focus Groups Content Expert Review  Questionnaire administered to large representative sample            

15 9/23/2015 Item Characteristic Curves (able to climb flight of stairs versus run a mile)

16 9/23/2015 Item Characteristic Curves (2-Parameter Model)

17 9/23/2015 Item Banks no pain mild pain moderate pain severe painextreme pain    Pain Item Bank Item 1 Item 2 Item 3 Item 4 Item 5 Item 6 Item 7 Item 8 Item 9 Item n An item bank is comprised of a large collection of items measuring a single domain (e.g., pain).

18 9/23/2015 Fatigue Measure and Standard Error Comparison by Test Length

19 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel nervous? All of the time Most of the time Little of the time Some of the time None of the time

20 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel nervous? Some of the time

21 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel nervous? Some of the time

22 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel hopeless? All of the time Most of the time Little of the time Some of the time None of the time

23 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel hopeless? Some of the time

24 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel worthless? All of the time Most of the time Little of the time Some of the time None of the time

25 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low Emotional Distress How often did you feel worthless? Little of the time

26 9/23/2015 Item Bank (IRT-calibrated emotional distress items) Severe high moderatelowvery low How often did you feel worthless? Little of the time Target in on emotional distress score

27 9/23/2015 Repository

28 9/23/2015 Repository : PROMIS Item Library Literature searches and investigator contributions to the PROMIS domains Relational database of more than 7,000 items Catalog characteristics of items including  Context  Stem  Response options  Time frame  Instrument of origin (if applicable)  Intellectual property status  Track modifications to items

29 9/23/2015 Outcome Tools

30 9/23/2015 Item Library CATCAT Build-a-PROBuild-a-PRO Pick-a-PROPick-a-PRO

31 9/23/2015 Methodologies

32 9/23/2015 Methodologies Qualitative Item Review  Expert item review of 6,871 items  26 focus groups; over 120 patients interviewed; over 700 surveys Analysis Plan  Classical test theory and IRT analyses Sampling Plan  11,500 people; 951 items; minimum n = 500/item

33 9/23/2015 Interpretation

34 9/23/2015 LowLow HighHigh                                                       Person Fatigue Score Interpretation                                                     Q Q Q Q Q Q Q Q Q Likely Likely “I get tired when I run a marathon” Likely Likely “I get tired when I run a marathon” Unlikely “I get tired when I get out of a chair”Unlikely “I get tired when I get out of a chair” Item Location Q Q Q Q Q Q Q Q Q Q Q Q Q Q Q    

35 9/23/2015 LowLow HighHigh                                                       PRO Bank Person Score Interpretation Aids                                                     M = 50, SD = 10 T = (z * 10) + 50    

36 9/23/2015 LowLow HighHigh                                                       Example of high fatigue           Fatigue Score=60                                               This patient’s fatigue score is 60, significantly worse than average (50). People who score 60 on fatigue tend to answer questions as follows: …”I have been too tired to climb one flight of stairs: VERY MUCH …”I have had enough energy to go out with my family: A LITTLE BIT Click here if you would like to see this patient’s individual answers

37 9/23/2015 LowLow HighHigh                                                       Example of low fatigue           Fatigue Score=40                                               This patient’s fatigue score is 40, significantly better than average (50). People who score 40 on fatigue tend to answer questions as follows: …”I have been too tired to climb one flight of stairs: SOMEWHAT …”I have had enough energy to go out with my family: VERY MUCH Click here if you would like to see this patient’s individual answers

38 9/23/2015 Software

39 9/23/2015 Software Survey Construction Report Module Clinical/PRO Data Study Protocols Coming to Item Banks Patients Clinicians Item Library Survey Module Web Laptop PDA IVR CATPick-a-PROBuild-a-PRO Domains and item banks banks

40 9/23/2015 Web-based administration: Emotional distress (Anger) item

41 9/23/2015 Four possible administration formats  Automatic advance, not allowed to go back  Automatic advance, allowed to go back  Click to continue after response, not allowed to go back  Click to continue, allowed to go back

42 9/23/2015 Evaluated administration format 806 participants in Polimetrix PollingPlace registry 56 items measuring the performance of social/role activities  Items rated on a 5-point frequency scale ranging from ”never” to “always” 56 items measuring satisfaction with social/role activities  Items rated on a 5-point extent scale ranging from “not at all” to “very much”

43 9/23/2015 Analysis Plan Examined differences in:  Time spent  Mean domain scores  Variance in scores  Reliability

44 9/23/2015 Administration format conclusions Use automatic advance rather than continue button Use back button to guard against accidental key entry  Response time cost was minimal  No effect on missing data or scores

45 9/23/2015 PROMIS Sampling Administer a large number of items to a range of population subgroups (general population and clinical) to permit the estimation of item parameters for item banks in five health-related quality of life domains.

46 9/23/2015 Number of items 1013 items 784 items: 14 item banks (56 items per bank) in 5 domains 167 legacy items 62 demographic Items items administered to any one respondent n = 11,500 overall (500 observations per item minimum)

47 9/23/2015 Target Polimetrix General Population Demographics Gender Male 50% Female 50% Age = 20% = 20% = 20% = 20% 75+ = 20% Ethnicity (match the general population) Black = 12.3% Hispanic or Latino = 12.5% Education Min 25% High school graduate or less

48 9/23/2015 Samples N General population (Polimetrix) 7,500 Cancer (Duke, Polimetrix) 1000 Heart Disease (Duke, Polimetrix) 500 Rheumatoid arthritis (Stanford) 500 Osteoarthritis (Stanford, Polimetrix) 500 Psychiatric (Pittsburgh, Polimetrix) 500 Spinal Cord Injury (Polimetrix) 500 Chronic Obstructive Pulmonary Disease (Polimetrix) 500 TOTAL 11,500

49 9/23/2015 DomainSub-Domain ItemsFormSampleN Emotional Distress Anxiety56 AGen. Pop Depression56 Anger/Aggression56 BGen. Pop Alcohol Abuse56 Physical Function Part I56 CGen. Pop Part II56 Part III56GGen. Pop Fatigue Impact56 DGen. Pop Experience56 Social Role Impact56 EGen. Pop Experience56 Pain Interference56 FGen. Pop Quality56 Behavior56GGen. Pop. 7(above) Full Bank Administration

50 9/23/2015 Item Number → Sample Size ↓ General Pop. 8250HHHHHHH General Pop IIIIIII General Pop JJJJJJJ Heart Disease 250HHHHHHH JJJJJJJ Cancer 250IIIIIII KKKKK Block Administration

51 9/23/2015 Item Number → Sample Size ↓ General Pop. 8250HHHHHHH General Pop IIIIIII General Pop JJJJJJJ Heart Disease 250HHHHHHH JJJJJJJ Cancer 250IIIIIII KKKKK Between Banks Block Administration Strategy Item 5 is administered in Block Forms H and I to General Population Sample 8 as well as Heart Disease and Cancer Samples and General Population Sample 9

52 9/23/2015 Types of Analyses Classical Test Theory StatisticsClassical Test Theory Statistics IRT Model AssumptionsIRT Model Assumptions Model FitModel Fit Differential Item FunctioningDifferential Item Functioning Item CalibrationItem Calibration

53 9/23/2015 Classical Test Theory Statistics Out of rangeOut of range Item frequencies and distributionsItem frequencies and distributions Inter-item correlationsInter-item correlations Item-scale correlationsItem-scale correlations Internal consistency reliabilityInternal consistency reliability

54 9/23/2015 IRT Model Assumptions (Uni)dimensionality(Uni)dimensionality Local independenceLocal independence MonotonicityMonotonicity

55 9/23/2015 Sufficient Unidimensionality Confirmatory factor modelsConfirmatory factor models  One factor  Bifactor (general and group factors)

56 9/23/2015 Local Independence After controlling for dominant factor(s), item pairs should not be associated.After controlling for dominant factor(s), item pairs should not be associated.  Look at residual correlations (> 0.20)

57 9/23/2015 Monotonicity Probability of selecting a response category indicative of better health should increase as underlying health increases.Probability of selecting a response category indicative of better health should increase as underlying health increases. Item response function graphs withItem response function graphs with  y-axis: proportion positive for item step  x-axis: raw scale score minus item score

58 9/23/2015

59 9/23/2015 Category Response Curves for Samejima’s Graded Response Model

60 9/23/2015 Model Fit Compare observed and expected response frequencies by item and response categoryCompare observed and expected response frequencies by item and response category Items that do not fit and less discriminating items identified and reviewed by content expertsItems that do not fit and less discriminating items identified and reviewed by content experts

61 9/23/2015 Differential Item Functioning Uniform DIFUniform DIF  Threshold parameter Non-uniform DIFNon-uniform DIF  Discrimination parameter Gender, race/ethnicity, age, education, diseaseGender, race/ethnicity, age, education, disease

62 9/23/2015 DIF – Location (Item 1) DIF – Slope (Item 2) Hispanics Whites Hispanics Dichotomous Items Showing DIF (2-Parameter Model)

63 9/23/2015 Item Calibration Item parameters (threshold, discrimination)Item parameters (threshold, discrimination) Mean differences for studied disease groupsMean differences for studied disease groups

64 9/23/2015 Questions? Public website: Peer-reviewed manuscripts, e.g.: Hays, R. D. et al. (in press). Item response theory analyses of physical functioning items in the Medical Outcomes Study. Medical Care. Reeve, B. B., et al. (submitted). Psychometric evaluation and calibration of health-related quality of life items banks: Plans for the Patient-Reported Outcome Measurement Information System (PROMIS)

65 9/23/2015 Acknowledgements “Slides” in this presentation were lifted or adapted from PROMIS presentations by David Cella, Richard Gershon, Bryce Reeve, and Jim Fries.

66 9/23/2015 Appendices Pre-Application Meeting for the RFA-RM : Dynamic Assessment of Patient-Reported Chronic Disease Outcomes Monday, January 26, 2004 Deborah N. Ader, Ph.D. and Lawrence J. Fine, M.D., Dr.PH Total Running Time: 02:40:08