QDET2, Miami, FL, Hibiscus A

Slides:



Advertisements
Similar presentations
How does PROMIS compare to other HRQOL measures? Ron D. Hays, Ph.D. UCLA, Los Angeles, CA Symposium 1077: Measurement Tools to Enhance Health- Related.
Advertisements

Remaining Challenges and What to Do Next: Undiscovered Areas Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research
15-minute Introduction to PROMIS Ron D. Hays, Ph.D UCLA Division of General Internal Medicine & Health Services Research Roundtable Meeting on Measuring.
Primer on Evaluating Reliability and Validity of Multi-Item Scales Questionnaire Design and Testing Workshop October 25, 2013, 3:30-5:00pm Wilshire.
DIFFERENTIAL ITEM FUNCTIONING AND COGNITIVE ASSESSMENT USING IRT-BASED METHODS Jeanne Teresi, Ed.D., Ph.D. Katja Ocepek-Welikson, M.Phil.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
1 Health-Related Quality of Life as an Indicator of Quality of Care Ron D. Hays, Ph.D. HS216—Quality Assessment: Making the Business Case.
The emotional distress of children with cancer in China: An Item Response Analysis of C-Ped-PROMIS Anxiety and Depression Short Forms Yanyan Liu 1, Changrong.
Introduction Neuropsychological Symptoms Scale The Neuropsychological Symptoms Scale (NSS; Dean, 2010) was designed for use in the clinical interview to.
SAS PROC IRT July 20, 2015 RCMAR/EXPORT Methods Seminar 3-4pm Acknowledgements: - Karen L. Spritzer - NCI (1U2-CCA )
Development of Physical and Mental Health Summary Scores from PROMIS Global Items Ron D. Hays ( ) UCLA Department of Medicine
1 Differential Item Functioning in Mplus Summer School Week 2.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Item Response Theory (IRT) Models for Questionnaire Evaluation: Response to Reeve Ron D. Hays October 22, 2009, ~3:45-4:05pm
Multi-item Scale Evaluation Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Overlap between Subjective Well-being and Health-related Quality of Life. 3 Ron D. Hays, Ph.D. (Alina Palimaru) November 18, 2015 (11:30-12:00 noon) Geriatric.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology
Considerations in Comparing Groups of People with PROs Ron D. Hays, Ph.D. UCLA Department of Medicine May 6, 2008, 3:45-5:00pm ISPOR, Toronto, Canada.
Item Response Theory Dan Mungas, Ph.D. Department of Neurology University of California, Davis.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Latent Curve Modeling to Understand Achievement Emotions Gavin Brown Quant-DARE Methods Showcase Feb 22, 2016.
IRT Equating Kolen & Brennan, 2004 & 2014 EPSY
Study Population In , 1,154 Mexican origin youth (aged years) indicated which of 50 movies (randomly selected from a pool of 250 popular movies.
Stats Methods at IC Lecture 3: Regression.
Bootstrap and Model Validation
Evaluating Patient-Reports about Health
Theme 6. Linear regression
Nonparametric Statistics
Chapter 4 Basic Estimation Techniques
Ron D. Hays September 12, 2006 Network Testing ~ 5 or 6pm
Psychometric Evaluation of Items Ron D. Hays
Further Validation of the Personal Growth Initiative Scale – II: Gender Measurement Invariance Harmon, K. A., Shigemoto, Y., Borowa, D., Robitschek, C.,
Vertical Scaling in Value-Added Models for Student Learning
Diabetes and Hypertension Health Screening in the Fresno Sikh Population: A Cross Sectional Approach Baljit Singh Dhesi 1,2 1University of California,
Intelligence Andrea Mejia Spring 2017.
Parental Alcoholism and Adolescent Depression?
Running models and Communicating Statistics
Introduction to Regression Analysis
Computations, and the best fitting line.
Chapter 15 Confirmatory Factor Analysis
A Different Way to Think About Measurement Development:
Maternal Demographics
Psychometric Properties of an Acculturation Scale:
UCLA Department of Medicine
Evaluating Patient-Reports about Health
UCLA Department of Medicine
Evaluating Multi-Item Scales
Ron D. Hays GIM-HSR Friday Noon Seminar Series November 4, 2016
Study Limitations and Future Directions See Handout for References
CJT 765: Structural Equation Modeling
Evaluating IRT Assumptions
What is development? Domains of development
Generalized Linear Models
Multiple logistic regression
Nonparametric Statistics
His Name Shall Be Revered …
Logistic Regression.
Spanish and English Neuropsychological Assessment Scales - Guiding Principles and Evolution Friday Harbor Psychometrics Workshop 2005.
Basic Practice of Statistics - 3rd Edition Inference for Regression
Chapters Important Concepts and Terms
Evaluating the Psychometric Properties of Multi-Item Scales
Evaluating Multi-item Scales
Multitrait Scaling and IRT: Part I
Evaluating Multi-item Scales
Levine et al continued.
Psychometric testing and validation (Multi-trait scaling and IRT)
UCLA Department of Medicine
MATH 2311 Review for Exam 2 10 Multiple Choice (70 points): Test 2
Presentation transcript:

QDET2, Miami, FL, Hibiscus A Differential Item Functioning and Person Fit on the PROMIS® Physical Functioning Items among Children and Adolescents Ron D. Hays QDET2, Miami, FL, Hibiscus A November 10, 2016 (2:11-2:28pm) Using Pretesting Methods to Develop Standardized Survey Qs for Use in Cross National/Cross Cultural/Multi-Lingual Settings

Collaborators José Luis Calderón Karen L. Spritzer Steve P. Reise Sylvia H. Paz

In the past 7 days … 0 = Not able to do; 1 = With a lot of trouble; 2 =With some trouble; 3 = With a little trouble; 4 = With no trouble 29 upper extremity items I could button my shirt or pants. I could open a jar by myself. 23 mobility items I could run a mile. I could ride a bike. DeWitt et al., J Clin Epid, 2011, 64 (7), 794-804 21% of those 6-17 years old in U.S. speak language other than English at home. American Community Survey (2007)

Differential Item Functioning (DIF) Controlling for underlying upper extremity (mobility), Probability of picking a particular response (e.g.,”not able to do”) Differs when item is administered in English versus Spanish.

Samples English-language Spanish-language 5,091 children and adolescents from medical clinics in North Carolina and Texas and community schools in North Carolina. Spanish-language 605 children and adolescents of adults who are members of an internet panel. Hays, R. D., Liu, H., & Kapteyn, A. (2015). Use of internet panels to conduct surveys. Behavior Research Methods, 47 (3), 685-690. Adults completed demographic questions about child and then passed computer to child.

Spanish-Language Sample Hispanic Spanish-speaking adults Members of the Greenfield/Toluna internet panel (http://us.toluna.com) Average score on 4-item Short Acculturation Scale for Hispanics (SASH) was 2.6 for children. What language(s) do you read and speak? What language(s) do you usually speak at home? What language(s) do you usually speak with your friends? In which language(s) do you usually think? 1 = Only Spanish; 2 = Spanish better than English; 3 = Both equally; 4 = English better than Spanish; 5 = Only English

Spanish Translation Iterative process of forward translations, reconciliation, back translation, multiple reviews, and pre-test with cognitive debriefing (FACIT Translation Methodology). Each translated item pre-tested and debriefed in the US with 5 Spanish-speaking subjects from the general population to try to make sure translation is well understood and conceptually equivalent to the source.

Sample Demographics English (n = 5,091) Spanish (n = 605) % Female 52% 45% % Hispanic 18% 96% Age 8-12 years old 53% 50% 13-17 years old 47%

Confirmatory Factor Analysis Fit Indices 2  -  2 Normed fit index: Non-normed fit index: Comparative fit index: null model  2 2 2   null null - model df df null model  2 null - 1 df null 2  - df 1 - model model  - 2 df null null RMSEA = SQRT (λ2 – df)/SQRT (df (N – 1)) 9

Confirmatory Factor Analysis One-factor model fit the data well in the Spanish (n = 605) sample for upper extremity and mobility, respectively CFIs = 0.998 and 0.996 RMSEAs=0.036 and 0.054 Range of standardized factor loadings 0.824--0.962 (29 upper extremity items) 0.815—0.967 (23 mobility items) CFI>0.95; RMSEA = 0.05 (good)

Ordinal Logistic Regression http://CRAN.R-project.org/package=lordif Model 1 : logit P(ui >= k) = αk + β1 * mobility Model 2 : logit P(ui >= k) = αk + β1 * mobility + β2 * language Model 3 : logit P(ui >= k) = αk + β1 * mobility + β2 * language + β3 * mobility * language DIF assessment (log likelihood values and McFadden’s pseudo R2 >=0.02): - Overall: Model 3 versus Model 1 Non-uniform: Model 3 versus Model 2 Uniform: Model 2 versus Model 1 Note: Purified IRT score used for mobility (conditioning variable).

Upper extremity (4/29 dif) Mobility (7/23 dif) I could hold an empty cup. (harder for Spanish) I could pull open heavy doors. I could pour a drink from a full pitcher. I could open a jar by myself. Mobility I could ride a bike. I could do sports and exercise that other kids my age could do. I could run a mile. I could walk upstairs without holding on to anything. I could keep up when I played with other kids. I used a walker, cane or crutches to get around. (non-uniform) I could turn my head all the way to the side. (non-uniform)

Impact of DIF (4 items) on Test Characteristic Curves: Upper Extremity

Impact of DIF at Individual Level: Upper Extremity

CAT-based Theta Estimates Using English (x-axis) and Spanish (y-axis) Parameters for 29 Upper Extremity Items (n = 605)

Impact of DIF (7 items) on Test Characteristic Curves: Mobility

Impact of DIF at Individual Level: Mobility

CAT-based Theta Estimates Using English (x-axis) and Spanish (y-axis) Parameters for 23 Mobility Items (n = 605)

Person Fit Large negative ZL values (Drasgow et al., 2005) indicate misfit. For example, ZL = -3.13 for a person reporting they could do 13 different physical functioning items (including running 5 miles) without any difficulty, but … reported a little difficulty being out of bed for most of the day.

Questions?

Stocking-Lord Linking Constants Spanish calibrations transformed linearly so that their TCC most closely matches English TCC. a* = a/A and b* = A * b + B Optimal values of A (slope) and B (intercept) transformation constants found through multivariate search to minimize weighted sum of squared distances between TCCs of English and Spanish transformed parameters Stocking, M.L., & Lord, F.M. (1983). Developing a common metric in item response theory. Applied Psychological Measurement, 7, 201-210.