UCLA Department of Medicine

Slides:



Advertisements
Similar presentations
Standardized Scales.
Advertisements

Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
MEASUREMENT CONCEPTS © 2012 The McGraw-Hill Companies, Inc.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Reliability & Validity.  Limits all inferences that can be drawn from later tests  If reliable and valid scale, can have confidence in findings  If.
Part II Sigma Freud & Descriptive Statistics
PSYC512: Research Methods PSYC512: Research Methods Lecture 9 Brian P. Dyre University of Idaho.
PowerPoint Slides developed by Ms. Elizabeth Freeman
In the name of Allah. Development and psychometric Testing of a new Instrument to Measure Affecting Factors on Women’s Behaviors to Breast Cancer Prevention:
Chapter 14 Inferential Data Analysis
Power and Non-Inferiority Richard L. Amdur, Ph.D. Chief, Biostatistics & Data Management Core, DC VAMC Assistant Professor, Depts. of Psychiatry & Surgery.
AM Recitation 2/10/11.
PTP 560 Research Methods Week 3 Thomas Ruediger, PT.
Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.
Is the Minimally Important Difference Really 0.50 of a Standard Deviation? Ron D. Hays, Ph.D. June 18, 2004.
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
1 Assessing the Minimally Important Difference in Health-Related Quality of Life Scores Ron D. Hays, Ph.D. UCLA Department of Medicine October 25, 2006,
Tests and Measurements Intersession 2006.
Evaluating Health-Related Quality of Life Measures June 12, 2014 (1:00 – 2:00 PDT) Kaiser Methods Webinar Series Ron D.Hays, Ph.D.
Chapter 16 The Chi-Square Statistic
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Research methods in clinical psychology: An introduction for students and practitioners Chris Barker, Nancy Pistrang, and Robert Elliott CHAPTER 4 Foundations.
Measurement Issues by Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research (June 30, 2008, 4:55-5:35pm) Diabetes & Obesity.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
1 Session 6 Minimally Important Differences Dave Cella Dennis Revicki Jeff Sloan David Feeny Ron Hays.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 6, 2008 (3:30-6:30pm) HS 249F.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 8, 2006 (3:00-6:00pm) HS 249F.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Evaluation of structural equation models Hans Baumgartner Penn State University.
Chapter 13 Understanding research results: statistical inference.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire.
Evaluating Health-Related Quality of Life Measures Ron D. Hays, Ph.D. UCLA GIM & HSR February 9, 2015 (9:00-11:50 am) HPM 214, Los Angeles, CA.
Evaluating Multi-item Scales Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research HS237A 11/10/09, 2-4pm,
Evaluating Multi-Item Scales Health Services Research Design (HS 225B) January 26, 2015, 1:00-3:00pm CHS.
Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.
Descriptive Statistics Report Reliability test Validity test & Summated scale Dr. Peerayuth Charoensukmongkol, ICO NIDA Research Methods in Management.
Evaluating Patient-Reports about Health
Clinical practice involves measuring quantities for a variety of purposes, such as: aiding diagnosis, predicting future patient outcomes, serving as endpoints.
Ch. 5 Measurement Concepts.
Lecture Nine - Twelve Tests of Significance.
Measures of Agreement Dundee Epidemiology and Biostatistics Unit
Product Reliability Measuring
Research Methodology Lecture No :25 (Hypothesis Testing – Difference in Groups)
Background and Context Research Question and Proposition
UCLA Department of Medicine
Evaluating Patient-Reports about Health
UCLA Department of Medicine
Evaluating Multi-Item Scales
assessing scale reliability
Computing Reliability
Evaluating IRT Assumptions
Final Session of Summer: Psychometric Evaluation
Health-Related Quality of Life Assessment in Outcome Studies
Reliability, validity, and scaling
תוקף ומהימנות של ה- Dementia Quality of Life בארה"בMeasure
Quantitative Methods in HPELS HPELS 6210
The first test of validity
How can one measure intelligence?
Chapter 18: The Chi-Square Statistic
Evaluating the Psychometric Properties of Multi-Item Scales
Evaluating Multi-item Scales
Multitrait Scaling and IRT: Part I
Evaluating Multi-item Scales
Psychometric testing and validation (Multi-trait scaling and IRT)
Introduction to Psychometric Analysis of Survey Data
Evaluating the Significance of Individual Change
Presentation transcript:

UCLA Department of Medicine Quantitative Methods: Reliability, Validity, and Use of SEM to Assess Psychometric Equivalence Ron D. Hays, Ph.D. UCLA Department of Medicine Los Angeles, CA November 18, 2005

Measurement Range for Health Outcome Measures Nominal Ordinal Interval Ratio

Indicators of Acceptability Response rate Administration time Missing data (item, scale)

Variability All scale levels are represented Distribution approximates bell-shaped “normal”

Observed = true + systematic + random score error error Measurement Error Observed = true + systematic + random score error error (bias)

Flavors of Reliability Test-retest (administrators) Intra-rater (raters) Internal consistency (items)

Intraclass Correlation and Reliability Intraclass Correlation Model Reliability Intraclass Correlation One-Way Two-Way Fixed Random

Cronbach’s Alpha Source df SS MS 4 11.6 2.9 1 0.1 4.4 1.1 9 16.1 Respondents (BMS) 4 11.6 2.9 Items (JMS) 1 0.1 Resp. x Items (EMS) 4.4 1.1 Total 9 16.1 Alpha = 2.9 - 1.1 = 1.8 = 0.62 2.9

Reliability Minimum Standards 0.70 or above (for group comparisons) 0.90 or higher (for individual assessment) SEM = SD (1- reliability)1/2

Construct Validity Does measure relate to other measures in ways consistent with hypothesis? Responsiveness to change

Responsiveness to Change and Minimally Important Difference HRQOL measures should be responsive to interventions that changes HRQOL Evaluating responsiveness requires assessment of HRQOL pre-post intervention of known efficacy at two times in tandem with anchor HRQOL change among people who changed on anchor

Self-Report Anchor Overall has there been any change in your asthma since the beginning of the study? Much improved; Moderately improved; Minimally improved No change Much worse; Moderately worse; Minimally worse

Clinical Anchor “changed” group = seizure free (100% reduction in seizure frequency) “unchanged” group = < 50% change in seizure frequency

Responsiveness Indices (1) Effect size (ES) = D/SD (2) Standardized Response Mean (SRM) = D/SD† (3) Guyatt responsiveness statistic (RS) = D/SD‡ D = raw score change in “changed” group; SD = baseline SD; †SD = SD of D; ‡ SD = SD of D among “unchanged”

Effect Size Benchmarks Small: 0.20->0.49 Moderate: 0.50->0.79 Large: 0.80 or above

Hypothetical Multitrait/Multi-Item Correlation Matrix

Confirmatory Factor Analysis Compares observed covariances with covariances generated by hypothesized model Statistical and practical tests of fit Factor loadings Correlations between factors Regression coefficients

Fit Indices 1 - Normed fit index: Non-normed fit index: 2  -  2 Normed fit index: Non-normed fit index: Comparative fit index: null model  2 2 2   null null - model df df null model  2 null - 1 df null 2  - df 1 - model model  - 2 df null null

Acknowledgments Supported in whole by UCLA Center for Health Improvement in Minority Elders/Resource Centers for Minority Aging Research, National Institute on Aging (AG-02-004)