Factor Analysis Ron D. Hays, Ph.D. N208, Factor 5-255 (3-6pm) February 23, 2005.

Slides:



Advertisements
Similar presentations
Factor Analysis and Principal Components Removing Redundancies and Finding Hidden Variables.
Advertisements

Factor Analysis Continued
Chapter Nineteen Factor Analysis.
Structural Equation Modeling
Lecture 7: Principal component analysis (PCA)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Principal Components An Introduction Exploratory factoring Meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Factor Analysis There are two main types of factor analysis:
A quick introduction to the analysis of questionnaire data John Richardson.
Education 795 Class Notes Factor Analysis II Note set 7.
1 8/14/2015 Evaluating the Significance of Health-Related Quality of Life Change in Individual Patients Ron Hays October 8, 2004 UCLA GIM/HSR.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Factor Analysis Psy 524 Ainsworth.
Principal Components An Introduction
Basic Methods for Measurement of Patient-Reported Outcome Measures Ron D. Hays, Ph.D. UCLA/RAND ISOQOL Conference on.
1 9/14/2015 Options for Summarizing the SF-36 Health Survey in Health-Related Quality of Life Research Ron D. Hays, Ph.D. NCI, March 29, 2007 (11:00-11:59.
Some matrix stuff.
Profile Health-Related Quality of Life Measures
Use of Health-Related Quality of Life Measures to Assess Individual Patients July 24, 2014 (1:00 – 2:00 PDT) Kaiser Permanente Methods Webinar Series Ron.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Psy 427 Cal State Northridge Andrew Ainsworth PhD.
Tests and Measurements Intersession 2006.
Measuring Health-Related Quality of Life Ron D. Hays, Ph.D. UCLA Department of Medicine RAND Health Program UCLA Fielding School of Public Health
MGMT 6971 PSYCHOMETRICS © 2014, Michael Kalsher
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
1 Finding and Verifying Item Clusters in Outcomes Research Using Confirmatory and Exploratory Analyses Ron D. Hays, PhD May 13, 2003; 1-4 pm; Factor
Thursday AM  Presentation of yesterday’s results  Factor analysis  A conceptual introduction to: Structural equation models Structural equation models.
Factor Analysis ( 因素分析 ) Kaiping Grace Yao National Taiwan University
© 2007 Prentice Hall19-1 Chapter Nineteen Factor Analysis © 2007 Prentice Hall.
Overview of Health-Related Quality of Life Measures May 22, 2014 (1:00 – 2:00 PDT) Kaiser Methods Webinar Series 1 Ron D.Hays, Ph.D.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Reliability Analysis Based on the results of the PAF, a reliability analysis was run on the 16 items retained in the Task Value subscale. The Cronbach’s.
Measurement Issues by Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research (June 30, 2008, 4:55-5:35pm) Diabetes & Obesity.
Introduction to Multivariate Analysis of Variance, Factor Analysis, and Logistic Regression Rubab G. ARIM, MA University of British Columbia December 2006.
Explanatory Factor Analysis: Alpha and Omega Dominique Zephyr Applied Statistics Lab University of Kenctucky.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 6, 2008 (3:30-6:30pm) HS 249F.
Lecture 12 Factor Analysis.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Applied Quantitative Analysis and Practices
Exploratory Factor Analysis. Principal components analysis seeks linear combinations that best capture the variation in the original variables. Factor.
Education 795 Class Notes Factor Analysis Note set 6.
Exploratory Factor Analysis Principal Component Analysis Chapter 17.
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire Design and Testing.
Evaluating Self-Report Data Using Psychometric Methods Ron D. Hays, PhD February 8, 2006 (3:00-6:00pm) HS 249F.
Chapter 13.  Both Principle components analysis (PCA) and Exploratory factor analysis (EFA) are used to understand the underlying patterns in the data.
Introducing Structural Equation Modeling (SEM) to Novices Using Kevin Heslin's SF-36 Data from Opioid Users Ron D. Hays, Ph.D. May 8, 2009, 12:01-12:59pm.
Department of Cognitive Science Michael Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Factor Analysis 1 PSYC 4310 Advanced Experimental.
Principal Component Analysis
Multitrait Scaling and IRT: Part I Ron D. Hays, Ph.D. Questionnaire.
Evaluating Multi-item Scales Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research HS237A 11/10/09, 2-4pm,
FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.
Health-Related Quality of Life (HRQOL) Assessment in Outcome Studies Ron D. Hays, Ph.D. UCLA/RAND GCRC Summer Course “The.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Exploratory Factor Analysis
Factor analysis Advanced Quantitative Research Methods
Health-Related Quality of Life Assessment in Outcome Studies
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Principal Component Analysis
Chapter_19 Factor Analysis
Multitrait Scaling and IRT: Part I
UCLA Department of Medicine
Presentation transcript:

Factor Analysis Ron D. Hays, Ph.D. N208, Factor (3-6pm) February 23, 2005

Issues to be discussed SF-36 Factor Analysis Equivalence by subgroup Orthogonal or Oblique model Exploratory Factor Analysis Confirmatory Analysis Factor Multitrait Scaling

Generic HRQOL: 8 SF-36 Scales Physical functioning Physical functioning Role limitations/physical Role limitations/physical Pain Pain General health perceptions General health perceptions Social functioning Social functioning Energy/fatigue Energy/fatigue Role limitations/emotional Role limitations/emotional Emotional well-being Emotional well-being

Physical Health Physical function Role function- physical Pain General Health Physical Health

Mental Health Emotional Well- Being Role function- emotional Energy Social function Mental Health

Correlations among SF-36 Scales PFRPPGHEWREE PF1.00 RP P GH EW RE E SF

Larger Correlations 0.61 Energy and General Health/Emotional well-being 0.60 Role-Physical and Pain 0.54 Physical Function and Role-Physical 0.53 General Health and Role-Physical/Pain 0.52 Energy and Pain 0.50 Energy and Role—Physical 0.49 General Health and Physical Function; Role- Physical and Social Functioning

SF-36 Factor Analysis in United States

 Phys  Ment

SF-36 Factor Analysis in US

SF-36 Factor Analysis in Singapore vs. US

 PCS  MCS  Fr = 0.00

SF-36 PCS and MCS PCS_z = (PF_z *.42402) + (RP_z *.35119) + (BP_z *.31754) + (GH_z *.24954) + (EF_z *.02877) + (SF_z * ) + (RE_z * ) + (EW_z * ) MCS_z = (PF_z * ) + (RP_z * ) + (BP_z * ) + (GH_z * ) + (EF_z *.23534) + (SF_z *.26876) + (RE_z *.43407) + (EW_z *.48581)

T-score Transformation PCS = (PCS_z*10) + 50 MCS = (MCS_z*10) + 50 Z X = (X – x-bar)/ SD x

Debate About Summary Scores Taft, C., Karlsson, J., & Sullivan, M. (2001). Do SF-36 component score accurately summarize subscale scores? Quality of Life Research, 10, Ware, J. E., & Kosinski, M. (2001). Interpreting SF-36 summary health measures: A response. Quality of Life Research, 10, Taft, C., Karlsson, J., & Sullivan, M. (2001). Reply to Drs Ware and Kosinski. Quality of Life Research, 10,

536 Primary Care Patients Initiating Antidepressant Tx  3-month improvements in physical functioning, role— physical, pain, and general health perceptions ranging from 0.28 to 0.49 SDs.  Yet SF-36 PCS did not improve.  Simon et al. (Med Care, 1998)

Physical Health Physical function Role function- physical PainPain General Health Four scales improve SD, but physical health summary score doesn’t change

n = 194 with Multiple Sclerosis  Lower scores than general population on  Emotional well-being (  0.3 SD)  Role—emotional (  0.7 SD)  Energy (  1.0 SD)  Social functioning (  1.0 SD)  Yet SF-36 MCS was only 0.2 SD lower.  RAND-36 mental health was 0.9 SD lower. Nortvedt et al. (Med Care, 2000)

Mental Health Emotional Well-Being Role function- emotional EnergyEnergy Social function Four scales SD lower, but mental health summary score only 0.2 SD lower

 PH  MH  Fr = 0.62

Alternative Weights for SF-36 PCS and MCS PCS_z = (PF_z *.20) + (RP_z *.31) + (BP_z *.23) + (GH_z *.20) + (EF_z *.13) + (SF_z *.11) + (RE_z *.03) + (EW_z * -.03) MCS_z = (PF_z * -.02) + (RP_z *.03) + (BP_z *.04) + (GH_z *.10) + (EF_z *.29) + (SF_z *.14) + (RE_z *.20) + (EW_z *.35) Farivar, S. S., & Hays, R.D. (2004, November). Constructing correlated physical and mental health summary scores for the SF-36 health survey. Paper presented at the annual meeting of the International Society for Quality of Life Research, Hong Kong. (Quality of Life Research, 13 (9), 1550).

What is Factor Analysis Doing? observed r = reproduced r = ( ) (-.2077) = = residual = = F1 F2 SELF7 SELF

Correlations for 10 Self-Esteem Items SELF11.00 SELF SELF SELF SELF SELF SELF SELF SELF SELF

Factor Loadings for 10 Self-Esteem Items SELF SELF SELF SELF SELF SELF SELF SELF SELF SELF FACTOR 1FACTOR 2

Reproducing self7-self5 correlation: EFA observed r = reproduced r = ( ) (-.2077) = = residual = = F1 F2 SELF7 SELF

Three Steps in Exploratory Factor Analysis Check correlation matrix for problems Identify number of dimensions or factors Rotate to simple structure

Checking Correlation Matrix Determinant of correlation matrix ranges between 0-1 Determinant = 0 if there is linear dependency in the matrix (singular, not positive definite, matrix has no inverse) Determinant = 1 if all off diagonal elements in matrix are zero (identity matrix)

Measurement Error observed = true score + systematic error + random error (bias)

Partitioning of Variance Among Items + observed = Common Specific + Erro r Standardize items: Z X = (X – x-bar)/ SD x

Principal Components Analysis Try to explain ALL variance in items, summarizing interrelations among items by smaller set of orthogonal principal components that are linear combinations of the items. * First component is linear combination that explains maximum amount of variance in correlation matrix. * Second component explains maximum amount of variance in residual correlation matrix. Factor loadings represent correlation of each item with the component. Eigenvalue (max = number of items) is sum of squared factor loadings for the component (column) and represents amount of variance in items explained by it.

Principal Components Analysis Use 1.0 as initial estimate of communality (variance in item explained by the factors) for each item Component is linear combination of items First component accounts for as much of the total item variance as possible Second component accounts for as much variance as possible, but uncorrelated with the first component  C 1 = a 1 *x 1 + b 1 *x 2  C 2 = a 2 *x 1 + b 2 *x 2  Mean of C 1 & C 2 = 0

Common Factor Analysis Factors are not linear combinations of items but are hypothetical constructs estimated from the items. These factors are estimated from the common variance (not total) of the items; diagonal elements (communality estimates) of the correlation matrix estimated as less than 1.0.

Each item represented as a linear combination of unobserved common and unique factors F 1 and F 2 are standardized common factors a's and b's are factor loadings; e's are unique factors Factors are independent variables (components are dependent variables) Common Factor Analysis X = a F + b F + e X = a F + b F + e

Hypothetical Factor Loadings, Communalities, and Specificities Factor Loadings Communality Specificity Variable F F h u X Variance explained Percentage From Afifi and Clark, Computer-Aided Multivariate Analysis, 1984, p. 338 (Principal components example) % % % %

Number of factors decision Guttman’s weakest lower bound PCA eigenvalues > 1.0 Parallel analysis Scree test ML and Tucker’s rho

Parallel Analysis PARALLEL.EXE: LATENT ROOTS OF RANDOM DATA CORRELATION MATRICES PROGRAM PROGRAMMER: RON HAYS, RAND CORPORATION FOR 3000 SUBJECTS AND 15 VARIABLES ****************************************************************** Hays, R. D. (1987). PARALLEL: A program for performing parallel analysis. Applied Psychological Measurement, 11, 58. ****************************************************************** EIGENVALUES FOR FACTOR ANALYSIS SMC ESTIMATES FOLLOW: OBSERVED RANDOM SLOPE ========= ========= ========= LAMBDA 1= LAMBDA 2= *** LAMBDA 3= *** LAMBDA 4= *** LAMBDA 5= LAMBDA 6= *** LAMBDA 7= (CAN'T COMPUTE LAMBDA 8 :LOG OF ZERO OR NEGATIVE IS UNDEFINED) Results of Parallel Analysis Indicate Maximum of 6 Factors. Slopes followed by asterisks indicate discontinuity points that may be suggestive of the number of factors to retain.

Scree Test

ML and Tucker’s rho Significance Tests Based on 3000 Observations Pr > Test DF Chi-Square ChiSq H0: No common factors <.0001 HA: At least one common factor H0: 4 Factors are sufficient <.0001 HA: More factors are needed Chi-Square without Bartlett's Correction Tucker and Lewis's Reliability Coefficient

Factor Rotation Unrotated factors are complex and hard to interpret Rotation improves “simple” structure (more high and low loadings) and interpretability

Rotation Communalities unchanged by rotation Cumulative % of variance explained by common factors unchanged Varimax (orthogonal rotation) maximizes sum of squared factor loadings (after dividing each loading by the item’s communality) Promax allows factors to be correlated * Structure, pattern, and factor correlation matrix

Items/Factors and Cases/Items At least 5 - items per factor - cases per item - cases per parameter estimate

Confirmatory Factor Analysis Compares observed covariances with covariances generated by hypothesized model Statistical and practical tests of fit Factor loadings Correlations between factors Regression coefficients

 Fit Indices Normed fit index: Non-normed fit index: Comparative fit index:  -  2 null model 2  2 null  2 model 2 - df nullmodel 2 null df -1  - df 2 model  - 2 null df null 1 -

Software SAS PROC CALIS EQS LISREL MPLUS

KISS “Sometimes, very complex mathematical methods are required for the scientific problem at hand. However, most situations allow much simpler, direct, and practicable approaches” (Nunnally & Bernstein, 1994, p. 452).

Multitrait Scaling Analysis Internal consistency reliability – Item convergence Item discrimination

Hypothetical Multitrait/Multi-Item Correlation Matrix

Multitrait/Multi-Item Correlation Matrix for Patient Satisfaction Ratings Technical Interpersonal Communication Financial Technical 10.66*0.63†0.67† *0.54†0.50† * † * † *0.60†0.56† *0.58†0.57†0.23 Interpersonal *0.63† †0.58*0.61† †0.65*0.67† †0.57*0.60† *0.58† †0.48*0.46†0.24 Note – Standard error of correlation is Technical = satisfaction with technical quality. Interpersonal = satisfaction with the interpersonal aspects. Communication = satisfaction with communication. Financial = satisfaction with financial arrangements. *Item-scale correlations for hypothesized scales (corrected for item overlap). †Correlation within two standard errors of the correlation of the item with its hypothesized scale.

Recommended Readings Pett, M. A., Lackey, N. R., & Sullivan, J. J. (2003). Making sense of factor analysis: The use of factor analysis for instrument development in health care research. Thousand Oaks: Sage. Floyd, F. J., & Widaman, K. F. (1995). Factor analysis in the development and refinement of clinical assessment instruments. Psychological Assessment, 7, Hays, R. D., Revicki, D., & Coyne, K. (in press). Application of structural equation modeling to health outcomes research. Evaluation and the Health Professions.

is maximized C & C are uncorrelated S + S = S + S Appendix: Eigenvalues c 1 x 1 x 1 1 x, x x x S = a s + b s + 2(a )(b ) r s s S = a s + b s + 2(a )(b ) r s s c 2 x 2 x 2 2 x, x x x 1 2 S 1 2 c a b 1 2 = a b 2 2 c c x x 1 2

Appendix: Correlations Between Component and Item r = 1,c 1 a ( S ) 1 c 1 S x 1 r = 2,c 1 b ( S ) 1 c 1 S x 2