Presentation is loading. Please wait.

Presentation is loading. Please wait.

FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”

Similar presentations


Presentation on theme: "FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”"— Presentation transcript:

1 FACTOR ANALYSIS 1

2 What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components” o correlated variables are grouped together and separated from other variables with low or no correlation o seeks underlying unobservable (latent) variables that are reflected in the observed variables (manifest variables)

3 More on Factor Analysis requires a large sample size since it is based on the correlation matrix of the variables involved o 50 cases is very poor o 100 is poor o 200 is fair o 300 is good o 500 is very good, and 1000 or more is excellent. o rule of thumb – a bare minimum of 10 observations per variable is necessary to avoid computational difficulties.

4 “Good Factor” A good factor: o makes sense o easy to interpret o simple structure o lacks complex loadings

5 Problems with Factor Analysis There is no statistical criterion to compare the linear combination to as in MANOVA or Canonical Correlations It is more art than science o several extraction methods o several rotation methods o number of factors to extract o communality estimates Life (researcher) saver o Often, when nothing else can be salvaged from research a FA will be conducted.

6 Types of Factor Analysis Exploratory Factor Analysis (EFA) Confirmatory Factor Analysis (CFA)

7 Exploratory Factor Analysis (EFA) o summarizing data by grouping correlated variables o investigating sets of measured variables related to theoretical constructs o usually done near the onset of research o generate “factor scores“ which represent values of the underlying constructs for use in other analyses o often confused with Principal Component Analysis (PCA) which is a similar statistical procedure

8 FA vs. PCA EFA analyzes only the variance shared among the variables (common variance without error or unique variance) examines what are the underlying processes that could produce these correlations produces factors factors cause variables PCA analyzes all of the variance only summarizes empirical associations very data driven produces components components are aggregates of the variables

9 Confirmatory Factor Analysis (CFA) Confirmatory FA o more advanced technique o used when factor structure is known or at least theorized o testing generalization of factor structure to new data, etc. o tested through Structural Equation Modeling (SEM) methods discussed later in course

10 Application of Factor Analysis defining indicators of constructs o ideally 4 or more measures should be chosen to represent each construct of interest o choice of measures should, as much as possible, be guided by theory, previous research, and logic selecting items or scales to be included in a measure  determine what items or scales should be included and excluded from a measure  results of the analysis should not be used alone in making decisions of inclusions or exclusions  decisions should be taken in conjunction with the theory and what is known about the construct(s) that the items or scales assess

11 Assumptions Underlying Factor Analysis measured variables are linearly related to the factors + errors. o likely to be violated if items use limited response scales, i.e. too many dichotomous variables should have a bivariate normal distribution for each pair of variables observations are independent assumes variables are determined by common factors and unique factors o unique factors assumed to be uncorrelated with each other and with the common factors

12 Terminology Reproduced Correlation Matrix o correlation matrix based on the extracted factors o want the values in the reproduced matrix to be as close to the values in the original correlation matrix as possible o If reproduced matrix is very similar to the original correlation matrix, then the few factors do a good job of representing the original data Residual Correlation Matrix represents the differences between original correlations and the reproduced correlations should be close to zero

13 Terminology Eigenvalues o number of variables which the factor represents o amount of variance in the data described by the factor Communalities o proportion of the variance in the original variables that can be explained by the factors o factor solution should explain at least half of each original variable's variance, so the communality value for each variable should be 0.50 or higher

14 Terminology Rotated Factor Matrix o represents both how the variables are weighted for each factor and also the correlation between the variables and the factor these are correlations so possible values range from -1 to +1 o In SPSS, you can tell it to print any of the correlations that less than a particular value (usually use 0.3) o makes the output easier to read by removing the clutter of low correlations that are probably not meaningful anyway

15 General Steps to FA Step 1: Selecting and Measuring a set of variables in a given domain Step 2: Data screening in order to prepare the correlation matrix Step 3: Factor Extraction Step 4: Factor Rotation to increase interpretability Step 5: Interpretation Further Steps: Validation and Reliability of the measures

16 The Correlation Matrix generate a correlation matrix for all variables identify variables not related to other variables if correlation between variables are small, unlikely that they share common factors (variables must be related to each other for the factor model to be appropriate) think of correlations in absolute value. correlation coefficients > 0.3 in absolute value are indicative of acceptable correlations. examine visually the appropriateness of the factor model

17 The Correlation Matrix Bartlett Test of Sphericity o tests the null hypothesis that the correlation matrix is an identity matrix (all diagonal terms are 1 and all off- diagonal terms are 0)  want to reject this null hypothesis o If the value of the test statistic for sphericity is large and the associated significance level is small, it is unlikely that the population correlation matrix is an identity

18 The Correlation Matrix The Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy o index for comparing magnitude of observed correlation coefficients to magnitude of partial correlation coefficients. o closer KMO measure is to 1, evidence of a sizeable sampling adequacy  0.8 and higher are great  0.7 is acceptable  0.6 is mediocre  < 0.5 is unacceptable o Small KMO values indicate that a factor analysis may not be a good idea.

19 Factor Extraction primary objective is to determine the factors initial decisions can be made here about number of factors underlying a set of measured variables. several factor extraction methods o Principal Component Analysis – used for data reduction o Maximum likelihood method o Principal axis factoring o Alpha method o Unweighted lease squares method o Generalized least square method o Image factoring

20 Factor Extraction To decide on how many factors needed to represent the data, use 2 statistical criteria: o Eigenvalues o Scree Plot Determination of number of factors is usually done by considering only factors with Eigenvalues > 1. Factors with a variance less than 1 are no better than a single variable, since each variable is expected to have a variance of 1. Total Variance Explained Comp onent Initial Eigenvalues Extraction Sums of Squared Loadings Total % of Variance Cumulativ e %Total % of Variance Cumulativ e % 13.04630.465 3.04630.465 21.80118.01148.4761.80118.01148.476 31.00910.09158.5661.00910.09158.566 4.9349.33667.902 5.8408.40476.307 6.7117.10783.414 7.5745.73789.151 8.4404.39693.547 9.3373.36896.915 10.3083.085100.000 Extraction Method: Principal Component Analysis.

21 Factor Extraction Examination of Scree Plot provides visual of total variance associated with each factor. Steep slope shows large factors. Gradual trailing off (scree) shows rest of factors usually have Eigenvalue < 1. In choosing number of factors, in addition to the statistical criteria, make initial decisions based on conceptual and theoretical grounds.

22 Factor Extraction – using PCA Component Matrix a Component 123 I discussed my frustrations and feelings with person(s) in school.771-.271.121 I tried to develop a step-by-step plan of action to remedy the problems.545.530.264 I expressed my emotions to my family and close friends.580-.311.265 I read, attended workshops, or sought someother educational approach to correct the problem.398.356-.374 I tried to be emotionally honest with my self about the problems.436.441-.368 I sought advice from others on how I should solve the problems.705-.362.117 I explored the emotions caused by the problems.594.184-.537 I took direct action to try to correct the problems.074.640.443 I told someone I could trust about how I felt about the problems.752-.351.081 I put aside other activities so that I could work to solve the problems.225.576.272 Extraction Method: Principal Component Analysis. a. 3 components extracted.

23 Factor Extraction using Principal Axis Factoring

24 Factor Rotation Unrotated factors are typically not very interpretable (most factors are correlated with may variables). Factors are rotated to make them more meaningful and easier to interpret (each variable is associated with a minimal number of factors). Different rotation methods may result in the identification of somewhat different factors.

25 Factor Rotation Two types of rotation o Orthogonal – produces uncorrelated factors/components  Varimax: most popular attempts to minimize the number of variables that have high loadings on a factor. enhances the interpretability of the factors  Quartimax  Equamax o Oblique – produces correlated factors/components  used less frequently because results are more difficult to summarize  types Direct Quartimin Promax Harris-Kaiser Orthoblique

26 Factor Rotation A factor is interpreted or named by examining largest values linking the factor to the measured variables in the rotated factor matrix. Rotated Component Matrix a Component 123 I discussed my frustrations and feelings with person(s) in school.803.186.050 I tried to develop a step-by-step plan of action to remedy the problems.270.304.694 I expressed my emotions to my family and close friends.706-.036.059 I read, attended workshops, or sought someother educational approach to correct the problem.050.633.145 I tried to be emotionally honest with my self about the problems.042.685.222 I sought advice from others on how I should solve the problems.792.117-.038 I explored the emotions caused by the problems.248.782-.037 I took direct action to try to correct the problems-.120-.023.772 I told someone I could trust about how I felt about the problems.815.172-.040 I put aside other activities so that I could work to solve the problems-.014.155.657 Extraction Method: Principal Component Analysis. Rotation Method: Varimax with Kaiser Normalization. a. Rotation converged in 5 iterations.

27 Making Final Decisions Making final decisions o should base final decision on number of factors for rotated solution that is most interpretable. o identify factors by grouping variables that have large loadings for same factor o interpret factors according to meaning of the variables o decision should be guided by:  conceptual beliefs about the number of factors from past research or theory  Eigenvalues computed earlier  relative interpretability of rotated solutions computed


Download ppt "FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”"

Similar presentations


Ads by Google