FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”

Slides:



Advertisements
Similar presentations
Factor Analysis Continued
Advertisements

Exploratory Factor Analysis
Chapter Nineteen Factor Analysis.
© LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON
Lecture 7: Principal component analysis (PCA)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
Factor Analysis There are two main types of factor analysis:
A quick introduction to the analysis of questionnaire data John Richardson.
1 Carrying out EFA - stages Ensure that data are suitable Decide on the model - PAF or PCA Decide how many factors are required to represent you data When.
Factor Analysis Factor analysis is a method of dimension reduction.
Principal component analysis
Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.
Education 795 Class Notes Factor Analysis II Note set 7.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Factor Analysis Psy 524 Ainsworth.
Principal Components An Introduction
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Factor Analysis PowerPoint Prepared by Alfred.
Principal Components Principal components is a method of dimension reduction. Suppose that you have a dozen variables that are correlated. You might use.
Exploratory Factor Analysis
Factor Analysis © 2007 Prentice Hall. Chapter Outline 1) Overview 2) Basic Concept 3) Factor Analysis Model 4) Statistics Associated with Factor Analysis.
EXPLORATORY FACTOR ANALYSIS (EFA)
Psy 427 Cal State Northridge Andrew Ainsworth PhD.
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
Applied Quantitative Analysis and Practices
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Thursday AM  Presentation of yesterday’s results  Factor analysis  A conceptual introduction to: Structural equation models Structural equation models.
Factor Analysis ( 因素分析 ) Kaiping Grace Yao National Taiwan University
© 2007 Prentice Hall19-1 Chapter Nineteen Factor Analysis © 2007 Prentice Hall.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Factor Analysis Ανάλυση Παραγόντων Dr. Angelos Markos Lecturer Dept. of Primary Education Democritus Univ. of Thrace 1.
Introduction to Multivariate Analysis of Variance, Factor Analysis, and Logistic Regression Rubab G. ARIM, MA University of British Columbia December 2006.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Lecture 12 Factor Analysis.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Applied Quantitative Analysis and Practices
Exploratory Factor Analysis. Principal components analysis seeks linear combinations that best capture the variation in the original variables. Factor.
Education 795 Class Notes Factor Analysis Note set 6.
Exploratory Factor Analysis Principal Component Analysis Chapter 17.
Chapter 13.  Both Principle components analysis (PCA) and Exploratory factor analysis (EFA) are used to understand the underlying patterns in the data.
Department of Cognitive Science Michael Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Factor Analysis 1 PSYC 4310 Advanced Experimental.
Factor Analysis I Principle Components Analysis. “Data Reduction” Purpose of factor analysis is to determine a minimum number of “factors” or components.
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
Applied Quantitative Analysis and Practices LECTURE#19 By Dr. Osman Sadiq Paracha.
Principal Component Analysis
FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
Basic statistical concepts Variance Covariance Correlation and covariance Standardisation.
1 FACTOR ANALYSIS Kazimieras Pukėnas. 2 Factor analysis is used to uncover the latent (not observed directly) structure (dimensions) of a set of variables.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Exploratory Factor Analysis
EXPLORATORY FACTOR ANALYSIS (EFA)
Factor analysis Advanced Quantitative Research Methods
An introduction to exploratory factor analysis in IBM SPSS Statistics
Advanced Data Preparation
© LOUIS COHEN, LAWRENCE MANION AND KEITH MORRISON
Measuring latent variables
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Principal Component Analysis
Chapter_19 Factor Analysis
Lecture 8: Factor analysis (FA)
Measuring latent variables
Presentation transcript:

FACTOR ANALYSIS 1

What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components” o correlated variables are grouped together and separated from other variables with low or no correlation o seeks underlying unobservable (latent) variables that are reflected in the observed variables (manifest variables)

More on Factor Analysis requires a large sample size since it is based on the correlation matrix of the variables involved o 50 cases is very poor o 100 is poor o 200 is fair o 300 is good o 500 is very good, and 1000 or more is excellent. o rule of thumb – a bare minimum of 10 observations per variable is necessary to avoid computational difficulties.

“Good Factor” A good factor: o makes sense o easy to interpret o simple structure o lacks complex loadings

Problems with Factor Analysis There is no statistical criterion to compare the linear combination to as in MANOVA or Canonical Correlations It is more art than science o several extraction methods o several rotation methods o number of factors to extract o communality estimates Life (researcher) saver o Often, when nothing else can be salvaged from research a FA will be conducted.

Types of Factor Analysis Exploratory Factor Analysis (EFA) Confirmatory Factor Analysis (CFA)

Exploratory Factor Analysis (EFA) o summarizing data by grouping correlated variables o investigating sets of measured variables related to theoretical constructs o usually done near the onset of research o generate “factor scores“ which represent values of the underlying constructs for use in other analyses o often confused with Principal Component Analysis (PCA) which is a similar statistical procedure

FA vs. PCA EFA analyzes only the variance shared among the variables (common variance without error or unique variance) examines what are the underlying processes that could produce these correlations produces factors factors cause variables PCA analyzes all of the variance only summarizes empirical associations very data driven produces components components are aggregates of the variables

Confirmatory Factor Analysis (CFA) Confirmatory FA o more advanced technique o used when factor structure is known or at least theorized o testing generalization of factor structure to new data, etc. o tested through Structural Equation Modeling (SEM) methods discussed later in course

Application of Factor Analysis defining indicators of constructs o ideally 4 or more measures should be chosen to represent each construct of interest o choice of measures should, as much as possible, be guided by theory, previous research, and logic selecting items or scales to be included in a measure  determine what items or scales should be included and excluded from a measure  results of the analysis should not be used alone in making decisions of inclusions or exclusions  decisions should be taken in conjunction with the theory and what is known about the construct(s) that the items or scales assess

Assumptions Underlying Factor Analysis measured variables are linearly related to the factors + errors. o likely to be violated if items use limited response scales, i.e. too many dichotomous variables should have a bivariate normal distribution for each pair of variables observations are independent assumes variables are determined by common factors and unique factors o unique factors assumed to be uncorrelated with each other and with the common factors

Terminology Reproduced Correlation Matrix o correlation matrix based on the extracted factors o want the values in the reproduced matrix to be as close to the values in the original correlation matrix as possible o If reproduced matrix is very similar to the original correlation matrix, then the few factors do a good job of representing the original data Residual Correlation Matrix represents the differences between original correlations and the reproduced correlations should be close to zero

Terminology Eigenvalues o number of variables which the factor represents o amount of variance in the data described by the factor Communalities o proportion of the variance in the original variables that can be explained by the factors o factor solution should explain at least half of each original variable's variance, so the communality value for each variable should be 0.50 or higher

Terminology Rotated Factor Matrix o represents both how the variables are weighted for each factor and also the correlation between the variables and the factor these are correlations so possible values range from -1 to +1 o In SPSS, you can tell it to print any of the correlations that less than a particular value (usually use 0.3) o makes the output easier to read by removing the clutter of low correlations that are probably not meaningful anyway

General Steps to FA Step 1: Selecting and Measuring a set of variables in a given domain Step 2: Data screening in order to prepare the correlation matrix Step 3: Factor Extraction Step 4: Factor Rotation to increase interpretability Step 5: Interpretation Further Steps: Validation and Reliability of the measures

The Correlation Matrix generate a correlation matrix for all variables identify variables not related to other variables if correlation between variables are small, unlikely that they share common factors (variables must be related to each other for the factor model to be appropriate) think of correlations in absolute value. correlation coefficients > 0.3 in absolute value are indicative of acceptable correlations. examine visually the appropriateness of the factor model

The Correlation Matrix Bartlett Test of Sphericity o tests the null hypothesis that the correlation matrix is an identity matrix (all diagonal terms are 1 and all off- diagonal terms are 0)  want to reject this null hypothesis o If the value of the test statistic for sphericity is large and the associated significance level is small, it is unlikely that the population correlation matrix is an identity

The Correlation Matrix The Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy o index for comparing magnitude of observed correlation coefficients to magnitude of partial correlation coefficients. o closer KMO measure is to 1, evidence of a sizeable sampling adequacy  0.8 and higher are great  0.7 is acceptable  0.6 is mediocre  < 0.5 is unacceptable o Small KMO values indicate that a factor analysis may not be a good idea.

Factor Extraction primary objective is to determine the factors initial decisions can be made here about number of factors underlying a set of measured variables. several factor extraction methods o Principal Component Analysis – used for data reduction o Maximum likelihood method o Principal axis factoring o Alpha method o Unweighted lease squares method o Generalized least square method o Image factoring

Factor Extraction To decide on how many factors needed to represent the data, use 2 statistical criteria: o Eigenvalues o Scree Plot Determination of number of factors is usually done by considering only factors with Eigenvalues > 1. Factors with a variance less than 1 are no better than a single variable, since each variable is expected to have a variance of 1. Total Variance Explained Comp onent Initial Eigenvalues Extraction Sums of Squared Loadings Total % of Variance Cumulativ e %Total % of Variance Cumulativ e % Extraction Method: Principal Component Analysis.

Factor Extraction Examination of Scree Plot provides visual of total variance associated with each factor. Steep slope shows large factors. Gradual trailing off (scree) shows rest of factors usually have Eigenvalue < 1. In choosing number of factors, in addition to the statistical criteria, make initial decisions based on conceptual and theoretical grounds.

Factor Extraction – using PCA Component Matrix a Component 123 I discussed my frustrations and feelings with person(s) in school I tried to develop a step-by-step plan of action to remedy the problems I expressed my emotions to my family and close friends I read, attended workshops, or sought someother educational approach to correct the problem I tried to be emotionally honest with my self about the problems I sought advice from others on how I should solve the problems I explored the emotions caused by the problems I took direct action to try to correct the problems I told someone I could trust about how I felt about the problems I put aside other activities so that I could work to solve the problems Extraction Method: Principal Component Analysis. a. 3 components extracted.

Factor Extraction using Principal Axis Factoring

Factor Rotation Unrotated factors are typically not very interpretable (most factors are correlated with may variables). Factors are rotated to make them more meaningful and easier to interpret (each variable is associated with a minimal number of factors). Different rotation methods may result in the identification of somewhat different factors.

Factor Rotation Two types of rotation o Orthogonal – produces uncorrelated factors/components  Varimax: most popular attempts to minimize the number of variables that have high loadings on a factor. enhances the interpretability of the factors  Quartimax  Equamax o Oblique – produces correlated factors/components  used less frequently because results are more difficult to summarize  types Direct Quartimin Promax Harris-Kaiser Orthoblique

Factor Rotation A factor is interpreted or named by examining largest values linking the factor to the measured variables in the rotated factor matrix. Rotated Component Matrix a Component 123 I discussed my frustrations and feelings with person(s) in school I tried to develop a step-by-step plan of action to remedy the problems I expressed my emotions to my family and close friends I read, attended workshops, or sought someother educational approach to correct the problem I tried to be emotionally honest with my self about the problems I sought advice from others on how I should solve the problems I explored the emotions caused by the problems I took direct action to try to correct the problems I told someone I could trust about how I felt about the problems I put aside other activities so that I could work to solve the problems Extraction Method: Principal Component Analysis. Rotation Method: Varimax with Kaiser Normalization. a. Rotation converged in 5 iterations.

Making Final Decisions Making final decisions o should base final decision on number of factors for rotated solution that is most interpretable. o identify factors by grouping variables that have large loadings for same factor o interpret factors according to meaning of the variables o decision should be guided by:  conceptual beliefs about the number of factors from past research or theory  Eigenvalues computed earlier  relative interpretability of rotated solutions computed