FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.

Slides:



Advertisements
Similar presentations
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Advertisements

Factor Analysis Continued
Chapter Nineteen Factor Analysis.
Lecture 7: Principal component analysis (PCA)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Factor Analysis There are two main types of factor analysis:
A quick introduction to the analysis of questionnaire data John Richardson.
1 Carrying out EFA - stages Ensure that data are suitable Decide on the model - PAF or PCA Decide how many factors are required to represent you data When.
Factor Analysis Factor analysis is a method of dimension reduction.
GRA 6020 Multivariate Statistics Factor Analysis Ulf H. Olsson Professor of Statistics.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Dr. Michael R. Hyman Factor Analysis. 2 Grouping Variables into Constructs.
Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.
Education 795 Class Notes Factor Analysis II Note set 7.
Correlation and Regression Analysis
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Factor Analysis Psy 524 Ainsworth.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Factor Analysis PowerPoint Prepared by Alfred.
Principal Components Principal components is a method of dimension reduction. Suppose that you have a dozen variables that are correlated. You might use.
Factor Analysis © 2007 Prentice Hall. Chapter Outline 1) Overview 2) Basic Concept 3) Factor Analysis Model 4) Statistics Associated with Factor Analysis.
EXPLORATORY FACTOR ANALYSIS (EFA)
MGMT 6971 PSYCHOMETRICS © 2014, Michael Kalsher
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
Applied Quantitative Analysis and Practices
Customer Research and Segmentation Basis for segmentation is heterogeneous markets Three imp. Definitions: Mkt. Segment, Mkt. Segmentation, Target market.
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
Factor Analysis ( 因素分析 ) Kaiping Grace Yao National Taiwan University
© 2007 Prentice Hall19-1 Chapter Nineteen Factor Analysis © 2007 Prentice Hall.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Lecture 12 Factor Analysis.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Copyright © 2010 Pearson Education, Inc Chapter Nineteen Factor Analysis.
Applied Quantitative Analysis and Practices
Exploratory Factor Analysis. Principal components analysis seeks linear combinations that best capture the variation in the original variables. Factor.
Education 795 Class Notes Factor Analysis Note set 6.
Chapter 13.  Both Principle components analysis (PCA) and Exploratory factor analysis (EFA) are used to understand the underlying patterns in the data.
Department of Cognitive Science Michael Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Factor Analysis 1 PSYC 4310 Advanced Experimental.
Multivariate Data Analysis Chapter 3 – Factor Analysis.
Factor Analysis I Principle Components Analysis. “Data Reduction” Purpose of factor analysis is to determine a minimum number of “factors” or components.
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
Applied Quantitative Analysis and Practices LECTURE#19 By Dr. Osman Sadiq Paracha.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”
Factor Analysis. Introduction 1. Factor Analysis is a set of techniques used for understanding variables by grouping them into “factors” consisting of.
Principal Component Analysis
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
1 FACTOR ANALYSIS Kazimieras Pukėnas. 2 Factor analysis is used to uncover the latent (not observed directly) structure (dimensions) of a set of variables.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Exploratory Factor Analysis
Customer Research and Segmentation
EXPLORATORY FACTOR ANALYSIS (EFA)
Analysis of Survey Results
Factor analysis Advanced Quantitative Research Methods
Measuring latent variables
Measuring latent variables
Measuring latent variables
Principal Component Analysis
Chapter_19 Factor Analysis
Measuring latent variables
Presentation transcript:

FACTOR ANALYSIS

 The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant (highly correlated) variables from the data file, perhaps replacing the entire data file with a smaller number of uncorrelated variables.  The purpose of structure detection is to examine the underlying (or latent) relationships between the variables.

FACTOR ANALYSIS cont.  It represents a complex array of structure-analyzing procedures used to identify interrelationships among a large set of observed variables and then, through data reduction, to group a smaller set of these variables into dimensions or factors that have common characteristics.

FACTOR ANALYSIS  Finally it is, A technique that analyses data on a relatively large set of variables and produces a smaller set of factors which are used to represent the original variables as linear combinations of the smaller set of factors; so that the set of factor captures as much information as possible from the original set of data.

Uses of Factor Analysis  Scale construction: Factor analysis could be used to develop concise multiple item scales for measuring various constructs.  Establish antecedents: This method reduces multiple input variables into grouped factors. Thus, the independent variables can be grouped into broad factors.  Psychographic profiling: Different independent variables are grouped to measure independent factors. These are then used for identifying personality types.  Segmentation analysis: Factor analysis could also be used for segmentation. For example, there could be different sets of two-wheelers-customers owning two-wheelers because of different importance they give to factors like prestige, economy consideration and functional features.

Uses of Factor Analysis  Marketing studies: The technique has extensive use in the field of marketing and can be successfully used for new product development; product acceptance research, developing of advertising copy, pricing studies and for branding studies. For example we can use it to: identify the attributes of brands that influence consumers’ choice; get an insight into the media habits of various consumers; identify the characteristics of price-sensitive customers

Requirements for Factor Analysis The following conditions must be ensured before executing the technique:  Factor analysis exercise requires metric data. This means the data should be either interval or ratio scale in nature.  The variables for factor analysis are generally identified through exploratory research which may be conducted by reviewing the literature on the subject, by informal interviews of knowledgeable persons, qualitative analysis like focus group discussions, analysis of case studies and judgment of the researcher.  If the responses to different statements are obtained through different scales, all the responses need to be standardized.

Requirements for Factor Analysis  The size of the sample respondents should be at least four to five times more than the number of variables (number of statements).  The basic principle behind the application of factor analysis is that the initial set of variables should be highly correlated. If the correlation coefficients between all the variables are small, factor analysis may not be an appropriate technique.

Bartlett's test of sphericity  It tests the hypothesis that your correlation matrix is an identity matrix, which would indicate that your variables are unrelated and therefore unsuitable for structure detection.  H 0 : correlation matrix is an identity matrix where diagonal elements are one and off diagonal elements are zero. H 1 : Correlation matrix is significant. H 0 must be rejected for data to be suited for Factor Analysis.

Kaiser-Meyer-Olkin (KMO) Test  The KMO is a statistic that indicates the proportion of variance in your variables that might be caused by underlying factors. High values (close to 1.0) generally indicate that a factor analysis may be useful with your data. If the value is less than 0.50, the results of the factor analysis probably won't be very useful.

FACTOR ANALYSIS... ( assumptions )  Variables are correlated and some are having high degree of co-variance. underlying factors  The co-variation among variables is due to some underlying factors which are smaller in number than the observed variables. linear combinations  Observed variables can be expressed as some linear combinations of the underlying factors.  Reduction in data would not effect significantly conclusions of the research

TYPES OF FACTOR ANALYSIS  There are basically two types of Factor Analysis – EXPLORATORY FACTOR ANALYSIS CONFIRMATORY FACTOR ANALYSIS

EXPLORATORY FACTOR ANALYSIS… (EFA)  Researchers use EFA when they do not know how many factors are necessary to explain the interrelationships among a set of characteristics, indicators, or items.  It means that EFA is basically used to explore the underlying dimensions of the construct of interest.

CONFIRMATORY FACTOR ANALYSIS … (CFA)  Researchers use CFA when they want to assess the extent to which the hypothesized organization of a set of identified factors fits the data..  It means that CFA is used when the researcher has some knowledge about the underlying structure of the construct under investigation.

FACTOR ANALYSIS MODEL CONTINUED...

FACTOR  A factor is an unobservable / hypothetical underlying variable or construct. It is a linear combination of related observed variables that represents a specific underlying dimension of a construct, which is as distinct as possible from other factors included in solution.

Key terms in Factor Analysis  FACTOR LOADING: It is a coefficient of a factor in the factor model equation. If variables are standardized, then factor loadings are correlation coefficients between a factor and a variable.

FACTOR LOADING  Correlation between the factors and the original variables is represented by Factor Loading. Factor loading are present in the “Component Matrix” table of SPSS output.  The squared factor loading is the percentage of variance in the variable, explained by a factor.

COMMUNALITY(h 2 )  It indicates how much of each variable is accounted for by the underlying factors taken together. In other words, it is a measure of the percentage of variable’s variation that is explained by the factors.  It is defined for each variable. It is equal to summation of squares of factor loadings of all factors for a given variable.  It represents variations due to common factors and (1-h 2 ) is called specific variation or unique variation of the variable not explained by the factors.

Initial communalities  Initial communalities are estimates of the variance in each variable accounted for by all components or factors. For principal components extraction, this is always equal to 1.0 for correlation analyses.

Extraction communalities  Extraction communalities are estimates of the variance in each variable accounted for by the components. The high value of communalities indicates that the extracted components represent the variables well. If any communalities are very low in a principal components extraction, you may need to extract another component.

Communality example  Extracted Communality for Variable 5 = (Factor loading of var5 with Factor1) 2 + (Factor loading of var5 with Factor2) 2 + …..+ (Factor loading of var5 with Factor N) 2  Where N is the total factor extracted or retained.  If we do the same computation for all the factors, then we will get the initial communality for that variable, which has to be 1 (in principal components extraction, for correlation analyses).

EIGENVALUES  The percentage of variance explained by each factor can be computed using eigenvalue. The eigenvalue of any factor is obtained by taking the sum of squares of the factor loadings for all variables.

EIGENVALUES  The eigen values are calculated for a given factor.  They reflects the variance in all the variables, which is accounted for by that factor.  A factor's eigen value may be computed as the sum of its squared factor loadings for all the variables.  They are contained in the “Total variance explained” table of SPSS output.

EIGENVALUES  Eigen value for Factor 1 = (Factor loading of var1 with Factor1) 2 + (Factor loading of var2 with Factor1) 2 + …..+ (Factor loading of varN with Factor 1) 2  Where N is the total number of variables present

Extraction methods  Principal component Analysis: It begins by finding a linear combination of variables (a component) that accounts for as much variation in the original variables as possible.  It then finds another component that accounts for as much of the remaining variation as possible and is uncorrelated with the previous component, continuing in this way until there are as many components as original variables.  Usually, a few components will account for most of the variation, and these components can be used to replace the original variables. This method is most often used to reduce the number of variables in the data file.

Principal component Analysis  The principal component methodology involves searching for those values of weights i so that the first factor explains the largest portion of total variance. This is called the first principal factor.  This explained variance is then subtracted from the original input matrix so as to yield a residual matrix.  A second principal factor is extracted from the residual matrix in a way such that the second factor takes care of most of the residual variance.  One point that has to be kept in mind is that the second principal factor has to be statistically independent of the first principal factor. The same principle is then repeated until there is little variance to be explained.

Factor Rotation  Factor analysis can generate several solutions for any data set. Each solution is termed a particular factor rotation and is generated by a particular factor rotation scheme.  Broadly there are two types of rotation:  Orthogonal & Oblique  Orthogonal rotation methods assume that the factors in the analysis are uncorrelated, where as Oblique consider otherwise.

Orthogonal METHODS OF ROTATION: Varimax. An orthogonal rotation method that minimizes the number of variables that have high loadings on each factor. It simplifies the interpretation of the factors. Quartimax. A rotation method that minimizes the number of factors needed to explain each variable. It simplifies the interpretation of the observed variables.

Orthogonal METHODS OF ROTATION  Equamax. A rotation method that is a combination of the varimax method, which simplifies the factors, and the quartimax method, which simplifies the variables. The number of variables that load highly on a factor and the number of factors needed to explain a variable are minimized.

Oblique METHODS OF ROTATION: Orthomax. Specifies families of orthogonal rotations. Gamma specifies the member of the family to use. Varying Gamma changes maximization of the variances of the loadings from columns (Varimax) to rows (Quartimax). Oblimin. Specifies families of oblique (non-orthogonal) rotations. Gamma specifies the member of the family to use. For Gamma, specify 0 for moderate correlations, positive values to allow higher correlations, and negative values to restrict correlations.

FACTOR ANALYSIS…How Many Factors?  Rule of Thumb All included factors (prior to rotation) must explain at least as much variance as an "average variable"  Eigenvalues Criteria Eigenvalue represents the amount of variance in the original variables associated with a factor Sum of the square of the factor loadings of each variable on a factor represents the eigenvalue Only factors with eigenvalues greater than 1.0 are retained  Predefined Number of Factors

Example

FACTOR LOADING: CUSTOMERS PERCEPTION ABOUT A MANUFACTURER

Example