Applied Quantitative Analysis and Practices LECTURE#16 By Dr. Osman Sadiq Paracha.

Slides:



Advertisements
Similar presentations
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Advertisements

Social Media Marketing Research 社會媒體行銷研究 SMMR08 TMIXM1A Thu 7,8 (14:10-16:00) L511 Exploratory Factor Analysis Min-Yuh Day 戴敏育 Assistant Professor.
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
Principal Components Factor Analysis
Chapter Nineteen Factor Analysis.
Lecture 7: Principal component analysis (PCA)
Factor Analysis Ulf H. Olsson Professor of Statistics.
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
Factor Analysis There are two main types of factor analysis:
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
A quick introduction to the analysis of questionnaire data John Richardson.
1 Carrying out EFA - stages Ensure that data are suitable Decide on the model - PAF or PCA Decide how many factors are required to represent you data When.
Principal component analysis
Dr. Michael R. Hyman Factor Analysis. 2 Grouping Variables into Constructs.
Chapter 5 Multiple Discriminant Analysis
Education 795 Class Notes Factor Analysis II Note set 7.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Segmentation Analysis
Factor Analysis Psy 524 Ainsworth.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Factor Analysis PowerPoint Prepared by Alfred.
Measuring the Unobservable
Factor Analysis © 2007 Prentice Hall. Chapter Outline 1) Overview 2) Basic Concept 3) Factor Analysis Model 4) Statistics Associated with Factor Analysis.
Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall. 4-1 Chapter 4 Multiple Regression Analysis.
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
Applied Quantitative Analysis and Practices
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Multiple Discriminant Analysis
Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall. 7-1 Chapter 7 Multivariate Analysis of Variance.
Introduction to Multivariate Analysis of Variance, Factor Analysis, and Logistic Regression Rubab G. ARIM, MA University of British Columbia December 2006.
1 Hair, Babin, Money & Samouel, Essentials of Business Research, Wiley, Learning Objectives: 1.Explain the difference between dependence and interdependence.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall. 1-1 Chapter 1 Introduction.
Lecture 12 Factor Analysis.
Exploratory Factor Analysis
Applied Quantitative Analysis and Practices LECTURE#21 By Dr. Osman Sadiq Paracha.
Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall. 9-1 Chapter 9 Cluster Analysis.
Applied Quantitative Analysis and Practices
Exploratory Factor Analysis. Principal components analysis seeks linear combinations that best capture the variation in the original variables. Factor.
Education 795 Class Notes Factor Analysis Note set 6.
Department of Cognitive Science Michael Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Factor Analysis 1 PSYC 4310 Advanced Experimental.
Multivariate Data Analysis Chapter 3 – Factor Analysis.
Multiple Regression Analysis. LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: Determine when regression analysis.
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
Applied Quantitative Analysis and Practices LECTURE#19 By Dr. Osman Sadiq Paracha.
FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”
Factor Analysis Basics. Why Factor? Combine similar variables into more meaningful factors. Reduce the number of variables dramatically while retaining.
Applied Quantitative Analysis and Practices LECTURE#17 By Dr. Osman Sadiq Paracha.
SW388R7 Data Analysis & Computers II Slide 1 Principal component analysis Strategy for solving problems Sample problem Steps in principal component analysis.
Principal Component Analysis
FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
1 FACTOR ANALYSIS Kazimieras Pukėnas. 2 Factor analysis is used to uncover the latent (not observed directly) structure (dimensions) of a set of variables.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
FACTOR ANALYSIS & SPSS.
Exploratory Factor Analysis
Lecturing 11 Exploratory Factor Analysis
Analysis of Survey Results
Understanding Results
An introduction to exploratory factor analysis in IBM SPSS Statistics
Advanced Data Preparation
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Principal Component Analysis
Chapter_19 Factor Analysis
Chapter 6 Logistic Regression: Regression with a Binary Dependent Variable Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall.
Presentation transcript:

Applied Quantitative Analysis and Practices LECTURE#16 By Dr. Osman Sadiq Paracha

Lecture Summary Assumptions in testing of hypothesis Additivity and linearity Normality Homogeneity of Variance Independence Reduction of Bias Trim the data: Windsorizing: Analyse with Robust Methods: Transform the data:

Transforming Data Log Transformation (log(X i )) Reduce positive skew. Square Root Transformation (√X i ): Also reduces positive skew. Can also be useful for stabilizing variance. Reciprocal Transformation (1/ X i ): Dividing 1 by each score also reduces the impact of large scores. This transformation reverses the scores, you can avoid this by reversing the scores before the transformation, 1/(X Highest – X i ).

Log Transformation BeforeAfter

Square Root Transformation Slid e 5 BeforeAfter

Reciprocal Transformation BeforeAfter

But … BeforeAfter

To Transform … Or Not Transforming the data helps as often as it hinders the accuracy of F. According to researchers: The central limit theorem: sampling distribution will be normal in samples > 40 anyway. Transforming the data changes the hypothesis being tested E.g. when using a log transformation and comparing means you change from comparing arithmetic means to comparing geometric means In small samples it is tricky to determine normality one way or another. The consequences for the statistical model of applying the ‘wrong’ transformation could be worse than the consequences of analysing the untransformed scores.

Reliability The ability of the measure to produce the same results under the same conditions. Test-Retest Reliability The ability of a measure to produce consistent results when the same entities are tested at two different points in time.

Cronbach’s alpha assessing scale reliability

Cronbach’s alpha  Cronbach's alpha is an index of reliability associated with the variation accounted for by the true score of the "underlying construct."  Allows a researcher to measure the internal consistency of scale items, based on the average inter-item correlation  Indicates the extent to which the items in your questionnaire are related to each other  Indicates whether a scale is unidimensional or multidimensional

Interpreting scale reliability  The higher the score, the more reliable the generated scale is  A score of.70 or greater is generally considered to be acceptable.90 or > = high reliability = good reliability = acceptable reliability = marginal reliability  lower thresholds are sometimes used in the literature.

Validity Whether an instrument measures what it set out to measure. Content validity Evidence that the content of a test corresponds to the content of the construct it was designed to cover Construct validity Construct validity involves adoption of complex statistical methods to validate the constructs making it preferable over other types of validity.

Exploratory Factor Analysis

Exploratory factor analysis... is an interdependence technique whose primary purpose is to define the underlying structure among the variables in the analysis. Exploratory Factor Analysis Defined

Exploratory Factor Analysis... Examines the interrelationships among a large number of variables and then attempts to explain them in terms of their common underlying dimensions. Examines the interrelationships among a large number of variables and then attempts to explain them in terms of their common underlying dimensions. These common underlying dimensions are referred to as factors. These common underlying dimensions are referred to as factors. A summarization and data reduction technique that does not have independent and dependent variables, but is an interdependence technique in which all variables are considered simultaneously. A summarization and data reduction technique that does not have independent and dependent variables, but is an interdependence technique in which all variables are considered simultaneously. What is Exploratory Factor Analysis?

Correlation Matrix for Store Image Elements

Correlation Matrix of Variables After Grouping Using Factor Analysis Shaded areas represent variables likely to be grouped together by factor analysis.

3-19 Application of Factor Analysis to a Fast-Food Restaurant Service Quality Food Quality FactorsVariables Waiting Time Cleanliness Friendly Employees Taste Temperature Freshness

Factor Analysis Decision Process Stage 1: Objectives of Factor Analysis Stage 2: Designing a Factor Analysis Stage 3: Assumptions in Factor Analysis Stage 4: Deriving Factors and Assessing Overall Fit Stage 5: Interpreting the Factors Stage 6: Validation of Factor Analysis Stage 7: Additional uses of Factor Analysis Results

Stage 1: Objectives of Factor Analysis 1.Is the objective exploratory or confirmatory? 2.Specify the unit of analysis. 3.Data summarization and/or reduction? 4.Using factor analysis with other techniques.

Factor Analysis Outcomes 1.Data summarization = derives underlying dimensions that, when interpreted and understood, describe the data in a much smaller number of concepts than the original individual variables. 2.Data reduction = extends the process of data summarization by deriving an empirical value (factor score or summated scale) for each dimension (factor) and then substituting this value for the original values.

Types of Factor Analysis 1.Exploratory Factor Analysis (EFA) = is used to discover the factor structure of a construct and examine its reliability. It is data driven. 2.Confirmatory Factor Analysis (CFA) = is used to confirm the fit of the hypothesized factor structure to the observed (sample) data. It is theory driven.

Stage 2: Designing a Factor Analysis Two Basic Decisions: 1.Design of study in terms of number of variables, measurement properties of variables, and the type of variables. 2.Sample size necessary.

Rules of Thumb Factor Analysis Design oFactor analysis is performed most often only on metric variables, although specialized methods exist for the use of dummy variables. A small number of “dummy variables” can be included in a set of metric variables that are factor analyzed. oIf a study is being designed to reveal factor structure, strive to have at least five variables for each proposed factor. oFor sample size: the sample must have more observations than variables. the minimum absolute sample size should be 50 observations. oMaximize the number of observations per variable, with a minimum of five and hopefully at least ten observations per variable.

Stage 3: Assumptions in Factor Analysis Two Basic Decisions... 1.Design of study in terms of number of variables, measurement properties of variables, and the type of variables. 2.Sample size required.

Assumptions Multicollinearity  Assessed using MSA (measure of sampling adequacy). Homogeneity of sample factor solutions The MSA is measured by the Kaiser-Meyer-Olkin (KMO) statistic. As a measure of sampling adequacy, the KMO predicts if data are likely to factor well based on correlation and partial correlation. KMO can be used to identify which variables to drop from the factor analysis because they lack multicollinearity. There is a KMO statistic for each individual variable, and their sum is the KMO overall statistic. KMO varies from 0 to 1.0. Overall KMO should be.50 or higher to proceed with factor analysis. If it is not, remove the variable with the lowest individual KMO statistic value one at a time until KMO overall rises above.50, and each individual variable KMO is above.50.

Rules of Thumb Testing Assumptions of Factor Analysis There must be a strong conceptual foundation to support the assumption that a structure does exist before the factor analysis is performed. A statistically significant Bartlett’s test of sphericity (sig. <.05) indicates that sufficient correlations exist among the variables to proceed.. Measure of Sampling Adequacy (MSA) values must exceed.50 for both the overall test and each individual variable. Variables with values less than.50 should be omitted from the factor analysis one at a time, with the smallest one being omitted each time.

Stage 4: Deriving Factors and Assessing Overall Fit Selecting the factor extraction method – common vs. component analysis. Selecting the factor extraction method – common vs. component analysis. Determining the number of factors to represent the data. Determining the number of factors to represent the data.

Extraction Decisions oWhich method? Principal Components Analysis Common Factor Analysis oHow to rotate? Orthogonal or Oblique rotation

Extraction Method Determines the Types of Variance Carried into the Factor Matrix Diagonal Value Variance Diagonal Value Variance Unity (1) Unity (1) Communality Communality Total Variance Total Variance Common Common Specific and Error Specific and Error Variance extracted Variance not used Variance not used

Principal Components vs. Common? Two Criteria... Objectives of the factor analysis. Amount of prior knowledge about the variance in the variables.

Number of Factors? A Priori Criterion A Priori Criterion Latent Root Criterion Latent Root Criterion Percentage of Variance Percentage of Variance Scree Test Criterion Scree Test Criterion

Eigenvalue Plot for Scree Test Criterion

Rules of Thumb Choosing Factor Models and Number of Factors Although both component and common factor analysis models yield similar results in common research settings (30 or more variables or communalities of.60 for most variables): the component analysis model is most appropriate when data reduction is paramount. the common factor model is best in well-specified theoretical applications. Any decision on the number of factors to be retained should be based on several considerations: use of several stopping criteria to determine the initial number of factors to retain. Factors With Eigenvalues greater than 1.0. A pre-determined number of factors based on research objectives and/or prior research. Enough factors to meet a specified percentage of variance explained, usually 60% or higher. Factors shown by the scree test to have substantial amounts of common variance (i.e., factors before inflection point). More factors when there is heterogeneity among sample subgroups. Consideration of several alternative solutions (one more and one less factor than the initial solution) to ensure the best structure is identified.

Processes of Factor Interpretation Estimate the Factor Matrix Estimate the Factor Matrix Factor Rotation Factor Rotation Factor Interpretation Factor Interpretation Respecification of factor model, if needed, may involve... Respecification of factor model, if needed, may involve... oDeletion of variables from analysis oDesire to use a different rotational approach oNeed to extract a different number of factors oDesire to change method of extraction

Rotation of Factors Factor rotation = the reference axes of the factors are turned about the origin until some other position has been reached. Since unrotated factor solutions extract factors based on how much variance they account for, with each subsequent factor accounting for less variance. The ultimate effect of rotating the factor matrix is to redistribute the variance from earlier factors to later ones to achieve a simpler, theoretically more meaningful factor pattern.

Two Rotational Approaches 1.Orthogonal = axes are maintained at 90 degrees. 2.Oblique = axes are not maintained at 90 degrees.

Orthogonal Factor Rotation Unrotated Factor II Unrotated Factor I Rotated Factor I Rotated Factor II V1V1 V2V2 V3V3 V4V4 V5V5

Unrotated Factor II Unrotated Factor I Oblique Rotation : Factor I Orthogonal Rotation: Factor II V1V1 V2V2 V3V3 V4V4 V5V5 Orthogonal Rotation: Factor I Oblique Rotation: Factor II Oblique Factor Rotation

Orthogonal Rotation Methods Quartimax (simplify rows) Quartimax (simplify rows) Varimax (simplify columns) Varimax (simplify columns) Equimax (combination) Equimax (combination)

Rules of Thumb Choosing Factor Rotation Methods Orthogonal rotation methods... Orthogonal rotation methods... oare the most widely used rotational methods. oare The preferred method when the research goal is data reduction to either a smaller number of variables or a set of uncorrelated measures for subsequent use in other multivariate techniques. Oblique rotation methods... Oblique rotation methods... obest suited to the goal of obtaining several theoretically meaningful factors or constructs because, realistically, very few constructs in the “real world” are uncorrelated.

Which Factor Loadings Are Significant? Customary Criteria = Practical Significance. Customary Criteria = Practical Significance. Sample Size & Statistical Significance. Sample Size & Statistical Significance. Number of Factors ( = >) and/or Variables ( = ) and/or Variables ( = <).

Guidelines for Identifying Significant Factor Loadings Based on Sample Size Factor LoadingSample Size Needed for Significance* * Significance is based on a.05 significance level (a), a power level of 80 percent, and standard errors assumed to be twice those of conventional correlation coefficients.

Rules of Thumb 3–5 Assessing Factor Loadings While factor loadings of +.30 to +.40 are minimally acceptable, values greater than +.50 are considered necessary for practical significance. To be considered significant: oA smaller loading is needed given either a larger sample size, or a larger number of variables being analyzed. oA larger loading is needed given a factor solution with a larger number of factors, especially in evaluating the loadings on later factors. Statistical tests of significance for factor loadings are generally very conservative and should be considered only as starting points needed for including a variable for further consideration.

Stage 5: Interpreting the Factors Selecting the factor extraction method – common vs. component analysis. Selecting the factor extraction method – common vs. component analysis. Determining the number of factors to represent the data. Determining the number of factors to represent the data.

Interpreting a Factor Matrix: 1.Examine the factor matrix of loadings. 2.Identify the highest loading across all factors for each variable. 3.Assess communalities of the variables. 4.Label the factors.

Rules of Thumb 3–6 Interpreting The Factors  An optimal structure exists when all variables have high loadings only on a single factor.  Variables that cross-load (load highly on two or more factors) are usually deleted unless theoretically justified or the objective is strictly data reduction.  Variables should generally have communalities of greater than.50 to be retained in the analysis.  Respecification of a factor analysis can include options such as: odeleting a variable(s), ochanging rotation methods, and/or oincreasing or decreasing the number of factors.

Stage 6: Validation of Factor Analysis Confirmatory Perspective. Confirmatory Perspective. Assessing Factor Structure Stability. Assessing Factor Structure Stability. Detecting Influential Observations. Detecting Influential Observations.

Stage 7: Additional Uses of Factor Analysis Results Selecting Surrogate Variables Selecting Surrogate Variables Creating Summated Scales Creating Summated Scales Computing Factor Scores Computing Factor Scores

Rules of Thumb Summated Scales A summated scale is only as good as the items used to represent the construct. While it may pass all empirical tests, it is useless without theoretical justification. Never create a summated scale without first assessing its unidimensionality with exploratory or confirmatory factor analysis. Once a scale is deemed unidimensional, its reliability score, as measured by Cronbach’s alpha: oshould exceed a threshold of.70, although a.60 level can be used in exploratory research. othe threshold should be raised as the number of items increases, especially as the number of items approaches 10 or more. With reliability established, validity should be assessed in terms of: oconvergent validity = scale correlates with other like scales. odiscriminant validity = scale is sufficiently different from other related scales. onomological validity = scale “predicts” as theoretically suggested.

Rules of Thumb Representing Factor Analysis In Other Analyses The single surrogate variable: Advantages: simple to administer and interpret. Disadvantages: 1)does not represent all “facets” of a factor 2)prone to measurement error. Factor scores: Advantages: 1)represents all variables loading on the factor, 2) best method for complete data reduction. 3)Are by default orthogonal and can avoid complications caused by multicollinearity. Disadvantages: 1)interpretation more difficult since all variables contribute through loadings 2)Difficult to replicate across studies.

Rules of Thumb Continued... Representing Factor Analysis In Other Analyses Summated scales: Advantages: 1)compromise between the surrogate variable and factor score options. 2)reduces measurement error. 3)represents multiple facets of a concept. 4)easily replicated across studies. Disadvantages: 1)includes only the variables that load highly on the factor and excludes those having little or marginal impact. 2)not necessarily orthogonal. 3)Require extensive analysis of reliability and validity issues.

Variable Description Variable Type Data Warehouse Classification Variables X1Customer Typenonmetric X2Industry Typenonmetric X3Firm Sizenonmetric X4Regionnonmetric X5Distribution Systemnonmetric Performance Perceptions Variables X6Product Qualitymetric X7E-Commerce Activities/Websitemetric X8Technical Supportmetric X9Complaint Resolutionmetric X10Advertising metric X11Product Linemetric X12Salesforce Imagemetric X13Competitive Pricingmetric X14Warranty & Claimsmetric X15New Productsmetric X16Ordering & Billingmetric X17Price Flexibilitymetric X18Delivery Speedmetric Outcome/Relationship Measures X19Satisfactionmetric X20Likelihood of Recommendationmetric X21Likelihood of Future Purchasemetric X22Current Purchase/Usage Levelmetric X23Consider Strategic Alliance/Partnership in Futurenonmetric Description of HBAT Primary Database Variables

3-55 Rotated Component Matrix “Reduced Set” of HBAT Perceptions Variables Component Communality 1234 X9 – Complaint Resolution X18 – Delivery Speed X16 – Order & Billing X12 – Salesforce Image X7 – E-Commerce Activities X10 – Advertising X8 – Technical Support X14 – Warranty & Claims X6 – Product Quality X13 – Competitive Pricing Sum of Squares Percentage of Trace Extraction Method: Principal Component Analysis. Rotation Method: Varimax.

3-56 Scree Test for HBAT Component Analysis

3-57 Factor Analysis Learning Checkpoint 1.What are the major uses of factor analysis? 2.What is the difference between component analysis and common factor analysis? 3.Is rotation of factors necessary? 4.How do you decide how many factors to extract? 5.What is a significant factor loading? 6.How and why do you name a factor? 7.Should you use factor scores or summated ratings in follow-up analyses?