Scaling and Factor Analysis. Scaling: lack of a perfect question Often we cannot find one question that can measure exactly what we want to measure Often.

Slides:



Advertisements
Similar presentations
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.
Advertisements

What are the causes of age discrimination in employment?
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
1 Reliability in Scales Reliability is a question of consistency do we get the same numbers on repeated measurements? Low reliability: reaction time High.
Welfare Attitudes. Outline Review of Welfare Policies Review of Welfare Policies Discussion of Svallfors’ methodology and the relationship between attitudes.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Measures of Central Tendency.  Parentheses  Exponents  Multiplication or division  Addition or subtraction  *remember that signs form the skeleton.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
The financial practices and perceptions behind separate systems of household financial management Dr Katherine Ashby, Faculty of Law and Social Sciences,
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Public policy and European society University of Castellanza Session 3(a) Inequality and poverty in Europe and the USA November
Lecture 6: Multiple Regression
1 Sociology 601, Class 4: September 10, 2009 Chapter 4: Distributions Probability distributions (4.1) The normal probability distribution (4.2) Sampling.
Multicollinearity Omitted Variables Bias is a problem when the omitted variable is an explanator of Y and correlated with X1 Including the omitted variable.
Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.
Education 795 Class Notes Factor Analysis II Note set 7.
Social Welfare Policymaking Chapter 18
Social Welfare Policymaking
Economic Systems.
Measuring Social Life Ch. 5, pp
Measures of Central Tendency
Chapter 7 The Government Sector. Introduction: The Growing Economic Role of Government Most of the growth over the past seven decades was due to the Depression.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Presentation on „Steeple“ analyzis in switzerland
Hypothesis Testing II The Two-Sample Case.
PERSONAL FINANCE Unit 3: Financial Responsibility and Money Management.
Essentials of Contemporary Management, 1Ce. Copyright (c) 2005 The McGraw-Hill Companies, Inc. All rights reserved. 2-1 The Environment of Management 2.
Regression. Regression  y= output, such as income, support for welfare  c= constant, does not change  b= coefficient (an increase of one year of education.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
1 Cronbach’s Alpha It is very common in psychological research to collect multiple measures of the same construct. For example, in a questionnaire designed.
Chapter 08 Risk and Rate of Return
Factor Analysis in Individual Differences Research: The Basics Psych 437.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Multiple Regression. Multiple Regression  Usually several variables influence the dependent variable  Example: income is influenced by years of education.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Measures of Central Tendency: The Mean, Median, and Mode
Using Credit SSEPF4.a, SSEPF4.b, SSEPF4.c. Loans and Credit Cards: Buy Now, Pay Later The U.S. economy runs on credit. Credit – The ability to obtain.
The Commutative Property Using Tiles © Math As A Second Language All Rights Reserved next #4 Taking the Fear out of Math.
Factor analysis (PCA) in action Thought for the day: “Does one learn better by understanding the abstract definition or by actually doing the activity?”
1 Psych 5500/6500 Measures of Variability Fall, 2008.
AP Statistics Section 11.1 B More on Significance Tests.
Examining Data. Constructing a variable 1. Assemble a set of items that might work together to define a construct/ variable. 2. Hypothesize the hierarchy.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Education 795 Class Notes Factor Analysis Note set 6.
Exploratory Factor Analysis Principal Component Analysis Chapter 17.
Indexes Anthony Sealey University of Toronto This material is distributed under an Attribution-NonCommercial-ShareAlike 3.0 Unported Creative Commons License,
Political Parties #1: Ideology. Which political party are you? Answer true or false according to your opinion. 1.Getting rid of or reducing the restrictions.
Regression Lab Exercises. Crosstabs Good to give a general picture Good to give a general picture Good to compare dependent varialbes between groups (countries,
Farid Abolhassani Equity 12. Learning Objectives After working through this chapter, you will be able to: Describe the relationship between equality and.
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
STANDARDS 1.4.3, Local and State Governments.
Economy (Jobs, Workers, and Taxes) Economics- of or relating to the production, development, and management of material wealth, as of a country, household,
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
Do Now What are some strengths/weaknesses of your city? Think of its government if you can…
FACTOR ANALYSIS & SPSS.
Regression Analysis.
Regression Analysis.
Dimension Reduction via PCA (Principal Component Analysis)
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Introduction to bivariate data
Descriptive Statistics vs. Factor Analysis
Political Parties: Ideology.
Examining Data.
Presentation transcript:

Scaling and Factor Analysis

Scaling: lack of a perfect question Often we cannot find one question that can measure exactly what we want to measure Often we cannot find one question that can measure exactly what we want to measure But a collection of questions can measure it more clearly But a collection of questions can measure it more clearly An example is support for welfare An example is support for welfare If we would ask: do you support welfare, it would be difficult to interpret that question. If we would ask: do you support welfare, it would be difficult to interpret that question. Does it mean in general you support welfare policies? Does it mean in general you support welfare policies? Does it mean you suppot the policies that actually exist in your own country? Does it mean you suppot the policies that actually exist in your own country? Does it mean you do not support the policies that exist in your country, but exist in another county, like Sweden? Does it mean you do not support the policies that exist in your country, but exist in another county, like Sweden?

Questions from ISSP 2006 that deal with Welfare Q6a: Government should spend money: Environment Q6a: Government should spend money: Environment Q6b: Government should spend money: Health Q6b: Government should spend money: Health Q6c: Government should spend money: Law enforcement Q6c: Government should spend money: Law enforcement Q6d: Government should spend money: Education Q6d: Government should spend money: Education Q6e: Government should spend money: Defence Q6e: Government should spend money: Defence Q6f: Government should spend money: Retirement Q6f: Government should spend money: Retirement Q6g: Government should spend money: Unempl. benefits Q6g: Government should spend money: Unempl. benefits Q6h: Government should spend money: Culture and arts Q6h: Government should spend money: Culture and arts Q7a: Gov. responsibility: Provide job for everyone Q7a: Gov. responsibility: Provide job for everyone Q7b: Gov. responsibility: Control prices Q7b: Gov. responsibility: Control prices Q7c: Gov. responsibility: Provide health care for sick Q7c: Gov. responsibility: Provide health care for sick Q7d: Gov. responsibility: Provide living standard for the old Q7d: Gov. responsibility: Provide living standard for the old Q7e: Gov. responsibility: Help industry grow Q7e: Gov. responsibility: Help industry grow Q7f: Gov. responsibility: Provide living standard for unemployed Q7f: Gov. responsibility: Provide living standard for unemployed Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7h: Gov. responsibility: Financial help to students Q7h: Gov. responsibility: Financial help to students Q7i: Gov. responsibility: Provide decent housing Q7i: Gov. responsibility: Provide decent housing Q7j: Gov. responsibility: Laws to protect environment Q7j: Gov. responsibility: Laws to protect environment Q8a: Gov. successful: Provide health care for sick Q8a: Gov. successful: Provide health care for sick Q8b: Gov. successful: Provide living standard for old Q8b: Gov. successful: Provide living standard for old Q8c: Gov. successful: Dealing with threats to security Q8c: Gov. successful: Dealing with threats to security Q8d: Gov. successful: Controlling crime Q8d: Gov. successful: Controlling crime Q8e: Gov. successful: Fighting unemployment Q8e: Gov. successful: Fighting unemployment

The darker questions we could eliminate, because they do not deal with social policies Q6a: Government should spend money: Environment Q6a: Government should spend money: Environment Q6b: Government should spend money: Health Q6b: Government should spend money: Health Q6c: Government should spend money: Law enforcement Q6c: Government should spend money: Law enforcement Q6d: Government should spend money: Education Q6d: Government should spend money: Education Q6e: Government should spend money: Defence Q6e: Government should spend money: Defence Q6f: Government should spend money: Retirement Q6f: Government should spend money: Retirement Q6g: Government should spend money: Unempl. benefits Q6g: Government should spend money: Unempl. benefits Q6h: Government should spend money: Culture and arts Q6h: Government should spend money: Culture and arts Q7a: Gov. responsibility: Provide job for everyone Q7a: Gov. responsibility: Provide job for everyone Q7b: Gov. responsibility: Control prices Q7b: Gov. responsibility: Control prices Q7c: Gov. responsibility: Provide health care for sick Q7c: Gov. responsibility: Provide health care for sick Q7d: Gov. responsibility: Provide living standard for the old Q7d: Gov. responsibility: Provide living standard for the old Q7e: Gov. responsibility: Help industry grow Q7e: Gov. responsibility: Help industry grow Q7f: Gov. responsibility: Provide living standard for unemployed Q7f: Gov. responsibility: Provide living standard for unemployed Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7h: Gov. responsibility: Financial help to students Q7h: Gov. responsibility: Financial help to students Q7i: Gov. responsibility: Provide decent housing Q7i: Gov. responsibility: Provide decent housing Q7j: Gov. responsibility: Laws to protect environment Q7j: Gov. responsibility: Laws to protect environment Q8a: Gov. successful: Provide health care for sick Q8a: Gov. successful: Provide health care for sick Q8b: Gov. successful: Provide living standard for old Q8b: Gov. successful: Provide living standard for old Q8c: Gov. successful: Dealing with threats to security Q8c: Gov. successful: Dealing with threats to security Q8d: Gov. successful: Controlling crime Q8d: Gov. successful: Controlling crime Q8e: Gov. successful: Fighting unemployment Q8e: Gov. successful: Fighting unemployment

After Eliminating the Darker ones, we see 3 groups: 1)spending money, 2) responsibility, 3) successful Q6b: Government should spend money: Health Q6b: Government should spend money: Health Q6d: Government should spend money: Education Q6d: Government should spend money: Education Q6f: Government should spend money: Retirement Q6f: Government should spend money: Retirement Q6g: Government should spend money: Unempl. benefits Q6g: Government should spend money: Unempl. benefits Q6h: Government should spend money: Culture and arts Q6h: Government should spend money: Culture and arts Q7a: Gov. responsibility: Provide job for everyone Q7a: Gov. responsibility: Provide job for everyone Q7b: Gov. responsibility: Control prices Q7b: Gov. responsibility: Control prices Q7c: Gov. responsibility: Provide health care for sick Q7c: Gov. responsibility: Provide health care for sick Q7d: Gov. responsibility: Provide living standard for the old Q7d: Gov. responsibility: Provide living standard for the old Q7f: Gov. responsibility: Provide living standard for unemployed Q7f: Gov. responsibility: Provide living standard for unemployed Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7g: Gov. responsibility: Reduce income differences betw. rich/ poor Q7h: Gov. responsibility: Financial help to students Q7h: Gov. responsibility: Financial help to students Q7i: Gov. responsibility: Provide decent housing Q7i: Gov. responsibility: Provide decent housing Q8a: Gov. successful: Provide health care for sick Q8a: Gov. successful: Provide health care for sick Q8b: Gov. successful: Provide living standard for old Q8b: Gov. successful: Provide living standard for old Q8e: Gov. successful: Fighting unemployment Q8e: Gov. successful: Fighting unemployment

Implications The ones about success are new and to make it simple, I will ignore them for now The ones about success are new and to make it simple, I will ignore them for now Spending: problem is that one’s answer depends on one’s starting point. A Swede who thinks the government should spend less might be more favorable to the welfare state than an American who thinks the government should spend more, because they have different starting points Spending: problem is that one’s answer depends on one’s starting point. A Swede who thinks the government should spend less might be more favorable to the welfare state than an American who thinks the government should spend more, because they have different starting points Responsibility: one can think the government should be resonsible for healthcare AND provide it, or simply that it should be responsible for healthcare by REGULATING it and keeping it private Responsibility: one can think the government should be resonsible for healthcare AND provide it, or simply that it should be responsible for healthcare by REGULATING it and keeping it private Since no perfect questions exist, we get a better idea by combining imperfect questions Since no perfect questions exist, we get a better idea by combining imperfect questions

The advantage in have a larger number of possible outcomes We also get a more exact answer if we can create a scale from 0-50 than from 0-4 We also get a more exact answer if we can create a scale from 0-50 than from 0-4 We can also use simpler statistical methods if we have a “continuous” dependent variable We can also use simpler statistical methods if we have a “continuous” dependent variable This allows us to use the simple multiple regression method that we have learned so far This allows us to use the simple multiple regression method that we have learned so far If the dependent variable is 0-1 we need to use things like “logit” and “probit” If the dependent variable is 0-1 we need to use things like “logit” and “probit” If it is 0-4 we should use things like “ordinal logit” or “ordinal probit” If it is 0-4 we should use things like “ordinal logit” or “ordinal probit”

One Dimensional Scales Cronbach alpha Cronbach alpha Test to see if all the questions are consistent Test to see if all the questions are consistent We might think a group of questions belong together, but the respondents could interpret them differently We might think a group of questions belong together, but the respondents could interpret them differently Cronbach’s alfa expects all the questions to measure approximately the same thing Cronbach’s alfa expects all the questions to measure approximately the same thing But let´s say the scale is 0-4. Perhaps my ”true” score is 2.8 and your true score is 2.7. If we only have one question, then we will both answer 3, so more questions makes it more accurate, as my average then would become 2.8 and yours 2.7. But let´s say the scale is 0-4. Perhaps my ”true” score is 2.8 and your true score is 2.7. If we only have one question, then we will both answer 3, so more questions makes it more accurate, as my average then would become 2.8 and yours 2.7. It is acceptable if alpha >.6 but best if >.8 It is acceptable if alpha >.6 but best if >.8

Alpha Score in SPSS. Not so bad since in this example I did not recode the variables, so they all go in the same direction

We look at the last row and see which items would increase the alpha score if eliminated

What to do after conducting reliability analysis? First eliminate values if the alpha level is low First eliminate values if the alpha level is low Second when a good enough alpha score is found, create a new variable Second when a good enough alpha score is found, create a new variable You again go to TRANSFORM -> COMPUTE VARIABLE and then give the variable a new name, like WELFSUP and add all the questions together that comprise the scale. You again go to TRANSFORM -> COMPUTE VARIABLE and then give the variable a new name, like WELFSUP and add all the questions together that comprise the scale. Then you can conduct regression analysis without the previous problems as you now have a continuous dependent variable! Then you can conduct regression analysis without the previous problems as you now have a continuous dependent variable!

Mokken Scale Is not used much, but it is also valuable Is not used much, but it is also valuable It allows us to test the consistency for questions that we can rank It allows us to test the consistency for questions that we can rank For example: do you support the women’s right to abortion anytime she wants it? For example: do you support the women’s right to abortion anytime she wants it? Do you support the right for women to have an abortion if they are too poor to take care of the child? Do you support the right for women to have an abortion if they are too poor to take care of the child? Do you support the right for women to have an abortion if they were raped? Do you support the right for women to have an abortion if they were raped? Do you support the right for women to have an abortion if their own life is threatened? Do you support the right for women to have an abortion if their own life is threatened? Unfortunately, not in SPSS. Can import it to STATA or buy as separate program. Unfortunately, not in SPSS. Can import it to STATA or buy as separate program.

Multidimensional scaling Factor analysis Factor analysis There can be several dimensions to an issue There can be several dimensions to an issue For example: “support for big government” For example: “support for big government” One dimension could be support for welfare programs One dimension could be support for welfare programs Another dimension could be support for the military and police Another dimension could be support for the military and police A third dimension could be support for education A third dimension could be support for education We use statistical programs to see which questions scale well together. We use statistical programs to see which questions scale well together. The most common is called “principle components analysis” The most common is called “principle components analysis”

Rotation: we don´t want the items in the two factors to be related, so we rotate the axis: before rotation

After Rotation (REPUT should be revmoved as it is related strongly to both factors)

How Many Factors are There? It would be best to start with theory and test it. We do this in structural equation modeling. It would be best to start with theory and test it. We do this in structural equation modeling. In SPSS we work inductively with exploratory factor analysis and make are decisions mostly on data. In SPSS we work inductively with exploratory factor analysis and make are decisions mostly on data. But we still need theory then to interpret the data and in some cases we might not want to include an item in a factor if it does not make theoretical sense, even if it scales well. But we still need theory then to interpret the data and in some cases we might not want to include an item in a factor if it does not make theoretical sense, even if it scales well.

Explained variance As is usual, we want to explain as much variance as possible, with as few variables as possible, which in this case means as few factors as possible. As is usual, we want to explain as much variance as possible, with as few variables as possible, which in this case means as few factors as possible. Each eigenvalue represents the amount of variance that has been captured by one component. Each eigenvalue represents the amount of variance that has been captured by one component. Normally, we want the eigenvalue to be greater than 1. Normally, we want the eigenvalue to be greater than 1.

Table of Eigenvalues The first two components have eigenvalues greater than 1. Their account for almost 85% of the explained variance. Addint a third factor would only increase the explained variance by a little more than 8%.

Scree Plot We look where the biggest drop takes place. We look where the biggest drop takes place. We see that after the second factor, there is a huge drop in Eigenvalue. We see that after the second factor, there is a huge drop in Eigenvalue. So again, it seems there are only two factors. So again, it seems there are only two factors.

Testing Kaiser has described MSAs above.9 as marvelous, above.8 as meritorious, above.7 as middling, above.6 as mediocre, above.5 as miserable, and below.5 as unacceptable. Kaiser has described MSAs above.9 as marvelous, above.8 as meritorious, above.7 as middling, above.6 as mediocre, above.5 as miserable, and below.5 as unacceptable.

After creating factors SPSS lets you create new variables for each factor. SPSS lets you create new variables for each factor. This is fine if you only want to regressions on each factor. This is fine if you only want to regressions on each factor. If you want to compare outcomes for each factor and be able to say, for example, that Swedes score higher than Czechs on Factor 1 (support for equality) then it is good to make your own new variables by adding together all the items (i.e. questions) for each factor. If you want to compare outcomes for each factor and be able to say, for example, that Swedes score higher than Czechs on Factor 1 (support for equality) then it is good to make your own new variables by adding together all the items (i.e. questions) for each factor. In other words, you do just as with Cronbach´s alpha, but this time you create several scales, not just one. In other words, you do just as with Cronbach´s alpha, but this time you create several scales, not just one. Afterwards, you can run regressions on each factor separately just as with Cronbach´s alpha. Afterwards, you can run regressions on each factor separately just as with Cronbach´s alpha. In advanced statistics, such as Structural Equation Modelling, it is also possible to include sevaral factors in one model and run several regressions simultaneously, but this is not so common even if I usually do it this way. In advanced statistics, such as Structural Equation Modelling, it is also possible to include sevaral factors in one model and run several regressions simultaneously, but this is not so common even if I usually do it this way.