Correlations Renan Levine POL 242 July 12, 2006. Association : Crosstabulation Category Specifics Symmetry Specification Measure of Association Indication.

Slides:



Advertisements
Similar presentations
POL242 October 9 and 11, 2012 Jennifer Hove. Questions of Causality Recall: Most causal thinking in social sciences is probabilistic, not deterministic:
Advertisements

Chapter 16: Correlation.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Statistics and Quantitative Analysis U4320
Measures of Association for contingency tables 4 Figure 8.2 : lambda – association; +-1: strong; near 0: weak Positive association: as value of the independent.
Correlation and Linear Regression.
Bivariate Analyses.
Bivariate Analysis Cross-tabulation and chi-square.
Sociology 601 Class 13: October 13, 2009 Measures of association for tables (8.4) –Difference of proportions –Ratios of proportions –the odds ratio Measures.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Correlation Chapter 9.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
The Simple Regression Model
Session 7.1 Bivariate Data Analysis
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Social Research Methods
Multiple Regression – Basic Relationships
Chapter 7 Correlational Research Gay, Mills, and Airasian
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Review Regression and Pearson’s R SPSS Demo
Correlation Coefficient Correlation coefficient refers to the type of relationship between variables that allows one to make predications from one variable.
Correlations 11/5/2013. BSS Career Fair Wednesday 11/6/2013- Mabee A & B 12:30-2:30P.
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Week 9: QUANTITATIVE RESEARCH (3)
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Association between Variables Measured at the Nominal Level.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Correlation.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.
Chi-Square Testing 10/23/2012. Readings Chapter 7 Tests of Significance and Measures of Association (Pollock) (pp ) Chapter 5 Making Controlled.
In the Lab: Working With Crosstab Tables Lab: Association and the Chi-square Test Chapters 7, 8 and 9 1.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Dr. G. Johnson, Exploring Relationships: Measures of Association Research Methods for Public Administrators Dr. Gail Johnson.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Examining Relationships in Quantitative Research
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Logistic Regression July 28, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Regression MBA/510 Week 5. Objectives Describe the use of correlation in making business decisions Apply linear regression and correlation analysis. Interpret.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Chapter 16 Data Analysis: Testing for Associations.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
Dummy Variables; Multiple Regression July 21, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
Statistics. Descriptive Statistics Organize & summarize data (ex: central tendency & variability.
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
Midterm Review Renan Levine POL 242 June 14, 2006.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Regression Chapter 5 January 24 – Part II.
Scatterplots & Correlations Chapter 4. What we are going to cover Explanatory (Independent) and Response (Dependent) variables Displaying relationships.
Copyright © 2012 by Nelson Education Limited. Chapter 12 Association Between Variables Measured at the Ordinal Level 12-1.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
Association Between Variables Measured at the Ordinal Level
T-test calculations.
Social Research Methods
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Regression Part II.
Presentation transcript:

Correlations Renan Levine POL 242 July 12, 2006

Association : Crosstabulation Category Specifics Symmetry Specification Measure of Association Indication of Direction Nominal X NominalOnly (2 X 2)Symmetrical Phi Yes Nominal X NominalGreater than (2X2)- Cramer's V No Nominal X OrdinalAt least (2 X 3)- Cramer's V No Ordinal X OrdinalSquare (e.g., 3 X 3)Symmetrical Kendall's Tau- b Yes Ordinal X OrdinalRectangle (e.g., 3 X 4)Asymmetrical Kendall's Tau- c Yes Interval X Interval (not taught yet) -- Pearson's R (not yet taught) Yes

Today: Correlations Correlation is a measure of a relationship between variables. Measured with a coefficient [Pearson’s r] that ranges from -1 to 1. Measure strength of relationship of interval or ratio variables r = Σ(Zx * Zy)/n – 1  Zx=Z scores for X variable and Z scores for Y variable. Sum the products and divide by number of paired cases minus one.  How to calculate Z scores can be found on-line.

Correlation r Absolute values closer to 0 indicate that there is little or no linear relationship.  Generally, is weak, is okay, 0.6 or higher is strong.  If correlation is very high, then its probably something related that you might considering indexing or choosing just one variable. The closer the coefficient is to the absolute value of 1 the stronger the relationship between the variables being correlated.

Positive Relationship If two variables are related positively or directly  r > 0 Variables “track together” – high values on Variable X are associated with high values on Variable Y.  Low values on X associated with low values.

Example Robert D. Putnam; Robert Leonardi; Raffaella Y. Nanetti; Franco Pavoncello. “Explaining Institutional Success: The Case of Italian Regional Government.” The American Political Science Review 77:1 (Mar. 1983), pp More fun examples:

Example II r = 0.84

Negative or Inverse Relationship Variables can be inversely or negatively related  High values of X are associated with low values of Y.

Example – Negative / Inverse r = red= Republicans, blue=Democrats, grey diamonds=Independents Time/SRBI: Oct 3-6, ‘08

Data You need interval-level data. You will find many interval-level variables in:  Countries / World  Provinces  Election studies (feeling thermometers, odds of party entering government, etc) You can often use the index you created as an interval-level variable.

Compare Lots more noise here. Typical of public opinion data. Most points close to a line.

Differences between Public Opinion and Aggregate Data Although it is not uncommon to have one/some outliers in aggregate data, public opinion data tends to be “noisy”.  Feeling thermometer example: Many respondents gave both candidates a 50; Quite a few respondents liked both candidates Even though most who liked McCain disliked Obama A high Pearson’s r for public opinion data may be low for an association in aggregate data.

Guidelines for Public Opinion Data MAGNITUDE OF ASSOCIATION QUALIFICATIONCOMMENTS 0.00No Relationship Knowing the independent variable does not reduce the number of errors in predicting the dependent variable at all..00 to.15Not Useful Not Acceptable.15 to.20Very Weak Minimally acceptable.20 to.25Moderately Strong Acceptable.25 to.30Fairly Strong Good Work.30 to.40Strong Great Work.40 to.70Very Strong/Worrisomely Strong EITHER an excellent relationship OR the two variables are measuring the same thing.70 to.99Redundant (?) Proceed with caution: are the two variables testing the same thing? 1.00Perfect Relationship. If we the know the independent variable, we can predict the dependent variable with absolute success.

Rough Guidelines for Aggregate Data MAGNITUDE OF ASSOCIATION QUALIFICATIONCOMMENTS 0.00No Relationship Knowing the independent variable does not reduce the number of errors in predicting the dependent variable at all..00 to.30Not useful, very weak Not Acceptable.30 to.50Weak Minimally acceptable.50 to.70Fairly Strong Acceptable.70 to.85Strong Good Work.80 to.90Very Strong/Worrisomely Strong EITHER an excellent relationship OR the two variables are measuring the same thing.90 to.99Redundant (?) Proceed with caution: are the two variables testing the same thing? 1.00Perfect Relationship. If we the know the independent variable, we can predict the dependent variable with absolute success.

Very Strong or Worrisome?? Public Opinion: above |0.40| Aggregate: above |0.80| But these are just guidelines. It depends on how good the data is:  Lots of variation in data  Large scale (10, 20, 100 pts – like prediction odds, physicians per 100,000 people, feeling thermometer scales)  Number of observations (N) Provinces dataset is small

Outstanding or the same? You either have an outstanding relationship OR the variables may be measuring the same idea.  Ex. unemployment and GDP both measure economic health  Ex. Feeling thermometer Barack Obama and feeling thermometer for Joe Biden both measure attitudes towards the Democratic ticket Also inverse relationship  Example above: Obama and McCain feeling thermometers – different sides of the same coin, as both seem to measure partisanship.

Use Yo’ Brain Computer cannot tell you if it’s a good, strong relationship or two measures looking at the same thing. Need to understand what each variable is measuring  Same thought process about the index creation. Use your knowledge of world and theory to decide whether two variables measure the same thing or two different things. Example (above): Putnam’s relationship between civic culture and government performance. Failed states survey - appears that the higher an indicator value, the worse off the country in that particular field. sk=view&id=99&Itemid=140 sk=view&id=99&Itemid=140

Flip side Relationship you expect is strong is surprisingly not ?!?!? Make certain both variables are interval Double check that you cleaned up data  Missing values are missing  Next week: there may be the need to qualify the relationship as some sub-group of the data is not like the others and those need to be identified. Think about relationship – maybe its not linear, so that relationship is only present for part of range.

Usefulness Quick, easy way to look at several variables to see if they are related. With strong association, you can begin to think about predicting values of Y based on a value of X.  Ex. Positive correlation – you know a high value of X is associated with a high value of Y!

Webstats Output - - Correlation Coefficients - - Q375A1 Q305 Q375A3 Q1005 Q375A ( 686) ( 666) ( 667) ( 672) P=. P=.000 P=.000 P=.000 Q ( 666) ( 2776) ( 660) ( 2721) P=.000 P=. P=.000 P=.000 Q375A ( 667) ( 660) ( 682) ( 666) P=.000 P=.000 P=. P=.000 Q ( 672) ( 2721) ( 666) ( 3181) P=.000 P=.000 P=.000 P=. Coefficients (Pearson’s r) N

Significance? Webstats will tell you whether or not the correlation coefficient is significant. Remember that this is just telling you whether the relationship may be due to chance.  Not the strength of the relationship Almost unheard of to have a strong relationship that is insignificant when using survey data. So, don’t spend any time discussing significance.

What if non-interval/non-ratio? Usually more appropriate to use the other measures of association. Webstats will perform a correlation. Be ready for results to be less strong Program may report (instead of Pearson’s r):  Spearman: ordinal x ordinal  Point-biserial: one interval/ratio, one dichotomous  Phi: two dichotomous variables All interpreted the same way