Covariance and Correlation Questions: What does it mean to say that two variables are associated with one another? How can we mathematically formalize.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Covariance and Correlation
Chapter 3 Examining Relationships
Wednesday AM  Presentation of yesterday’s results  Associations  Correlation  Linear regression  Applications: reliability.
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
Linear Statistical Model
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Designing Experiments In designing experiments we: Manipulate the independent.
Calculating & Reporting Healthcare Statistics
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Perfect Negative Correlation Perfect Positive Correlation Non-Existent Correlation Imperfect Negative Correlation Imperfect Positive Correlation.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 8: Bivariate Regression and Correlation
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 3 Correlation and Prediction.
Lecture 3-2 Summarizing Relationships among variables ©
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
Copyright © Cengage Learning. All rights reserved. 1 Functions and Their Graphs.
Chapter 5 Z-Scores. Review ► We have finished the basic elements of descriptive statistics. ► Now we will begin to develop the concepts and skills that.
Correlation and regression 1: Correlation Coefficient
Correlation. Correlation  Is a statistical procedure that estimates the linear relationship between two or more variables.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Basic linear regression and multiple regression Psych Fraley.
Descriptive Statistics Descriptive Statistics describe a set of data.
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
Wednesday, October 12 Correlation and Linear Regression.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
POSC 202A: Lecture 12/10 Announcements: “Lab” Tomorrow; Final ed out tomorrow or Friday. I will make it due Wed, 5pm. Aren’t I tender? Lecture: Substantive.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Correlation Analysis. Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
“Life is a series of samples. You can infer the truth from the samples, but you never see the truth.” --Kenji, 2010 Educ 200C Friday, October 5, 2012.
Descriptive Statistics Descriptive Statistics describe a set of data.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Individual Differences & Correlations Psy 425 Tests & Measurements Furr & Bacharach Ch 3, Part 1.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.
You can calculate: Central tendency Variability You could graph the data.
Psychology 202a Advanced Psychological Statistics October 22, 2015.
Outline of Today’s Discussion 1.Introduction to Correlation 2.An Alternative Formula for the Correlation Coefficient 3.Coefficient of Determination.
Correlations in Personality Research Many research questions that are addressed in personality psychology are concerned with the relationship between two.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Linear Regression 1 Sociology 5811 Lecture 19 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
Statistical analysis.
Covariance and Correlation
Statistical analysis.
Remember No Class on Wednesday No Class on Friday.
Covariance and Correlation
You can calculate: Central tendency Variability You could graph the data.
Introduction to bivariate data
The Weather Turbulence
Presentation transcript:

Covariance and Correlation Questions: What does it mean to say that two variables are associated with one another? How can we mathematically formalize the concept of association?

The Concept of Bivariate Association Up to this point, we have focused on single variables, and describing their shape, their central tendency, and their dispersion. –What is the infant homicide rate? Now that we’ve covered some of these basics, we’re ready to discuss one of the fundamental kinds of questions asked in psychology: How do two variables relate to one another? –Is there an association between the infant homicide rate of a nation and the degree to which teachers of that nation endorse corporal punishment?

The Concept of Bivariate Association The more a nation’s teachers approve of corporal punishment, the higher its infant homicide rate from Straus, M. A. (1994). Beating the devil out of them: Corporal punishment in American families. San Francisco, CA: Jossey-Bass. scatterplot

Other possibilities could have existed...

The Concept of Bivariate Association Question: How can we quantify the association between two variables?

x y [A] [B] [C] [D] [E] [F] How do people’s scores on one variable vary as a function of another variable?

x y [A] [B] [C] [D] [E] [F] People with high scores on x seem to have high scores on y Can we define what we mean by “high scores” more precisely?

xd yd [A] [B] [C] [D] [E] [F] yes. we can study deviations from the mean (X – M x ) and (Y – M y ) now we can ask whether people who are above the mean (i.e., “high” on x) are above the mean on y

xd yd [A] both below [B] both below [C] both above [D] both below [E] both above [F] both above One way to do this is to tally the matches. People who are above the mean on X should be above the mean on Y. People who are below the mean on X should be below the mean on Y. 100% match

If we resort some of the numbers, note what happens. Now E, C, B, & D show the same pattern on the two variables, but persons A & F do not. 4/6 (66%) show the matching pattern.

One limitation of counting the number of matches is that there are clearly different magnitudes of association that would count as perfect matches.

xd yd (xd*yd) [A] [B] [C] [D] [E] [E] A more precise way to study the association is to multiply each person’s deviations together. Advantage: when there is a match (both + or both -), the product will be +. When there is a mismatch (one + and other -), the product will be -.

xd yd (xd*yd) [A] [B] [C] [D] [E] [E] Further, we can now inquire about the average product of deviation scores. The average of these products will tell us whether the typical person has the same signed deviation score on the two variables.

Covariance This particular way of quantifying the association is called the covariance. In short, we are seeking to determine the correspondence between the average person’s deviation scores on two variables—the extent to which those deviation scores vary together (i.e., covary).

Covariance When this average product is positive, we say the two variables covary positively: people who are high on one variable tend to be high on the other When this average product is negative, we say the two variables negatively covary together: people who are high on one variable tend to be low on the other When this average product is zero, we say the two variables do not covary together. People who are high on one variable are just as likely to be high on the other as they are to be low on the other.

These two variables positively covary People who drink a lot of coffee tend to be happy, and people who do not tend to be unhappy Preview: The line is called a regression line, and represents the estimated linear relationship between the two variables. Notice that the slope of the line is positive in this example.

In this example, the two variables covary negatively People high on x tend to be low on y The regression line has a negative slope

In this example, there is no covariance between the two variables People who are high on x are just as likely to be high on y as they are low on y The regression line is flat