Correlation: A statistic to describe the relationship between variables Hours Worked Pay Hours Worked Pay Hours Worked Pay.

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

Wednesday AM  Presentation of yesterday’s results  Associations  Correlation  Linear regression  Applications: reliability.
Correlation and Linear Regression.
Describing Relationships Using Correlation and Regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
Elementary Statistics Larson Farber 9 Correlation and Regression.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
CJ 526 Statistical Analysis in Criminal Justice
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Lecture 4: Correlation and Regression Laura McAvinue School of Psychology Trinity College Dublin.
Chapter Eighteen MEASURES OF ASSOCIATION
PSY 307 – Statistics for the Behavioral Sciences
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Measures of Association Deepak Khazanchi Chapter 18.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
Statistics for the Behavioral Sciences (5th ed.) Gravetter & Wallnau
Correlation & Regression Math 137 Fresno State Burger.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Correlation vs. Causation What is the difference?.
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Analyzing Data: Bivariate Relationships Chapter 7.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Chapter 3 Statistical Concepts.
Correlation – PMCC Monday 18 th March 2013 Learning objective: To be confident finding the Product Moment Correlation Coefficient and using it to interpret.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Correlational Problems and Fallacies James H. Steiger.
Scatterplots October 14, Warm-Up Given the following domain and range in set notation, write the equivalent domain and range in algebraic notation.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
M22- Regression & Correlation 1  Department of ISM, University of Alabama, Lesson Objectives  Know what the equation of a straight line is,
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Basic Statistics Correlation Var Relationships Associations.
B AD 6243: Applied Univariate Statistics Correlation Professor Laku Chidambaram Price College of Business University of Oklahoma.
When trying to explain some of the patterns you have observed in your species and community data, it sometimes helps to have a look at relationships between.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Math 4030 – 13a Correlation & Regression. Correlation (Sec. 11.6):  Two random variables, X and Y, both continuous numerical;  Correlation exists when.
Appendix B: Statistical Methods. Statistical Methods: Graphing Data Frequency distribution Histogram Frequency polygon.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
You can calculate: Central tendency Variability You could graph the data.
Scatter Plot A scatter plot is a graph of a collection of ordered pairs (x,y). The ordered pairs are not connected The graph looks like a bunch of dots,
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Correlation & Linear Regression Using a TI-Nspire.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Math 4030 – 12a Correlation.
Understanding Research Results: Description and Correlation
Ch. 11: Quantifying and Interpreting Relationships Among Variables
You can calculate: Central tendency Variability You could graph the data.
Correlation vs. Causation
Introduction to Probability and Statistics Thirteenth Edition
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Correlation and Prediction
Presentation transcript:

Correlation: A statistic to describe the relationship between variables Hours Worked Pay Hours Worked Pay Hours Worked Pay

Univariate vs. Bivariate Statistics Bivariate analyses/graphical representations Scatterplots Correlation: Univariate analyses/ graphical representations: Frequency histograms Measures of central tendency and variability Z-scores linear pattern of relationship between one variable (x) and another variable (y) – an association between two variables relative position of one variable correlates with relative distribution of another variable How can we define correlation?

Correlations allow us to look for evidence of a relationship between variables.

Correlations can vary in strength

Correlations can vary in direction So, how do we QUANTIFY a correlation? We need to come up with a NUMBER that reflects both the strength and direction of the correlation. flu shots given

Correlation finds the strength and direction of the best fitting line to the data.  XY - (  X) (  Y) n r =  X 2 - (  X) 2  Y 2 - (  Y) 2 n n [ [ ] ] The number we calculate in Statistics is called the correlation coefficient. Developed by Karl Pearson, it is also sometimes referred to as Pearson’s r.

Example Calculation: the following data represent the number of emergency room visits per year (x) and cigarettes smoked a day (y) by three individuals recruited from New York Methodist Hospital.  = 0.94 x237x237 y456y456 x y xy (12) (15) 3 62 (12) (15) √[(14)(2)] =

Another way to think of the correlation The product of the Z-scores for each pair of scores r = (  Z x Z y ) /( n-1) x237x237 y456y456 Zx Zy 0 1 If x=2, (2-4)/2.65 = -.76 … If y=4, (4-5)/1 = -1

Another way to think of the correlation The product of the Z-scores for each pair of scores r = (  Z x Z y ) /( n-1) x237x237 y456y456 Zx Zy 0 1 ZxZy  =.945 =.95

Interpreting the Pearson r * Range of values: Interpreting the value of r -1.0 to +1.0 * Direction from the sign negative => anticorrelated As one variable goes up the other goes down in value. positive => correlatedAs on variable goes up so does the other. * Strength from the magnitude | r | = 1.0perfect relationship | r | = 0.0no evidence of relationship 0.0 < | r | < 1.0intermediate strength relationship

When NOT to use a correlation: Extreme scores r =.97 Non-linear relationships r =.20

Some Issues with Correlation NO CAUSATION! Spurious correlation

Number of people who drowned in a swimming pool & number of Nicholas Cage films in a given year =.67 Per capita consumption of cheese & number of deaths by becoming tangled in bed sheets =.95 Divorce rate in Maine & consumption of margarine in the United States =.99

Preview of Next Lecture: Regression finding the best fitting line to a data set.