Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Bivariate Analyses.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Chapter 4 The Relation between Two Variables
IB Math Studies – Topic 6 Statistics.
Describing Relationships Using Correlation and Regression
Overview Correlation Regression -Definition
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Unobtrusive Research 1.Content analysis - examine written documents such as editorials. 2.Analyses of existing statistics. 3.Historical/comparative analysis.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
Linear Regression and Correlation
Statistics Psych 231: Research Methods in Psychology.
SIMPLE LINEAR REGRESSION
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
Crash Course in Correlation and Regression MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central.
Correlation and Regression Analysis
Relationships Among Variables
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
T-tests and ANOVA Statistical analysis of group differences.
Correlation & Regression
Linear Regression Modeling with Data. The BIG Question Did you prepare for today? If you did, mark yes and estimate the amount of time you spent preparing.
Lecture 16 Correlation and Coefficient of Correlation
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Correlation.
Chapter 15 Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Basic Statistics Correlation Var Relationships Associations.
Chapter 10 Correlation and Regression
When trying to explain some of the patterns you have observed in your species and community data, it sometimes helps to have a look at relationships between.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Chapter 14 Correlation and Regression
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
CORRELATION ANALYSIS.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Pearson’s Correlation The Pearson correlation coefficient is the most widely used for summarizing the relation ship between two variables that have a straight.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Statistical analysis.
Regression and Correlation
Statistical analysis.
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Regression
CORRELATION ANALYSIS.
BA 275 Quantitative Business Methods
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Warsaw Summer School 2017, OSU Study Abroad Program
Presentation transcript:

Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise. Scientists spend most of their time figuring out how one thing relates to another and structuring these relationships into explanatory theories.

Scatterplots A. scatter diagram A list of 1,078 pairs of numbers would be impossible to grasp. [so we need some method that can examine this data and convert it into a more conceivable format]. One method is plotting the data for the two variables (e.g., father's height and son's height; father’s years of education and son’s years) in a graph called a scatter diagram.

B. The Correlation Coefficient This scatter plot looks like a cloud of points which visually can give us a nice representation and a gut feeling on the strength of the relationship, and is especially useful for examining outliners or data anomalies, but statistics isn't too fond of simply providing a gut feeling. Statistics is interested in the summary and interpretation of masses of numerical data - so we need to summarize this relationship numerically. How do we do that - yes, with a correlation coefficient. The correlation coefficient ranges from +1 to -1

r = 1.0

r =.85

r =.42

R =.17

R = -.94

R = -.54

R = -.33

Computing the Pearson's r correlation coefficient Definitional formula is: Convert each variable to standard units (zscores). The average of the products give the correlation coefficient. But this formula requires you to calculate z-scores for each observation, which means you have to calculate the standard deviation of X and Y before you can get started. For example, look what you have to do for only 5 cases.

Dividing the Sum of ZxZy (2.50) by N (5) get you the correlation coefficient =.50

Therefore through some algebraic magic we get the computational formula, which is a bit more manageable.

Interpreting correlation coefficients Strong Association versus Weak Association: strong: knowing one helps a lot in predicting the other. Weak, information about one variables does not help much in guessing the other. 0 = none;.25 weak;.5 moderate;.75 < strong Index of Association R-squared defined as the proportion of the variance of one variable accounted for by another variable a.k.a PRE STATISTIC (Proportionate Reduction of Error))

Significance of the correlation Null hypothesis? Formula: Then look to Table C in Appendix B Or just look at Table F in Appendix B

Limitations of Pearson's r 1) at best, one must speak of "strong" and "weak," "some" and "none"-- precisely the vagueness statistical work is meant to cure. 2) Assumes Interval level data: Variables measured at different levels require that different statistics be used to test for association.

3) Outliers and nonlinearity The correlation coefficient does not always give a true indication of the clustering. There are two main exceptional cases: Outliers and nonlinearity. r =.457r =.336

4. Assumes a linear relationship

4) Christopher Achen in 1977 argues (and shows empirically) that two correlations can differ because the variance in the samples differ, not because the underlying relationship has changed. Solution? Regression analysis!!!!!!!!