Download presentation
Presentation is loading. Please wait.
Published byRobyn Freeman Modified over 9 years ago
1
Correlation with a Non - Linear Emphasis Day 2
2
Correlation measures the strength of the linear association between 2 quantitative variables. Before you use correlation, you must check several conditions:
3
Quantitative Variables Condition: Are both variables quantitative? Straight Enough Condition: Is the form of the scatterplot straight enough that a linear relationship makes sense? If the relationship is not linear, the correlation will be misleading. Outlier Condition: Outliers can distort the correlation dramatically. If an outlier is present it is often good to report the correlation with and without that point.
4
A hidden variable that stands behind a relationship and determines it by simultaneously affecting the other two variables is called a lurking (confounding) variable. Scatterplots and correlation coefficients NEVER prove causation.
5
Don’t ever assume the relationship is linear just because the correlation coefficient is high. In order to determine whether a relationship is linear or not linear, we must always look at the residual plot.
6
Residuals A residual is the vertical distance between a data point and the graph of a regression equation.
7
The Residual is positive if the data point is above the graph. negative if the data point is below the graph. Is 0 only when the graph passes through the data point.
8
What should you look for to tell if it is not linear?...... Sometimes a high “r” value for linear regression is deceptive. You must look at the scatter plot AND you must look at the residual pattern it makes. If the residuals have a curved pattern then it is NOT linear.
9
To prove linearity A scatterplot of the residuals vs. the x- values should be the most boring scatterplot you’ve ever seen. It shouldn’t have any interesting features, like a direction or a shape. It should stretch horizontally, with about the same amount of scatter throughout. It should show no bends. It should show no outliers.
10
Some Non Linear Regression Shapes…… Positive Quadratic Regression: Negative Quadratic Regression:
11
More Non Linear Regression Shapes…… Positive Exponential Regression: Negative Exponential Regression:
12
Quadratic and Exponential on GDC…… Quadratic: Exponential:
13
Example……The scatter plot could possibly be linear. You must check the residual pattern. xy 516.3 109.7 158.1 204.2 451.9 253.4 601.3
14
Change y-list to resid after running a linear correlation regression – 2 nd stat resid: Notice the curved pattern in the residuals.
15
NOTE!!!!!! Just because the curved pattern on the residuals looks like a quadratic we cannot determine that until we check the “r” value of other curved functions and see how well the data fits. You should also consider “real-life” implications when deciding.
16
When you see that the residuals are curved you must check the correlation coefficient for the exponential and the quadratic to choose the stronger correlation. A check on the exponential regression yield an r – value of -0.956. (Strong Negative but check out the quadratic….)
17
This is a quadratic regression….. Equation: y=.00946x² - 0.839x+18.5 r = 0.966 This value is even stronger than the exponential.
18
Example 2……Is it linear? xy 01 -30.125 -40.0625 38 416 532
19
Look at the residuals…… There is a curved pattern in the residuals. It is NOT linear – it is either quadratic or exponential. (Positive) Use the “r” value to help you decide.
20
And the Winner is….. Here is the equation you should use for predictions: y = 1(2) x
21
Homework Follow the flowchart.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.