Download presentation
Presentation is loading. Please wait.
1
CHAPTER 10 Correlation and Regression (Objectives)
Draw a scatter plot for a set of ordered pairs. Compute the correlation coefficient. Test the hypothesis H0: 0. (Will be done later) Compute the equation of the regression line. 11/10/2018 © Kasturiarachi
2
Statistical Methods Correlation is a statistical method used to determine whether a linear relationship between variables exists. Regression is a statistical method used to describe the nature of the relationship between variables—that is, positive or negative, linear or nonlinear. 11/10/2018 © Kasturiarachi
3
Statistical Questions
Are two or more variables related? If so, what is the strength of the relationship? What type or relationship exists? What kind of predictions can be made from the relationship? A correlation coefficient is a measure of how variables are related. In a simple relationship, there are only two types of variables under study. 11/10/2018 © Kasturiarachi
4
Scatter Plots A scatter plot is a graph of the ordered pairs (x,y) of numbers consisting of the independent variable, x, and the dependent variable, y. A scatter plot is a visual way to describe the nature of the relationship between the independent and dependent variables. 11/10/2018 © Kasturiarachi
5
(10.2): Correlation Coefficient
The correlation coefficient computed from the sample data measures the strength and direction of a linear relationship between two variables. The symbol for the sample correlation coefficient is r. The symbol for the population correlation coefficient is . 11/10/2018 © Kasturiarachi
6
Correlation Coefficient (cont’d.)
The range of the correlation coefficient is from 1 to 1. If there is a strong positive linear relationship between the variables, the value of r will be close to 1. If there is a strong negative linear relationship between the variables, the value of r will be close to 1. 11/10/2018 © Kasturiarachi
7
Correlation Coefficient (cont’d.)
When there is no linear relationship between the variables or only a weak relationship, the value of r will be close to 0. No linear relationship 1 1 Strong negative linear relationship Strong positive linear relationship 11/10/2018 © Kasturiarachi
8
Formula for the Correlation Coefficient r
where n is the number of data pairs. 11/10/2018 © Kasturiarachi
9
Possible Relationships Between Variables
There is a direct cause-and-effect relationship between the variables: that is, x causes y. There is a reverse cause-and-effect relationship between the variables: that is, y causes x. The relationship between the variable may be caused by a third variable: that is, y may appear to cause x but in reality z causes x. 11/10/2018 © Kasturiarachi
10
(10.3): Regression Line If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is the data’s line of best fit. Best fit means that the sum of the squares of the vertical distance from each point to the line is at a minimum. 11/10/2018 © Kasturiarachi
11
Scatter Plot with Three Lines
11/10/2018 © Kasturiarachi
12
A Linear Relation 11/10/2018 © Kasturiarachi
13
Equation of a Line In algebra, the equation of a line is usually given as , where m is the slope of the line and b is the y intercept. In statistics, the equation of the regression line is written as , where b is the slope of the line and a is the y' intercept. 11/10/2018 © Kasturiarachi
14
Regression Line Formulas for the regression line :
where a is the y' intercept and b is the slope of the line. 11/10/2018 © Kasturiarachi
15
Regression Line (Easy Formula)
The formula for the regression line: where slope and intercept 11/10/2018 © Kasturiarachi
16
Rounding Rule When calculating the values of a and b, round to three decimal places. 11/10/2018 © Kasturiarachi
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.