Correlation and Regression-II QSCI 381 – Lecture 37 (Larson and Farber, Sects 9.2+9.4)
Introduction regression line Having determined that the correlation between two variables is significant, we now need to quantify the relationship by the line that best fits the data. This line is called the and it can be used to predict the value of y for a given value of x. regression line
Fitting a Regression Line-I We refer to the difference between each value of y and the value predicted by the regression line as a . The regression line (or the line of best fit) is chosen so that the sum of squared residuals is minimized. residual
Fitting a Regression Line-II d3 d2 d1 Minimize
Equation of a Regression Line-I Slope = m Intercept=b The equation of a regression line for an independent variable x and a dependent variable y is: where is the predicted y-value for a given x-value.
Equation of a Regression Line-II The slope m and the y-intercept b are estimated using the equations: where is the mean of the y-values and is the mean of the x-values. The regression line always passes through the point .
Example-I Length and weight are related by a power curve: This equation can be transformed to a linear relationship by taking logarithms: We will estimate the values for a and c for blue warehou.
The relationship looks linear after log-transformation Example-II The relationship looks linear after log-transformation
Example-III X Y XY X2 3.13 5.51 17.25 9.79 3.39 6.29 21.33 11.49 3.38 6.39 21.62 11.46 2.71 4.21 11.39 7.34 3.95 7.99 31.54 15.59 3.83 7.62 29.17 14.65 3.44 6.58 22.66 11.84 7.94 31.40 15.63 3.60 7.01 25.20 12.95 3.63 25.45 13.20 35.01 66.54 237.01 123.93
Example IV
Using EXCEL to Estimate a Regression Line-I Data – Data Analysis – Regression
Using EXCEL to Estimate a Regression Line-II The function LINEST implements the regression dialog box. INDEX(LINEST(Y1:Y10,X1:X10),1) – slope INDEX(LINEST(Y1:Y10,X1:X10),2) – intercept. This function includes several other features which we will discover in the next lecture.
Application of Regression Lines Use the regression line from the first example to find the logarithm of the weight of a fish whose length is 50cm.
Multiple Linear Regression-I It is not uncommon to be able to make better predictions using more than one independent (explanatory) variable. A equation has the form: where are the explanatory variables. multiple regression