HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.2.

Slides:



Advertisements
Similar presentations
Section 10-3 Regression.
Advertisements

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.2.
Eight backpackers were asked their age (in years) and the number of days they backpacked on their last backpacking trip. Is there a linear relationship.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression
Correlation and Regression Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chapter 9: Correlation and Regression
Introduction to Linear Regression.  You have seen how to find the equation of a line that connects two points.
Least Squares Regression
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation and Linear Regression
Correlation and Linear Regression
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Graphing Scatter Plots and Finding Trend lines
The Line of Best Fit Linear Regression. Definition - A Line of Best or a trend line is a straight line on a Scatter plot that comes closest to all of.
Linear Regression.
SECTION 2.2 BUILDING LINEAR FUNCTIONS FROM DATA BUILDING LINEAR FUNCTIONS FROM DATA.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Linear Regression and Correlation
Slide Copyright © 2008 Pearson Education, Inc. Chapter 4 Descriptive Methods in Regression and Correlation.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
5-7 Scatter Plots. _______________ plots are graphs that relate two different sets of data by displaying them as ordered pairs. Usually scatter plots.
Relationships between Variables. Two variables are related if they move together in some way Relationship between two variables can be strong, weak or.
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Linear Regression To accompany Hawkes lesson 12.2 Original content by D.R.S.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.4.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 3.2.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section 12.2 Linear Regression HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.3.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.3.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
Correlation tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition, we would like to.
Scatter Plots, Correlation and Linear Regression.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.1.
S CATTERPLOTS Correlation, Least-Squares Regression, Residuals Get That Program ANSCOMBE CRICKETS GESSEL.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 9.3.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
Least Squares Regression.   If we have two variables X and Y, we often would like to model the relation as a line  Draw a line through the scatter.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.2.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Linear Regression Day 1 – (pg )
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
Section 1.3 Scatter Plots and Correlation.  Graph a scatter plot and identify the data correlation.  Use a graphing calculator to find the correlation.
EXCEL DECISION MAKING TOOLS AND CHARTS BASIC FORMULAE - REGRESSION - GOAL SEEK - SOLVER.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.1.
The Practice of Statistics Third Edition Chapter 15: Inference for Regression Copyright © 2008 by W. H. Freeman & Company.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Copyright © 2017, 2014 Pearson Education, Inc. Slide 1 Chapter 4 Regression Analysis: Exploring Associations between Variables.
Correlation and Regression
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Section 12.2 Linear Regression
Review and Preview and Correlation
Sections Review.
Note: In this chapter, we only cover sections 10-1 through 10-3
CHAPTER 10 Correlation and Regression (Objectives)
Scatter Plots and Line of Best Fit
Section 10-1 Correlation © 2012 Pearson Education, Inc. All rights reserved.
Correlation and Regression
Presentation transcript:

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.2 Linear Regression With added comments and content by D.R.S., University of Cordele

Linear Regression

The Theory behind it. Least-Squares Regression Line The least-squares regression line is the line for which the average variation from the data is the smallest, also called the line of best fit, given by The Line Of Best Fit The Line Of Best Fit means that the sum of the squares of the vertical distance from each point to the line is at a minimum. Your textbook has the awful formulas to determine the equation of the line. That’s what the calculator uses to come up with its results. (This slide is mostly from Bluman’s 5 th edition, © McGraw Hill)

Living with Inconsistent Notation

Know your equations of lines

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Slope of the Least-Squares Regression Line The slope of the least-squares regression line for paired data from a sample is given by where n is the number of data pairs in the sample, x i is the i th value of the explanatory variable, and y i is the i th value of the response variable. (This frightening formula is presented for shock value and informational purposes only.)

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. y-Intercept of the Least-Squares Regression Line The y-intercept of the least-squares regression line for paired data from a sample is given by where n is the number of data pairs in the sample, x i is the i th value of the explanatory variable, y i is the i th value of the response variable, and b 1 is the slope of the least-squares regression line. (This frightening formula is presented for shock value and informational purposes only.)

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.10: Finding a Least-Squares Regression Line Using a TI-83/84 Plus Calculator The local school board wants to evaluate the relationship between class size and performance on the state achievement test. It decides to collect data from various schools in the district, and the data from a sample of eight classes are shown in the following table. Each pair of data values represents the class size and corresponding average score on the achievement test for one class.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.10: Finding a Least-Squares Regression Line Using a TI-83/84 Plus Calculator (cont.) Class Sizes and Average Test Scores Class Size Average Test Score Determine if there is a significant linear relationship between class size and average test score at the 0.05 level of significance. If the relationship is significant, find the least-squares regression line for these data.

Example Inputs Put x and y values in two lists, as usual. Recall: STAT, TESTS, ALPHA F on 84, E on 83 VARS, Y-VARS, 1, 1 to get The Y 1 into the RegEQ

Example Outputs Do we have a significant linear relationship here? Compare the p-Value to the Level Of Significance α=0.05 If significant, then y = a + bx with these values of a and b is the “Line Of Best Fit”, but you do NOT have to retype them, see next screen! r and r 2 are here as usual.

Example Outputs Because you told LinRegTTest “RegEQ: Y1”, the y = a + bx is assembled for you in the Y= screen as equation Y 1. AWESOME!AWESOME!

Example Outputs You can then set up a STAT PLOT to do the scatter plot. ZOOM 9:ZoomStat plots both the scatter plot and the Line of Best Fit together.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. A prediction should not be made with a regression model if… 1.The data do not fall in a linear pattern when graphed on a scatter plot. 2.The correlation coefficient is not statistically significant. 3.You wish to make a prediction about a value outside the range of the sample data. 4.The population is different than that from which the sample data were drawn.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.11: Making Predictions Using a Least-Squares Regression Line Use the equation of the regression line from the previous example, to predict what the average achievement test score will be for the following class sizes. a.16 b.19 c.25 d.45 But instead of plugging in and computing by hand, use the TI-84 to directly do function notation, as shown here: VARS Y-VARS, 1: Function 1:Y1 ( the x-value ) ENTER

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.11: Making Predictions Using a Least-Squares Regression Line Use the equation of the regression line from the previous example, to predict what the average achievement test score will be for the following class sizes. a.16 b.19 c.25 d.45 But instead of plugging in and computing by hand, use the TI-84 to directly do function notation, as shown here:

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.11: Making Predictions Using a Least-Squares Regression Line (cont.) Solution d.It is not meaningful to predict the value of y for this class size because the value x = 45 is outside the range of the original data. The original data only considered class sizes between 15 and 29, so we should only predict the average achievement test scores for class sizes within this range. They used rounded-off values for slope and y- intercept and got slightly different results compared to what we did with the Y 1 (16), etc.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.12: Finding the Least-Squares Regression Line for a Given Data Set The following table lists data collected on the selling prices of used Land Rover Freelanders and their ages in years. Find a linear regression model for predicting the price of a used Land Rover Freelander based on its age in years, if appropriate at the 0.05 level of significance. Selling Prices and Ages of Used Land Rover Freelanders Age (in Years) Price (in Dollars) 15,50014,99530,79528,99523,99520,90020,50019,99519,88829,995

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.12: Finding the Least-Squares Regression Line for a Given Data Set (cont.) Solution To begin, we must determine which variable is the explanatory variable (x) and which variable is the response variable (y). We want to use the age of a used Freelander to predict its selling price, thus age (x) is the explanatory variable and price (y) is the response variable. Use LinRegTTest again and see if you agree with their results on the following slides.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.12: Finding the Least-Squares Regression Line for a Given Data Set (cont.) Thus, it is appropriate to consider the linear regression model. The slope of the regression line is a = b 1 ≈  and the y-intercept of the regression line is b = b 0 ≈ 34, Therefore the regression line, in the form is as follows.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.13: Making Predictions Using a Linear Regression Model Use the linear regression model from the previous example, to predict the following. a.The selling price of a Land Rover Freelander that is 2.5 years old $ ________________ b.The selling price of a Land Rover Freelander that is 10 years old NO, because ___________________ c.The selling price of a Land Rover Range Rover that is 3 years old NO, because ____________________

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish The following table gives the average monthly temperatures and corresponding monthly precipitation totals for one year in Key West, Florida. Average Temperatures and Precipitation Totals in Key West, Florida Average Temperature (in °F) Precipitation (in Inches) Average Temperature (in °F) Precipitation (in Inches)

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish (cont.) a.Create a scatter plot for the data. Does there appear to be a linear relationship between x and y? b.Calculate the correlation coefficient, r. c.Verify that the correlation coefficient is statistically significant at the 0.05 level of significance. d.Determine the equation of the line of best fit. e.Calculate and interpret the coefficient of determination, r 2. f.If appropriate, predict the monthly precipitation total in Key West for a month in which the average temperature is 80 deg. g.If appropriate, predict the monthly precipitation in Destin, Florida, for a month in which the average temp is 83 degrees.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish (cont.) If r is statistically significant for the variables, a linear regression model would be appropriate.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish (cont.) a.Does there appear to be a linear relationship between x and y? _________ b.Calculate the correlation coefficient, r = _________ c.Using the p value from LinRegTTest, is there a statistically significant relationship? ________ And if so… d. From the calculator, we see that the equation of the line of best fit is ____________________________.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish (cont.) e.The coefficient of determination is approximately _____. This tells us that approximately ____% of the variation in precipitation can be attributed to the linear relationship between temperature and precipitation. The remaining ____% of the variation is from unknown sources. f.Because r is statistically significant, we can use the regression line to make predictions regarding the variables.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 12.14: A Linear Regression Model, Start to Finish (cont.) f. If appropriate, predict the monthly precipitation total in Key West for a month in which the average temperature is 80 degrees. TI-84 Command: _________________________ Result: _________________ (include units of measure) g.If appropriate, predict the monthly precipitation in Destin, Florida, for a month in which the average temperature is 83 degrees.

More TI-84 Notes If you’re starting a new problem and you want to clear out lists L 1 and L 2, here’s an efficient way: STAT 4 2 ND 1 comma 2 ND 2 ENTER

More TI-84 Notes If your LinRegTTest results are no longer on the screen and you have a question that asks for those results, you don’t have to redo the LinRegTTest command. VARS 5:Statistics EQ submenu There you will find 7: r (the ___________ coefficient) 8: r 2 (the coefficient of ________) also RegEQ, a, b, and more.

Excel – Data tab, Data Analysis add-in, Regression

Excel > Data > Data Analysis > Regression A lot of choices to make:

Excel Regression tool with class size and test score example *

*

Some familiar things in the output (yellow) But you have to know where to find them A lot of new and different advanced stuff Some of it’s discussed in Lessons 12.3 and 12.4 Some of it’s from advanced courses.