MATH 2400 Chapter 5 Notes. Regression Line Uses data to create a linear equation in the form y = ax + b where “a” is the slope of the line (unit rate.

Slides:



Advertisements
Similar presentations
Regression BPS chapter 5 © 2006 W.H. Freeman and Company.
Advertisements

Chapter 4 The Relation between Two Variables
Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
CHAPTER 3 Describing Relationships
Chapter 3 Association: Contingency, Correlation, and Regression
CHAPTER 3 Describing Relationships
Ch 2 and 9.1 Relationships Between 2 Variables
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Chapter 2 Looking at Data - Relationships. Relations Among Variables Response variable - Outcome measurement (or characteristic) of a study. Also called:
Regression BPS chapter 5 © 2010 W.H. Freeman and Company.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Lesson Correlation and Regression Wisdom. Knowledge Objectives Recall the three limitations on the use of correlation and regression. Explain what.
1 Association  Variables –Response – an outcome variable whose values exhibit variability. –Explanatory – a variable that we use to try to explain the.
Lecture 5 Chapter 4. Relationships: Regression Student version.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.2 Least-Squares Regression.
Linear Regression Day 1 – (pg )
Business Statistics for Managerial Decision Making
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
SWBAT: Calculate and interpret the equation of the least-squares regression line Do Now: If data set A of (x, y) data has correlation r = 0.65, and a second.
CHAPTER 3 Describing Relationships
Chapter 5 Sections 1-3.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Chapter 3: Describing Relationships
Cautions about Correlation and Regression
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

MATH 2400 Chapter 5 Notes

Regression Line Uses data to create a linear equation in the form y = ax + b where “a” is the slope of the line (unit rate of change) “b” is the y-intercept (initial value) Can be used generalize a set of data, to estimate a value, or predict a value.

Example 1 (Exercise 5.1) We expect a car’s highway gas mileage to be related to its city gas mileage. Data for all 1040 vehicles in the government’s 2010 Fuel Economy Guide give the regression line HWY MPG = (1.016 x CITY MPG) for predicting highway mileage from city mileage. a)What is the slope of this line? Say in words what the numerical value of the slope tells you. b)What is the intercept? Explain why the value of the intercept is not statistically meaningful. c)Find the predicted highway mileage for a car that gets 16 mpg in the city. Do the same for a car with city mileage 28 mpg.

Example 2 (Exercise 5.2…sort of) You use the same bottle of body wash every day. The volume was initially 355 ml. You estimate you use 7 ml of body wash each day. What is the equation of the regression line for predicting the volume of body wash left in the bottle after each day?

Least-Square Regression Line

Example 3 This table displays the data regarding 8 U.S airports and their total number of passengers for the year 1992 and Use the 1992 data as the explanatory variable and the 2005 data as the response variable. Create a least-squares regression line and use that line to estimate how many passengers Raleigh- Durham International had in 2005 if the airport had 4.9 million passengers in1992.

r and r 2 r tells us if there is a positive or negative relationship between the explanatory variable and the response variable. r also tells us how strong of a relationship the variables have. r 2 tells us what portion of the linear relationship between the variables can be explained by the explanatory variable. 1 – r 2 tells us what portion of the linear relationship between the variables can not be explained by the explanatory variable. Ex: If r = 0.6,  r 2 = % of the linear relationship can be explained by the explanatory variable and 64% cannot be explained. Ex: If r = -1,  r 2 = % of the linear relationship can be explained by the explanatory variable and 0% cannot be explained.

Example 4

Example 5

Residuals A residual is the difference between an observed value of the response variable and the value predicted by the regression line. That is, a residual is the prediction error that remains after we have chosen the regression line: Residual = observed y – predicted y = y -

Residuals…continued A residual plot makes it easier to see unusual observations and patterns. The regression line is horizontal (think about it…).

Residual Graphing Use the following data to create a least-squares regression line and plot the residuals on the graph provided. AGEHEIGHT

CAUTION!!! Correlation and regression lines describe only linear relationships. Correlation and least-squares regression lines are not resistant to influential data (data drastically outside the norm). We should always plot our data and look for observations that might be influential. Ecological Correlation is based on averages rather than on individuals. Ex: There is a large positive correlation between average income and number of years of education. The correlation is smaller if we compare the incomes of individuals with number of years of education. The correlation based on average income ignores the large variation in the incomes of individuals having the same amount of education.

CAUTION!!! Extrapolation is the use of a regression line for prediction far outside the range of values of the explanatory variable that you used to obtain the line. Ex: Using the least-squares regression line for the height of the child from ages 0-9 to predict their height at age 30. Lurking Variables should always be thought about before drawing conclusions based on correlation or regression.

Correlation  Causation??? NO!!! A serious study once found that people with two cars live longer than people who own only one car. Owning three cars is even better, and so on. There is a substantial positive correlation between number of cars x and length of life y. Lurking variables?

HW 5.17

HW 5.25

HW 5.27

HW 5.29

HW 5.53