Chapter 9: Regression Alexander Swan & Rafey Alvi.

Slides:



Advertisements
Similar presentations
Chapter 9. * No regression analysis is complete without a display of the residuals to check the linear model is reasonable. * The residuals are what is.
Advertisements

 Objective: To identify influential points in scatterplots and make sense of bivariate relationships.
Correlation and Linear Regression
Chapter 8 Linear regression
Chapter 8 Linear regression
Extrapolation: Reaching Beyond the Data
Regression Wisdom Chapter 9.
CHAPTER 8: LINEAR REGRESSION
Regression Wisdom.
Chapter 9: Regression Wisdom
Getting to Know Your Scatterplot and Residuals
Chapter 9 Regression Wisdom
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 8, Slide 1 Chapter 8 Regression Wisdom.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Chapter 9 Regression Wisdom
Regression Wisdom.  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t take it for granted.)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
AP Statistics Chapter 8 & 9 Day 3
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 9 Regression Wisdom.
Linear Regression Chapter 8.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 3.3 Cautions about Correlations and Regression Wisdom.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Slide 9-1 Copyright © 2004 Pearson Education, Inc.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Copyright © 2010 Pearson Education, Inc. Slide The lengths of individual shellfish in a population of 10,000 shellfish are approximately normally.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
Chapter 9 Regression Wisdom math2200. Sifting residuals for groups Residuals: ‘left over’ after the model How to examine residuals? –Residual plot: residuals.
Chapter 9 Regression Wisdom
Regression Wisdom Chapter 9. Getting the “Bends” Linear regression only works for linear models. (That sounds obvious, but when you fit a regression,
Regression Wisdom. Getting the “Bends”  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t.
Chapter 9 Regression Wisdom. Getting the “Bends” Linear regression only works for data with a linear association. Curved relationships may not be evident.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Regression Wisdom Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Do Now Examine the two scatterplots and determine the best course of action for other countries to help increase the life expectancies in the countries.
Statistics 9 Regression Wisdom. Getting the “Bends” Linear regression only works for linear models. (That sounds obvious, but when you fit a regression,
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 8, Slide 1 Chapter 8 Regression Wisdom.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
AP Statistics.  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t take it for granted.)
 Understand how to determine a data point is influential  Understand the difference between Extrapolation and Interpolation  Understand that lurking.
CHAPTER 3 Describing Relationships
Chapter 9 Regression Wisdom Copyright © 2010 Pearson Education, Inc.
Cautions about Correlation and Regression
Chapter 8 Regression Wisdom.
Chapter 8 Part 2 Linear Regression
Week 5 Lecture 2 Chapter 8. Regression Wisdom.
CHAPTER 3 Describing Relationships
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3.2 Regression Wisdom.
Chapter 9 Regression Wisdom.
Chapter 9 Regression Wisdom.
Honors Statistics Review Chapters 7 & 8
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Chapter 9: Regression Alexander Swan & Rafey Alvi

Residuals Grouping ●No regression analysis is complete without a display of the residuals to check that the linear model is reasonable. ●Residuals often reveal subtleties that were not clear from a plot of the original data.

Residuals Grouping ●Sometimes the subtleties we see are additional details that help confirm or refine our understanding. ●Sometimes they reveal violations of the regression conditions that require our attention.

Subsets Some important information: All the data must come from the same group. When we discover that there is more than one group in a regression, neither modeling the groups together nor modeling them apart is correct.

Subsets

Extrapolation Extrapolations are dubious because they require the additional—and very questionable— assumption that nothing about the relationship between x and y changes even at extreme values of x. Extrapolations can get you into deep trouble. You’re better off not making extrapolations.

Outliers Any point that stands away from the others is called an outlier and strongly influences a regression.

Leverage, influential A data point can be unusual if its x-value is far from the mean of the x-values. These kind of points have high leverage. A data point is influential if omitted it will give a very different model.

Lurking Variable There is no way to conclude from a regression alone that a variable causes the other. With observational data, as opposed to data from a designed experiment, there is no way to be sure that a lurking variable is not the cause of any apparent association.

Summary values Scatterplots of summary statistics show less scatter than the baseline data on individuals. Scatterplots of statistics summarized over groups tend to show less variability than if measured with same variable on individuals.

Question 3 Suppose you wanted to predict the trend in marriage age for American women into the early part of this century. a.How could you use this data graphed in Exercise 1 to get a good prediction? Marriage ages in selected years starting in 1990 are listed below. Use all or part of these data to create an appropriate model for predicting the average age at which women will first marry in (10 year intervals): 21.9, 21.6, 21.2, 21.3, 21.5, (5 year intervals): 20.2, 20.2, 20.6, 20.8, 21.1, 22.0, 23.3, 23.9, 24.5 To predict average age you would use the most recent ages, from , which are straight enough for a linear regression model. The linear model used to predict the marriage age would come out to be Age = (Year). The residual plot showed no pattern, but according to the plot the average age of marriage for women would be years old.

Question 3 b.How much faith do you place in this prediction? Explain. I don’t have very much faith in the prediction because the prediction is for a year that is 10 years higher than the highest year we are given. b.Do you think your model would produce an accurate prediction about your grandchildren, say, 50 years from now? Explain. NO! If the prediction from 10 years higher would be unlikely, a prediction 50 years later would not be possible following the trend from

Question 5 In justifying his choice of a model, a student wrote, “I know this is the correct model because R² = 99.4%.” a.Is this reasoning correct? Explain. No, you would need a scattered plot to make this prediction. a.Does this model allow the student to make accurate predictions? Explain. No, the data could possibly be curved.

Vocabulary to know ●The Outlier Condition means two things: nPoints with large residuals or high leverage (especially both) can influence the regression model significantly. ●lurking variable