DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slides:



Advertisements
Similar presentations
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Advertisements

 Objective: To determine whether or not a curved relationship can be salvaged and re-expressed into a linear relationship. If so, complete the re-expression.
Chapter 10: Re-expressing data –Get it straight!
Chapter 10: Re-Expressing Data: Get it Straight
Linear Regression Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Extrapolation: Reaching Beyond the Data
Chapter 10 Re-Expressing data: Get it Straight
Get it Straight!! Chapter 10
Chapter 3 Bivariate Data
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Lesson Quiz: Part I 1. Change 6 4 = 1296 to logarithmic form. log = 4 2. Change log 27 9 = to exponential form = log 100,000 4.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Chapter 10 Re-expressing the data
Re-expressing data CH. 10.
Re-expressing the Data: Get It Straight!
Stat 112: Lecture 15 Notes Finish Chapter 6: –Review on Checking Assumptions (Section ) –Outliers and Influential Points (Section 6.7) Homework.
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Transforming to achieve linearity
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Scatterplots, Associations, and Correlation
1 Chapter 7 Scatterplots, Association, and Correlation.
Chapter 10: Re-Expressing Data: Get it Straight AP Statistics.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 10 Re-expressing the data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 10 Re-expressing Data: Get It Straight!. Slide Straight to the Point We cannot use a linear model unless the relationship between the two.
Chapter 10: Re- expressing Data by: Sai Machineni, Hang Ha AP STATISTICS.
Lecture 6 Re-expressing Data: It’s Easier Than You Think.
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Bivariate Data Analysis Bivariate Data analysis 4.
Answers to Part 2 Review Pg #1  % over 50 – r=.69  % under 20 – r=-.71  Full time Fac – r=.09  Gr. On time-r=-.51.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
Chapter 7 Scatterplots, Association, and Correlation.
If the scatter is curved, we can straighten it Then use a linear model Types of transformations for x, y, or both: 1.Square 2.Square root 3.Log 4.Negative.
Linear Regression Chapter 8. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Chapter 5 Lesson 5.4 Summarizing Bivariate Data 5.4: Nonlinear Relationships and Transformations.
Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.
Chapter 9 Regression Wisdom
Module 11 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Chapter 10 Notes AP Statistics. Re-expressing Data We cannot use a linear model unless the relationship between the two variables is linear. If the relationship.
Statistics 7 Scatterplots, Association, and Correlation.
MATH 2311 Section 5.4. Residuals Examples: Interpreting the Plots of Residuals The plot of the residual values against the x values can tell us a lot.
Chapter 4 More on Two-Variable Data. Four Corners Play a game of four corners, selecting the corner each time by rolling a die Collect the data in a table.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 7- 1.
Statistics 10 Re-Expressing Data Get it Straight.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
 Understand why re-expressing data is useful  Recognize when the pattern of the data indicates that no re- expression will improve it  Be able to reverse.
Let’s Get It Straight! Re-expressing Data Curvilinear Regression
Chapter 10: Re-Expression of Curved Relationships
Re-expressing the Data: Get It Straight!
Chapter 7: Scatterplots, Association, and Correlation
Chapter 10 Re-Expressing data: Get it Straight
Re-expressing Data: Get it Straight!
Chapter 8 Part 2 Linear Regression
Re-expressing the Data: Get It Straight!
Re-expressing the Data: Get It Straight!
1. Describe the Form and Direction of the Scatterplot.
Re-expressing Data:Get it Straight!
Lecture 6 Re-expressing Data: It’s Easier Than You Think
Chapter 3.2 Regression Wisdom.
Chapter 9 Regression Wisdom.
Re-expressing Data: Get it Straight!
Presentation transcript:

DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions: Answer the following questions: What is a purpose of re-expressing data? What is a purpose of re-expressing data? What can we check to see that the re-expression makes the linear model appropriate? What can we check to see that the re-expression makes the linear model appropriate?

Previous Average Current Average Total Change Failures Thanksgiving EC Points # of Students w/ HW EC 3rd 78%83%5 pts th 74%77%3 pts3995 5th 74%81%7 pts119712

HW CHECK - #7 7. Movie Dramas. a) The units for the slopes of these lines are millions of dollars per minutes of running time. b) The slopes of the regression lines are the same. Dramas and movies from other genres have costs for longer movies that increase at the same rate. c) The regression line for dramas has a lower y-intercept. Regardless of running time, dramas cost about 20 million dollars less than other genres of movies of the same running time.

HW CHECK - #8 8. Movie Ratings. a) The slopes of the regression lines are approximately the same. The costs increase at about the same rate for all genres as the movies get longer. b) Although the costs per minute are about the same, it costs about 20 million dollars less to make an R-rated movies than a movie of the other rating type with the same running time. c) Omitting King Kong would make the slope for the PG-13 movies steeper. We would conclude that the cost per minute of PG-13 movies was greater than the cost per minute of movies with other rating.

HW CHECK - #12 D 1) The point has high leverage and a small residual. 2) The point is not influential. It has the potential to be influential, because its position far from the mean of the explanatory variable gives it high leverage. However, the point is not exerting much influence, because it reinforces the association. 3) If the point were removed, the correlation would become weaker. The point heavily reinforces the association. Removing it would weaken the association. 4) The slope would remain roughly the same, since the point is not influential.

HW CHECK - #16 Suppose that researchers find a moderately strong positive correlation between the amount of time that a child spends playing computer games and the aggressiveness they display at school. 16. What’s the effect? 1) Playing computer games may make kids more violent. 2) Violent kids may like to play computer games. 3) Playing computer games and violence may both be caused by a lurking variable such as the child’s home life or a genetic predisposition to aggressiveness.

Slide STRAIGHT TO THE POINT (CONT.) The relationship between fuel efficiency (in miles per gallon) and weight (in pounds) for late model cars looks fairly linear at first:

Slide STRAIGHT TO THE POINT (CONT.) A look at the residuals plot shows a problem:

CONVERTING UNITS 3 feet = ____ inches 3 feet = ____ inches 30 inches = ____ feet 30 inches = ____ feet 50 yards = ____ feet 50 yards = ____ feet Does changing the units change the meaning of the quantity? Does changing the units change the meaning of the quantity?

Slide STRAIGHT TO THE POINT (CONT.) We can re-express fuel efficiency as gallons per hundred miles (a reciprocal) and eliminate the bend in the original scatterplot:

Slide STRAIGHT TO THE POINT (CONT.) A look at the residuals plot for the new model seems more reasonable:

Slide STRAIGHT TO THE POINT We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening bent relationships so that we can fit and use a simple linear model. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening bent relationships so that we can fit and use a simple linear model. Two simple ways to re-express data are with logarithms and reciprocals. Two simple ways to re-express data are with logarithms and reciprocals. Re-expressions can be seen in everyday life— everybody does it. Re-expressions can be seen in everyday life— everybody does it.

Slide WHAT CAN GO WRONG? Beware of multiple modes. Beware of multiple modes. Re-expression cannot pull separate modes together. Re-expression cannot pull separate modes together. Watch out for scatterplots that turn around. Watch out for scatterplots that turn around. Re-expression can straighten many bent relationships, but not those that go up then down, or down then up. Re-expression can straighten many bent relationships, but not those that go up then down, or down then up.

Slide GOALS OF RE-EXPRESSION Goal 1: Make the distribution of a variable (as seen in its histogram, for example) more symmetric.

Slide GOALS OF RE-EXPRESSION (CONT.) Goal 2: Make the spread of several groups (as seen in side-by-side boxplots) more alike, even if their centers differ.

Slide GOALS OF RE-EXPRESSION (CONT.) Goal 3: Make the form of a scatterplot more nearly linear.

Slide GOALS OF RE-EXPRESSION (CONT.) Goal 4: Make the scatter in a scatterplot spread out evenly rather than thickening at one end. This can be seen in the two scatterplots we just saw with Goal 3: This can be seen in the two scatterplots we just saw with Goal 3:

TEXTBOOK #1 Suppose you have fit a linear model to some data and now take a look at the residuals. For each of the following possible residuals plots, tell whether you would try a re- expression and, if so, why.

TEXTBOOK #3 Here is the residual plot for a linear model describing the trend in the number of passengers departing from the Oakland (CA) airport each month since the start of Can you account for the pattern shown here? 2.Would a re-expression help us deal with this pattern? Explain.

TEXTBOOK #7 One of the important factors determining a car’s Fuel Efficiency is its Weight. Let’s examine this relationship again, for 11 cars. Describe the association between these variables shown in the scatterplot.

TEXTBOOK #7 One of the important factors determining a car’s Fuel Efficiency is its Weight. Let’s examine this relationship again, for 11 cars. The linear model for this data is Fuel Efficiency = – 7.65Weight. What does the slope of the line say about this relationship?

TEXTBOOK #7 One of the important factors determining a car’s Fuel Efficiency is its Weight. Let’s examine this relationship again, for 11 cars. The linear model for this data is Fuel Efficiency = – 7.65Weight. Let’s examine the residuals plot for this linear regression. Is this model appropriate?

TEXTBOOK #7 Let’s re-express the variable Fuel Consumption (gal/100 mi) to examine the fuel efficiency of the 11 cars. The revised linear regression is Fuel Efficiency = Weight. Explain why this model appears to be better than the linear model. Interpret the slope of this line.