So how do we know what type of re-expression to use?

Slides:



Advertisements
Similar presentations
Straightening a curved scatterplot
Advertisements

Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Re-Expressing Data Get it Straight!. Page 192, #2, 4, 15, 19, 22 Residuals Pg 193, # 11, 23, 27, 33, 45 Pg 195, 16, 22, 23,25,37 Regression Wisdom Pg.
 Objective: To determine whether or not a curved relationship can be salvaged and re-expressed into a linear relationship. If so, complete the re-expression.
Chapter 10: Re-expressing data –Get it straight!
Polynomial Regression and Transformations STA 671 Summer 2008.
Chapter 10 Re-Expressing data: Get it Straight
Get it Straight!! Chapter 10
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Chapter 10 Re-expressing the data
Re-expressing data CH. 10.
Re-expressing the Data: Get It Straight!
Chapter 10 Re-expressing Data: Get it Straight!!
Transforming to achieve linearity
Chapter 4: More on Two-Variable (Bivariate) Data.
Statistics Review Chapter 10. Important Ideas In this chapter, we have leaned how to re- express the data and why it is needed.
M25- Growth & Transformations 1  Department of ISM, University of Alabama, Lesson Objectives: Recognize exponential growth or decay. Use log(Y.
“Numbers are like people… torture them long enough and they’ll tell you anything...” Linear transformation.
Chapter 10 Re-expressing the data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 10 Re-expressing Data: Get It Straight!. Slide Straight to the Point We cannot use a linear model unless the relationship between the two.
Chapter 10: Re- expressing Data by: Sai Machineni, Hang Ha AP STATISTICS.
Lecture 6 Re-expressing Data: It’s Easier Than You Think.
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Bivariate Data Analysis Bivariate Data analysis 4.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Reexpressing Data. Re-express data – is that cheating? Not at all. Sometimes data that may look linear at first is actually not linear at all. Straight.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
If the scatter is curved, we can straighten it Then use a linear model Types of transformations for x, y, or both: 1.Square 2.Square root 3.Log 4.Negative.
Chapter 5 Lesson 5.4 Summarizing Bivariate Data 5.4: Nonlinear Relationships and Transformations.
Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.
Copyright © 2010 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Chapter 4 More on Two-Variable Data. Four Corners Play a game of four corners, selecting the corner each time by rolling a die Collect the data in a table.
Statistics 10 Re-Expressing Data Get it Straight.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
10.2 Logarithms and Logarithmic functions. Graphs of a Logarithmic function verse Exponential functions In RED y = 10 x.
 Understand why re-expressing data is useful  Recognize when the pattern of the data indicates that no re- expression will improve it  Be able to reverse.
Statistical Data Analysis - Lecture /04/03
BAE 5333 Applied Water Resources Statistics
Chapter 10: Re-expressing Data (Get it Straight)
WARM UP: Use the Reciprocal Model to predict the y for an x = 8
Let’s Get It Straight! Re-expressing Data Curvilinear Regression
Transforming to Achieve Linearity (C10 BVD)
Chapter 10: Re-Expression of Curved Relationships
Transforming Relationships
Ch. 10 – Scatterplots, Association and Correlation (Day 1)
Re-expressing the Data: Get It Straight!
Active Learning Lecture Slides
Chapter 7: Scatterplots, Association, and Correlation
Chapter 7 Linear Regression.
Bell Ringer Make a scatterplot for the following data.
Chapter 10 Re-Expressing data: Get it Straight
Re-expressing Data: Get it Straight!
Chapter 8 Part 2 Linear Regression
Re-expressing the Data: Get It Straight!
Re-expressing the Data: Get It Straight!
Graphs in Physics.
Re-expressing Data:Get it Straight!
CHAPTER 12 More About Regression
Chapter 10 Re-expression Day 1.
Transforming Relationships
Solving Linear Equations
Lecture 6 Re-expressing Data: It’s Easier Than You Think
CHAPTER 12 More About Regression
Chapters Important Concepts and Terms
CHAPTER 12 More About Regression
Re-expressing Data: Get it Straight!
Presentation transcript:

So how do we know what type of re-expression to use?

The Ladder of Powers There is a family of simple re-expressions that move data toward our goals in a consistent way. This collection of re-expressions is called the Ladder of Powers. The Ladder of Powers orders the effects that the re-expressions have on data. This can help you figure out where to start, and from there you can find a good re-expression.

The Ladder of Powers (Pg 237)

The Ladder of Powers: The Ladder of Powers orders the effect that the re-expression has on your data. Example: If the square root doesn’t straighten it enough, try to move down further on the ladder (use the logarithm or reciprocal root). Those re-expressions will have a stronger effect on your data. Don’t forget that when you take a negative power, the direction of the graph will change, but that is OKAY. You will just analyze the graph in a different context.

How to Re-Express Data Observe Flow Chart: “How to Find the Best Linear Model”

Just Checking Box Pg 237 Solutions: Start with the square root. Counts are often best transformed by using the square root. Don’t do anything. It is already straight. Even though these are counts, I would use the logarithm for this one. This is because populations grow exponentially, so something stronger would be better.

Now back to these graphs: Let’s start the process of re-expressing. Heart rate is in beats/minute. Let’s try taking the negative reciprocal re-expression and use minutes/beat. Let’s look at the re-expressed data…

Re-Expressed:

Were the re-expressions successful? The scatterplot is more straight, but may be slightly concave down. The histogram is slightly skewed to the left. Most of the boxplots have no outliers. These ones look much better than the ones with the raw data.

Were the re-expressions successful? Overall, it looks like it may have moved a bit too far. If this happens, try a re-expression that is lower on the ladder. Now, let’s try something halfway between 1 and -1, like the logarithm.

Were the re-expressions successful?

Were the re-expressions successful? The data is now more linear and the histogram is symmetric. The boxplots are still a little skewed to the right, but not as much as the original. Never expect real data to cooperate perfectly. This may be the best one that we get.

Classwork: Calculator: Re-Expressing Data to Achieve Linearity

Homework: Pg 248 – 252, Exercises 8, 10, 12, 15, 17, 24 Read Chapter 9 Complete the Guided Reading

Plan B: Attack of the Logarithms The ladder of Powers is often successful at finding an effective re-expression. Sometimes the curvature can be very stubborn though, and we may not be happy with the residuals plots of the re-expression. When none of the data values is zero or negative, logarithms can be a helpful ally in the search for a useful model. Try taking the logs of both the x- and y-variable. Then re-express the data using some combination of x or log(x) vs. y or log(y). You may find that one of these works pretty well.

Observe the Chart:

Warning! Don’t expect to be able to straighten every curved scatterplot that you see. Some may just not have an effective re-expression. Don’t set your sights too high and remember that you won’t even find a “perfect” model. We seek a “useful” model, not perfection.

Classwork: Calculator Activity: Using Logarithmic Re-Expressions

Multiple Benefits We often choose a re-expression for one reason and then discover that it has helped other aspects of an analysis. For example, a re-expression that makes a histogram more symmetric might also straighten a scatterplot or stabilize variance.

Why Not Just Use a Curve? If there’s a curve in the scatterplot, why not just fit a curve to the data?