Presentation is loading. Please wait.

Presentation is loading. Please wait.

Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.

Similar presentations


Presentation on theme: "Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency."— Presentation transcript:

1 Re-Expressing Data

2 Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency

3 Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency Re-Expression!!

4 Goal 1: Make distribution more symmetric!  Allows use of 68 – 95 – 99.7 Rule  Take log of x variable

5 Goal 2: Make spread more alike even if centers are different  Groups with common spread are easier to compare  Take log of x

6 Goal 3: Make scatter plot nearly linear  Linear is easier to model  Take log of x

7 Goal 4: Make scatter plot more spread out than thick at one end or the other  Take log x

8 Ladder of Powers There is a family of simple re-expressions that move data toward our goals in a consistent way. This collection of re-expressions is called the Ladder of Powers. The Ladder of Powers orders the effects that the re-expressions have on data. Ladder of Powers

9 Ratios of two quantities (e.g., mph) often benefit from a reciprocal. The reciprocal of the data–1 An uncommon re-expression, but sometimes useful. Reciprocal square root–1/2 Measurements that cannot be negative often benefit from a log re-expression. We’ll use logarithms here“0” Counts often benefit from a square root re-expression. Square root of data values½ Data with positive and negative values and no bounds are less likely to benefit from re-expression. Raw data1 Try with unimodal distributions that are skewed to the left. Square of data values2CommentNamePower

10 Plan B: Attack of the Logarithms We seek a “useful” model, not perfection!!!!

11 Why Not Just Use a Curve?  If there’s a curve in the scatterplot, why not just fit a curve to the data?

12 Why Not Just Use a Curve? (cont.)  The mathematics and calculations for “curves of best fit” are considerably more difficult than “lines of best fit.”  Besides, straight lines are easy to understand.  We know how to think about the slope and the y - intercept.


Download ppt "Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency."

Similar presentations


Ads by Google