Chapter 10: Re-expressing Data (Get it Straight)

Slides:



Advertisements
Similar presentations
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Advertisements

Re-Expressing Data Get it Straight!. Page 192, #2, 4, 15, 19, 22 Residuals Pg 193, # 11, 23, 27, 33, 45 Pg 195, 16, 22, 23,25,37 Regression Wisdom Pg.
 Objective: To determine whether or not a curved relationship can be salvaged and re-expressed into a linear relationship. If so, complete the re-expression.
Chapter 10: Re-expressing data –Get it straight!
Chapter 10 Re-Expressing data: Get it Straight
Get it Straight!! Chapter 10
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Chapter 10 Re-expressing the data
Re-expressing data CH. 10.
Re-expressing the Data: Get It Straight!
Chapter 10 Re-expressing Data: Get it Straight!!
+ Hw: pg 788: 37, 39, 41, Chapter 12: More About Regression Section 12.2b Transforming using Logarithms.
Transforming to achieve linearity
Statistics Review Chapter 10. Important Ideas In this chapter, we have leaned how to re- express the data and why it is needed.
Chapter 10: Re-expressing Data It’s easier than you think!
Chapter 10 Re-expressing the data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 10 Re-expressing Data: Get It Straight!. Slide Straight to the Point We cannot use a linear model unless the relationship between the two.
Chapter 10: Re- expressing Data by: Sai Machineni, Hang Ha AP STATISTICS.
Lecture 6 Re-expressing Data: It’s Easier Than You Think.
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Reexpressing Data. Re-express data – is that cheating? Not at all. Sometimes data that may look linear at first is actually not linear at all. Straight.
Chapter 5 Lesson 5.4 Summarizing Bivariate Data 5.4: Nonlinear Relationships and Transformations.
Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.
Solar System Distance Model The planets nearest the Sun are very different from the planets farther out in composition and structure.
Copyright © 2010 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Statistics 10 Re-Expressing Data Get it Straight.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
It takes 88 days for Mercury to orbit the Sun. This is 0.2 years less days to orbit the Sun than Earth.
Scientific Notation Definition: A method of expressing numbers in terms of a decimal number between 1 and 10 multiplied by a power of 10. Example: The.
 Understand why re-expressing data is useful  Recognize when the pattern of the data indicates that no re- expression will improve it  Be able to reverse.
Scientific Notation Definition: A method of expressing numbers in terms of a decimal number between 1 and 10 multiplied by a power of 10. Example: The.
WARM UP: Use the Reciprocal Model to predict the y for an x = 8
Let’s Get It Straight! Re-expressing Data Curvilinear Regression
Creating our solar system
THE SOLAR SYSTEM by Hunter
Chapter 10: Re-Expression of Curved Relationships
How spaced out are planets? How big are they compared with the Earth?
Re-expressing the Data: Get It Straight!
Diameter: 1,391,000 km (864,400 mi) Day (rotation): Earth days
Bell Ringer Make a scatterplot for the following data.
Chapter 10 Re-Expressing data: Get it Straight
Space.
Re-expressing Data: Get it Straight!
Re-expressing the Data: Get It Straight!
Re-expressing the Data: Get It Straight!
Section 4.4 Logarithmic Scales
Use the Reciprocal Model to find the regression line equation.
So how do we know what type of re-expression to use?
Venus Mercury Earth Meteoroid Comet Neptune Mars Jupiter
Re-expressing Data:Get it Straight!
Far and Away.
CHAPTER 12 More About Regression
Chapter 11 Looking at the Universe
Smallest to Largest ? Universe Solar System Galaxy.
Lecture 6 Re-expressing Data: It’s Easier Than You Think
Solar System.

Homework: pg. 266 #3 A. The relationship is strong, negative, and slightly curved. B. Yes. The scatterplot for the transformed data shows a clear linear.
CHAPTER 12 More About Regression
Our solar system is huge! Our Solar System has 8 planets.
The Planets of our Solar System The Terrestrial Planets
Chapters Important Concepts and Terms
Solar System.
CHAPTER 12 More About Regression
Re-expressing Data: Get it Straight!
Presentation transcript:

Chapter 10: Re-expressing Data (Get it Straight) Jami Copeland Shriya Varma Semester Project

Straightening Relationships In order to compare two variables using a linear regression model, the relationship between them must be linear To re-express data you can use square roots, reciprocals or logarithms

Goals of Re-expression Make the distribution of a variable more symmetric Make the spread of several groups (as seen in side-by-side boxplots) more alike Make the form of a scatterplot more nearly linear Make the scatter in a scatterplot spread out evenly rather than following a fan shape.

The Ladder Of Powers The Ladder of Powers helps to decide how to re-express data It offers a range of options and when each option should be used It also orders the effects of the re-expressions, from weakest to strongest

The Ladder of Powers Power Name Comment 2 Square of data values Try with unimodal distributions that are skewed to the left. 1 Raw data Data with positive and negative values and no bounds are less likely to benefit from re-expression. 1/2 Square root of data values Counts often benefit from a square root re-expression. “0” We’ll use logarithms here Measurements that cannot be negative often benefit from a log re-expression. -1/2 Reciprocal square root An uncommon re-expression, but sometimes useful. -1 The reciprocal of the data Ratios of two quantities (e.g., mph) often benefit from a reciprocal.

Plan B: Attack of the Logarithms If a stronger re-expression is needed, you can use Logarithms When none of the data is zero or negative, logarithms can be used in different combinations

Plan B: Attack of the Logarithims Model Name X-axis Y-axis Comment Exponential x log(y) This model is the “0’ power in the ladder approach, useful for values that grow by percentage increases. Logarithmic Log(x) y A wide range of x-values, or a scatterplot descending rapidly at the left but leveling off toward the right, may benefit from trying this model. Power Log(y) The Goldilocks model: When one of the ladder’s powers is too big and the next is too small, this one may be just right.

Problem 13: Planet distances and order Let’s look again at the pattern in the locations of the planets in our solar system seen in the given table. Use re-expressed data to create a model for the distance from the sun based on the planet’s position There is some debate among astronomers as to whether Pluto is truly a planet or actually a large member of the Kuiper Belt of comets and other icy bodies. Does your model suggest that Pluto may not belong in the planet group? Explain.

Problem 13 Planet Position Number Distance from Sun Mercury 1 36 Venus 2 67 Earth 3 93 Mars 4 142 Jupiter 5 484 Saturn 6 887 Uranus 7 1784 Neptune 8 2796 Pluto 9 3666

Problem 13a Re-express the “Distance from the sun” data using logarithms and your calculator to find the new distances, log(y).

Problem 13a Planet Position Number Distance from Sun Log(y) Mercury 1 36 1.5563 Venus 2 67 1.8261 Earth 3 93 1.9685 Mars 4 142 2.1523 Jupiter 5 484 2.6848 Saturn 6 887 2.9479 Uranus 7 1784 3.2514 Neptune 8 2796 3.4465 Pluto 9 3666 3.5642

Problem 13b This problem is just asking whether Pluto should be counted as a planet. The linear model predicts that Pluto will be 5741 million miles away, while the data shows it is only 3666 million miles away. This means it doesn’t fit very well and supports the claim that Pluto doesn’t behave like a planet.

Problem 15: Quaoar Planet Caltech astronomers discovered a new large body orbiting around the Sun, a billion miles beyond Jupiter named Quaoar. Quaoar orbits at a distance of about 4 billion miles. It is classified as a member of the Kuiper Belt instead of a planet. There are many reasons for suspecting that Pluto is unlike other planets.

Problem 15: Quaoar Planet Omit Pluto from your count of planets, and consider Quaoar as a candidate for the new planet. Based on its position, how does Quaoar’s distance from the sun compare with the prediction made by your model? Refit the model using Quaoar’s distance and position in the model instead of Pluto’s. Now how well does your model predict the re-expressed distance and position?

Problem 15a To solve this, you find the predicted distance from the re-expressed linear regression model. The predicted distance is 3.635. Pluto’s distance is 3.564. Quaoar’s is 3.602. Quaoar is therefore a better fit on the model.

Problem 15b To refit the model, you replace Pluto’s data in the original chart with Quaoar's data. This makes the R^2 value go up to 99.5% which means it is a better fit.