Transforming Relationships

Slides:



Advertisements
Similar presentations
Chapter 4 Review: More About Relationship Between Two Variables
Advertisements

4.1: Linearizing Data.
Forecasting Models With Linear Trend. Linear Trend Model If a modeled is hypothesized that has only linear trend and random effects, it will be of the.
Chapter 10 Re-Expressing data: Get it Straight
Chapter Four: More on Two- Variable Data 4.1: Transforming to Achieve Linearity 4.2: Relationships between Categorical Variables 4.3: Establishing Causation.
Lesson Quiz: Part I 1. Change 6 4 = 1296 to logarithmic form. log = 4 2. Change log 27 9 = to exponential form = log 100,000 4.
S ECTION 4.1 – T RANSFORMING R ELATIONSHIPS Linear regression using the LSRL is not the only model for describing data. Some data are not best described.
Logarithms ISP 121.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Lesson Nonlinear Regression: Transformations.
VCE Further Maths Least Square Regression using the calculator.
Transforming to achieve linearity
Chapter 4: More on Two-Variable (Bivariate) Data.
Transforming Relationships
Correlation & Regression – Non Linear Emphasis Section 3.3.
Transforming Relationships Chapter 4.1: Exponential Growth and Power Law Models Part A: Day 1: Exponential Growth.
Regression Regression relationship = trend + scatter
Exponential Regression Section Starter The city of Concord was a small town of 10,000 people in Returning war veterans and the G.I.
3.2 - Least- Squares Regression. Where else have we seen “residuals?” Sx = data point - mean (observed - predicted) z-scores = observed - expected * note.
4.1 Modeling Nonlinear Data.  Create scatter plots of non linear data  Transform nonlinear data to use for prediction  Create residual plots.
Nonlinear modeling Problem 4.6 Gypsy moths. Since gypsy moths were introduced to North America, they have proliferated unchecked and resulted in deforestation.
Chapter 10 Notes AP Statistics. Re-expressing Data We cannot use a linear model unless the relationship between the two variables is linear. If the relationship.
Chapter 4 More on Two-Variable Data. Four Corners Play a game of four corners, selecting the corner each time by rolling a die Collect the data in a table.
Lecture Slides Elementary Statistics Twelfth Edition
Topics
Steve Greer hrsbstaff.ednet.ns.ca/sgreer
CHAPTER 3 Describing Relationships
FW364 Ecological Problem Solving Class 6: Population Growth
Bring project data to enter into Fathom
Unit 4 LSRL.
Sections Review.
WARM UP: Use the Reciprocal Model to predict the y for an x = 8
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Transforming Relationships
Section 3.4 Solving Exponential and Logarithmic Equations
Chapter 5 LSRL.
Active Learning Lecture Slides
CHAPTER 12 More About Regression
Bell Ringer Make a scatterplot for the following data.
Chapter 10 Re-Expressing data: Get it Straight
Warm Up A method to estimate the age of a lobster was investigated in a 2007 study. The length of the exterior shell (in mm) was compared to the known.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Describing Bivariate Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Examining Relationships
Chapter 3 Describing Relationships Section 3.2
Residuals and Residual Plots
Transforming Relationships
Chapter 5 LSRL.
Correlation and Regression
CHAPTER 12 More About Regression
Section 4.1 Exponential Modeling
Ch 4 : More on Two-Variable Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 12 More About Regression
Homework: pg. 276 #5, 6 5.) A. The relationship is strong, negative, and curved. The ratios are all Since the ratios are all the same, the exponential.
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Which graph best describes your excitement for …..
Chapters Important Concepts and Terms
CHAPTER 3 Describing Relationships
Presentation transcript:

Transforming Relationships Chapter 4.1: Exponential Growth and Power Law Models Part A: Day 1: Exponential Growth

Beyond Linear What if your data is clearly curved in some manner? Are there models we can use, and prediction equations we can develop? Of course … and we will examine two basic types … Exponential Growth & The Power Law But ARE we really beyond LINEAR?

The Exponential Model Y = abx “a” is the initial value when x = 0, it is the y-intercept. “a” is often unrealistically small depending on the manner in which the data is entered. Each subsequent Y value is obtained by multiplying by a factor “b”. Taking a Logarithm (LOG) of the y-value, and using the old x-value will linearize the data.

Transforming Data Enter the following data and observe the curved pattern. Do a LinReg on L1, L2 ŷ = -18081823.7 + 9083.3487(x); r = .96501 Check out the residuals, just to reinforce the non-linearity. Cell Phone Subscribers in the U.S., 1990-1999 Year 1990 1993 1994 1995 1996 1997 1998 1999 Subscribers (1000’s) 5283 16,009 24,134 33,784 44,043 55,312 69,209 86,047

Logarithmic Transformation Now, obtain the data from the Y-value list (L2), and “take the LOG” of each value. Place the resulting LOGGED DATA into L3 Re-plot the L1/L3 data. Are things perfectly linear? Explore with LinReg, r-value. Comment. Log (ŷ) = -263.203 + 0.13417(x); r = .99116 Year 1990 1993 1994 1995 1996 1997 1998 1999 Log of Subscribers (1000’s) 3.7229 4.2044 4.3826 4.5287 4.6439 4.7428 4.8402 4.9347

Still not perfect – Is it? Eliminate all data except the last 4 years Dump this data into L4/L5 Do a LinReg on L4/L5 … better? Log (ŷ) = -188.951 + 0.09699(x); r = .99995 How about that Residual Plot still? Grrrrrr. Year 1996 1997 1998 1999 Log of Subscribers (1000’s) 4.6439 4.7428 4.8402 4.9347

Predictions? OK, regardless of the suspicious Residual Plot … we move on. Let’s use the last Prediction Equation to Predict for the year 2000. Log (ŷ) = -188.951 + 0.09699(2000) Log (ŷ) = 5.032878574 ŷ =10^ 5.032878574 = 107, 864.5

Conclusions? If a variable grows exponentially, then its logarithm grows linearly High r and R-square values are not the total picture. Near perfect (.99999) r values and R-Square values still are incomplete. Residuals tell a big tale. But the magnitude of the error can still warrant usage, if we are simply trying to predict! Plug into the “Log Equation”, then raise answer to the 10th power.

Another Problem! The combined American Indian, Eskimo, Aleut, Asian and Pacific Islander population grew in the US from 1950 to 1990 …as shown below … When entering the year, enter 50 for 1950, 60 for 1960, etc. Perform the Regression after transforming the data. Make a prediction of this combined population in the year 2000 … Year 1950 1960 1970 1980 1990 Population(1000’s) 1131 1620 2557 5150 9534

So how’d ya do? Log (ŷ) = 1.8246 + 0.023539(x); r = .992025 Log (ŷ) = 1.8246 + 0.023539(100) … note we are using “100” to represent the year 2000. Log (ŷ) = 4.17853333477 ŷ = 10 ^ 4.17853333477 = 15084.5839 So … the population would be predicted to be : 15,084,584 people. Do you think your prediction (Extrapolation) is too high or too low as compared to the actual population in 2000? Why?

Gypsy Moths Enter the data into L1 and L2 for the year (x–List 1) and the Acres of land defoliated by the Gypsy Moth (y-List 2). Year Acres 1978 63,042 1979 226,260 1980 907,075 1981 2,826,095

Gypsy Moths P.212/#4.6 – A) Plot the number of acres defoliated (y) against the year (x). B) Check out the three consecutive ratios of the Acreage … to verify the approximate exponential growth. What is that approximate growth RATIO (to the nearest integer)? C) “Linearize” the data – i.e. Transform the y-values, and plot the results. D) Calculate the LSRL for the transformed data. Log (ŷ) = -1094.51 + 0.5558(x); r = .999293 E) Construct and interpret the residual plot. F) Perform an inverse transformation to express ŷ as an exponential function of year. G) Predict the number of acres defoliated in 1982. Log (ŷ) = -1094.51 + 0.5558(1982) = 7.0302 ŷ = 10 ^ 7.0302 = 10,719,964.92 acres.