Transformations.

Slides:



Advertisements
Similar presentations
Inference for Linear Regression (C27 BVD). * If we believe two variables may have a linear relationship, we may find a linear regression line to model.
Advertisements

Chapter 10: Re-Expressing Data: Get it Straight
Chapter 10 Re-Expressing data: Get it Straight
Multivariate distributions. The Normal distribution.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Business Statistics - QBM117 Statistical inference for regression.
Transformations. Transformation (re-expression) of a Variable A very useful transformation is the natural log transformation Transformation of a variable.
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
© 2002 Thomson / South-Western Slide 6-1 Chapter 6 Continuous Probability Distributions.
Transformations to Achieve Linearity
9 - 1 Intrinsically Linear Regression Chapter Introduction In Chapter 7 we discussed some deviations from the assumptions of the regression model.
Inference for regression - Simple linear regression
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
Bivariate Data When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable.
Applications The General Linear Model. Transformations.
Marginal and Conditional distributions. Theorem: (Marginal distributions for the Multivariate Normal distribution) have p-variate Normal distribution.
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
Linear Regression Hypothesis testing and Estimation.
Statistics Review Chapter 10. Important Ideas In this chapter, we have leaned how to re- express the data and why it is needed.
Chapter 10: Re-Expressing Data: Get it Straight AP Statistics.
Transformations. Transformations to Linearity Many non-linear curves can be put into a linear form by appropriate transformations of the either – the.
Chapter 10 Re-expressing the data
Lecture 6 Re-expressing Data: It’s Easier Than You Think.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Hypothesis testing and Estimation
Applied Quantitative Analysis and Practices
Transformations.
If the scatter is curved, we can straighten it Then use a linear model Types of transformations for x, y, or both: 1.Square 2.Square root 3.Log 4.Negative.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Chapter 10 Notes AP Statistics. Re-expressing Data We cannot use a linear model unless the relationship between the two variables is linear. If the relationship.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Assessing Normality Are my data normally distributed?
Statistics 10 Re-Expressing Data Get it Straight.
Chapter 13 Lesson 13.2a Simple Linear Regression and Correlation: Inferential Methods 13.2: Inferences About the Slope of the Population Regression Line.
 Understand why re-expressing data is useful  Recognize when the pattern of the data indicates that no re- expression will improve it  Be able to reverse.
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Continuous Random Variables
MATH 2311 Section 5.5.
Inferences for Regression
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Checking Regression Model Assumptions
Bell Ringer Make a scatterplot for the following data.
Chapter 10 Re-Expressing data: Get it Straight
Evaluating Bivariate Normality
Hypothesis testing and Estimation
Statistical Methods For Engineers
Theme 5 Standard Deviations and Distributions
Continuous Random Variables
Checking Regression Model Assumptions
Transformations.
Introduction to Probability and Statistics
The Normal Probability Distribution Summary
Re-expressing Data:Get it Straight!
Hypothesis testing and Estimation
Chapter 10 Re-expression Day 1.
Chapter 12 Review Inference for Regression
Transformations to Achieve Linearity
CHAPTER 12 More About Regression
Statistics for Managers Using Microsoft® Excel 5th Edition
MATH 2311 Section 5.5.
Lecture 6 Re-expressing Data: It’s Easier Than You Think
Diagnostics and Remedial Measures
Inferences for Regression
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
The Normal Distribution
Diagnostics and Remedial Measures
Presentation transcript:

Transformations

Transformation (re-expression) of a Variable Transformation of a variable can change its distribution from a skewed distribution to a normal distribution (bell-shaped, symmetric about its centre A very useful transformation is the natural log transformation For any value of x, ln(x) can be: Looked up in tables Calculated by most calculators Calculated by most statistical packages

Graph of ln(x)

The effect of the transformation

The effect of the ln transformation It spreads out values that are close to zero Compacts values that are large

Transforming data to a normal distribution allows one to use powerful statistical procedures (discussed later on) that assumes the data is normally distributed.

Transformations to Linearity Many non-linear curves can be put into a linear form by appropriate transformations of the either the dependent variable Y or the independent variable X or both. This leads to the wide utility of the Linear model. Another use of trans

Intrinsically Linear (Linearizable) Curves 1 Hyperbolas y = x/(ax-b) Linear form: 1/y = a -b (1/x) or Y = b0 + b1 X Transformations: Y = 1/y, X=1/x, b0 = a, b1 = -b

2. Exponential y = a ebx = aBx Linear form: ln y = lna + b x = lna + lnB x or Y = b0 + b1 X Transformations: Y = ln y, X = x, b0 = lna, b1 = b = lnB

3. Power Functions y = a xb Linear from: ln y = lna + blnx or Y = b0 + b1 X Transformations: Y = ln y, X = ln x, b0 = lna, b1 = b

Summary Transformations can be useful for: Changing data from a skewed distribution to a Normal (bell- shaped) distribution Straightening out Non-linear data A common transformation is the natural log transformation ln(x)

Example – Motor Vehicle Data The data is in an Excel file – MtrVeh.xls Dependent = mpg Independent = Engine size, horsepower and weight

The data in an SPSS file

We will try to fit a model predicting mpg with Engine (engine size). First a scatter plot: The dialog box selecting the variables:

The scatter-plot

Similar to: 2. Exponential y = a ebx = aBx Linear form: ln y = lna + b x = lna + lnB x or Y = b0 + b1 X Transformations: Y = ln y, X = x, b0 = lna, b1 = b = lnB

To perform a ln transformation in SPSS Go to the menu Transform->Compute

In this dialogue box you define the tansformation Press OK and the trasformation will be performed

The new variable has been added to the SPSS spreadsheet

The scatterplot showing a better fit to a straight line using the new variable lnmpg.

Transformations summary Transformations can be used to convert non-normal data to normally (bell-shaped) distributed data (allowing for the use of the more powerful techniques assuming normality) Transformations can be used to convert non-linear data linear (straight line) data.

Next topic Probability