REGRESSION. Regression is a mathematical means for finding the best fitting line for bivariate data. ? ? ? ? ?

Slides:



Advertisements
Similar presentations
Sleeping and Happiness
Advertisements

Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Regression Greg C Elvers.
Chapter 10 Curve Fitting and Regression Analysis
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
Review for the chapter 6 test 6. 1 Scatter plots & Correlation 6
Regression What is regression to the mean?
Overview Correlation Regression -Definition
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
CORRELATON & REGRESSION
Statistics for the Social Sciences
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Business Statistics - QBM117 Least squares regression.
Multiple Regression Research Methods and Statistics.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Relationships Among Variables
Lecture 16 Correlation and Coefficient of Correlation
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
Simple Linear Regression. Correlation Correlation (  ) measures the strength of the linear relationship between two sets of data (X,Y). The value for.
Chapter 6 & 7 Linear Regression & Correlation
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
CHAPTER 8 LEAST SQUARES LINE. IMPORTANT TERMS – WRITE THESE DOWN WE WILL WORK ON THE DEFINITIONS AS WE GO! Model Linear Model Predicted value Residuals.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Part IV Significantly Different: Using Inferential Statistics
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Simple & Multiple Regression 1: Simple Regression - Prediction models 1.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
SIMPLE LINEAR REGRESSION AND CORRELLATION
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Simple Linear Regression
Regression Analysis AGEC 784.
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Regression Chapter 6 I Introduction to Regression
Chapter 5 LSRL.
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapter 15 Linear Regression
Least Squares Regression Line LSRL Chapter 7-continued
Correlation and Regression
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Sleeping and Happiness
Chapter Thirteen McGraw-Hill/Irwin
Presentation transcript:

REGRESSION

Regression is a mathematical means for finding the best fitting line for bivariate data. ? ? ? ? ?

Age of 1st Computer use Age requiring Glasses ? y = 2x Rise Run Rise Run slope:

Two numbers tell us all we need to know about ANY line. Slope tells us how STEEP or angled the line is (rise/run) Intercept tells us simply at which value does the line CROSS the y-axis. Slope = 1/2

Two numbers tell us all we need to know about ANY line. Slope tells us how STEEP or angled the line is (rise/run) Intercept tells us simply at which value does the line CROSS the y-axis. Slope = 3

Two numbers tell us all we need to know about ANY line. Slope tells us how STEEP or angled the line is (rise/run) Intercept tells us simply at which value does the line CROSS the y-axis. Slope = 1

Two numbers tell us all we need to know about ANY line. Slope tells us how STEEP or angled the line is (rise/run) Intercept tells us simply at which value does the line CROSS the y-axis. Slope = -1

? ?? ? ? ? ? Regression finds the best fitting line to a set of data y = b x + a Correlation and Regression

Correlation finds the strength and direction of the best fitting line to the data. Mean weight = 135 lbs Mean height = 67 inches

Regression finds the the best fitting line to the data. y = bx + a Mean weight = 135 lbs Mean height = 67 inches a b Looks like something you probably already know: y = mx +b

Calculation Example I. set of paired heights and weights for 5 people Person Height Weight Let’s find the equation of the regression line to predict weight from height. You must know which variable is which! (unlike correlation)

Regression summarizes the data into two statistics: the slope, b, and the intercept, a, of the best fitting line. y = bx + a The SLOPE b = r xy (s y /s x ) The INTERCEPT a = y - bx _ _

Person Height Weight y =165.6 s y = 9.76 x = 55.6 s x = 3.05 _ r =.98 b =.98 (9.76/3.05) b = 3.14 a = – 3.14(55.6) a = _ The SLOPE b = r xy (s y /s x ) The INTERCEPT a = y - bx _ _

This IS the equation of the regression line—the line that BEST FITS our data: y = 3.14x What if I would like to predict the weight of someone who is 50 inches tall? y = 3.14x y = 3.14(50) y = y =

How good of a predictor is the independent variable? Coefficient of determination: r 2 Gives the percent of the variation that can be explained by the regression equation.

Coefficent of determination (r 2 ) = 96.04%

Homoscedacticity: Assumptions about the error of Prediction Homoscadadic Not Homoscadadic

Regression summarizes the data into two statistics: the slope, b, and the intercept, a, of the best fitting line. Predicting y from x: Predicting x from y: b: r x y (s y /s x ) b: r x y (s x /s y ) a: y – b x a: x – b y This is the same as simply always using y for the predicted variable and x for the predictor variable.

Person Height Weight

Disclaimer about Regression lines and their equations. If the best fitting line to predict y from x is: y = 2x +1 It does NOT mean that the best fitting line to predict x from y is: x = 2y + 1 NOR is it: x =.5y -.5 y = 3.12x y =.305x

Regression is used for prediction ? Height Weight H1H1 W1W1 ? No Observations H2H2 W2W2 Predicting Weight from Height from paired measurements of 18 people. Best fitting line H3H3 W3W3

Examples from everyday life: Waiter/Waitress Collect data on the size of tips with the time of your shift. Do some times of day bring larger more tips (and get you more money)? Baseball: You desperately need to pick players for your fantasy baseball team, and you need a way to know which players will do best. You may be able to figure out which variable (batting average, etc) predicts performance best. Eg: multiple regression Correlation will tell you the direction and STRENGTH of the relationship. Regression calculates the best-fitting line to your data, and allows for predictions to be made.

An example of Regression… Equation of a line: y= bx + a You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day.

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day.  x N = = 62.8 Find the mean (x): _ r =.91 x y Find the standard deviation (s):  x = 314  x 2 = SS =  x 2 -  x) 2 N = – = s = SS N-1 √ = √

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day.  y N = 62 5 = 12.4 Find the mean (y): _ r =.91 x y Find the standard deviation (s):  y = 62  y 2 = 886 SS =  y 2 -  y) 2 N = 886 – =117.2 s = SS N-1 √ = √

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day. r =.91 x y x = 62.8 s x = y = s y = 5.41 To find the slope: b = r (s y /s x ) b =.91 (5.41/12.46) b =.40 To find the intercept: a = –.40(62.8) a = a = y – b x Equation of the regression line to predict tip from temperature: y =.40x – 12.72

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day. r =.91 x y x = 62.8 s x = y = s y = 5.41 Equation of the regression line to predict tip from temperature: y =.40x – Prediction for a 60-degree day: y =.40(60) – y = 24 – y = 11.28

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict the temperature outside based on your hourly tip rate. Predict the weather if you earned 25 dollars hourly on a particular day. r =.91 y x y = 62.8 s y = x = s x = 5.41 To find the slope: b = r (s y /s x ) b =.91 (12.46/5.41) b = 2.10 To find the intercept: a = – 2.10(12.40) a = a = y – b x Equation of the regression line to predict tip from temperature: y = 2.10x

An example: Temperature (Fahrenheit) Hourly Tip (dollars) r =.91 y x y = 62.8 s y = x = s x = 5.41 Equation of the regression line to predict tip from temperature: y = 2.10x Prediction for a 25-dollar tip hourly: y = 2.10(25) y = y = You are a waiter/waitress at a local restaurant. You would like to predict the temperature outside based on your hourly tip rate. Predict the weather if you earned 25 dollars hourly on a particular day.

Another way to think about regression… Z-Score equation: Zy=  Zx  Where  is the same thing as r

An example: Temperature (Fahrenheit) Hourly Tip (dollars) You are a waiter/waitress at a local restaurant. You would like to predict how much tip money to expect from customers on a 60- degree day. r =.91 x y x = 62.8 s x = y = s y = 5.41 Zy=  Zx Zy=  Zx z(60) = ( )/12.46 =-.22 Zy=  (-.22) Zy=  y=  Raw(-.20)= (-.20)5.41