Regression. The Basic Problem How do we predict one variable from another?How do we predict one variable from another? How does one variable change as.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chapter 12 Simple Linear Regression
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Linear regression models
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Linear Regression.
The Simple Regression Model
Intro to Statistics for the Behavioral Sciences PSYC 1900
SIMPLE LINEAR REGRESSION
Accuracy of Prediction How accurate are predictions based on a correlation?
Chapter Topics Types of Regression Models
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Regression The basic problem Regression and Correlation Accuracy of prediction in regression Hypothesis testing Regression with multiple predictors.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
INTERPRETING REGRESSION COEFFICIENTS. OUTLINE 1.Back to Basics 2.Form: The Regression Equation 3.Strength: PRE and r 2 4.The Correlation Coefficient r.
Simple Linear Regression Analysis
Cal State Northridge 427 Ainsworth
Linear Regression.  Uses correlations  Predicts value of one variable from the value of another  ***computes UKNOWN outcomes from present, known outcomes.
Relationships Among Variables
Example of Simple and Multiple Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Fundamental Statistics for the Behavioral Sciences, 5th edition David C. Howell Chapter 10 Regression © 2003 Brooks/Cole Publishing Company/ITP.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
1 Lecture 4 Main Tasks Today 1. Review of Lecture 3 2. Accuracy of the LS estimators 3. Significance Tests of the Parameters 4. Confidence Interval 5.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
PS 225 Lecture 20 Linear Regression Equation and Prediction.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 6 (cont.) Difference Estimation. Recall the Regression Estimation Procedure 2.
1 Regression & Correlation (1) 1.A relationship between 2 variables X and Y 2.The relationship seen as a straight line 3.Two problems 4.How can we tell.
Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different.
Midterm Review Ch 7-8. Requests for Help by Chapter.
With the growth of internet service providers, a researcher decides to examine whether there is a correlation between cost of internet service per.
9.2 Linear Regression Key Concepts: –Residuals –Least Squares Criterion –Regression Line –Using a Regression Equation to Make Predictions.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Regression The basic problem Regression and Correlation
Multiple Regression.
CHAPTER 29: Multiple Regression*
6-1 Introduction To Empirical Models
Fundamental Statistics for the Behavioral Sciences, 4th edition
Simple Linear Regression
Correlation and Regression
24/02/11 Tutorial 2 Inferential Statistics, Statistical Modelling & Survey Methods (BS2506) Pairach Piboonrungroj (Champ)
Regression & Prediction
Regression & Correlation (1)
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Introduction to Regression
MGS 3100 Business Analysis Regression Feb 18, 2016
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression

The Basic Problem How do we predict one variable from another?How do we predict one variable from another? How does one variable change as the other changes?How does one variable change as the other changes? Cause and effectCause and effect

An Example Cigarettes and CHD Mortality from Chapter 9Cigarettes and CHD Mortality from Chapter 9 Data repeated on next slideData repeated on next slide We want to predict level of CHD mortality in a country averaging 10 cigarettes per day.We want to predict level of CHD mortality in a country averaging 10 cigarettes per day.

The Data

Why Do We Care? We may want to make a prediction.We may want to make a prediction. More likely, we want to understand the relationship.More likely, we want to understand the relationship. XHow fast does CHD mortality rise with a one unit increase in smoking? XNote we speak about predicting, but often don’t actually predict.

Regression Line FormulaFormula X = the predicted value of Y (CHD mortality) XX = smoking incidence for that country

Regression Coefficients “Coefficients” are a and b“Coefficients” are a and b b = slopeb = slope XChange in predicted Y for one unit change in X a = intercepta = intercept Xvalue of when X = 0

Calculation SlopeSlope InterceptIntercept

For Our Data Cov XY = 11.13Cov XY = s 2 X = = 5.43s 2 X = = 5.43 b = 11.13/5.43 = 2.04b = 11.13/5.43 = 2.04 a = *5.95 = 2.37a = *5.95 = 2.37 See SPSS printout on next slideSee SPSS printout on next slide Answers are not exact due to rounding error and desire to match SPSS.

SPSS Printout

Note: The values we obtained are shown on printout.The values we obtained are shown on printout. The intercept is labeled “constant.”The intercept is labeled “constant.” Slope is labeled by name of predictor variable.Slope is labeled by name of predictor variable.

Making a Prediction Assume that we want CHD mortality when cigarette consumption of 6.Assume that we want CHD mortality when cigarette consumption of 6. We predict people/10,000 in that country will die of coronary heart disease.We predict people/10,000 in that country will die of coronary heart disease.

Accuracy of Prediction Finnish smokers smoke 6 cigarettes/adult/dayFinnish smokers smoke 6 cigarettes/adult/day We predict deaths/10,000We predict deaths/10,000 They actually have 23 deaths/10,000They actually have 23 deaths/10,000 Our error (“residual”) = = 8.39Our error (“residual”) = = 8.39 Xa large error

Cigarette Consumption per Adult per Day CHD Mortality per 10, Residual Prediction

Errors of Prediction Residual varianceResidual variance XThe variability of predicted values Standard error of estimateStandard error of estimate XThe standard deviation of predicted values

Standard Error of Estimate A common measure of the accuracy of our predictionsA common measure of the accuracy of our predictions XWe want it to be as small as possible.

r 2 as % Predictable Variability Define Sum of SquaresDefine Sum of Squares The remaining error divided by the original errorThe remaining error divided by the original error

For Our Data r =.713r =.713 r 2 = =.508r 2 = =.508 Approximately 50% in variability of incidence of CHD mortality is associated with variability in smoking.Approximately 50% in variability of incidence of CHD mortality is associated with variability in smoking. Elaborate on what this means.Elaborate on what this means.

Hypothesis Testing Null hypothesesNull hypotheses Xb * = 0 Xa * = 0 Define b* and a*Define b* and a* Xpopulation correlation (  ) = 0 We saw how to test the last one in Chapter 9.We saw how to test the last one in Chapter 9.

Testing Slope and Intercept These are given in computer printout as a t test.These are given in computer printout as a t test.

Testing The t values in the second from right column are tests on slope and intercept.The t values in the second from right column are tests on slope and intercept. The associated p values are next to them.The associated p values are next to them. The slope is significantly different from zero, but not the intercept.The slope is significantly different from zero, but not the intercept. Why do we care?Why do we care? Cont.

Testing--cont. What does it mean if slope is not significant?What does it mean if slope is not significant? XHow does that relate to test on r? What if the intercept is not significant?What if the intercept is not significant? Does significant slope mean we predict quite well?Does significant slope mean we predict quite well?