Linear Regression. PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Linear regression models
Simple Linear Regression and Correlation
Simple Linear Regression
Simple Linear Regression 1. Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Chapter 12 Simple Regression
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chapter Topics Types of Regression Models
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Correlation and Regression Analysis
Simple Linear Regression Analysis
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Chapter 11 Simple Regression
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Statistics for Business and Economics Chapter 10 Simple Linear Regression.
Ch4 Describing Relationships Between Variables. Pressure.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
Lecture 10: Correlation and Regression Model.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Multiple Regression. PSYC 6130, PROF. J. ELDER 2 Multiple Regression Multiple regression extends linear regression to allow for 2 or more independent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
LESSON 4.1. MULTIPLE LINEAR REGRESSION 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Chapter 11 REGRESSION Multiple Regression  Uses  Explanation  Prediction.
Regression Analysis AGEC 784.
REGRESSION G&W p
Simple Linear Regression
Chapter 11 Simple Regression
Chapter 13 Simple Linear Regression
BIVARIATE REGRESSION AND CORRELATION
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Simple Linear Regression
Simple Linear Regression
Simple Linear Regression and Correlation
Simple Linear Regression
Chapter 13 Additional Topics in Regression Analysis
3.2. SIMPLE LINEAR REGRESSION
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Linear Regression

PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables are. Regression provides a means for predicting the value of one variable based on the value of a related variable. The underlying mathematics are the same. Here we are dealing only with linear correlation and linear regression.

PSYC 6130, PROF. J. ELDER 3 Optimal Prediction using z Scores Consider 2 variables X and Y that may be related in some way. –e.g., X = midterm score, Y = final exam score X = reaction time, Y = error rate Suppose you know X for a particular case (e.g., individual, trial). What is your best guess at Y? The answer turns out to be pretty simple:

PSYC 6130, PROF. J. ELDER 4 Example: 6130A Assignment marks

PSYC 6130, PROF. J. ELDER 5 Graphical Representation Regression line PSYC 6130A Assignment 1 z-Score Assignment 2 z-Score

PSYC 6130, PROF. J. ELDER 6 The Raw-Score Regression Formula or where In terms of population parameters: In terms of sample statistics: or where

PSYC 6130, PROF. J. ELDER 7 Example: 6130A Assignment marks

PSYC 6130, PROF. J. ELDER 8 Graphical Representation PSYC 6130 Section A % 80% 85% 90% 95% 100% 80%85%90%95%100% Assignment 1 Grade Assignment 2 Grade y = 0.867x % Regression line

PSYC 6130, PROF. J. ELDER 9 Residuals The deviations of the actual Y values from the Y values predicted by the regression line are called residuals. The regression line minimizes the sum of squared residuals (and hence is called a mean-squared fit). PSYC 6130 Section A % 80% 85% 90% 95% 100% 80%85%90%95%100% Assignment 1 Grade Assignment 2 Grade

PSYC 6130, PROF. J. ELDER 10 Variance of the Estimate Total prediction error is expressed as the variance of the estimate (or mean-squared error) : In terms of population parameters:In terms of sample statistics:

PSYC 6130, PROF. J. ELDER 11 Explained and Unexplained Variance PSYC 6130 Section A % 80% 85% 90% 95% 100% 80%85%90%95%100% Assignment 1 Grade Assignment 2 Grade Unexplained Explained

PSYC 6130, PROF. J. ELDER 12 Summary of Variances

PSYC 6130, PROF. J. ELDER 13 Summary of Variances It can be shown that: i.e., the variance is equal to the sum of the explained and unexplained variances.

PSYC 6130, PROF. J. ELDER 14 Summary of Variances

PSYC 6130, PROF. J. ELDER 15 Coefficient of Determination The fraction of the total variance explained by the regression line is called the coefficient of determination It can be shown that this is just the square of the Pearson coefficient r: Population: Sample:

PSYC 6130, PROF. J. ELDER 16 Coefficient of Nondetermination The fraction of the total variance that remains unexplained by the regression line is called the coefficient of nondetermination It can be shown that this is just 1-r 2 : Population: Sample:

PSYC 6130, PROF. J. ELDER 17 Summary of Coefficients

PSYC 6130, PROF. J. ELDER 18 Components of Variance: SPSS Output ANOVA b a Regression Residual Total Model 1 Sum of SquaresdfMean SquareFSig. Predictors: (Constant), How tall are you without your shoes on (in cm.) a. Dependent Variable: How much do you weigh (in kilograms) b.

PSYC 6130, PROF. J. ELDER 19 Estimating the Variance of the Estimate Uncertainty in predictions can be estimated using the assumption of homoscedasticity. –(Etymology: hom- + Greek skedastikos able to disperse, from skedannynai to disperse) –Thought question: does this also explain the origin of the verb skedaddle? –In other words, homogeneity of variance in Y over the range of X.

PSYC 6130, PROF. J. ELDER 20 Confidence Intervals for Predictions

PSYC 6130, PROF. J. ELDER 21 Example: 6130A Assignment marks

PSYC 6130, PROF. J. ELDER 22 Underlying Assumptions Independent random sampling Linearity Normal Distribution Homoscedasticity

PSYC 6130, PROF. J. ELDER 23 Regressing X on Y Simply reverse the formulae, e.g., In terms of sample statistics: or where

PSYC 6130, PROF. J. ELDER 24 When to Use Linear Regression Prediction Statistical Control –Adjust for effects of confounding variable. –Also known as partialing out the effect of the confounding variable. Experimental Psychology: modeling effect of continuous independent variable on continuous dependent variable. –e.g., reaction time vs set size in visual search.

PSYC 6130, PROF. J. ELDER 25 Statistical Control Example: Mental Health Women report more bad mental health days than men, t(8176)=-7.1, p<.001, 2-tailed.

PSYC 6130, PROF. J. ELDER 26 Statistical Control Example: Physical Health

PSYC 6130, PROF. J. ELDER 27 Correlation Pearson’s r = 0.31

PSYC 6130, PROF. J. ELDER 28 After Partialing Out Physical Health

PSYC 6130, PROF. J. ELDER 29 Result of Partialing Out Physical Health Controlling for physical health, women report more bad mental health days than men, t(8176)=-5.7, p<.001, 2-tailed.