Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.

Slides:



Advertisements
Similar presentations
Regression Inferential Methods
Advertisements

Regression Basics Predicting a DV with a Single IV.
Overview Correlation Regression -Definition
Simple Linear Regression 1. Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Introduction to Linear and Logistic Regression. Basic Ideas Linear Transformation Finding the Regression Line Minimize sum of the quadratic residuals.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapters 10 and 11: Using Regression to Predict Math 1680.
Least Squares Regression
Linear Regression.  Uses correlations  Predicts value of one variable from the value of another  ***computes UKNOWN outcomes from present, known outcomes.
Review Regression and Pearson’s R SPSS Demo
Relationships Among Variables
Least Squares Regression Line (LSRL)
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Introduction to Linear Regression and Correlation Analysis
Regression with 2 IVs Generalization of Regression from 1 to 2 Independent Variables.
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
STAT 211 – 019 Dan Piett West Virginia University Lecture 2.
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
Introduction to Regression Analysis. Two Purposes Explanation –Explain (or account for) the variance in a variable (e.g., explain why children’s test.
Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Elementary Statistics Correlation and Regression.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Least Squares Regression.   If we have two variables X and Y, we often would like to model the relation as a line  Draw a line through the scatter.
1 Correlation and Regression Analysis Lecture 11.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
رگرسیون چندگانه Multiple Regression
Political Science 30: Political Inquiry
Linear Regression and Correlation Analysis
LSRL Least Squares Regression Line
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapter 15 Linear Regression
Understanding Research Results: Description and Correlation
Module 12 Math 075.
Chapter 12 Regression.
Regression.
Regression.
Regression.
Regression.
Regression.
Presentation transcript:

Regression

Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used to predict values of one variable from values of the other (e.g., SAT to predict GPA).

Basic Ideas (2) Sample value: Intercept – place where X=0 Slope – change in Y if X changes 1 unit. Rise over run. If error is removed, we have a predicted value for each person at X (the line): Suppose on average houses are worth about $75.00 a square foot. Then the equation relating price to size would be Y’=0+75X. The predicted price for a 2000 square foot house would be $150,000.

Linear Transformation 1 to 1 mapping of variables via line Permissible operations are addition and multiplication (interval data) Add a constantMultiply by a constant

Linear Transformation (2) Centigrade to Fahrenheit Note 1 to 1 map Intercept? Slope? Degrees C Degrees F 32 degrees F, 0 degrees C 212 degrees F, 100 degrees C Intercept is 32. When X (Cent) is 0, Y (Fahr) is 32. Slope is 1.8. When Cent goes from 0 to 100 (rise), Fahr goes from 32 to 212, and = 180. Then 180/100 =1.8 is rise over run is the slope. Y = X. F=32+1.8C.

Regression Line (1) Basics 1. Passes thru both means. 2. Passes close to points. Note errors. 3. Described by an equation.

Regression Line (2) Slope Equation for a line is Y=mX+b in algebra. In regression, equation usually written Y=a+bX Y is the DV (weight), X is the IV (height), a is the intercept (-327) and b is the slope (7.15). The slope, b, indicates rise over run. It tells how many units of change in Y for a 1 unit change in X. In our example, the slope is a bit over 7, so a change of 1 inch is expected to produce a change a bit more than 7 pounds.

Regression Line (3) Intercept The Y intercept, a, tells where the line crosses the Y axis; it’s the value of Y when X is zero. The intercept is calculated by: Sometimes the intercept has meaning; sometimes not. It depends on the meaning of X=0. In our example, the intercept is –327. This means that if a person were 0 inches tall, we would expect them to weigh –327 lbs. Nonsense. But if X were the number of smiles,then a would have meaning.

Correlation & Regression Correlation & regression are closely related. 1. The correlation coefficient is the slope of the regression line if X and Y are measured as z scores. Interpreted as SD Y change with a change of 1 SD X. 2.For raw scores, the slope is: The slope for raw scores is the correlation times the ratio of 2 standard deviations. (These SDs are computed with (N-1), not N). In our example, the correlation was.96, so the slope can be found by b =.96*(33.95/4.54) =.96*7.45 = Recall that. Our intercept is *66.8  -327.

Correlation & Regression (2) 3.The regression equation is used to make predictions. The formula to do so is just: Suppose someone is 68 inches tall. Predicted weight is *68 =

Review What is the slope? What does it tell or mean? What is the intercept? What does it tell or mean? How are the slope of the regression line and the correlation coefficient related? What is the main use of the regression line?

Test Questions A B C D What is the approximate value of the intercept for Figure C? a.0 b.10 c.15 d.20

Test Questions In a regression line, the equation used is typically. What does the value a stand for?  independent variable  intercept  predicted value (DV)  slope

Regression of Weight on Height HtWt N=10 M=67M=150 SD=4.57SD= Correlation (r) =.94. Regression equation: Y’= X

Predicted Values & Errors NHtWtY'Error M SD Variance Numbers for linear part and error. Note M of Y’ and Residuals. Note variance of Y is V(Y’) + V(res).

Error variance In our example, Standard error of the Estimate – average distance from prediction In our example (Heiman’s notation for error is not standard. )

Variance Accounted for (Heiman’s notation for error is not standard. ) The basic idea is to try maximize r-square, the variance accounted for. The closer this value is to 1.0, the more accurate the predictions will be.

Sample Exam Data from Previous Class Exam 1 Exam 2 A sample of 10 scores from both exams Assuming these are representative, what can you say about the exams? The students?

Scatterplot & Boxplots of 2 Exams Exam 1Exam 2

Descriptive Stats Descriptives StatisticStd. Error Exam1Mean Median Variance Std. Deviation Minimum52.00 Maximum Range48.00 Exam2Mean Median Variance Std. Deviation Minimum24.00 Maximum Range76.00

Correlations Exam1Exam2 Exam1 Pearson Correlation ** Sig. (2-tailed).000 N Exam2 Pearson Correlation.420 ** 1 Sig. (2-tailed).000 N **. Correlation is significant at the 0.01 level (2-tailed).

Scatterplot with means and regression line Coefficients a Model Unstandardized Coefficients Standardized Coefficients tSig. BStd. ErrorBeta 1(Constant) Exam a. Dependent Variable: Exam2 Note that the correlation, r, is.42 and the squared correlation, R 2, is.177. R 2 is also the variance accounted for. We can predict a bit less than 20 percent of the variance in Exam 2 from Exam 1.

Predicted Scores Coefficients a Model Unstandardized Coefficients Standardized Coefficients tSig. BStd. ErrorBeta 1(Constant) Exam a. Dependent Variable: Exam2 Predicted Exam 2 = *Exam1 For example, if I got 85 on Exam 1, then my predicted score for Exam 2 is *85 = = 72 percent