Statistical Analysis SC504/HS927 Spring Term 2008 Session 5: Week 20: 15 th February OLS (2): assessing goodness of fit, extension to multiple regression.

Slides:



Advertisements
Similar presentations
Multivariate Data/Statistical Analysis SC504/HS927 Spring Term 2008 Week 18: Relationships between variables: simple ordinary least squares (OLS) regression.
Advertisements

NOTATION & ASSUMPTIONS 2 Y i =  1 +  2 X 2i +  3 X 3i + U i Zero mean value of U i No serial correlation Homoscedasticity Zero covariance between U.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Lecture 3 Cameron Kaplan
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Statistics for the Social Sciences
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
ASSESSING THE STRENGTH OF THE REGRESSION MODEL. Assessing the Model’s Strength Although the best straight line through a set of points may have been found.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Simple Linear Regression Analysis
Multiple Regression What is our research question?  How does IV1, IV2, IV3 influence the DV What unique contribution to variation in the DV does IV1 make.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Lecture 5: Simple Linear Regression
Multiple Regression Research Methods and Statistics.
Multiple Regression Dr. Andy Field.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
SPSS Statistical Package for Social Sciences Multiple Regression Department of Psychology California State University Northridge
CHAPTER 5 REGRESSION Discovering Statistics Using SPSS.
Correlation and Regression
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Class 4 Ordinary Least Squares SKEMA Ph.D programme Lionel Nesta Observatoire Français des Conjonctures Economiques
Elements of Multiple Regression Analysis: Two Independent Variables Yong Sept
Analysis of Variance: Some Review and Some New Ideas
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Class 4 Ordinary Least Squares CERAM February-March-April 2008 Lionel Nesta Observatoire Français des Conjonctures Economiques
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 5 Multiple Regression.
Simple Linear Regression (SLR)
Simple Linear Regression (OLS). Types of Correlation Positive correlationNegative correlationNo correlation.
Environmental Modeling Basic Testing Methods - Statistics III.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
Michael J. Kalsher PSYCHOMETRICS MGMT 6971 Regression 1 PSYC 4310 Advanced Experimental Methods and Statistics © 2014, Michael Kalsher.
Analyze Of VAriance. Application fields ◦ Comparing means for more than two independent samples = examining relationship between categorical->metric variables.
Linear Regression 1 Sociology 5811 Lecture 19 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard)   Week 5 Multiple Regression  
Regression Analysis.
Regression Analysis AGEC 784.
Multiple Regression Prof. Andy Field.
Multiple Regression – Part II
CHAPTER 29: Multiple Regression*
Analysis of Variance: Some Review and Some New Ideas
Example 1 5. Use SPSS output ANOVAb Model Sum of Squares df
24/02/11 Tutorial 3 Inferential Statistics, Statistical Modelling & Survey Methods (BS2506) Pairach Piboonrungroj (Champ)
Simple Linear Regression
M248: Analyzing data Block D UNIT D2 Regression.
Simple Linear Regression
Multiple Linear Regression
Regression Part II.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Statistical Analysis SC504/HS927 Spring Term 2008 Session 5: Week 20: 15 th February OLS (2): assessing goodness of fit, extension to multiple regression

2 How well does the estimated regression model fit the data?  How well does the estimate model describe the variation in the dependent variable?  Regression models try to account for (“explain”) deviations from the mean

3 Goodness of Fit  We need to compare the line of best fit with something to assess the ‘goodness of fit’. The most basic model we have of our data is the mean so we use that.  We are looking to see if our line of best fit is a better representation of the data than the mean (remember the example I read out of the Andy Field book last week)

4 The proportion of total variation of Y explained by the regression  To do this we work out the Sum of Squares (SS) for the mean which we call SS Total and we also work out the SS for the regression line which we call SS Residual  The difference between the SS Total and the SS Residual is known as the SS Model/Regression  To compute R 2 we just divide the SS Model/Regression by the SS Total

5 Proportion of Variance Explained by the Regression  R 2 can range from 0 to 1 with higher values indicating better fit

6 Example from Suicide data:

7 Multiple OLS regression  y i =a + b 1 x i1 + b 2 x i2 + …..+ b k x ik + e  a = value of y when all x’s =0  b 1 = change in y from a 1 unit change in x 1, holding all x 2, x 3 …………. x k constant i.e. controlling for all other independent variables in the model

8

9 E.g.: Using the data set alcohol

10 Multiple regression – beta coefficients  suppose Y = alcohol consumption, X 1 = income (in £s pw) and X 2 = age (in years)  how would you compare estimates for b 1 and b 2 ?  You would need to standardise the beta coefficient for b 1 = b 1 multiplied by standard deviation of X 1 relative to std. dev. of Y  the variable with the largest standardised beta coefficient has the largest impact on Y

11 F-test  an F-test can be used to test:  whether a subset of variables contributes significantly to the regression model (when doing hierarchical regression)  If the F-test is significant then the model as a whole is considered to be able to ‘explain’ or ‘account’ for a significant amount of the variance in the dependent variable.