Multiple regression.

Slides:



Advertisements
Similar presentations
Extension The General Linear Model with Categorical Predictors.
Advertisements

Correlation and Linear Regression.
Regression analysis Linear regression Logistic regression.
Correlation and Regression By Walden University Statsupport Team March 2011.
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Quantitative Techniques
Simple Linear Regression 1. Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Lecture 6: Multiple Regression
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Multiple Regression Research Methods and Statistics.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Correlation & Regression
INFERENTIAL STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON
Introduction to Linear Regression and Correlation Analysis
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Regression: Checking the Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Chapter 16 Data Analysis: Testing for Associations.
Analysis of covariance. When… ANCOVA is an ‘extra’ on an ANOVA ANOVA outcome = number of words learned IV = Sex ANCOVA adds a covariate covariate = size.
Correlation & Regression. The Data SPSS-Data.htmhttp://core.ecu.edu/psyc/wuenschk/SPSS/ SPSS-Data.htm Corr_Regr.
Chapter 6 Simple Regression Introduction Fundamental questions – Is there a relationship between two random variables and how strong is it? – Can.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
1 Preparation for Final Exam How to answer question related to computer output?
B AD 6243: Applied Univariate Statistics Multiple Regression Professor Laku Chidambaram Price College of Business University of Oklahoma.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Multiple Linear Regression An introduction, some assumptions, and then model reduction 1.
Lab 4 Multiple Linear Regression. Meaning  An extension of simple linear regression  It models the mean of a response variable as a linear function.
رگرسیون چندگانه Multiple Regression
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Special Topics in Multiple Regression Analysis
Chapter 14 Introduction to Multiple Regression
Chapter 20 Linear and Multiple Regression
Multiple Regression – Part I
Correlation, Regression & Nested Models
Essentials of Modern Business Statistics (7e)
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapter 12: Regression Diagnostics
BIVARIATE REGRESSION AND CORRELATION
Business Statistics Multiple Regression This lecture flows well with
Correlation and Simple Linear Regression
Multiple Regression – Part II
Example 1 5. Use SPSS output ANOVAb Model Sum of Squares df
Correlation and Simple Linear Regression
Simple Linear Regression
Simple Linear Regression and Correlation
Checking Assumptions Primary Assumptions Secondary Assumptions
Analysis of covariance
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Multiple regression

Overview Simple linear regression SPSS output Linearity assumption Multiple regression … in action; 7 steps checking assumptions (and repairing) Presenting multiple regression in a paper

Simple linear regression Class attendance and language learning Bob: 10 classes; 100 words Carol: 15 classes; 150 words Dave: 12 classes; 120 words Ann: 17 classes; 170 words Here’s some data. We expect that the more classes someone attends, the more words they learn.

The straight line is the model for the data The straight line is the model for the data. The definition of the line (y = mx + c) summarises the data.

SPSS output for simple regression (1/3) Model Summaryb Model R R Square Adjusted R Square Std. Error of the Estimate 1 .792a .627 .502 25.73131 a. Predictors: (Constant), classes b. Dependent Variable: vocabulary

SPSS output for simple regression (2/3)

SPSS output for simple regression (3/3) Coefficientsa Model Unstandardized Coefficients Standrdzd Coefficients t Sig. B Std. Error Beta 1 (Constant) -19.178 64.837 -.296 .787 classes 10.685 4.762 .792 2.244 .111 a. Dependent Variable: vocabulary

Linearity assumption Always check that the relationship between each predictor variable and the outcome is linear

Multiple regression More than one predictor e.g. predict vocabulary from classes + homework + L1vocabulary

Multiple regression in action Bivariate correlations & scatterplots – check for outliers Analyse / Regression Overall fit (R2) and its significance (F) Coefficients for each predictor (‘m’s) Regression equation Check mulitcollinearity (Tolerance) Check residuals are normally distributed

Bivariate outlier

Multivariate outlier Test Mahalanobis distance (In SPSS, click ‘Save’ button in Regression dialog) to test sig., treat as a chi-square value with df = number of predictors

Multicollinearity Tolerance should not be too close to zero T = 1 – R2 where R2 is for prediction of this predictor by the others If it fails, you need to reduce the number of predictors (you don’t need the extra ones anyway)

Failed normality assumption If residuals do not (roughly) follow a normal distribution … it is often because one or more predictors is not normally distributed  May be able to transform predictor

Categorical predictor Typically predictors are continuous variables Categorical predictors e.g. Sex (male, female) can do: code as 0, 1 Compare simple regression with t-test (vocabulary = constant + Sex)

Presenting multiple regression Table is a good idea: Include correlations (bivariate) R2 adjusted Report F (df, df), and its p, for the overall model Report N Coefficient, t, and p (sig.) for each predictor Mention that assumptions of linearity, normality, and absence of multicollinearity were checked, and satisfied

Further reading Tabachnik & Fidell (2001, 2007) Using Multivariate Statistics. Ch5 Multiple regression