Copyright © 1999-2008 Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.

Slides:



Advertisements
Similar presentations
Final Thoughts HRP 223 – 2013 December 4 th, 2013 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
Advertisements

Multiple Linear Regression
Chapter 3 Bivariate Data
Overview Correlation Regression -Definition
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
1 SAS Formats and SAS Macro Language HRP223 – 2011 November 9 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
1 Merging with SQL HRP223 – 2011 October 31, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
1 Lab 2 HRP223 – 2010 October 18, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
1 Processing Grouped Data HRP223 – 2011 November 14 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
1 Combining (with SQL) HRP223 – 2010 October 27, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
1 Creating and Tweaking Data HRP223 – 2010 October 24, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
1 Database Theory and Normalization HRP223 – 2010 November 14 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
Describing the Relation Between Two Variables
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Interpreting Bi-variate OLS Regression
Multiple Regression Research Methods and Statistics.
Linear Regression and Correlation Topic 18. Linear Regression  Is the link between two factors i.e. one value depends on the other.  E.g. Drivers age.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Multiple Regression Dr. Andy Field.
SAS for Categorical Data Copyright © 2004 Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright.
Simple Linear Regression
CHAPTER 5 REGRESSION Discovering Statistics Using SPSS.
Correlation and Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
CHAPTER 15 Simple Linear Regression and Correlation
HPR Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Working with Data in Windows HRP223 – 2009 Sept 28 th, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Selecting Variables and Avoiding Pitfalls Chapters 6 and 7.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
1 Lab 2 and Merging Data (with SQL) HRP223 – 2009 October 19, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
1 Data Manipulation (with SQL) HRP223 – 2010 October 13, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental.
Lecture 4 Introduction to Multiple Regression
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
B AD 6243: Applied Univariate Statistics Multiple Regression Professor Laku Chidambaram Price College of Business University of Oklahoma.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
HRP 223 – 2007 lm.ppt - Linear Models Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
Venn diagram shows (R 2 ) the amount of variance in Y that is explained by X. Unexplained Variance in Y. (1-R 2 ) =.36, 36% R 2 =.64 (64%)
1 Data Manipulation (with SQL) HRP223 – 2009 October 12, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Beginning Data Manipulation HRP Topic 4 Oct 14 th 2012 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Michael J. Kalsher PSYCHOMETRICS MGMT 6971 Regression 1 PSYC 4310 Advanced Experimental Methods and Statistics © 2014, Michael Kalsher.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Simple Linear Regression
Topic 10 - Linear Regression
Correlation, Bivariate Regression, and Multiple Regression
Working with Data in Windows
SAS Output Delivery System
Lab 3 and HRP259 Lab and Combining (with SQL)
Lab 2 and Merging Data (with SQL)
Lab 2 HRP223 – 2010 October 18, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
Correlation and Covariance
Processing Grouped Data
Presentation transcript:

Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international treaties. Unauthorized reproduction of this presentation, or any portion of it, may result in severe civil and criminal penalties and will be prosecuted to maximum extent possible under the law. HRP Topic 9 - Regression

HRP Height and Resting Pulse  The spreadsheet RESTING.xls has height and pulse measures on 50 people. On average, does pulse go up or down with height?

Look before you leap!

HRP

 Root MSE = Estimated standard deviation of the error in the model (eta)  Dependent Mean = Mean of the outcome  CV = ratio of above * 100  In general r2 is interpreted as: –.1 small effect, 3. medium effect,.5 large effect  Adjusted R-square =1- ( (1- rsquare) * ((n-1)/n-m-1)) ) n=subjects m=variables – It penalizes you for putting extra terms in the model. – R-squared is typically reported if you have a single predictor variable. – Adjusted R-square is typically reported if you have several predictors.

HRP Oxygen  The next set of data looks at the relationship between oxygen inhaled and exhaled. You would hope that there would be close to a perfect relationship between the two factors.

Add the library to a new flowchart. Add the SAS data set to the project.

Look at the Data  This is bad news…. At least it is symmetric.

 Simple correlation is questionable.

HRP

Are the residuals about normal?

HRP Leave yourself a note on how to interpret the output.  Right click on the flowchart and choose New > Note.  Leave yourself some notes.  Right click on the Note icon > Link Note to > Quadratic

HRP Ice cream!  In this example you will predict ice cream sales based on factors like price and temperature.  Start by making a library (or copy and paste the existing one) in a new flowchart.  The data is in a text file. Import the data.

Load the Data

Add Celsius Celsius is ( (5/9) * (Fahr-32) )

HRP

Some people say VIF > 10 is a problem but that is arbitrary. If VIF is > 1/(1 - R- squared) then the factors are more related to other predictors than outcome.

HRP Severely Dehydrated Children

HRP A Look  Do univariate descriptive statistics. – Things look reasonable.  Do bivariate correlations. – Age and weight are correlated  Do univariate modeling. – There is a weak but statistically significant association.  Build a model with all 3 predictors and check variance inflation.

A Simpler Model  It explains a fair amount of the variability (45%). How can I check to make sure the model is working well and is not being driven by outliers?

HRP Outliers Images from: Statistics I: Introduction to ANOVA, Regression, and Logistic Regression Course Notes (2005) and Categorical Data Analysis Using Logistic Regression Course Notes (2005), SAS Press.

First Check Residuals

HRP What is influential?  Freund and Littell SAS System for Regression 3rd edition, page 70;  Variance inflation: – vifcheck = 1 /(1 – r2)  Leverage greater than this value: – leverageCheck = 2 * (predictors + 1) / records  Covariance more extreme than: – cov1Check = * (predictors+1) / records – cov1Check = * (predictors+1) / records  Dfits values with absolute value bigger than: – dffitsCheck = 2 * ((predictors + 1)/records) **.5

Influence Code