Data Analysis: Relationships Continued Regression

Slides:



Advertisements
Similar presentations
Managerial Economics in a Global Economy
Advertisements

Lesson 10: Linear Regression and Correlation
Dr. G. Johnson, Data Analysis: Regression Research Methods for Public Administrators Dr. Gail Johnson.
Forecasting Using the Simple Linear Regression Model and Correlation
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Soc 3306a: Path Analysis Using Multiple Regression and Path Analysis to Model Causality.
Regression single and multiple. Overview Defined: A model for predicting one variable from other variable(s). Variables:IV(s) is continuous, DV is continuous.
Introduction to Regression Analysis
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 10 Simple Regression.
Simple Linear Regression
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
1 Multiple Regression Here we add more independent variables to the regression. In this section I focus on sections 13.1, 13.2 and 13.4.
The Simple Regression Model
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Multiple Regression Research Methods and Statistics.
Simple Linear Regression Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Lecture 5 Correlation and Regression
Lecture 15 Basics of Regression Analysis
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Chapter 11 Simple Regression
Correlation and Linear Regression
Understanding Multivariate Research Berry & Sanders.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
Multivariate Regression 11/19/2013. Readings Chapter 8 (pp ) Chapter 9 Dummy Variables and Interaction Effects (Pollock Workbook)
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Chapter 16 Data Analysis: Testing for Associations.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Examining Relationships in Quantitative Research
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
1.What is Pearson’s coefficient of correlation? 2.What proportion of the variation in SAT scores is explained by variation in class sizes? 3.What is the.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Stats Methods at IC Lecture 3: Regression.
EXCEL: Multiple Regression
Regression and Correlation
Regression Analysis.
Simple Bivariate Regression
Regression Analysis AGEC 784.
Lecture 10 Regression Analysis
REGRESSION G&W p
Bivariate & Multivariate Regression Analysis
Introduction to Regression Analysis
REGRESSION (R2).
Political Science 30: Political Inquiry
Multiple Regression.
Regression Analysis.
Regression Analysis Week 4.
Simple Linear Regression
SIMPLE LINEAR REGRESSION
Regression Part II.
Presentation transcript:

Data Analysis: Relationships Continued Regression Research Methods Dr. Gail Johnson

Simple Regression Enables us to estimate the: Strength of relationship Expressed as the percent of variance explained How much change you can expect in the dependent variable based on a one unit change in the independent variable Enables you to make predictive estimates

Relationships Correlation is not causation. Statistical measurement includes the measurement of relationships. There are 2 ways to measure the strength of a relationship: 1. How great a difference the independent variable makes on the dependent variable (sometimes called the effect-description). This allows you to predict the effect of the IV on the DV but you have to have interval data.

Relationships 2. How completely the dependent variable is explained by the independent variable. (Correlational). (R squared).

Simple Regression Assumes a linear relationship Interval level (or dichotomous: which means coded 0 or 1) data Independent Variable: interval level Random or census

Simple Regression Y = a + bX + error Where: a = the constant or Y intercept b = the regression coefficient, or slope Y = predicted value of the dependant variable X = the independent variable.

Simple Regression Estimate car repair costs for motor pool Y= car repair costs X = miles driven Collect data and crunch it. You get these results: Y = -267 and .018X

Simple Regression Estimate car repair costs Y = -267 and .018X Interpretation: for every mile driven, the repair costs goes up by 1.8 cents. For every 100 miles driven, costs go up by $1.80

Simple Regression Y = -267 and .018X If you expect the cars to be driven a total of 100,000 miles, how much will car repair costs likely be? 100,000 x .018 = $1,800 Solve equation: Y = -267 + 1,800 = $1,763

Simple Regression r= correlation coefficient (overall fit) (measure of association but non-directional; zero-order correlational coefficient). r2 = proportion of explained variation 1-r2 = proportion of unexplained variation

Life is more complex Rarely will any one single variable cause something to happen Life is inherently multivariate What are the possible causes for urban decay?

What are the possible causes for urban decay? lack of jobs high % of absentee landlords low % of homeowners poor quality of schools increased concentration of poor increase in drugs, crime aging housing stock flight of middle class to suburbs corruption aging infrastructure business flight to suburbs

What caused drop in crime? Changing demographics? Better policing? Strong economy? Gun control laws? Concealed weapons laws? Increased use of death penalty? Increase in number of police? Rising prison population? Waning Crack epidemic? Legalization of abortion?

Multiple Regression Multiple Regression lets you do four things: test your hypothesis predict the dependent variable if you know the values for independent variables Predicts the independent effect of each independent variable while controlling for the others tells you the relative strength of each of the independent variable using the beta weights

Multiple Regression Y = a1 + bX1 + bX2 + bX3 + b X4 + e. Y = dependent variable X1 = independent variable 1, controlling for X2, X3, X4 X2 = independent variable 2 controlling for X1, X3, X4 X3 = independent variable 3 controlling for X1, X2, X4 X4= independent variable 4 controlling for X1, X2, X3

Multiple Regression Income as a function of education and seniority? Y = Income (dep. Var.) Y (Income) = a + education + seniority Y= 6000 + 400X1 + 200X2 based on Lewis-Beck example

Multiple Regression Y= 6000 + 400X1 + 200X2 R square. = .67 67% of the variation in income is explained by these two variables. Excellent! For every year of education, holding seniority constant, income increases by $400. For every year of seniority, holding education constant, income increases by $200.

Multiple Regression Y= 6000 + 400X1 + 200X2 Example: Estimate the income of someone who has 10 years of education and 5 years of seniority: Y=6000 + 400(10) + 200(5) Y= $ 11,000

Multiple Regression Relationship between contributions to political campaigns as a function of age and income? Y= campaign contribution (dollars) x1 = age (years) X2 = income (dollars)

Multiple Regression Relationship between contributions to political campaigns as a function of age and income. Y = 8 + 2X1 + .010X2 (age) (income) For every increase in age, contributions go up by $2. For every increase in income, contributions go up .01 dollars

Multiple Regression Y = 8 + 2X1 + .010X2 Y= campaign contribution (dollars) But which is stronger? Need to look at the Beta weights Age = .15 Income = .45

Beta Weights Workbook: Table 10.15

Quick Analysis Whenever you are dealing with a correlation (regression analysis) First check the R squared value. A good study will have this If it is low, then you know that it is not a strong model and they shouldn’t be making grand conclusions Make sure they meet 4 conditions necessary for causality

Example Study tried to determine what explained why some cities introduced reinvention. Sent out a survey, respectable response rate Tested 13 factors they thought would explain reinvention R squared was .05 What do you conclude?

Example They ran a second model Included managers’ attitudes about innovation R squared was .22 What do you conclude?

The Levitt Article What data does he show? What kind of question is he asking? Does he show correlations? Does he build a multivariate model? Did anyone see an R squared in this article?