Dummies (no, this lecture is not about you) POL 242 Renan Levine February 13/15, 2007.

Slides:



Advertisements
Similar presentations
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Advertisements

Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Bivariate Regression Analysis
Lecture 3 Cameron Kaplan
Statistics for the Social Sciences
Statistics for Managers Using Microsoft® Excel 5th Edition
1 Qualitative Independent Variables Sometimes called Dummy Variables.
An Introduction to Logistic Regression
Multiple Regression 2 Sociology 5811 Lecture 23 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Multiple Regression 1 Sociology 8811 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Dummy Variables. Outline Objective Why forming dummy variables to use nominal variables as independent variables in regressions are important. How to.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Example of Simple and Multiple Regression
Understanding Multivariate Research Berry & Sanders.
Multiple Regression 1 Sociology 5811 Lecture 22 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Chapter 18 Four Multivariate Techniques Angela Gillis & Winston Jackson Nursing Research: Methods & Interpretation.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Interactions POL 242 Renan Levine March 13/15, 2007.
Multivariate Regression 11/19/2013. Readings Chapter 8 (pp ) Chapter 9 Dummy Variables and Interaction Effects (Pollock Workbook)
Multiple Regression 3 Sociology 5811 Lecture 24 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Multiple Regression Lab Chapter Topics Multiple Linear Regression Effects Levels of Measurement Dummy Variables 2.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
Warsaw Summer School 2015, OSU Study Abroad Program Regression.
Multiple Regression. Multiple Regression  Usually several variables influence the dependent variable  Example: income is influenced by years of education.
Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental.
Review. POL 242 – Strong Correlation. Positive or Negative?
Regression analysis and multiple regression: Here’s the beef* *Graphic kindly provided by Microsoft.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Data Lab #8 July 23, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Chapter 16 Data Analysis: Testing for Associations.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Lecture 4 Introduction to Multiple Regression
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting multivariate OLS and logit coefficients Jane E. Miller, PhD.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Multiple Regression Review Sociology 229A Copyright © 2008 by Evan Schofer Do not copy or distribute without permission.
Review for Final Examination COMM 550X, May 12, 11 am- 1pm Final Examination.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
[title of your research project] [author] INST 381 Fall 2015.
Dummy Variables; Multiple Regression July 21, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Specification: Choosing the Independent.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Regression Lab Exercises. Crosstabs Good to give a general picture Good to give a general picture Good to compare dependent varialbes between groups (countries,
DTC Quantitative Research Methods Regression I: (Correlation and) Linear Regression Thursday 27 th November 2014.
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Another way to select sample size The sample R-squared method.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Michael J. Kalsher PSYCHOMETRICS MGMT 6971 Regression 1 PSYC 4310 Advanced Experimental Methods and Statistics © 2014, Michael Kalsher.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard)   Week 5 Multiple Regression  
Statistics for Political Science Levin and Fox Chapter 11:
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Lecture 10 Regression Analysis
Political Science 30: Political Inquiry
Multiple Regression Analysis and Model Building
Bi-variate #1 Cross-Tabulation
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
QM222 A1 On tests and projects
Multiple Regression 1 Sociology 8811 Copyright © 2007 by Evan Schofer
Regression Part II.
Presentation transcript:

Dummies (no, this lecture is not about you) POL 242 Renan Levine February 13/15, 2007

Regression Interpretation Review Interpretation of a Regression “Countries" dataset.  The dependent variable is military expenditures as a % of GDP.  There are two independent variables: % of GDP from industry [INDUSTRY] and Civil Liberties Index [CIVLIB] Variables in the Equation Variable B SE B Beta T Sig T INDUSTRY CIVLIB (Constant)

What to look at first The primary question you are asking when you run a regression is "what is the effect of x [independent variable] on y [dependent variable]?" So first look at the value of the unstandardized coefficients (B) The unstandardized coefficient tells you how much the value of the dependent variable is expected to change if the value of the independent variable increases by one unit controlling for all other independent variables.  Ex: For INDUSTRY, B=  You can write this in a sentence like this: All else being equal, for every one percent increase in the % of GDP from industry, the amount of military expenditures as a % of GDP goes up 0.04 points [or 4 hundredths of a percent of GDP].

Are you sure? Can we be sure that this is not the result of sampling error?  For this we look at the significance (Sig T).  Ex: P =  You can write this result in a sentence like this: The relationship between the % of GDP from industry and the % of military expenditures is significant [P < 0.5].

Are you worried about multicollinearity? In this case, both variables are significant, so we do not need to check the tolerance. If two or more variables are insignificant, what should you examine?

Which variable has the biggest effect? Now that we have observed the effect of each independent variable on the dependent variable, we can turn to the question of which variable has the biggest effect on the dependent variable.  For this, we look at the standardized coefficient (Beta).  In this example, the largest beta is for Civil Liberties.  You can report this result in three sentences like this: According to the standardized coefficients, the level civil liberties has a bigger effect on a country's percentage of GDP spent on military expenditures than the percentage of GDP from industry. A change in one standard deviation in the level of civil liberties causes a 0.35 decrease in the percentage of GDP spent on military expenditures when controlling for percentage of GDP from industry. In comparison, a change in one standard in the percentage of GDP from industry causes a 0.13 increase in the percentage of GDP spent on military expenditures.

Last Finally, we can turn to the final question of how much total variation the model explains. The R-squared of the model provides this information. Remember that R-squared values will be lower if your DV has fewer categories.  R-squared tends to be very high when researching country-level data.

Multivariate Regression You finished a worksheet, demonstrating you know mechanics of a regression. Now you need to start developing a model of the world. You are trying to explain your DV. You must choose all those IVs that you (or others) think explains the DV Today: how to use nominal IVs. Two tools to make your model a more accurate description of the social world

Coefficient Remember that the unstandardized coefficient tell us how much a one-unit increase in the independent variable affects the dependent variable. The dependent variable must be (at least) ordinal.  The change from “strongly agree” to “agree” or “disagree” to “strongly disagree” causes…  For each additional percentage point increase in literacy…

Nominal Variables Makes no sense to use nominal variables this way.  The change from New Brunswick to Ontario or P.E.I to Quebec causes !?!?!?!  Being a Liberal rather than a Conservative, or being Green rather than a NDP supporter?!?!? But you can say..  The difference between New Brunswick and all other provinces OR Ontario and all other provinces is, OR Atlantic provinces and all others…  NDP supporters vs Liberals AND/OR Conservatives vs. Liberals, Bloc vs. Liberals AND/OR Others vs. Liberals.

Make Nominal Vars Dichotomous! Quebec vs. R.O.C. Alberta vs. all other provinces Men vs. Women Africa vs. all other countries City residents vs. all others NDP supporters only Catholics only Or even…  People over 65 only  University educated vs. non-University educated  Wealthiest countries vs. all others

Rules for Dummies “Dummy” Variables are dichotomous. Can use dummy variables for each category of a nominal variable minus one. Example. Original variable has five categories (Liberal, Conservative, NDP, Bloc, & Other)  Minimum: use just one dummy 1= NDP, 0= All others  Maximum: use # of categories – 1 = 4 1= NDP, 0= Liberals; * 1= Tory, 0= Liberals; * 1= Bloc, 0= Liberals; * 1= Other, 0= Liberals;* * 0= Liberals when # of categories-1 dummies are created if Liberals are excluded.  All options in between are also okay! One remaining category is the reference

What is the reference? Reference category is the baseline. When you use maximum number of categories, you are comparing each dummy to the [omitted] reference.  Often is the most popular category. Ex. For Canadian partisanship, Liberals are most popular, so if we have four dummies we often omit the Liberals and compare NDP-ers, Tories, Separatists and Others to the Liberals. Your call as to what you want baseline to be.

What does a dummy do? Y X Constant (Y intercept) Dummy’s B { Since X is zero or one, it is just like adding another constant!

Next Work on the lab! You have a write-up to do. You can use previous results, but you should really think about DV and IVs  You may use dummies in your write-up.  You may need to use dummies (and/or interactions) in your final paper (when we most care about the accuracy of your model).