Logistic Regression July 28, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Brief introduction on Logistic Regression
Regression analysis Linear regression Logistic regression.
Qualitative Variables and
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Logistic Regression and Odds Ratios
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
An Introduction to Logistic Regression
Logistic Regression Source
Multiple Regression – Basic Relationships
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Logistic Regression – Basic Relationships
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
Dummy Variables. Outline Objective Why forming dummy variables to use nominal variables as independent variables in regressions are important. How to.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Wednesday PM  Presentation of AM results  Multiple linear regression Simultaneous Simultaneous Stepwise Stepwise Hierarchical Hierarchical  Logistic.
Logistic Regression- Dichotomous Dependent Variables March 21 & 23, 2011.
Measurement and Variables May 14, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Hierarchical Binary Logistic Regression
Tests of Significance June 11, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Cross-Tabulation Analysis; Making Comparisons; Controlled Comparisons June 2, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Data Lab # 2 Descriptive Statistics May 28, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Multiple Regression Lab Chapter Topics Multiple Linear Regression Effects Levels of Measurement Dummy Variables 2.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Logistic (regression) single and multiple. Overview  Defined: A model for predicting one variable from other variable(s).  Variables:IV(s) is continuous/categorical,
Linear correlation and linear regression + summary of tests
Logistic Regression. Conceptual Framework - LR Dependent variable: two categories with underlying propensity (yes/no) (absent/present) Independent variables:
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Data Lab # 3 June 4, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Data Lab #8 July 23, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Data Lab # 4 June 16, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Dummy Variables; Multiple Regression July 21, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Logistic Regression Analysis Gerrit Rooks
Week 13 Application! DPPG 8504 Sara Spowart, PhD, MPA
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
CSE 5331/7331 F'07© Prentice Hall1 CSE 5331/7331 Fall 2007 Regression Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Describing Variables & Hypotheses Testing May 26, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Data Lab #6 July 9, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Nonparametric Statistics
Measures of Association June 25, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Lecture note on statistics, data analysis planning – week 14 Elspeth Slayter, M.S.W., Ph.D.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
REGRESSION G&W p
Logistic Regression.
Political Science 30: Political Inquiry
Drop-in Sessions! When: Hillary Term - Week 1 Where: Q-Step Lab (TBC) Sign up with Alice Evans.
Ivan Katchanovski, Ph.D. POL 242Y-Y
BIVARIATE REGRESSION AND CORRELATION
Multiple logistic regression
Nonparametric Statistics
Bivariate Linear Regression July 14, 2008
STA 282 – Regression Analysis
Regression III.
Presentation transcript:

Logistic Regression July 28, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y

Binary Logistic Regression Appropriate when the dependent variable is a dummy variable – Dummy variable: a variable that includes two categories which assume values 1 and 0 – Example: “Conservative party supporter”: Yes=1; No=0 – Binary: two values One or many independent variables Assumes non-linear relationship 2

Regression Coefficients and Odds Ratio Regression coefficients: – Interpretation is similar to interpretation of unstandardized regression coefficients in linear regression Effect of a change of one unit of an independent variable on the logged odds of the dependent variable Logged odds are not very easy to grasp Odds Ratio: – Effect of a change of one unit of an independent variable on the change in the odds of the dependent variable Better to grasp If odds ratio more than 1: positive relationship If odds ratio less than 1: negative relationship If odds ratio equal to1: no relationship 3

Statistical Significance Statistical significance of a regression coefficient: – Statistically significant if p(obtained)<p(critical)=.05 or.01 or.001 – Statistically nonsignificant if p(obtained)>p(critical)=.05 Direction of association should be reported only for statistically significant regression coefficients 4

Pseudo R Square R Square analogs in logistic regression – Power of independent variables in predicting the dependent variable Cox & Snell R square – Ranges between 0 (no association) and less than 1 (perfect association) Nagelkerke R square – Adjusts Cox & Snell R square so that its maximum value can equal 1 – Ranges between 0 (no association) and 1 (perfect association) 5

Example: Multiple Research Hypotheses First : The level of economic development has a positive effect on the odds that countries are democratic Second: Former British colonies are more likely to be democratic compared to other countries Third : Protestant countries are more likely to be democratic compared to other countries Fourth: Ethnic and linguistic homogeneity has a positive effect on the odds of countries being democratic 6

Example: Variables Dataset: World Dependent Variable: – Democracy (Is country democratic?) Dummy variable Independent Variables: – GDP per capita ($1000) Interval-ratio – Ethno-linguistic heterogeneity Ordinal treated as interval-ratio – Colony variable Transformed into dummy variables – Religious culture variable Transformed into dummy variables 7

Binary Logistic Regression: SPSS Commands SPSS Command: Analyze-Regression-Binary Logistic “Dependent” box: Select the dependent variable “Covariates” box: Select independent variables Method: “Enter” 8

Table: Determinants of democracy Regression coefficients B (Standard error) Odds ratio Exp(B) GDP per cap ($1000).336*** (.105) French colony (1.148).198 Spanish colony.433 (.823) Other country (1.035) Catholic.389 (1.218) Muslim (1.253).336 Other religion (1.149).824 Ethno-linguistic heterogeneity (.434).705 Constant (1.557).531 Nagelkerke R square.645 N92 9 *** Statistically significant at the.01 level, ** statistically significant at the.05 level, * statistically significant at the.1 level

Example: Statistical Significance Number of cases: N=92.1 or 10% significance level can be used Regression coefficient of the GDP variable: SPSS: p(obtained)=.001 <p(critical)=.01=1% Statistically significant at the.01 or 1% level Regression coefficients of the other independent variables: SPSS: p(obtained)=from.159 to.866 >p(critical)=.1 Statistically insignificant 10

Example: Regression Coefficients and Odds Ratio Regression Coefficient of GDP per capita variable=.336 Increase of $1000 in the level of GDP per capita increases the logged odds of country being democratic by.336 Odds ratio of GDP per capita: Increase of $1000 in the level of GDP per capita increases the odds of country being democratic by about 1.4 times 11

Example: Interpretation Nagelkerke R square=.645 The logistic regression model has a strong predictive power The first research hypothesis is supported by logistic regression analysis The level of economic development has a positive and statistically significant effect on the odds of countries being democracies All other research hypotheses are not supported by logistic regression analysis 12