Topic 10 - Categorical Outcomes

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Chapter 2 Describing Contingency Tables Reported by Liu Qi.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Brief introduction on Logistic Regression
M2 Medical Epidemiology
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Logistic Regression.
Lecture 16: Logistic Regression: Goodness of Fit Information Criteria ROC analysis BMTRY 701 Biostatistical Methods II.
Lecture 22: Evaluation April 24, 2010.
Logistic Regression Example: Horseshoe Crab Data
Overview of Logistics Regression and its SAS implementation
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Lecture 17: Regression for Case-control Studies BMTRY 701 Biostatistical Methods II.
Midterm Review Goodness of Fit and Predictive Accuracy
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Handling Categorical Data. Learning Outcomes At the end of this session and with additional reading you will be able to: – Understand when and how to.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Generalized Linear Models
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
STAT E-150 Statistical Methods
Regression and Correlation
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Statistics for clinical research An introductory course.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
Hierarchical Binary Logistic Regression
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Logistic Regression Database Marketing Instructor: N. Kumar.
Different Distributions David Purdie. Topics Application of GEE to: Binary outcomes: – logistic regression Events over time (rate): –Poisson regression.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
The binomial applied: absolute and relative risks, chi-square.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Logistic Regression. Linear Regression Purchases vs. Income.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
AMMBR II Gerrit Rooks. Checking assumptions in logistic regression Hosmer & Lemeshow Residuals Multi-collinearity Cooks distance.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Logistic Regression Analysis Gerrit Rooks
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
Analysis of matched data Analysis of matched data.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
Logistic Regression When and why do we use logistic regression?
EHS Lecture 14: Linear and logistic regression, task-based assessment
Logistic Regression APKC – STATS AFAC (2016).
CHAPTER 7 Linear Correlation & Regression Methods
An Interactive Tutorial for SPSS 10.0 for Windows©
Notes on Logistic Regression
The binomial applied: absolute and relative risks, chi-square
Generalized Linear Models
BUS 308 HELPS Perfect Education/ bus308helps.com.
Diagnosis II Dr. Brent E. Faught, Ph.D. Assistant Professor
Multiple logistic regression
Nonparametric Statistics
ביצוע רגרסיה לוגיסטית. פרק ה-2
Statistical Analysis using SPSS
Logistic Regression.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Lexico-grammar: From simple counts to complex models
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Categorical Data Analysis
Applied Statistics Using SPSS
Applied Statistics Using SPSS
Logistic Regression.
Presentation transcript:

Topic 10 - Categorical Outcomes HRP 223 - 2008 Topic 10 - Categorical Outcomes

Pearson Less Adverse Events Modeling (χ2 /N) ^ .5 Effect Size +/- 1 for 2x2 tables with.1 .3 .5 Odds ratio = 22/44 ÷ 28/24 Relative Risk 1 = 22/66 ÷ 28/52

Chi-Square Working χ2 = (Observed # - Expected #)2 / Expected # χ2 determines if an association exits χ2 does not measure the strength of the association χ2 depends on and reflects sample size

Right click on the flowchart or File > New…

Tables Bigger than 2 x 2

2 Type “ Yes” with a leading blank 1 4 3

5 6 7 8

(16 * 1 + 26 * 2 + 29 * 3) / 71 = mean for ACT (24 * 1 + 26 * 2 + 18 * 3) / 71 = mean for Placebo

OR: 29/23 ÷ 40/60 = 1.891

If you model the wrong outcome, tweak this. You can control parameterization here.

Watch for missing data. Use the response variable sort order if you are predicting the wrong outcome. Impact of drug relative to an on placebo baseline The model found an answer. If you have prefect prediction (0 frequencies in the contingency tables), expect problems.

Akaike’s Information Criterion, Schwartz Criterion, -2 log likelihood are here. All are used to compare nested models. AIC and SC penalize you for the number of parameters. Overall, is the model any good?

Concordant = Given a pair of observations with different values in the response variable, the model correctly assigns the higher probability of the outcome to the person with the outcome. Domer’s D (nc-nd) / t Gamma (nc –nd) / (nc + nd) Tau-a (nc – nd) / (.5 * N * N-1) c = (nc + .5 * (t-nc –nd)) / t nc = # concordant pairs Nd = # discordant pairs t = pairs with different response values c = area under the ROC curve Somer’s D = 2(c-.5)

Sensitivity: Correctly identify the presence of a condition in those with the disease. Specificity: Correctly identify the lack of a condition in those without the disease.

Odds of a Month in Remission

Profile likelihood are more accurate (especially for small samples) but they need extra horsepower to calculate.

Confidence Intervals

Don’t interpret these as the % of variance accounted for in your model.

Image from: Categorical Data Analysis Using Logistic Regression Course Notes (2005) SAS.

Like leverage How good does the model do at predicting each person Standardized change in parameters if this observation is dropped Overall change in parameters if this observation is dropped. Like cooks distance.

Poorly fit are in the upper corners. Rough rule of thumb is > 4 Poorly fit are in the upper corners. Rough rule of thumb is > 4. See Hosmer and Lemeshow (2000). Events (healthy) Not events (sick)

Modify the code to get a better plot showing the impact on the CLs.

Look for big bubbles and you can hover the mouse over the dots to see the subject-observation number.

Are you missing interactions? Compare you model vs. a saturated model. It does not make sense if you have continuous variables in the model.

Use H and L if you have continuous predictors in the model.

Logistic “issues”

Conditional Logistic You want to predict BBD, so change response variable sort order to descending. They forgot the strata task role! Use “Group analysis by” then tweak the code.

Change “by” into strata.