Case Replacement for Logistic Regression

Slides:



Advertisements
Similar presentations
Agenda of Week VII Review of Week VI Multiple regression Canonical correlation.
Advertisements

Inference in the Simple Regression Model
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Two-sample tests. Binary or categorical outcomes (proportions) Outcome Variable Are the observations correlated?Alternative to the chi- square test if.
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
732G21/732G28/732A35 Lecture computer programmers with different experience have performed a test. For each programmer we have recorded whether.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Simple Logistic Regression
Multiple Logistic Regression RSQUARE, LACKFIT, SELECTION, and interactions.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Chapter 8 Introduction to Hypothesis Testing
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Logistic Regression Pre-Challenger Relation Between Temperature and Field-Joint O-Ring Failure Dalal, Fowlkes, and Hoadley (1989). “Risk Analysis of the.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
Logistic Regression.
4.4 Equations as Relations
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
AN INTRODUCTION TO LOGISTIC REGRESSION ENI SUMARMININGSIH, SSI, MM PROGRAM STUDI STATISTIKA JURUSAN MATEMATIKA UNIVERSITAS BRAWIJAYA.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
TABLES AND VALUES Section 1.5. Open Sentence Equation.
How to Read Scientific Journal Articles
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Education 795 Class Notes P-Values, Partial Correlation, Multi-Collinearity Note set 4.
–The shortest distance is the one that crosses at 90° the vector u Statistical Inference on correlation and regression.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
The dangers of an immediate use of model based methods The chronic bronchitis study: bronc: 0= no 1=yes poll: pollution level cig: cigarettes smokes per.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Hypothesis Testing and Statistical Significance
Correlations: Linear Relationships Data What kind of measures are used? interval, ratio nominal Correlation Analysis: Pearson’s r (ordinal scales use Spearman’s.
PO 141: INTRODUCTION TO PUBLIC POLICY Summer I (2015) Claire Leavitt Boston University.
Methods of Presenting and Interpreting Information Class 9.
Chapter 8 Introducing Inferential Statistics.
Logistic Regression When and why do we use logistic regression?
How does the Unpaired t-Test work?
Advanced Quantitative Techniques
Comparison of Correlation and % Bias to Invalidate Frameworks
Sample Size Estimation
Logistic regression One of the most common types of modeling in the biomedical literature Especially case-control studies Used when the outcome is binary.
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Categorical Data Analysis Review for Final
1.3 Data Recording, Analysis and Presentation
(AERA on-line video – cost is $105)
Modeling with Dichotomous Dependent Variables
Statistics collection, presentation, analysis and interpretation of data Descriptive collection and description of data sets to yield meaningful information.
(AERA on-line video – cost is $105)
Graphs and Chi Square.
Logistic Regression.
Alternative Scenarios and Related Techniques
Clinical prediction models
Type I and Type II Errors
(AERA on-line video – cost is $105)
Statistical Power.
(AERA on-line video – cost is $105)
Presentation transcript:

Case Replacement for Logistic Regression Replace cases with null hypothesis cases. What is a null hypothesis case? One in which probability of success is independent of predictor  You switch some treatment success cases to treatment failure case

Neighborhood Effects: Even Logistic Leverages Product of Associations Odds ratio relating treatment to outcome Odds ratio relating omitted variable to outcome Increments of Γ matter more as Δ increases. Correlation between Γ Δ and odd ratio is -.96 Odds ratio relating omitted variable to treatment Harding, D. J. (2003). Counterfactual models of neighborhood effects: The effect of neighborhood poverty on dropping out and teenage pregnancy. American Journal of Sociology, 109(3), 676-719.

Replacement of Cases for Logistic: Toy Example   Failure Success Total Control 16 9 25 Treatment 4 21 20 30 50 Odds ratio=16*21/(4*9)=9.3 Odds ratio=9.33. ln(odds ratio)=2.234. se=.686 =(1/16+1/9+1/4+1/21) Threshold =t critical*.686=1.96*.686=1.345 % bias to invalidate=1-1.345/2.2335=40%

Replacement of Cases for Logistic = Switching Cases: Toy Example   Failure Success Total Control 16 9 25 Treatment 4 21 20 30 50 7 18 New odds ratio=16*18/(9*7)=4.57 23 27 How many treatment success must you replace with null hypothesis (p=30/50=.6) to invalidate the inference  How many to switch from treatment success to treatment failure: % bias to invalidate=1-1.345/2.2335=40% Replace 40% of treatment successes (n=21)=8 cases Replace with null hypothesis cases (p of success=.6). So switch 8*(1-.6): =about 3.2 cases from treatment success to treatment failure. # of cases to switch=% bias to invalidate*(treatment success cases)*(1-overall probability of success).

Replacement of Cases for Logistic: Toy Example   Failure Success Total Control 16 9 25 Treatment 4 21 20 30 50   Failure Success Total Control Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Treatment Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ

Replacement of Cases for Logistic: Toy Example   Failure Success Total Control Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Treatment Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Ꙫ Replace 8 cases = switch 3 cases Ln(p/(1-p)=β0 +β1 treatment. H0:β1 =0 Ln(p/(1-p)=β0 p=.6

R code brute force setwd("C:/Users/user/Dropbox (Personal)/sensitivity for logistics") rm(list = ls()) A <- 29 B <- 26 C <- 15 D <- 40 x <- matrix(c(A,B,C,D), byrow = TRUE, 2, 2) # this is the 2 by 2 table we start with p.CD <- p.value <- chisq.test(x,correct = FALSE)$p.value N.CD <- 0 while ( p.value <0.05 ) { C <- C + 1 D <- D - 1 N.CD <- N.CD + 1 print(x) print(chisq.test(x,correct = FALSE)) p.value <- chisq.test(x,correct = FALSE)$p.value p.CD <- c(p.CD, chisq.test(x,correct = FALSE)$p.value) } p.AB <- p.value <- chisq.test(x,correct = FALSE)$p.value N.AB <- 0 A <- A - 1 B <- B + 1 N.AB <- N.AB + 1 p.AB <- c(p.AB, chisq.test(x,correct = FALSE)$p.value) # so p.AB and p.CD record the p values each time we switch one case # N.AB = 5 (5 cases needed to be changed from A to B) # N.CD = 4 (4 cases needed to be changed from D to C)

KonFound-it for Logistic Regression https://jmichaelrosenberg.shinyapps.io/shinykonfound/

Reflection What part if most confusing to you? Why? More than one interpretation? Talk with one other, share Find new partner and problems and solutions