Adjusting for extraneous factors Topics for today Stratified analysis of 2x2 tables Regression Readings Jewell Chapter 9.

Slides:



Advertisements
Similar presentations
The %LRpowerCorr10 SAS Macro Power Estimation for Logistic Regression Models with Several Predictors of Interest in the Presence of Covariates D. Keith.
Advertisements

M2 Medical Epidemiology
2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Logistic Regression.
Simple Logistic Regression
Logistic Regression Part I - Introduction. Logistic Regression Regression where the response variable is dichotomous (not continuous) Examples –effect.
Analysis of frequency counts with Chi square
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Header= Verdana 28 pt., Red 1 STA 517 – Chapter 3: Inference for Contingency Tables 3. Inference for Contingency Tables 3.1 Confidence Intervals for Association.
1 More on exposure/response associations Readings Jewell Chapters 3 & 7.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
EPI 809/Spring Multiple Logistic Regression.
Sociology 601 Class12: October 8, 2009 The Chi-Squared Test (8.2) – expected frequencies – calculating Chi-square – finding p When (not) to use Chi-squared.
Regression Topics for today Readings Jewell Chapters 12, 13, 14 & 15.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
1 SOC 3811 Basic Social Statistics. 2 Reminder  Hand in your assignment 5  Remember to pick up your previous homework  Final exam: May 12 th (Saturday),
Deaths of snails vs exposure by species. Deaths of snails vs exposure by temperature.
Categorical Data Analysis: Stratified Analyses, Matching, and Agreement Statistics Biostatistics March 2007 Carla Talarico.
Incomplete Block Designs
WLS for Categorical Data
Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Confounding, Effect Modification, and Stratification.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Statistics for clinical research An introductory course.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
EIPB 698E Lecture 10 Raul Cruz-Cano Fall Comments for future evaluations Include only output used for conclusions Mention p-values explicitly (also.
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Xuhua Xia Polynomial Regression A biologist is interested in the relationship between feeding time and body weight in the males of a mammalian species.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
The binomial applied: absolute and relative risks, chi-square.
Preparing for the final - sample questions with answers.
1 Topic 2 LOGIT analysis of contingency tables. 2 Contingency table a cross classification Table containing two or more variables of classification, and.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
Sigmoidal Response (knnl558.sas). Programming Example: knnl565.sas Y = completion of a programming task (1 = yes, 0 = no) X 2 = amount of programming.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Analysis of matched data Analysis of matched data.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
CHAPTER 7 Linear Correlation & Regression Methods
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Saturday, August 06, 2016 Farrokh Alemi, PhD.
Lecture 10 Comparing 2xk Tables
Logistic Regression.
Statistical Process Control
Statistical Inference for the Mean: t-test
Presentation transcript:

Adjusting for extraneous factors Topics for today Stratified analysis of 2x2 tables Regression Readings Jewell Chapter 9

Berkeley Admissions Data 1973 study showed that 45% of 2691 male applicants were admitted, compared with only 30% of 1835 female applicants. The odds ratio is 1.84 with 95% confidence interval (1.62, 2.08). Is this evidence of sex bias? AdmitReject Male Female Log odds ratio = 95% conf interval:

Berkeley Admissions Data The picture changes completely once we look at admissions by department! Bickel, P.J., J.W. Hammel and J.W. O'Connell (1975) "Sex bias in graduate admissions: Data from Berkeley" in Science, 187: ) # applicants (% admit) DeptMaleFemale %10882% %2568% %59334% %37535% %39324% 63736%3417%

Stratified analysis Consider relationship between a disease outcome (D in Jewell, often Y in practice) and an exposure (E in Jewell, often X in practice), but we also want to adjust for an additional factor such as age or sex that can be divided up into I distinct strata. Suppose that the data from the ith stratum can be represented as follows: Jewell Tables 9.2 & 9.3 give two examples DiseasedNot Diseased Exposedaiai bibi Unexposedcici didi

What do we want to do? 1.Ask whether there is a significant association between disease (D) and exposure (E), after adjusting for the additional stratification factor 2.Estimate an adjusted odds ratio, that appropriately takes into account the stratification factor. Lets start with 1. but first, we need to quickly go over another way to assess whether there is a significant association for a 2x2 table

Assessing association - Berkeley Admissions again We already determined that there is a significant association in this 2x2 table, based on the 95% confidence interval for the odds ratio. An alternative approach is a chi-squared test There are several variations. But basic idea is to compare observed data to what would be expected if there were no association (see J p 69) Observed data AdmitReject Male Female Expected data AdmitReject Male Female

Chi-Squared test for a 2x2 table The test statistic is And its “significance” can be determined by looking up the chi-squared tables with 1 degree of freedom. For the Berkeley data, we get:

Back to the stratified analysis Cochran-Mantel-Haenszel test combines the differences between observed and expected values over all the strata. It focuses only on the “a” element of each 2x2 table Stratum iDNot D Eaiai bibi Not Ecici didi

Berkeley Admissions MaleFemale stratum a b c d

Estimating a common effect Wolf method (averages the log odds ratios) Mantel-Haenszel (averages the odds ratios) Regression-based

Wolf’s average log-odds ratio Can add.5 to cell entries if sample sizes are small

Applying Wolf method to Berkeley data stratumabcdlorvw=1/vw*lor Wolf estimate of LOR is.03, with variance What is 95% CI? Corresponding OR estimate is

Wolf’s average log-odds ratio Can add.5 to cell entries if sample sizes are small

Applying Wolf method to Berkeley data stratumabcdlorvw=1/vw*lor Wolf estimate of LOR is.03, with variance What is 95% CI? Corresponding OR estimate is

Mantel-Haenszel average odds ratio

Applying Wolf method to Berkeley data stratumabcdlorvw=1/vw*lor Wolf estimate of LOR is.03, with variance What is 95% CI? Corresponding OR estimate is

Regression-based analysis for Berkeley data data berkeley; input stratum male a b ; cards; run; data berkeley; set berkeley; n=a+b; Unstratified analysis; proc genmod; model a/n=male/dist=binomial; run; Code continued

Results of unstratified analysis Standard 95% Confidence Chi- Parameter DF Estimate Error Limits Square P Intercept <.0001 male <.0001 Scale Compare with our initial analysis

Stratified analysis proc genmod; class stratum; model a/n=male stratum/dist=binomial; run; Standard 95% Conf Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 male stratum <.0001 stratum <.0001 stratum <.0001 stratum <.0001 stratum <.0001 stratum Scale

More general modeling We can add additional factors into the logistic regression model so as to obtain an estimate of the log-odds ratio, adjusting for all these additional factors. Example, smoking in the Epilepsy study. Lets look in SAS: proc freq ; table one3*cig2 /chisq; run;

Epilepsy data in SAS

Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 DRUG DRUG DRUG Scale Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 DRUG DRUG DRUG CIG Scale

Why don’t drug estimates change much?? Hint – look at association between drug and smoking

proc freq ; table one3*cig2 /chisq; run;