Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook.

Slides:



Advertisements
Similar presentations
Sources and effects of bias in investigating links between adverse health outcomes and environmental hazards Frank Dunstan University of Wales College.
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Two-sample tests. Binary or categorical outcomes (proportions) Outcome Variable Are the observations correlated?Alternative to the chi- square test if.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004.
Chapter 10 Simple Regression.

ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
Generalised linear models
Clustered or Multilevel Data
Log-linear and logistic models
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Chapter 11 Multiple Regression.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li December 2, 2004.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
Generalized Linear Models
Conditional Logistic Regression for Matched Data HRP /25/04 reading: Agresti chapter 9.2.
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
7.1 - Motivation Motivation Correlation / Simple Linear Regression Correlation / Simple Linear Regression Extensions of Simple.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
HSRP 734: Advanced Statistical Methods June 19, 2008.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
The binomial applied: absolute and relative risks, chi-square.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Multiple Regression BPS chapter 28 © 2006 W.H. Freeman and Company.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Confidential and Proprietary. Copyright © 2011 Educational Testing Service. All rights reserved. 2/3/2016 Detection of Unusual Administrations Using a.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Analysis of matched data Analysis of matched data.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
CHAPTER 7 Linear Correlation & Regression Methods
Chapter 13 Nonlinear and Multiple Regression
M.Sc. in Economics Econometrics Module I
Generalized Linear Models
CHAPTER 29: Multiple Regression*
Narrative Reviews Limitations: Subjectivity inherent:
Fixed, Random and Mixed effects
Inference for Regression
Presentation transcript:

Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook

Outline 1.Motivation Petrochemical exposure in relation to childhood brain and leukemia cancers Petrochemical exposure in relation to childhood brain and leukemia cancers 2.Cumulative Geographic Residuals Unconditional Unconditional Conditional Conditional 3.Application Childhood Leukemia Childhood Leukemia Childhood Brain Cancer Childhood Brain Cancer

Taiwan Petrochemical Study Matched Case-Control Study 3 controls per case 3 controls per case Matched on Age and Gender Matched on Age and Gender Resided in one of 26 of the overall 38 administrative districts of Kaohsiung County, Taiwan Resided in one of 26 of the overall 38 administrative districts of Kaohsiung County, Taiwan Controls selected using national identity numbers (not dependent on location). Controls selected using national identity numbers (not dependent on location).

Study Population Due to dropout approximately 50% 3 to 1 matching, 40% 2 to 1 matching, and 10% 1 to 1 matching. Leukemia Brain Cancer Cases Controls

Map of Kaohsiung

Cumulative Residuals Unconditional (Independence) Unconditional (Independence) Model definition using logistic regression Model definition using logistic regression Extension to Cluster Detection Extension to Cluster Detection Conditional (Matched Design) Conditional (Matched Design) Model definition using conditional logistic regression Model definition using conditional logistic regression Extension to Cluster Detection Extension to Cluster Detection

Logistic Model Assume the logistic model where, and the link function,

Residual Formulation Then define a residual as, Assuming the model is correctly specified would imply there is no pattern in residuals. => Use Residuals to test for misspecification. Cumulative Residuals for Model Checking; Lin, Wei, Ying 2002

Hypothesis Test Hypothesis of interest, Geographic Location, (r i, t i ) Independent of Outcome, Y i |X i  Cumulative Geographic Residual Moving Block Process is Patternless

Unconditional Cluster Detection Define the Cumulative Geographic Residual Moving Block Process as,

Asymptotic Distribution However, the distribution of, is hard to define analytically, but we have found another distribution that is asymptotically equivalent, which consists of a fixed component of data and random variables

Significance Test Testing the NULL Simulate N realizations of Simulate N realizations of by repeatedly simulating, while fixing the data at their observed values. Calculate P-value Calculate P-value

Conditional Logistic Model Type of Matching: 1 case to M s controls Data Structure: Assume that conditional on, an unobserved stratum-specific intercept, and given the logit link, implies, The conditional likelihood, conditioning on is,

Conditional Residual Then define a residual as, => Use these correlated Residuals to test for patterns based on location.

Conditional Cumulative Residual However, the distribution of, is hard to define analytically, but we have found another distribution that is asymptotically equivalent, which consists of a fixed component of data and random variables

Significance Test Testing the NULL Simulate N realizations of by repeatedly simulating, while fixing the data at their observed values. Calculate P-value

Application Study: Study: Kaohsiung, Taiwan Matched Case-Control Study Method: Method: Conditional Cumulative Geographic Residual Test (Normal and Mixed Discrete)

Results Odds Ratio (p-values) Marginally Significant Clustering for both outcomes without adjusting for smoking history.

Childhood Leukemia

Childhood Brain Cancer

Discussion Cumulative Geographic Residuals Unconditional and Conditional Methods for Binary Outcomes Unconditional and Conditional Methods for Binary Outcomes Can find multiple significant hotspots holding type I error at appropriate levels. Can find multiple significant hotspots holding type I error at appropriate levels. Not computer intensive compared to other cluster detection methods Not computer intensive compared to other cluster detection methods Taiwan Study Found a possible relationship between Childhood Leukemia and Petrochemical Exposure, but not with the outcome Childhood Brain Cancer. Found a possible relationship between Childhood Leukemia and Petrochemical Exposure, but not with the outcome Childhood Brain Cancer.