Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li December 2, 2004.

Slides:



Advertisements
Similar presentations
STATISTICS Joint and Conditional Distributions
Advertisements

Sources and effects of bias in investigating links between adverse health outcomes and environmental hazards Frank Dunstan University of Wales College.
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
M2 Medical Epidemiology
Longitudinal and Multilevel Methods for Models with Discrete Outcomes with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
A Spatial Scan Statistic for Survival Data Lan Huang, Dep Statistics, Univ Connecticut Martin Kulldorff, Harvard Medical School David Gregorio, Dep Community.
Sociology 601 Class 19: November 3, 2008 Review of correlation and standardized coefficients Statistical inference for the slope (9.5) Violations of Model.
EVAL 6970: Meta-Analysis Fixed-Effect and Random- Effects Models Dr. Chris L. S. Coryn Spring 2011.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004.
Chapter 10 Simple Regression.
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
Generalised linear models
BIOST 536 Lecture 14 1 Lecture 14 – Stratified Models Outline  Description of stratified models  Exact and approximate conditional likelihood  Comparison.
Clustered or Multilevel Data
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Generalised linear models Generalised linear model Exponential family Example: Log-linear model - Poisson distribution Example: logistic model- Binomial.
Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Generalized Linear Models
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Study Design and Analysis in Epidemiology: Where does modeling fit? Meaningful Modeling of Epidemiologic Data, 2010 AIMS, Muizenberg, South Africa Steve.
Conditional Logistic Regression for Matched Data HRP /25/04 reading: Agresti chapter 9.2.
9. Binary Dependent Variables 9.1 Homogeneous models –Logit, probit models –Inference –Tax preparers 9.2 Random effects models 9.3 Fixed effects models.
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.
Analysis of Categorical Data
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
7.1 - Motivation Motivation Correlation / Simple Linear Regression Correlation / Simple Linear Regression Extensions of Simple.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
HSRP 734: Advanced Statistical Methods June 19, 2008.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Repeated Measures  The term repeated measures refers to data sets with multiple measurements of a response variable on the same experimental unit or subject.
1 Modeling Coherent Mortality Forecasts using the Framework of Lee-Carter Model Presenter: Jack C. Yue /National Chengchi University, Taiwan Co-author:
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
The binomial applied: absolute and relative risks, chi-square.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 4.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
ELEC 303 – Random Signals Lecture 17 – Hypothesis testing 2 Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 2, 2009.
1 Part09: Applications of Multi- level Models to Spatial Epidemiology Francesca Dominici & Scott L Zeger.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Construction Engineering 221 Probability and Statistics.
Analysis of matched data Analysis of matched data.
BPS - 5th Ed. Chapter 231 Inference for Regression.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Mining Statistically Significant Co-location and Segregation Patterns.
CHAPTER 7 Linear Correlation & Regression Methods
Chapter 13 Nonlinear and Multiple Regression
M.Sc. in Economics Econometrics Module I
THE LOGIT AND PROBIT MODELS
The binomial applied: absolute and relative risks, chi-square
Linear Mixed Models in JMP Pro
Maximum Likelihood Estimation
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
THE LOGIT AND PROBIT MODELS
CHAPTER 6 Statistical Inference & Hypothesis Testing
Presentation transcript:

Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li December 2, 2004

Outline 1.Motivation Petrochemical exposure in relation to childhood brain and leukemia cancers 2.Cumulative Geographic Residuals Unconditional Conditional 3.Application Childhood Leukemia Childhood Brain Cancer 4.Discussion Future Research

Taiwan Petrochemical Study Matched Case-Control Study 3 controls per case Matched on Age and Gender Resided in one of 26 of the overall 38 administrative districts of Kaohsiung County, Taiwan Controls selected using national identity numbers (not dependent on location).

Study Population Due to dropout approximately 50% 3 to 1 matching, 40% 2 to 1 matching, and 10% 1 to 1 matching. LeukemiaBrain Cancer Cases Controls

Map of Kaohsiung

Conditional Logistic Model Type of Matching: 1 case to M s controls Data Structure: Assume that conditional on, an unobserved stratum- specific intercept, and given the logit link, implies, The conditional likelihood, conditioning on is,

Conditional Residual Then define a residual as, where is the solution to. => Use these correlated Residuals to test for patterns based on location.

Conditional Cluster Detection Define the Cumulative Geographic Residual Moving Block Process as,

Conditional Cumulative Residual However, the distribution of, is hard to define analytically, but we have found another distribution that is asymptotically equivalent, which consists of a fixed component of data and random variables

Significance Test Testing the NULL Simulate N realizations of by repeatedly simulating, while fixing the data at their observed values. Calculate P-value

Application Study: Kaohsiung, Taiwan Matched Case-Control Study Method: Conditional Cumulative Geographic Residual Test (Normal and Mixed Discrete)

Results Odds Ratio (p-values) Marginally Significant Clustering for both outcomes without adjusting for smoking history.

Childhood Leukemia

Childhood Brain Cancer

Discussion Cumulative Geographic Residuals Unconditional and Conditional Methods for Binary Outcomes Can find multiple significant hotspots holding type I error at appropriate levels. Not computer intensive compared to other cluster detection methods Taiwan Study Found a possible relationship between Childhood Leukemia and Petrochemical Exposure, but not with the outcome Childhood Brain Cancer.

Discussion Future Research Failure Time Data Recurrent Events Relocation of Study Participants Surveillance