Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.

Slides:



Advertisements
Similar presentations
Summary of A Spatial Scan Statistic by M. Kulldorff Presented by Gauri S. Datta SAMSI September 29, 2005.
Advertisements

Summary of A Spatial Scan Statistic by M. Kulldorff Presented by Gauri S. Datta Mid-Year Meeting February 3, 2006.
Hotspot/cluster detection methods(1) Spatial Scan Statistics: Hypothesis testing – Input: data – Using continuous Poisson model Null hypothesis H0: points.
Global Clustering Tests. Tests for Spatial Randomness H 0 : The risk of disease is the same everywhere after adjustment for age, gender and/or other covariates.
Ellen F. Heineman, Ph.D. National Cancer Institute Epidemiology & Genetics Research Program The GIS for the Long Island Breast Cancer Study Project.
Statistical approaches for detecting clusters of disease. Feb. 26, 2013 Thomas Talbot New York State Department of Health Bureau of Environmental and Occupational.
Rubén Solano 1, 2, D. Gómez-Barroso 4,2, F. Simón 2, 4, S. Lafuente 3, J. Caylà 3 1 Field Epidemiology Training Program (PEAC-Spain). 2 National Centre.
*Wahida Kihal 1, Cindy Padilla 1,2, Benoit Lalloué 1,2,3, Marcello Gelormini1, Denis Zmirou-Navier 1,2,3, Séverine Deguen 1,2 1 EHESP School of Public.
Early Detection of Disease Outbreaks Prospective Surveillance.
The Geographical and Temporal Distribution of Vital Statistics in Champaign County, from 2005 to 2009 Lan Luo Supervisor: Awais Vaid (C-UPHD) Dr.Ruiz (University.
Empirical/Asymptotic P-values for Monte Carlo-Based Hypothesis Testing: an Application to Cluster Detection Using the Scan Statistic Allyson Abrams, Martin.
A Spatial Scan Statistic for Survival Data Lan Huang, Dep Statistics, Univ Connecticut Martin Kulldorff, Harvard Medical School David Gregorio, Dep Community.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004.
 Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
A Tree-Based Scan Statistic for Database Disease Surveillance Martin Kulldorff University of Connecticut Joint work with: Zixing Fang, Stephen Walsh.
Scan Statistics via Permutation Tests David Madigan.
Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook.
Why Geography is important.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li December 2, 2004.
Sample Size Determination
Sample Size and Statistical Power Epidemiology 655 Winter 1999 Jennifer Beebe.
Spatial Clusters and Pattern Analysis
Spatial Statistics for Cancer Surveillance Martin Kulldorff Harvard Medical School and Harvard Pilgrim Health Care.
Mapping Rates and Proportions. Incidence rates Mortality rates Birth rates Prevalence Proportions Percentages.
Geographic Information Science
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Math 227 Elementary Statistics
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
SPONSOR JAMES C. BENNEYAN DEVELOPMENT OF A PRESCRIPTION DRUG SURVEILLANCE SYSTEM TEAM MEMBERS Jeffrey Mason Dan Mitus Jenna Eickhoff Benjamin Harris.
Health Consultation: Evaluation of Cancer Incidence in Census Tracts of Attleboro and Norton, Massachusetts: Suzanne K. Condon Associate Commissioner.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Temporal, Geographic and Demographic Trends of Early Onset Breast Cancer (EOBC) in Hampden County, MA Holly S. Mason MD FACS Jane Garb MS.
Section Inference for Experiments Objectives: 1.To understand how randomization differs in surveys and experiments when comparing two populations.
Enhancing Disease Surveillance with Spatial-temporal Results Patricia Araki, MPH County of Los Angeles – Department of Public Health Acute Communicable.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Literature searching & critical appraisal Chihaya Koriyama August 15, 2011 (Lecture 2)
Data Sources-Cancer Betsy A. Kohler, MPH, CTR Director, Cancer Epidemiology Services New Jersey Department of Health and Senior Services.
Showcase /06/2005 Towards Computational Epidemiology Using Stochastic Cellular Automata in Modeling Spread of Diseases Sangeeta Venkatachalam, Armin.
Section A Confidence Interval for the Difference of Two Proportions Objectives: 1.To find the mean and standard error of the sampling distribution.
Materials and Methods GIS Development A GIS was constructed from historical records of known villages reporting human anthrax between the years 1937 and.
Incorporating Multiple Evidence Sources for the Assessment of Breast Cancer Policies and Practices J. Jackson-Thompson, Gentry White, Missouri Cancer Registry,
Power and Sample Size Anquan Zhang presents For Measurement and Statistics Club.
Math 4030 – 9a Introduction to Hypothesis Testing
Coffee and Cardiovascular Disease
Analyzing the Geospatial Imbalance of the Primary Care Physician Labor Supply in the Contiguous United States By Russ Frith University of W. Florida Capstone.
The Role of Poverty in Prostate Cancer in African-Americans M. Norman Oliver, M.D., M.A., Assistant Professor, University of Virginia Departments of Family.
Statistical Significance: Tests for Spatial Randomness.
GEOGRAPHIC DISTRIBUTION OF BREAST CANCER IN MISSOURI, Faustine Williams, MS., MPH, Stephen Jeanetta, Ph.D. Department of Rural Sociology, Division.
GIS Software Applications in Epidemiology Marcus Liscombe Brent Croft GISC GIS MANAGEMENT AND IMPLEMENTATION.
Extension of the Cohort Analysis for Genetic Epidemiology (CAGE) Program to Assess Excess Risk of Cancer Mei Liu.
Cancer in Ontario: Overview A Statistical Report.
Premature deaths due to Prostate Cancer: The Role of Diagnosis and Treatment Appathurai Balamurugan MD, MPH S William Ross MD Chris Fisher, BS Jim Files,
BREAST CANCER BY STAGE OF DISEASE AT DIAGNOSIS, CENTRAL OKLAHOMA Arthur Owora, MPH; Aaron Wendelboe, PhD; David Thompson, PhD; Janis Campbell, PhD The.
Spatial Scan Statistic for Geographical and Network Hotspot Detection C. Taillie and G. P. Patil Center for Statistical Ecology and Environmental Statistics.
GEOGRAPHIC CLUSTERS OF HEAD & NECK CANCER IN FLORIDA Recinda Sherman, MPH, CTR Florida Cancer Data Systems NAACCR Detroit, June 7, 2007.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
General Elliptical Hotspot Detection Xun Tang, Yameng Zhang Group
Copyright © 2008 Delmar. All rights reserved. Chapter 4 Epidemiology and Public Health Nursing.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Early Detection of Disease Outbreaks with Applications in New York City Martin Kulldorff University of Connecticut Farzad Mostashari and James Miller.
A genetic algorithm for irregularly shaped spatial clusters Luiz Duczmal André L. F. Cançado Lupércio F. Bessegato 2005 Syndromic Surveillance Conference.
Dept of Biostatistics, Emory University
16 May 2011 Insert name of presentation on Master Slide
The Role of Poverty in Prostate Cancer in African-Americans
Inferences on Two Samples Summary
Software Cluster, Aldrich-Wayne Free, Old
South Carolina Statewide Cancer Prevention & Control Program
Presentation transcript:

Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH

Overview Breast Cancer incidence Study objectives Methods Results Conclusions and recommendations

Study Objectives To identify geographic areas in AR with higher proportion of excess cases of breast cancer. To plan treatment and rehabilitative services for women with breast cancer in these areas.

Methods Using SaTScan/ArcGIS to identify geographic areas with higher proportion of excess cases. Models used: - Poisson Model - Space-Time Permutation

SaTScan SaTScan Software is available for free from NCI SaTScan uses the Spatial Scan Statistic developed by Martin Kulldorff for the National Cancer Institute Gives health agencies ability to quickly assess potential cancer clusters.

Spatial scan statistic Circles of different sizes (from zero up to 50 % of the population size) For each circle a likelihood ratio statistic is computed based on the number of observed and expected cases within and outside the circle and compared with the likelihood L0 under the null hypothesis.

SaTScan To evaluate reported spatial or space-time disease clusters, to see if they are statistically significant. To test whether a disease is randomly distributed over space, over time or over space and time. To perform geographical surveillance of disease, to detect areas of significantly high or low rates. To perform repeated time-periodic disease surveillance for the early detection of disease outbreaks.

Poisson Model With the Poisson model, the number of cases in each location is Poisson- distributed. Under the null hypothesis, and when there are no covariates, the expected number of cases in each area is proportional to its population size, or to the person- years in that area. Purely spatial analysis was conducted using poisson model

Space-Time Permutation Model For the Space-Time Permutation model, the number of observed cases in a cluster is compared to what would have been expected if the spatial and temporal locations of all cases were independent of each other so that there is no space-time interaction.

Data Analysis Incidence cases from ACCR 2000 Census block groups ArcGIS for data geocoding, preparation and display.

Data Prep Model

Results from Poisson Model Locations with most likely clusters identified and displayed using ArcGIS Expected cases – 2,395 Observed cases – 3,016 Observed / expected – Test Statistic – P-Value

Results from Poisson Model

Inference from Poisson Model Most likely areas with higher than expected cases of breast cancer are centered around - Hot Spring, Pulaski, and Dallas Counties in the Central region. - Greene, Craighead, and Mississippi Counties in the northeast region

Inference from Poisson Model Pros - Expected number of cases proportional to population size - Diseases of long latency Cons - Purely spatial (Less time specific) - Less sensitive for a dynamic population

Results from STP Model Locations with most likely clusters identified and displayed using ArcGIS Time frame: 2003/8/ /1/31 Expected cases – 30 Observed cases – 63 Observed / expected – Test Statistic – P-value - <0.05

Results from STP Model

Inference from STP Model Most likely areas with higher than expected cases of breast cancer are centered around -Cleburne, Van Buren, and White Counties in the north central part of the state

Inference from STP Model Pros - Information on cases alone sufficient - Accounts for time changes Cons - Population shift bias: Ignores population dynamics over time - Longer study period

Overlay of the Two Models

Inference Poisson Model is preferred to calculate higher than expected cases in our scenario due to following reasons: Since breast cancer is a disease of long latency Arkansas has a relatively stable population

Recommendations Future methods should focus on accounting for time and space in calculating higher than expected cases for diseases of long latency. Also, adjusting for covariates like age, race, SES, and urban/rural would be critical.

Any Questions? Jim Files GIS Coordinator Arkansas Central Cancer Registry Web: Tel: