Spatio – Temporal Cluster Detection Using AMOEBA

Slides:



Advertisements
Similar presentations
Lectures 6&7: Variance Reduction Techniques
Advertisements

Perceptron Learning Rule
Geoprocessing; Useful Tools You Should Know in ArcToolbox Unlock the hidden secrets of ArcToolbox to discover tools that make your work easier and analysis.
Spatial Autocorrelation using GIS
Statistical approaches for detecting clusters of disease. Feb. 26, 2013 Thomas Talbot New York State Department of Health Bureau of Environmental and Occupational.
Spatial statistics Lecture 3.
Early Detection of Disease Outbreaks Prospective Surveillance.
Crime Mapping & Analysis William Jarvis & Ibrahim Sabek CSCI 5715 Prof. Shashi Shekhar Wilson, Ronald and Filbert, Katie. “Crime Mapping and Analysis.”
The Geographical and Temporal Distribution of Vital Statistics in Champaign County, from 2005 to 2009 Lan Luo Supervisor: Awais Vaid (C-UPHD) Dr.Ruiz (University.
Empirical/Asymptotic P-values for Monte Carlo-Based Hypothesis Testing: an Application to Cluster Detection Using the Scan Statistic Allyson Abrams, Martin.
Introducing ArcGIS Desktop
A Spatial Scan Statistic for Survival Data Lan Huang, Dep Statistics, Univ Connecticut Martin Kulldorff, Harvard Medical School David Gregorio, Dep Community.
IS 466 ADVANCED TOPICS IN INFORMATION SYSTEMS LECTURER : NOUF ALMUJALLY 20 – 11 – 2011 College Of Computer Science and Information, Information Systems.
 Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Baylisascaris procyonis, Raccoon Roundworm. Classification Raccoon Roundworm KingdomAnimalia PhylumNemathelminthes ClassNematoda OrderAscaridida FamilyAscarididae.
Introduction Time-to-event data (survival data) analysis consists of the study of the time until a specific event, such as malaria, takes place. In many.
Spatial Analysis, Geoprocessing,
Innovative Uses of Geographic Information Systems Lance A. Waller Department of Biostatistics Rollins School of Public Health Emory University
Marine GIS Applications using ArcGIS Global Classroom training course Marine GIS Applications using ArcGIS Global Classroom training course By T.Hemasundar.
Tse-Chuan Yang, Ph.D The Geographic Information Analysis Core Population Research Institute Social Science Research Institute Pennsylvania State University.
Investigating the Geographic Distributions of Lung Cancer Incidence and Trace Elements Exposure in Appalachian Kentucky W. Jay Christian, MPH.
Spatial Clusters and Pattern Analysis
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Introduction to the course January 9, Points to Cover  What is GIS?  GIS and Geographic Information Science  Components of GIS Spatial data.
Spatial Statistics for Cancer Surveillance Martin Kulldorff Harvard Medical School and Harvard Pilgrim Health Care.
Geographic Information Science
Habitat Analysis in ArcGIS Use of Spatial Analysis to characterize used resources Thomas Bonnot
Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio.
Applied Cartography and Introduction to GIS GEOG 2017 EL
EUROHEIS 2 Dr Linda Beale October 2007 – September 2010.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
1 Enhancing Comprehensive HIV Prevention Planning A Syndemic Spatial Analysis of HIV and STI Burden Mike Janson, MPH, Virginia Hu, MPH, Kai-Jen Cheng,
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Thomas Talbot Chief, Environmental Health Surveillance Section NYS Department of Health April 18, 2013.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Introduction of Geoprocessing Topic 7a 4/10/2007.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
Detecting Influenza Outbreaks by Analyzing Twitter Messages By Aron Culotta Jedsada Chartree 02/28/11.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
CHAPTER 11 VECTOR DATA ANALYSIS 11.1 Buffering
Materials and Methods GIS Development A GIS was constructed from historical records of known villages reporting human anthrax between the years 1937 and.
NR 143 Study Overview: part 1 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Geographic Visualization to Support Epidemiology in Bulgaria Anthony C. Robinson GeoVISTA Center Department of Geography The Pennsylvania State University.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Local Indicators of Categorical Data Boots, B. (2003). Developing local measures of spatial association for categorical data. Journal of Geographical Systems,
Analyzing the Geospatial Imbalance of the Primary Care Physician Labor Supply in the Contiguous United States By Russ Frith University of W. Florida Capstone.
The Statistical Urban Zoning. The Experience of the Municipality of Firenze La zonizzazione statistica in ambito urbano. L’esperienza del Comune di Firenze.
A Spatial-Temporal Model for Identifying Dynamic Patterns of Epidemic Diffusion Tzai-Hung Wen Associate Professor Department of Geography,
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
GIS Software Applications in Epidemiology Marcus Liscombe Brent Croft GISC GIS MANAGEMENT AND IMPLEMENTATION.
Introduction of Geoprocessing Lecture 9 3/24/2008.
Towards efficient prospective detection of multiple spatio-temporal clusters Bráulio Veloso, Andréa Iabrudi and Thais Correa. Universidade Federal de Ouro.
Spatial Scan Statistic for Geographical and Network Hotspot Detection C. Taillie and G. P. Patil Center for Statistical Ecology and Environmental Statistics.
GEOGRAPHIC CLUSTERS OF HEAD & NECK CANCER IN FLORIDA Recinda Sherman, MPH, CTR Florida Cancer Data Systems NAACCR Detroit, June 7, 2007.
Spatial statistics Lecture 3 2/4/2008. What are spatial statistics Not like traditional, a-spatial or non-spatial statistics But specific methods that.
Early Detection of Disease Outbreaks with Applications in New York City Martin Kulldorff University of Connecticut Farzad Mostashari and James Miller.
A genetic algorithm for irregularly shaped spatial clusters Luiz Duczmal André L. F. Cançado Lupércio F. Bessegato 2005 Syndromic Surveillance Conference.
Aldo Aviña Environmental and Occupational Health
Spatial analysis Measurements - Points: centroid, clustering, density
Dept of Biostatistics, Emory University
Lecture 22: Using ArcToolbox Tools in Python
Epidemic Alerts EECS E6898: TOPICS – INFORMATION PROCESSING: From Data to Solutions Alexander Loh May 5, 2016.
Lecture 6 Implementing Spatial Analysis
Presentation transcript:

Spatio – Temporal Cluster Detection Using AMOEBA Jimmy Kroon Pennsylvania State University   Advisor: Dr. Frank Hardisty

This is a parody – Original Art: http://projectswordtoys. blogspot

Outline Introduction – Clustering and Project Direction The Spatial Scan Statistic and SatScan AMOEBA Proposed Spatio-Temporal AMOEBA Method Software, Data, and Progress

Cluster Detection Cluster: “a geographically and/or temporally bounded group of occurrences of sufficient size and concentration to be unlikely to have occurred by chance” (Knox, 1989) Two Typical Uses Disease Surveillance Week of 2/7/2010 Data: Google Flu Trends – Analysis: GeoDa Epidemiological Studies Brain Cancer in NM Kulldorff et al. 1998 Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Time in Spatial Analysis Time Matters: Many geographic phenomena are dynamic. Spatial patterns we see probably change over time The American Association of Geographers describes temporal geography as a ‘frontier’ of GIScience. Spatio-temporal clusters may exhibit behaviors not seen in purely spatial clusters. Growth Movement Splits / Joins Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress Research Problem Primary: No method exists for the determining the true extent of irregularly shaped clusters in spatio-temporal datasets. Secondary: Spatial AMOEBA has not been implemented in R Project Goals A demonstration of spatio-temporal cluster detection based on the AMOEBA procedure. R scripts for running spatial and spatio-temporal AMOEBA will be contributed to the R community. Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

The Spatial Scan Statistic Scan data with a moving ‘window’, calculating local autocorrelation for spatial units that fall within the window. Select the window(s) with the highest calculated autocorrelation value as possible cluster(s). The spatial scan statistic is by far the most popular cluster detection technique, largely due to the availability of SaTScan software by Martin Kulldorff. Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

The Spatial Scan Statistic Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Drawbacks of the Spatial Scan Statistic Clusters that are not similar in shape to the scanning window can produce errors. False inclusions False exclusions Identify thin clusters as multiple small clusters Cannot detect holes in clusters Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

The Elliptical Spatial Scan Statistic Must choose shapes a priori to avoid pre-selection bias See Kulldorff et al. 2006 Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress Ecotope-Based – Regions of contiguous spatial units that are related in terms of z-value Multidirectional – Search in all directions. Optimum – Procedure takes place at the finest spatial scale possible and is capable of revealing all spatial association present in the dataset (Aldstadt and Getis, 2006). AMOEBA Clusters Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress Defining an Ecotope Add a seed location (one polygon) to the ecotope Calculate Gi* (Getis-Ord local autocorrelation statistic) Search in all directions for contiguous polygons Those that increase Gi* are added to the growing ecotope for that seed location Keep searching for more neighbors, growing the ecotope until Gi* no longer increases Repeat – creating ecotopes for each polygon in the dataset Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress The R Neighbor Object Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Finding an Ecotope with AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress From Ecotopes to Clusters Rank ecotopes by final Gi* Select that with the highest Gi* as a cluster Eliminate intersecting ecotopes Select the ecotope with the next highest Gi* as a second cluster Repeat Probability of clusters can be tested using Monte Carlo simulation Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Incorporating Time into AMOEBA Remember - Spatio-temporal clusters may exhibit behaviors not seen in purely spatial clusters. Growth Movement Splits / Joins Visualize temporal data as layers of data with time extending vertically through the layers. Each spatio-temporal unit has spatial neighbors and temporal neighbors Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

The Spatio-Temporal Scan Statistic See Kulldorff et al. 1998 Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Software Environment and Test Data The R Project Free, open source statistical software Extendable with user contributed packages www.r-project.org Google Flu Trends Estimates flu incidence levels using aggregated data about user searches for certain keywords 90% accurate compared to CDC data State-level data - updated daily www.google.org/googleflu SEER (Surveillance Epidemiology and End Results) National Cancer Institute incidence, survival, and mortality data Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

AMOEBA ArcToolbox for ArcGIS Python Scripts by Jared Aldstadt and Yeming Fan (Aldstadt, 2010) Google Flu Trends – Feb 1, 2009 Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Spatio-Temporal AMOEBA in Python: 2009 Flu Epidemic Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress Hmmm… Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

R Programming Progress Compete … Geoprocessing tasks Create spatio-temporal neighbor list Delineate ecotopes Sort and eliminate intersecting ecotopes Returns primary cluster PolyID’s that match the Python results To Do … Monte Carlo simulation Process results and add to the output shapefile Test, test, test Clusters : SaTScan : AMOEBA : ST AMOEBA : Progress

References Aldstadt, Jared, and Arthur Getis. 2006. Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters. Geographical Analysis 38: 327-343. Aldstadt, Jared. 2010. Spatial Analysis Tools (ArcGIS). Spatial Analysis Tools. http://www.acsu.buffalo.edu/~geojared/tools.htm. Bellec, S, D Hémon, J Rudant, A Goubin, and J Clavel. 2006. Spatial and space–time clustering of childhood acute leukaemia in France from 1990 to 2000: a nationwide study. British Journal of Cancer Duczmal, Luiz, Martin Kulldorff, and Lan Huang. 2006. Evaluation of Spatial Scan Statistics for Irregularly Shaped Clusters. Journal of Computational and Graphical Statistics 15(2): 428-442. Knox, G. 1989. Detection of Clusters. In Methodology of Enquiries into Disease Clustering, ed. P Elliott, 17-22. London: Small Area Health Statistics Unit. Kulldorff, Martin, Athas, William, Feuer, Eric, Miller, Barry, and Key, Charles. 1998. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos, New Mexico. American Journal of Public Health 88(9): 1377-1380. Kulldorff, Martin, Lan Huang, Linda Pickle, and Luiz Duczmal. 2006. An elliptic spatial scan statistic. Statistics in Medicine 25(22): 3929. Kulldorff, Martin. 1999. Geographic Information Systems (GIS) community health: Some statistical issues. Journal of Public Health Management and Practice 5(2): 100-106. Original artwork for parody title slide: http://projectswordtoys.blogspot.com/2009/05/project-sword-annual-1967.html