The Space-Time Scan Statistic for Multiple Data Streams

Slides:



Advertisements
Similar presentations
Using ESSENCE IV An Overview Objectives Explain ESSENCE and its impact Define surveillance Define syndromic surveillance and its importance Demonstrate.
Advertisements

Summary of A Spatial Scan Statistic by M. Kulldorff Presented by Gauri S. Datta SAMSI September 29, 2005.
Hotspot/cluster detection methods(1) Spatial Scan Statistics: Hypothesis testing – Input: data – Using continuous Poisson model Null hypothesis H0: points.
Perspectives on Outreach from the NYC Department of Health and Mental Hygiene Benjamin Tsoi, MD, MPH Bureau of HIV/AIDS Prevention and Control NYC Department.
Presenting Medicare 101 and Kaiser Permanente Senior Advantage (HMO) Welcome to Kaiser Permanente Gwinnett County Government Anni Kuechenmeister Medicare.
1 Monitoring Pediatric Emergency Room Use with the National Health Interview Survey Renee M. Gindi Division of Health Interview Statistics, NCHS National.
Statistical approaches for detecting clusters of disease. Feb. 26, 2013 Thomas Talbot New York State Department of Health Bureau of Environmental and Occupational.
 2005 Carnegie Mellon University A Bayesian Scan Statistic for Spatial Cluster Detection Daniel B. Neill 1 Andrew W. Moore 1 Gregory F. Cooper 2 1 Carnegie.
ARIZONA CHILDHOOD LEAD POISONING OFFICE OF ENVIROMENTAL HEALTH Christine Cervantez Young PREVENTION PROGRAM.
From Data to Signals to Screenshots: Recent Developments in NYCDOHMH ED Syndromic Surveillance. Marc Paladini New York City Department of Health and Mental.
Early Detection of Disease Outbreaks Prospective Surveillance.
Bayesian Biosurveillance Gregory F. Cooper Center for Biomedical Informatics University of Pittsburgh The research described in this.
Empirical/Asymptotic P-values for Monte Carlo-Based Hypothesis Testing: an Application to Cluster Detection Using the Scan Statistic Allyson Abrams, Martin.
Exact Logistic Regression Larry Cook. Outline Review the logistic regression model Explore an example where model assumptions fail –Brief algebraic interlude.
A Spatial Scan Statistic for Survival Data Lan Huang, Dep Statistics, Univ Connecticut Martin Kulldorff, Harvard Medical School David Gregorio, Dep Community.
FAMILY MEDICINE PRACTICE EXPERIENCES FROM TURKEY Dokuz Eylül University Medicine Faculty Family Medicine Department December 2014, Zagreb.
Syndromic Surveillance in New York State Loretta A. Santilli, MPH James R. Miller, MPH, MD Hwa-Gan Chang, PhD New York State Department of Health Division.
Rapid Detection of Significant Spatial Clusters Daniel B. Neill Andrew W. Moore The Auton Lab Carnegie Mellon University School of Computer Science
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Weng-Keen Wong, Greg Cooper, Denver Dash *, John Levander, John Dowling,
What’s Strange About Recent Events (WSARE) v3.0: Adjusting for a Changing Baseline Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon.
HANTAVIRUS An Increasing Public Health Concern in Urban Areas. Walters Y Mbimba PhD Student in Public Health. Walden University PUBH 8165 Environmental.
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Greg Cooper, Weng-Keen Wong, Denver Dash*, John Levander, John Dowling,
Conclusions On our large scale anthrax attack simulations, being able to infer the work zip appears to improve detection time over just using the home.
Spatiotemporal Cluster Detection in ESSENCE Biosurveillance Systems Panelist: Howard Burkom National Security Technology Department, John Hopkins University.
Space-Time Scan Statistics for Early Warning Systems Martin Kulldorff Department of Ambulatory Care and Prevention Harvard University Medical School and.
Population-Wide Anomaly Detection Weng-Keen Wong 1, Gregory Cooper 2, Denver Dash 3, John Levander 2, John Dowling 2, Bill Hogan 2, Michael Wagner 2 1.
Bayesian Network Anomaly Pattern Detection for Disease Outbreaks Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon University)
1 Bayesian Network Anomaly Pattern Detection for Disease Outbreaks Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon University)
Methods for Real-Time Detection and Assessment of Disease Outbreaks Using Information Technology Michael Wagner, M.D., Ph.D. Director, Real-Time Outbreak.
Overview of ‘Syndromic Surveillance’ presented as background to Multiple Data Source Issue for DIMACS Working Group on Adverse Event/Disease Reporting,
Spatial Statistics for Cancer Surveillance Martin Kulldorff Harvard Medical School and Harvard Pilgrim Health Care.
Geographic Information Science
Aaron Fleischauer, PhD, MPH
Clinical Pathological Conference Kartikya Ahuja, M.D. Resident Physician Department of Medicine NYU School of Medicine July 20 th, 2007.
Das Gesundheitssystem in Deutschland
Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.
2013 BROADBAND SUMMIT: BROADBAND ADOPTION AND USAGE – WHAT HAVE WE LEARNED? February 7, 2013.
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
SPONSOR JAMES C. BENNEYAN DEVELOPMENT OF A PRESCRIPTION DRUG SURVEILLANCE SYSTEM TEAM MEMBERS Jeffrey Mason Dan Mitus Jenna Eickhoff Benjamin Harris.
Robin A. Cohen, PhD National Center for Health Statistics National Conference on Health Statistics August 7, 2012 Financial burden of medical care: Looking.
Non-infectious Chronic Diseases Control & Prevention in Zhabei District Jane Xiong Zhabei District CDC, Shanghai, China Aug 29,2006.
Snapshot of IMS LifeLink Claims Database 10% Random Sample
Overview of the National Health Care Survey Thomas McLemore Division of Health Care Statistics October 10, 2003 U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES.
Additional Data For Harmonized Use Case for Biosurveillance HINF 5430 Final Project By Maria Metty, Priyaranjan Tokachichu &Resty Namata December 13, 2007.
There are two main purposes in statistics; (Chapter 1 & 2)  Organization & ummarization of the data [Descriptive Statistics] (Chapter 5)  Answering.
Surveillance Overview Julia Gunn Boston Public Health Commission.
HOW TO GET AND USE YOUR HEALTH INSURANCE Cabrini Clinic.
What’s Strange About Recent Events (WSARE) Weng-Keen Wong (University of Pittsburgh) Andrew Moore (Carnegie Mellon University) Gregory Cooper (University.
Enhancing Disease Surveillance with Spatial-temporal Results Patricia Araki, MPH County of Los Angeles – Department of Public Health Acute Communicable.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Innovations in Medical Education: Teaching Contagious Disease Outbreak Awareness to Medical Students and Residents Larissa May, M.D. Department of Emergency.
Practical Aspects of Alerting Algorithms in Biosurveillance Howard S. Burkom The Johns Hopkins University Applied Physics Laboratory National Security.
Emergency Department Syndromic Surveillance (EDSS): A public health unit perspective alPHa Meeting Feb 1, 2007.
~PPT Howard Burkom 1, PhD Yevgeniy Elbert 2, MSc LTC Julie Pavlin 2, MD MPH Christina Polyak 2, MPH 1 The Johns Hopkins University Applied Physics.
1 Copyright © 2012 by Mosby, an imprint of Elsevier Inc. Copyright © 2008 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 24 Public Health Surveillance.
Towards efficient prospective detection of multiple spatio-temporal clusters Bráulio Veloso, Andréa Iabrudi and Thais Correa. Universidade Federal de Ouro.
Health Care Systems Delivering Health Care to the Community.
Epi Program Overview Disease Surveillance and Reporting.
Katarina Grande, HIV Surveillance Coordinator Casey Schumann, HIV Epidemiologist Wisconsin Department of Health Services Statewide Action Planning Group.
GEOGRAPHIC CLUSTERS OF HEAD & NECK CANCER IN FLORIDA Recinda Sherman, MPH, CTR Florida Cancer Data Systems NAACCR Detroit, June 7, 2007.
Correlation of National Influenza Surveillance Data to the Local Experience Kate Goodin, MPH Florida Department of Health Bureau of Epidemiology 6 th Annual.
Albany New York (1) G. P. Patil. Albany New York (2) G. P. Patil.
Early Detection of Disease Outbreaks with Applications in New York City Martin Kulldorff University of Connecticut Farzad Mostashari and James Miller.
A genetic algorithm for irregularly shaped spatial clusters Luiz Duczmal André L. F. Cançado Lupércio F. Bessegato 2005 Syndromic Surveillance Conference.
Dept of Biostatistics, Emory University
Prehospital Emergency Medicine in Argentina
New Directions in Pre-Syndromic and Subpopulation Health Surveillance
One Health Early Warning Alert
Physician/ Clinical Services Other Personal Health Care
The Research Question Does changing prescription medication labels to conform to the United States Pharmacopeia (USP) patient-centered, more understandable,
Presentation transcript:

The Space-Time Scan Statistic for Multiple Data Streams Martin Kulldorff, Katherine Yih, Ken Kleinman, Richard Platt, Harvard Medical School and Harvard Pilgrim Health Care Farzad Mostashari, New York City Department of Health and Mental Hygiene Luiz Duczmal, Univ Fed Minas Gerais, Brazil

Different Data Sources For example: OTC Drug Sales, from pharmacy chains Nurses Hotline Calls, from Optum Regular Physician Visits, from HMOs/VA Emergency Department Visits, from hospitals Ambulance Dispatches, from 911 call centers Lab Test Results, from laboratories

Different Types of Data from the Same Data Source For example, HMO data concerning: Telephone Calls to Physicians Regular Physician Visits Emergency Department Visits Lab Test Requests Lab Test Results Drug Prescriptions

Different Groupings in the Same Type of Data Children, Young Adults, Adults age 65+ Male, Female Diarrhea, Vomiting

Early Work Burkom HS, Biosurveillance Applying Scan Statistics with Multiple, Disparates Data Sources, Journal of Urban Health, 80i:57-65, 2003 Wong WK, Moore A, Cooper G, Wagner M. WSARE: What’s strange about recent events? Journal of Urban Health, 80i:66-75, 2003.

Why Multivariate Detection Methods? We do not know whether an outbreak will create a signal in one or more data streams. The informational content is different in different data streams.

Outline Method: Space-Time Permutation Scan Statistic Example: Gastrointestinal telephone calls, urgent care visits and regular physician visits in Boston

The Spatial Scan Statistic Create a regular or irregular grid of centroids covering the whole study region. Create an infinite number of circles around each centroid, with the radius anywhere from zero up to a maximum so that at most 50 percent of the population is included.

A small sample of the circles used

Space-Time Scan Statistic Use a cylindrical window, with the circular base representing space and the height representing time. We will only consider cylinders that reach the present time.

Space-Time Permutation Scan Statistic 1. For each cylinder, calculate the expected number of cases conditioning on the marginals μst = Cs Ct / C where Cs = # cases in location s Ct = # cases in time interval t C = total number of cases

Space-Time Permutation Scan Statistic Let cst = # cases in the cylinder covering location s and time interval t.

Space-Time Permutation Scan Statistic 2. For each cylinder, calculate the Poisson likelihood Tst = [cst / μst ]cst x [(C-cst)/(C- μst)] C-cst if cst / μst > 1, Tst = 1 otherwise 3. Test statistic T = maxst log [ Tst ]

Statistical Inference 4. Generate random replicas of the data set conditioned on the marginals, by permuting the pairs of spatial locations and times. 5. Compare test statistic in real and random data sets using Monte Carlo hypothesis testing (Dwass, 1957): p = rank(Treal) / (1+#replicas)

log[ T[1]st ] +log[ T[2]st ] +log[ T[3]st ] Multiple Data Streams For each cylinder, add the Poisson log likelihoods: Tst = log[ T[1]st ] +log[ T[2]st ] +log[ T[3]st ] Test statistic T = maxst Tst

Syndromic Surveillance in Boston: Upper and Lower GI Harvard Pilgrim Health Care HMO members cared for by Harvard Vanguard Medical Associates Historical Data from Jan 1 to Dec 31, 2002 Mimicking Surveillance from Sept 1 to Dec 31, 2002

Three Data Streams Telephone Calls ( ~ 20 / day) Urgent Care Visits ( ~ 9 / day) Regular Physician Visits ( ~ 22 / day) Multiple contacts by the same person removed.

Strongest Signal: October 18 p= Recurrence Int. Tele: 0.001 < 1 / 1000 days Urgent 0.91 ~ every day Regular: 0.84 ~ every day Multiple DS: 0.001 < 1 / 1000 days

October 18 Signal Friday Number of Cases: 5 Expected Cases: 0.04 Location: Zip Code 01740 Time Length: One Day

October 18 Signal Friday Number of Cases: 5 Expected Cases: 0.04 Location: Zip Code 01740 Time Length: One Day Diagnosis: Pinworm Infestation (all 5)

October 18 Signal Friday Number of Cases: 5 (all tele) Expected Cases: 0.04 Location: Zip Code 01740 Time Length: One Day Diagnosis: Pinworm Infestation (all 5) Same Family: Mother, Father, 3 Kids

2nd Strongest Signal: December 20 p= Recurrence Int. Tele: 0.03 1 / 32 days Urgent 0.71 ~ every day Regular: 0.003 1 / 333 days Multiple DS: 0.002 1 / 500 days

December 20 Signal Number of Cases: 16 (7 tele, 7 regular, 2 urgent) Expected Cases: 3.5 Location: Zips 01810,26,45,50,52,76 Time Length: Two Days (Thu, Fri) Strong signals on the two following days.

December 20 Signal Mostly diverse vague GI diagnoses: Esophageal Reflux (3), Nausea (2), Abdominal Pain (2), Noninfectious GI (2), Acute pharyngitis, Mastodynia, Diarrhea, Anemia, Hypertension, Blood in stool, Holiday parties?

3rd Strongest Signal: October 26 p= Recurrence Int. Tele: 0.07 1 / 14 days Urgent 0.85 ~ every day Regular: 0.18 1 / 6 days Combined: 0.007 1 / 142 days

October 26 Signal Saturday Number of Cases: 8 ( 5 tele, 3 regular) Expected Cases: 0.9 Location: Zip Codes 01902,07,15,45,70 Time Length: Two Days (Fri, Sat) Various specific diagnoses.

Research Funded By Methods: Alfred P Sloan Foundation Data, National Bioterrorism Syndromic Surveillance Demonstration Program: National Center for Infectious Diseases, Centers for Disease Control and Prevention

SaTScan v 5.1 www.satscan.org Free Software SaTScan v 5.1 www.satscan.org