Bayesian Biosurveillance of Disease Outbreaks RODS Laboratory Center for Biomedical Informatics University of Pittsburgh Gregory F. Cooper, Denver H.

Slides:



Advertisements
Similar presentations
Autonomic Scaling of Cloud Computing Resources
Advertisements

Classification Classification Examples
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Multisource Fusion for Opportunistic Detection and Probabilistic Assessment of Homeland Terrorist Threats Kathryn Blackmond Laskey & Tod S. Levitt presented.
2005 Syndromic Surveillance1 Estimating the Expected Warning Time of Outbreak- Detection Algorithms Yanna Shen, Weng-Keen Wong, Gregory F. Cooper RODS.
 2005 Carnegie Mellon University A Bayesian Scan Statistic for Spatial Cluster Detection Daniel B. Neill 1 Andrew W. Moore 1 Gregory F. Cooper 2 1 Carnegie.
1 A Tutorial on Bayesian Networks Weng-Keen Wong School of Electrical Engineering and Computer Science Oregon State University.
Optimizing Disease Outbreak Detection Methods Using Reinforcement Learning Masoumeh Izadi Clinical & Health Informatics Research Group Faculty of Medicine,
Bayesian Biosurveillance Gregory F. Cooper Center for Biomedical Informatics University of Pittsburgh The research described in this.
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
World Statistics Day Statisical Modelling of Complex Systems Jouko Lampinen Finnish Centre of Excellence in Computational Complex Systems Research.
An introduction to time series approaches in biosurveillance Professor The Auton Lab School of Computer Science Carnegie Mellon University
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Weng-Keen Wong, Greg Cooper, Denver Dash *, John Levander, John Dowling,
What’s Strange About Recent Events (WSARE) v3.0: Adjusting for a Changing Baseline Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon.
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Greg Cooper, Weng-Keen Wong, Denver Dash*, John Levander, John Dowling,
Model N : The total number of patients in an anthrax outbreak who are seen by clinicians. DT : The time to detect the anthrax outbreak Detection : The.
Bayesian Biosurveillance Using Causal Networks Greg Cooper RODS Laboratory and the Laboratory for Causal Modeling and Discovery Center for Biomedical Informatics.
Weng-Keen Wong, Oregon State University © Bayesian Networks: A Tutorial Weng-Keen Wong School of Electrical Engineering and Computer Science Oregon.
1 Department of Computer Science and Engineering, University of South Carolina Issues for Discussion and Work Jan 2007  Choose meeting time.
Conclusions On our large scale anthrax attack simulations, being able to infer the work zip appears to improve detection time over just using the home.
Evaluation of Bayesian Networks Used for Diagnostics[1]
Population-Wide Anomaly Detection Weng-Keen Wong 1, Gregory Cooper 2, Denver Dash 3, John Levander 2, John Dowling 2, Bill Hogan 2, Michael Wagner 2 1.
Bayesian Network Anomaly Pattern Detection for Disease Outbreaks Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon University)
1 Bayesian Network Anomaly Pattern Detection for Disease Outbreaks Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon University)
Judgment and Decision Making in Information Systems Computing with Influence Diagrams and the PathFinder Project Yuval Shahar, M.D., Ph.D.
Graphical Causal Models: Determining Causes from Observations William Marsh Risk Assessment and Decision Analysis (RADAR) Computer Science.
Midterm Review Rao Vemuri 16 Oct Posing a Machine Learning Problem Experience Table – Each row is an instance – Each column is an attribute/feature.
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Ahsanul Haque *, Swarup Chandra *, Latifur Khan * and Charu Aggarwal + * Department of Computer Science, University of Texas at Dallas + IBM T. J. Watson.
Additional Data For Harmonized Use Case for Biosurveillance HINF 5430 Final Project By Maria Metty, Priyaranjan Tokachichu &Resty Namata December 13, 2007.
Ahsanul Haque *, Swarup Chandra *, Latifur Khan * and Michael Baron + * Department of Computer Science, University of Texas at Dallas + Department of Mathematical.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
What’s Strange About Recent Events (WSARE) Weng-Keen Wong (University of Pittsburgh) Andrew Moore (Carnegie Mellon University) Gregory Cooper (University.
Digital Statisticians INST 4200 David J Stucki Spring 2015.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 25 Wednesday, 20 October.
Problem Statement: Users can get too busy at work or at home to check the current weather condition for sever weather. Many of the free weather software.
Computing & Information Sciences Kansas State University Wednesday, 22 Oct 2008CIS 530 / 730: Artificial Intelligence Lecture 22 of 42 Wednesday, 22 October.
Using the Repeated Two-Sample Rank Procedure for Detecting Anomalies in Space and Time Ronald D. Fricker, Jr. Interfaces Conference May 31, 2008.
Simulation is the process of studying the behavior of a real system by using a model that replicates the behavior of the system under different scenarios.
Harmonized Biosurveillance Use Case By Resty Namata, Maria Metty & Priyaranjan Tokachichu December 13, 2007.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Randomized Algorithms for Bayesian Hierarchical Clustering
Nordic Process Control Workshop, Porsgrunn, Norway Application of the Enhanced Dynamic Causal Digraph Method on a Three-layer Board Machine Cheng.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
A Simulation Model for Bioterrorism Preparedness in An Emergency Room Lisa Patvivatsiri Department of Industrial Engineering Texas Tech University Presented.
Is for Epi Epidemiology basics for non-epidemiologists.
~PPT Howard Burkom 1, PhD Yevgeniy Elbert 2, MSc LTC Julie Pavlin 2, MD MPH Christina Polyak 2, MPH 1 The Johns Hopkins University Applied Physics.
Bayesian Disease Outbreak Detection that Includes a Model of Unknown Diseases Yanna Shen and Gregory F. Cooper Intelligent Systems Program and Department.
Michigan Disease Surveillance System Syndromic Surveillance Project January 2005.
Bayesian Hierarchical Clustering Paper by K. Heller and Z. Ghahramani ICML 2005 Presented by David Williams Paper Discussion Group ( )
Risk-Aware Mitigation for MANET Routing Attacks Submitted by Sk. Khajavali.
Modeling of Core Protection Calculator System Software February 28, 2005 Kim, Sung Ho Kim, Sung Ho.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Weng-Keen Wong, Oregon State University © Bayesian Networks: A Tutorial Weng-Keen Wong School of Electrical Engineering and Computer Science Oregon.
Research in Population Informatics
How to interact with the system?
Qian Liu CSE spring University of Pennsylvania
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair Research, Department of Biomedical Informatics.
Bayesian Biosurveillance of Disease Outbreaks
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair, Department of Biomedical Informatics Research involves.
Michael M. Wagner, MD PhD Professor, Department of Biomedical Informatics, University of Pittsburgh School of Medicine
A Short Tutorial on Causal Network Modeling and Discovery
Estimating the Expected Warning Time of Outbreak-Detection Algorithms
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair Research, Department of Biomedical Informatics.
How to interact with the system?
Causal Models Lecture 12.
A Scenario to Conceptually Illustrate
Presentation transcript:

Bayesian Biosurveillance of Disease Outbreaks RODS Laboratory Center for Biomedical Informatics University of Pittsburgh Gregory F. Cooper, Denver H. Dash, John D. Levander, Weng-Keen Wong, William R. Hogan, Michael M. Wagner

Outline Biosurveillance goals Bayesian biosurveillance A Bayesian biosurveillance model (PANDA) Summary and future plans

Biosurveillance Detection Goals Detect an unanticipated biological disease outbreak in the population as rapidly and as accurately as possible Determine the people who already have the disease Predict the people who are likely to get the disease

Bayesian Biosurveillance

PANDA: Population-wide ANomaly Detection and Assessment PANDA models outbreaks using a causal Bayesian network. The causal Bayesian network in PANDA represents probabilistic causal relationships that link outbreak etiologies to available evidence, such as emergency department (ED) visits. The network is assessed from training data and from knowledge of outbreak disease from the literature.

Example of a PANDA Bayesian Network that Models a Disease Outbreak Due to an Airborne Release of Anthrax I G P1P1 P2P2 P4P4 P3P3 Global nodes Interface nodes Person model

Person Model

Some Current Model Details The probabilities in the person-network models were estimated from U.S. Census data, from historical ED data from Allegheny county, and from the anthrax literature. The population currently being modeled consists of all ~1.4M people in Allegheny County The smallest region modeled is a Zip code, and all Zip codes in Allegheny county are included.

Equivalence Classes The 1.4M people in the modeled population can be partitioned into approximately 48,000 equivalence classes

Modeling an Entire Population Define the background population (e.g., using census data) As patients enter the ED, they get moved from their background class to a patient class corresponding to their symptoms. After sufficient time passes, patients get moved back into their background class, while other patients get added. people not seen in the EDpeople seen in the ED

Tractably Modeling an Entire Population Pre-compute the probability of observing the entire background population, and replace all equivalence classes with a single (binary) master node:

Simple Adjustment Rule As a person moves from equivalence class E i to class E j, we can easily adjust the probability table of E to reflect the change using:

Evaluation For testing, an outdoor anthrax release was simulated using the anthrax cases output by the BARD system. The BARD-simulated cases of infected individuals who visited the ED were overlaid onto actual historical ED data. Ninety-six such scenarios were generated and for each the data stream of ED cases was given as input to PANDA. Each simulated hour, PANDA generated a posterior probability of an anthrax outbreak. We plotted time-to-detection versus the false- positive rate of detection.

Results

PANDA Spatial Model

Spatial Model

Optimized Spatial Model

Optimized Spatial Model Versus a Control Chart Method

Timing Results The following timing results are based on monitoring historical ED data over six days using PANDA running on an AMD Opteron 248 (2.19 GHz and 4 GB RAM). Original Model:4 to 5 seconds of machine time Original Model with Season, Day of Week, Time of Day: 15 seconds Spatial Model: 20 seconds Spatial Model with Season, Day of Week, Time of Day: 52 seconds

Summary Biosurveillance can be viewed as ongoing diagnosis of an entire population. Causal networks provide a flexible and expressive means of coherently modeling a population in performing biosurveillance. Inference on causal networks can derive the type of posterior probabilities needed for biosurveillance. Initial results from a simulation study are promising, but preliminary. Inference can be computationally tractable when modeling non-contagious disease outbreaks, such as an outbreak due to the outdoor release of anthrax spores.

Future Work Includes … Modeling contagious diseases Including over-the-counter (OTC) data Constructing realistic decision models about when to raise an alert Developing explanations of alerts Performing additional evaluations

Thank you RODS Laboratory: Bayesian Biosurveillance: