Bayesian Biosurveillance of Disease Outbreaks

Slides:



Advertisements
Similar presentations
A Tutorial on Learning with Bayesian Networks
Advertisements

State Estimation and Kalman Filtering CS B659 Spring 2013 Kris Hauser.
Representations for KBS: Uncertainty & Decision Support
Bayesian Health Technology Assessment: An Industry Statistician's Perspective John Stevens AstraZeneca R&D Charnwood Bayesian Statistics Focus Team Leader.
2005 Syndromic Surveillance1 Estimating the Expected Warning Time of Outbreak- Detection Algorithms Yanna Shen, Weng-Keen Wong, Gregory F. Cooper RODS.
 2005 Carnegie Mellon University A Bayesian Scan Statistic for Spatial Cluster Detection Daniel B. Neill 1 Andrew W. Moore 1 Gregory F. Cooper 2 1 Carnegie.
1 A Tutorial on Bayesian Networks Weng-Keen Wong School of Electrical Engineering and Computer Science Oregon State University.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
Optimizing Disease Outbreak Detection Methods Using Reinforcement Learning Masoumeh Izadi Clinical & Health Informatics Research Group Faculty of Medicine,
Bayesian Biosurveillance Gregory F. Cooper Center for Biomedical Informatics University of Pittsburgh The research described in this.
World Statistics Day Statisical Modelling of Complex Systems Jouko Lampinen Finnish Centre of Excellence in Computational Complex Systems Research.
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Weng-Keen Wong, Greg Cooper, Denver Dash *, John Levander, John Dowling,
 2004 University of Pittsburgh Bayesian Biosurveillance Using Multiple Data Streams Greg Cooper, Weng-Keen Wong, Denver Dash*, John Levander, John Dowling,
1 Bayesian Reasoning Chapter 13 CMSC 471 Adapted from slides by Tim Finin and Marie desJardins.
Model N : The total number of patients in an anthrax outbreak who are seen by clinicians. DT : The time to detect the anthrax outbreak Detection : The.
Bayesian Biosurveillance Using Causal Networks Greg Cooper RODS Laboratory and the Laboratory for Causal Modeling and Discovery Center for Biomedical Informatics.
Weng-Keen Wong, Oregon State University © Bayesian Networks: A Tutorial Weng-Keen Wong School of Electrical Engineering and Computer Science Oregon.
Conclusions On our large scale anthrax attack simulations, being able to infer the work zip appears to improve detection time over just using the home.
Population-Wide Anomaly Detection Weng-Keen Wong 1, Gregory Cooper 2, Denver Dash 3, John Levander 2, John Dowling 2, Bill Hogan 2, Michael Wagner 2 1.
Value of Information for Complex Economic Models Jeremy Oakley Department of Probability and Statistics, University of Sheffield. Paper available from.
1 Bayesian Network Anomaly Pattern Detection for Disease Outbreaks Weng-Keen Wong (Carnegie Mellon University) Andrew Moore (Carnegie Mellon University)
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
Downscaling in time. Aim is to make a probabilistic description of weather for next season –How often is it likely to rain, when is the rainy season likely.
1 / 41 Inference and Computation with Population Codes 13 November 2012 Inference and Computation with Population Codes Alexandre Pouget, Peter Dayan,
What’s Strange About Recent Events (WSARE) Weng-Keen Wong (University of Pittsburgh) Andrew Moore (Carnegie Mellon University) Gregory Cooper (University.
Digital Statisticians INST 4200 David J Stucki Spring 2015.
Visibility Graph. Voronoi Diagram Control is easy: stay equidistant away from closest obstacles.
Bayesian networks. Motivation We saw that the full joint probability can be used to answer any question about the domain, but can become intractable as.
1 Chapter 14 Probabilistic Reasoning. 2 Outline Syntax of Bayesian networks Semantics of Bayesian networks Efficient representation of conditional distributions.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
Bayesian statistics Probabilities for everything.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Uncertainty. Assumptions Inherent in Deductive Logic-based Systems All the assertions we wish to make and use are universally true. Observations of the.
CAUSALITY ASSESSMENT OF SUSPECTED AEs Dr. Retesh Kumar Head, Global PhV Department 12/13/2015.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Motivation and Overview
Multi-target Detection in Sensor Networks Xiaoling Wang ECE691, Fall 2003.
TEMPLATE DESIGN © Vertically Integrated Seismological Analysis I : Modeling Nimar S. Arora, Michael I. Jordan, Stuart.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Bayesian Disease Outbreak Detection that Includes a Model of Unknown Diseases Yanna Shen and Gregory F. Cooper Intelligent Systems Program and Department.
CSC321: Introduction to Neural Networks and Machine Learning Lecture 17: Boltzmann Machines as Probabilistic Models Geoffrey Hinton.
Probabilistic Robotics Probability Theory Basics Error Propagation Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics.
Bayesian Biosurveillance of Disease Outbreaks RODS Laboratory Center for Biomedical Informatics University of Pittsburgh Gregory F. Cooper, Denver H.
23 Jan 2012 Background shape estimates using sidebands Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle.
CS 2750: Machine Learning Directed Graphical Models
Presented BY: Asif Jamil Chowdhury AHMED SHEHAB KHAN (4/21/2016)
Reasoning Under Uncertainty: More on BNets structure and construction
The Importance of Adequately Powered Studies
Read R&N Ch Next lecture: Read R&N
From last time: on-policy vs off-policy Take an action Observe a reward Choose the next action Learn (using chosen action) Take the next action Off-policy.
CSC321: Neural Networks Lecture 19: Boltzmann Machines as Probabilistic Models Geoffrey Hinton.
Bayesian Networks: A Tutorial
Reasoning Under Uncertainty: More on BNets structure and construction
Learning Bayesian Network Models from Data
Markov ó Kalman Filter Localization
Course: Autonomous Machine Learning
State Estimation Probability, Bayes Filtering
Bayesian Models in Machine Learning
Estimating the Expected Warning Time of Outbreak-Detection Algorithms
CS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2007
CS 188: Artificial Intelligence Fall 2007
Causal Models Lecture 12.
LECTURE 09: BAYESIAN LEARNING
LECTURE 07: BAYESIAN ESTIMATION
Belief Networks CS121 – Winter 2003 Belief Networks.
CS639: Data Management for Data Science
Probabilistic Modelling of Brain Imaging Data
Exceedence Diagnostics
Presentation transcript:

Bayesian Biosurveillance of Disease Outbreaks Gregory F. Cooper Denver H. Dash John D. Levander Weng-Keen Wong William R. Hogan Michael M. Wagner

Motivation Early, reliable detection of disease outbreaks is a critical problem today The cost of not detecting it early: as many as 30,000 people per day could die long-term economic costs were estimated to be as high as 250 million dollars per hour

Outbreaks often present signals that are weak and noisy It likely will be necessary to integrate multiple weak signals Combining spatial and temporal data is an important instance of such integration

Because of the noise in signals early in the event, early detection is almost always detection under uncertainty. They use probability as a measure of uncertainty. Modeling of risk factors, diseases, and symptoms often is causal They use causal Bayesian networks as their probabilistic modeling method Concentration is on modeling non-contagious outbreak diseases, such as airborne anthrax

In this model: Each person is represented by 14-node subnetwork  this model contains approximately 20 million nodes Given current data about individuals, Bayesian network is used to infer the posterior probabilities of outbreak diseases. the biosurveillance system should “keep up” with the data streaming in. Once the probability of an outbreak exceeds a particular threshold, an alert is generated

Methodology Modeling: BNs to explicitly model an entire population of individuals an object-oriented perspective: each person model can be viewed as a class

G: nodes that represent global features common to all people. I: contains factors that directly influence the status of the outbreak of disease in people Pi :represents a person in the population X = G U I U Pi

Assumptions Assumption 1: for non-contagious outbreak diseases

Inference At any given time, there are two general sources of evidence we consider: 1. General evidence at the global level: g = { G = g: G G}, and 2. The collective set of evidence e that we observe from the population of people: e = {X = e: X Pi, Pi P}.

Given e and g, Both in G There are too many nodes!

Equivalence classes Incremental updating

Model for outbreak detection Two differences No Terror Alert Level node More complex person node

Simulation Model PANDA Population-wide ANomaly Detection and Assessment They inject simulated ED cases into a background of actual ED cases. Given weather conditions, parameters for location and amount of airborne anthrax a Gaussian plume model derives the concentration of anthrax spores that are estimated to exist in each zip code Historical meteorological conditions for a region

The output of the simulator: A full evidence vector for a case including age and gender It cannot be used directly for PANDA Partially patient cases produced by the simulator is taken and the age and gender are probabilistically assigned using BN

Results 12 × 8 = 96 data sets month Random day

Spatial distribution

Spatial vs. non-spatial