The LLNL FMD Decision Support System: Concise Description of Features and Output DIMACS Workshop March 2006 “Data Mining and Epidemiological Modeling”

Slides:



Advertisements
Similar presentations
FHM TRAINING TOOLS This training presentation is part of FHM’s commitment to creating and keeping safe workplaces. Be sure to check out all the training.
Advertisements

Is America’s Agriculture still vulnerable to Bioterrorism???? Yes, and it will always be……. Dr. Jimmy Tickel NCDA& CS Emergency Programs.
Governor’s Freight Industry Summit Draft November 12, 2012 Sandra K. Beaupre’ Wisconsin DOT Governor’s Freight Industry Summit Draft November 12, 2012.
Update on PEDV Lisa Becton, DVM, MS National Pork Board.
Public Health Event Reporting: Lecture Template
Modelling – progress update Stephen Catterall, BioSS 28 th November 2007.
 A public health science (foundation of public health)  Impacts personal decisions about our lifestyles  Affects government, public health agency and.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Presentation Topic : Modeling Human Vaccinating Behaviors On a Disease Diffusion Network PhD Student : Shang XIA Supervisor : Prof. Jiming LIU Department.
Joe Dinius ECE Oct  Introduction  Review of the Literature  Hypotheses  Model Description  Parameters  Results  Sensitivity Analysis.
Foot and Mouth Analysis Mike Delorme Rachelle Miron.
Introduction to Quantitative Techniques
Secure Pork Supply Plan Pilot Project A Continuity of Business Plan for the Pork Industry in the Face of a Foreign Animal Disease Pam Zaabel, DVM Center.
Comparison of Private vs. Public Interventions for Controlling Influenza Epidemics Joint work with Chris Barrett, Jiangzhuo Chen, Stephen Eubank, Bryan.
Harvard Center for Risk Analysis Evaluation of the Potential for BSE in the United States Joshua T. Cohen Keith Duggar George M. Gray Silvia Kreindel Harvard.
Infectious Disease Epidemiology Sharyn Orton, Ph.D. American Red Cross, Rockville, MD Suggested reading: Modern Infectious Disease Epidemiology (1994)
Effects Of Animal Identification On Cattle Market Structure Prepared by: Darrell R. Mark, Ph.D. Asst. Professor & Extension Livestock Marketing Specialist.
Methods to Study and Control Diseases in Wild Populations Steve Bellan, MPH Department of Environmental Sci, Pol & Mgmt University of California at Berkeley.
Track A: Basic Science This track highlighted all aspects of HIV structure, replication, and the host immune responses and led to a greater understanding.
Epidemiology Tools and Methods Session 2, Part 1.
Epidemiology.
Introduction to Operation Research
Emerging Infectious Disease: A Computational Multi-agent Model.
Simulation Examples ~ By Hand ~ Using Excel
1 Performance Evaluation of Computer Networks: Part II Objectives r Simulation Modeling r Classification of Simulation Modeling r Discrete-Event Simulation.
Spatial patterns on the edge of stability: measles in the Sahel.
Surveillance Activities during Pandemic Phases Anne-Luise Winter APHEO-COMOH Workshop Toronto February 1, 2007.
January 27, th BVDV Symposium –BVDV Variability: Impact on Virulence, Host Range, and Control Bovine Viral Diarrhea Virus (BVDV) Research Needs.
Surveillance, Epidemiology, and Tracing Surveillance Part 1: The Surveillance Plan Adapted from the FAD PReP/NAHEMS Guidelines: Surveillance, Epidemiology,
Public Health in Tropics :Further understanding in infectious disease epidemiology Taro Yamamoto Department of International Health Institute of Tropical.
1 Operational and Logistical Aspects of Biodefense Moshe Kress CEMA, Israel.
Modelling infectious diseases Jean-François Boivin 25 October
A Data Intensive High Performance Simulation & Visualization Framework for Disease Surveillance Arif Ghafoor, David Ebert, Madiha Sahar Ross Maciejewski,
Discrete Distributions The values generated for a random variable must be from a finite distinct set of individual values. For example, based on past observations,
Quarantine and Movement Control Zones, Areas, and Premises Adapted from the FAD PReP/NAHEMS Guidelines: Quarantine and Movement Control (2014)
EpiFast: A Fast Algorithm for Large Scale Realistic Epidemic Simulations on Distributed Memory Systems Keith R. Bisset, Jiangzhuo Chen, Xizhou Feng, V.S.
Aemen Lodhi (Georgia Tech) Amogh Dhamdhere (CAIDA)
Jack DeWeese Computer Systems Research Lab. Purpose  Originally intended to create my own simulation with easily modified variables  Halfway through.
Outline of Chapter 9: Using Simulation to Solve Decision Problems Real world decisions are often too complex to be analyzed effectively using influence.
Showcase /06/2005 Towards Computational Epidemiology Using Stochastic Cellular Automata in Modeling Spread of Diseases Sangeeta Venkatachalam, Armin.
A Stochastic Model of Paratuberculosis Infection In Scottish Dairy Cattle I.J.McKendrick 1, J.C.Wood 1, M.R.Hutchings 2, A.Greig 2 1. Biomathematics &
1 Immunisation Strategies for a Community of Households Niels G Becker ( with help from David Philp ) National Centre for Epidemiology and Population Health.
Capital Budgeting.
Dynamic Random Graph Modelling and Applications in the UK 2001 Foot-and-Mouth Epidemic Christopher G. Small Joint work with Yasaman Hosseinkashi, Shoja.
Influenza epidemic spread simulation for Poland – A large scale, individual based model study.
Surveillance During Animal Disease Emergencies Overview.
Coevolution of Epidemics, Social Networks, and Individual Behavior: A Case Study Joint work with Achla Marathe, and Madhav Marathe Jiangzhuo Chen Network.
So, what’s the “point” to all of this?….
An Agent Epidemic Model Toward a general model. Objectives n An epidemic is any attribute that is passed from one person to others in society è disease,
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
1 Food and Agriculture Organization of the United Nation Regional Office for Asia and the Pacific Emergency Center for Transboundary Animal Diseases Special.
Diseases and the Immune/Lymphatic System. Define these terms: Put your answers where you would start your notes. Define with your partner using only your.
Foot-and-Mouth Disease Attack on the Midwest U. S.
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Country report for “Peste des Petits Ruminants(PPR).” Dr. Yousef S N S SH Al Azemi Head, Epidemiology & Zoonosis, Animal Health Department Public Authority.
Is for Epi Epidemiology basics for non-epidemiologists.
FMD situation in Egypt Dr: Shams Amin Preventive medicine specialist
Biological Attack Model (BAM) Progress Report March 8 Sponsor: Dr. Yifan Liu Richard Bornhorst Robert Grillo Deepak Janardhanan Shubh Krishna Kathryn Poole.
Alberta Agriculture, Food and Rural Development
Lesson 3 Page 1 of 24 Lesson 3 Considerations in Planning Public Health Surveillance.
QUALITATIVE RISK ASSESSMET
Sangeeta Venkatachalam, Armin R. Mikler
Foot and Mouth Disease Ministry Of Agriculture, Kingdom of Saudi Arabia
Meningitis surveillance
The value chain of veal & Preliminary FMD risk assessment along the value-chain in Kazakhstan Gulzhan Nurtazina EuFMD Progressive Control Practitioners’
Epidemiological Modeling to Guide Efficacy Study Design Evaluating Vaccines to Prevent Emerging Diseases An Vandebosch, PhD Joint Statistical meetings,
Epidemiology of Foot and Mouth Disease in Georgia
Measuring contact patterns with wearable sensors: methods, data characteristics and applications to data-driven simulations of infectious diseases  A.
Markus Schwehm and Martin Eichner free  traced, quarantine
ECONOMIC COSTS AND EFFECTS OF ACTIVITIES TO PREVENT FOOT AND MOUTH DISEASE IN DENMARK S. Denver1, L. Alban2, A. Boklund3, T. Halasa3, H. Houe1, S. Mortensen*4,
Presentation transcript:

The LLNL FMD Decision Support System: Concise Description of Features and Output DIMACS Workshop March 2006 “Data Mining and Epidemiological Modeling” Tanya Kostova T. Bates, C. Melius, S. Smith, A. Robertson, S. Hazlett, P. Hullinger, Lawrence Livermore National Laboratory

LLNL is developing a decision support system for evaluation of the economic impact of FMD epidemics Effort funded by the Department of Homeland Security DHS has numerous S&T investments in research projects for agriculture security countermeasures and requires tools to help evaluate future investments

LLNL is developing a decision support system for evaluation of the economic impact of FMD epidemics Effort funded by the Department of Homeland Security DHS has numerous S&T investments in research projects for agriculture security countermeasures and requires tools to help evaluate future investments Numerous FMD epidemiological models exist but… –They are not national in scale –Current models target natural or accidental introduction not an intentional act –Epidemiological and economic models are not coupled

GENERAL FEATURES OF THE EPIDEMIC MODEL Agent-based spatially-explicit discrete-time computational model Time progresses in increments of 1 unit (=1 day)

GENERAL FEATURES Agent-based spatially-explicit discrete-time computational model Time progresses in increments of 1 unit (=1 day) In a time stepping agent based model, at each time increment some of the agents change some of their attributes depending on their previous state and on the previous states of some of the other agents.

GENERAL FEATURES The FMD model agents are the animal facilities. Agent-based spatially-explicit discrete-time computational model Time progresses in increments of 1 unit (=1 day) In a time stepping agent based model, at each time increment some of the agents change some of their attributes depending on their previous state and on the previous states of some of the other agents.

GENERAL FEATURES The FMD model agents are the animal facilities. Facilities are groups of animals managed in a specific manner. Agent-based spatially-explicit discrete-time computational model Time progresses in increments of 1 unit (=1 day) In a time stepping agent based model, at each time increment some of the agents change some of their attributes depending on their previous state and on the previous states of some of the other agents. Farms, Markets, Feedlots, Slaughter houses …

THE ATTRIBUTES OF THE FACILITY AGENT Type (incl. species, size and operation) Spatial coordinates Static Dynamic Disease states Average Number of Contacts (to and from), Method of disease spread – specific network of contacts Availability Seasonal factors Change due to interaction Change externally and independently of interaction

THE ATTRIBUTES OF THE FACILITY AGENT Type The current model version deals with 34 types of animal facilities: Beef(B), Dairy(S), Dairy(M), Dairy(L), Dairy(B), Grazing(S), Grazing(L), Feedlot(S), Feedlot(L), Stocker(S), Stocker(L) Swine(B), SwineFWean(S), SwineFWean(L), SwineFinish(S), SwineFinish(L), SwineNursery(S), SwineNursery(L), SwineFFeeder(S), SwineFFeeder(L), SwineFarFin(S), SwineFarFin(L), Sheep(S), Sheep(L), Sheep(B), Goats, Goats(B), Market, Market(Cattle), Market(Swine), Market(Other), Market(L), Market(C-L), DCalfHeifer(L)

Beef (S) The spatial coordinates of each facility are exact “up to the county level” The NASS data supplies the numbers of different facility types in each county Swine (S) Dairy (S) There are 1.2M facilities (according to NASS data) with 160M animals. These do not include markets which come from another database. Thus, we model 1.2M+ facilities and their contacts. THE ATTRIBUTES OF THE FACILITY AGENT

Hogs and pigs Cattle and cows Sheep The spatial coordinates of the facilities are generated using a random algorithm based on the county-based data. THE ATTRIBUTES OF THE FACILITY AGENT

Type (incl. species, size and operation) Spatial coordinates Static Dynamic Disease states Average Number of Contacts (to and from), Method of disease spread – specific network of contacts Availability Seasonal factors Change due to interaction Change externally and independently of interaction

THE ATTRIBUTES OF THE FACILITY AGENT Average Number of Contacts (to and from), Method of disease spread – specific network of contacts Depends on the size and type of facility and determined for each specific facility as random number drawn from a given probability distribution obtained from survey data

THE ATTRIBUTES OF THE FACILITY AGENT Average Number of Contacts (to and from) Method of disease spread – specific network of contacts Depends on the size and type of facility and determined for each specific facility as random number drawn from a given probability distribution obtained from survey data Direct (regional and inter-state) Indirect (high risk and low risk)

THE ATTRIBUTES OF THE FACILITY AGENT Type (incl. species, size and operation) Spatial coordinates Static Dynamic Disease states Average Number of Contacts (to and from), Method of disease spread – specific network of contacts Availability Seasonal factors Change due to interaction Change externally and independently of interaction

THE ATTRIBUTES OF THE FACILITY AGENT Disease states     Susceptible Latent (infected) Subclinically infectious Clinically infectious Immune Infection Waning of immunity Vaccinated SuspectedConfirmedCulled S - Susceptible (healthy) L- Latent U- Subclinically infectious I- Clinically infectious W – Vaccinated and susceptible V- Vaccinated M- Immune P- Suspected F- Confirmed X - Culled ?

The disease state attributes of each facility are calculated by an “intra-facility model” (IFM) THE ATTRIBUTES OF THE FACILITY AGENT The intra-facility model is a “time-since infection” Reed-Frost type modelmodel Represents a discrete-time system of difference equations representing the number of animals on the facility that are in each state S, L, I, U, V, W, M

The intra-facility model is a “time-since infection” Reed-Frost type modelmodel Represents a discrete-time system of difference equations representing the number of animals on the facility that are in each state S, L, I, U, V, W, M The output of the IFM is used to calculate the probability that an infected facility will infect other facilities This is done by using a “spread model “ The disease state attributes of each facility are calculated by an “intra-facility model” (IFM) THE ATTRIBUTES OF THE FACILITY AGENT

Average Number of Contacts (to and from) Method of disease spread – specific network of contacts Availability Seasonal factors These attributes are used by the Spread Model to calculate the newly infected facilities

Infected agents can spread the epidemic via various methods along method-specific networks For each method, the infection can be spread within a predefined set of facilities specific to the method. Thus, an infected facility will spread the infection to the facilities within the networks to which it belongs. Examples of methods - direct (movement of animals) - indirect: personnel movements; - inter-state direct movements “Truck routes” network “Vet routes” network infected not infected The Spread Model calculates the newly infected facilities

The epidemic spread is modeled by a random process Uses information about the Average Number of adequate Contacts ANC of the infected facility by each of the methods The daily number of adequate contacts RANC mi is obtained from a Poisson process with mean ANC For each method of infection m For each infected facility i: - A probability density function P mi (j) defined on each of the nodes j of the network S mi of m and i is calculated - For each node j of S mi the probability C mj is calculated P mi (j) is the probability that facility j will get a contact with facility i by method m. Distance dependent C mj is the probability that an adequate contact to facility j will cause infection. P mi (j) is used in a roulette algorithm to determine which facilities receive an adequate contact C mj is used to determine which of the contacted facilities become infected RANC mi, P mi (j) and C mj are used to trace back the cause of infection of j A contact originating from a facility that can cause infection is an adequate contact. An adequate contact that actually infects a target facility is an effective contact. STEP1STEP1 STEP2STEP2 STEP3STEP3

P mi (j) depends on - the average number of m-type contacts received by j - size of the facility j - seasonal factors - control measure factors - distance between i and j - frequency of contacts between i and j The Spread Model involves factors sampled from PDFs C mj depends on - the fraction of vaccinated animals on the facility - control measure factors - probability that a contact of type m would cause infection Many of these factors are uncertain or involve variability and are sampled from probability density functions.

The Control Measures Component “Control measures” include Vaccination Culling Contact restrictions Isolation Increased detection Control measures are applied regionally

IFM Control Measures Spread model Events during one increment of time

AGGREGATION ALGORITHMS Our model is of US - national scale; however to keep calculations to a minimum: - We do not calculate all facilities at all times. - Only facilities in infected and their neighboring counties are initialized - Intra-facility model calculated only for infected facilities - Counties and states that have not been yet infected are considered as aggregated entities; if a contact happens to in such a county, it gets disaggregated.

OUTPUTS A simulation is made of N MC runs N O(10 2 ) - O(10 3 )

OUTPUTS A simulation is made of N MC runs N O(10 2 ) - O(10 3 ) A run implements M time steps M O(10 2 ), usually days or until a certain criterion is met (epidemic comes to end)

OUTPUTS A simulation is made of N MC runs N O(10 2 ) - O(10 3 ) A run implements M time steps M O(10 2 ), usually days or until a certain criterion is met (epidemic comes to end) At each time step we keep track of the number P of facilities that are currently involved in the epidemic (i.e. the ones that are infected or in the neighborhoods of infected facilities. P O(10 2 ) - O(10 5 ) ???

OUTPUTS A simulation is made of N MC runs N O(10 2 ) - O(10 3 ) A run implements M time steps M O(10 2 ), usually days or until a certain criterion is met (epidemic comes to end) At each time step we keep track of the number P of facilities that are currently involved in the epidemic (i.e. the ones that are infected or in the neighborhoods of infected facilities. P O(10 2 ) - O(10 5 ) ??? For each facility the important data (current states, costs, trace-back facilities) is O(10 1 )

OUTPUTS Thus, the total output of a simulation could be in the range of or more. O(10 10 ) Naturally, we do not keep all this output although what we do not keep may be important for the analysis What do we keep currently?

OUTPUTS Daily Numbers of facilities of the 34 types that are in the 9 disease states L- Latent U- Subclinically infectious I- Clinically infectious W – Vaccinated and susceptible V- Vaccinated M- Immune P- Suspected F- Confirmed X - Culled Numbers of facilities that have just acquired a new state Numbers of facilities that have ever been in some disease state Total numbers of infected, vaccinated, culled facilities Daily and total numbers of infected, vaccinated, culled animals of different species

OUTPUTS Durations: Lengths of time for which the 34 types of facilities were in some disease state Duration of total epidemic Costs associated with epidemic and control measures

OUTPUTS Currently, output is in Excel spreadsheet format and is used for visualization Duration Duration of epidemic Cumulative Frequency Days after index herd infected As well as to calculate statistics (means, quantiles, skewness, kurtosis, etc.) of MC output.

Epidemic model outputs and data mining Question: How can modern data mining tools help in the analysis of output data generated by a large-scale epidemic model?

Epidemic model outputs and data mining Question: How can modern data mining tools help in the analysis of output data generated by a large-scale epidemic model? Specifically, can data mining help uncover important relations between - scope of epidemic and spatial distributions of facilities? - how control measures are applied and the cost of the epidemic?

Epidemic model outputs and data mining Further, can data-mining tools help … Identify sources (infected facilities), likely transmission mechanisms? Classify of outbreaks into "natural" vs. "intentional" to help policy makers develop correct response strategies?

Epidemic model outputs and data mining Further, can data-mining tools help … Identify sources (infected facilities), likely transmission mechanisms? Classify of outbreaks into "natural" vs. "intentional" to help policy makers develop correct response strategies? Identify key facilities/locations for surveillance? Identify which control mechanisms are having the largest impact?

Epidemic model outputs and data mining Further, can data-mining tools help … Identify sources (infected facilities), likely transmission mechanisms? Classify of outbreaks into "natural" vs. "intentional" to help policy makers develop correct response strategies? Identify key facilities/locations for surveillance? Identify which control mechanisms are having the largest impact? Evaluate new technologies? Evaluate vulnerability of different industries and regions of the country?

If the answer is “yes” to at least some of our questions, which are the recommended data mining tools? Are they available?