Animal transport: spread of disease and …. Uno Wennergren Tom Lindström Annie Jonsson Nina Håkansson Jenny Lennartsson Spatio-Temporal Biology Division.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Model checking in mixture models via mixed predictive p-values Alex Lewin and Sylvia Richardson, Centre for Biostatistics, Imperial College, London Mixed.
Introduction to Monte Carlo Markov chain (MCMC) methods
Bayesian Estimation in MARK
The Model Following these assumptions, I propose a hierarchical model with these characteristics: where is the number of goals scored by a team’s offense.
Markov-Chain Monte Carlo
Chapter 4: Linear Models for Classification
Computing the Posterior Probability The posterior probability distribution contains the complete information concerning the parameters, but need often.
Results Method Local population dynamics followed the Ricker equation, changing from over- to under compensating growth with parameter b, and with environmental.
Computational statistics 2009 Random walk. Computational statistics 2009 Random walk with absorbing barrier.
Machine Learning CUNY Graduate Center Lecture 7b: Sampling.
Particle filters (continued…). Recall Particle filters –Track state sequence x i given the measurements ( y 0, y 1, …., y i ) –Non-linear dynamics –Non-linear.
Treating populations and landscapes as signals. A step towards research collaborations? (1)spread of disease, (2)animal welfare vs transport, (3)endangered.
Today Introduction to MCMC Particle filters and MCMC
End of Chapter 8 Neil Weisenfeld March 28, 2005.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Bayesian Filtering for Location Estimation D. Fox, J. Hightower, L. Liao, D. Schulz, and G. Borriello Presented by: Honggang Zhang.
Bayesian Analysis for Extreme Events Pao-Shin Chu and Xin Zhao Department of Meteorology School of Ocean & Earth Science & Technology University of Hawaii-
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Introduction to Monte Carlo Methods D.J.C. Mackay.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Bayes Factor Based on Han and Carlin (2001, JASA).
Gaussian process modelling
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
Bayesian parameter estimation in cosmology with Population Monte Carlo By Darell Moodley (UKZN) Supervisor: Prof. K Moodley (UKZN) SKA Postgraduate conference,
Priors, Normal Models, Computing Posteriors
Machine Learning Lecture 23: Statistical Estimation with Sampling Iain Murray’s MLSS lecture on videolectures.net:
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
St5219: Bayesian hierarchical modelling lecture 2.1.
1 Gil McVean Tuesday 24 th February 2009 Markov Chain Monte Carlo.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
Fast Simulators for Assessment and Propagation of Model Uncertainty* Jim Berger, M.J. Bayarri, German Molina June 20, 2001 SAMO 2001, Madrid *Project of.
Randomized Algorithms for Bayesian Hierarchical Clustering
Bayes’ Nets: Sampling [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available.
Example: Bioassay experiment Problem statement –Observations: At each level of dose, 5 animals are tested, and number of death are observed.
Three Frameworks for Statistical Analysis. Sample Design Forest, N=6 Field, N=4 Count ant nests per quadrat.
Academic Research Academic Research Dr Kishor Bhanushali M
Tutorial I: Missing Value Analysis
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Markov-Chain-Monte-Carlo (MCMC) & The Metropolis-Hastings Algorithm P548: Intro Bayesian Stats with Psych Applications Instructor: John Miyamoto 01/19/2016:
Contact patterns between herds: methods and visions (some results) Uno Wennergren (Tom Lindström) Linköping University Sweden.
Kevin Stevenson AST 4762/5765. What is MCMC?  Random sampling algorithm  Estimates model parameters and their uncertainty  Only samples regions of.
Overfitting, Bias/Variance tradeoff. 2 Content of the presentation Bias and variance definitions Parameters that influence bias and variance Bias and.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Chapter 6 Sampling and Sampling Distributions
SIR method continued. SIR: sample-importance resampling Find maximum likelihood (best likelihood × prior), Y Randomly sample pairs of r and N 1973 For.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Markov Chain Monte Carlo in R
Bayesian Semi-Parametric Multiple Shrinkage
Reducing Photometric Redshift Uncertainties Through Galaxy Clustering
MCMC Output & Metropolis-Hastings Algorithm Part I
Using the pig trade networks and the geographical distance among farms to model the spatio-temporal dynamics of porcine reproductive & respiratory syndrome.
CS 4/527: Artificial Intelligence
Remember that our objective is for some density f(y|) for observations where y and  are vectors of data and parameters,  being sampled from a prior.
Diagnostics and Transformation for SLR
CAP 5636 – Advanced Artificial Intelligence
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
CS 188: Artificial Intelligence
Model generalization Brief summary of methods
Diagnostics and Transformation for SLR
Presentation transcript:

Animal transport: spread of disease and …. Uno Wennergren Tom Lindström Annie Jonsson Nina Håkansson Jenny Lennartsson Spatio-Temporal Biology Division of Theoretical Biology Linköping University Sweden

Animal transport Aims of different projects The context – Research groups, their expertise – Data base on animal movements Specific research questions Estimating probability of animal movements – Tom Lindström

Projects - aims - groups Spread of disease: Foot and mouth disease. – Prepare to optimize intervention Animal welfare – Reduce stress and distance transported

Spread of disease: Foot and mouth disease. Funded by Swedish Civil Contingencies Agency (Swedish DHS): 2 grants, PI’s: UW and SSL at SVA – Prepare to optimize intervention Spatio-Temporal Biology (4 persons) – Biology/Ecology – Mathematics – Scientific Computing National Veterinary Institute (SVA) (3 persons) – Disease control and epidemiology – Veterinary medicine

Animal welfare – Reduce stress and distance per animal Funded by Swedish Board of Agriculture (Swedish USDA) PI: UW Spatio-Temporal Biology (3 persons) – Biology/Ecology – Mathematics – Scientific Computing Dept. of Animal Environment and Health, Swedish University of Agricultural Sciences (2 persons) – Animal welfare – Veterinary medicine Skogforsk, LiU, NHH (3 persons) – Optimization –Logistics – route planning –

Database All animal (cattle and pigs) movements between farms and farms to slaughterhouses. Not per vehicle – Cattle on individual level: birth, sale purchase, export, import, temporarily away (pasture), return from pasture, slaughter/house, death – Pig, on group level: as above Report within seven days Farms and slaughterhouses in Sweden. Dots: blue –farms, red – large slaughterhouses. Green - smaller slaughterhouses. From Håkansson et al Sweden

Database -specifics 12 months - cattle approximately reports of sales and purhase Important: errors in reports 10% – Possible to edit the database and reduce to 1% error by logical corrections (database cleaning) Spatial and temporal investigation of reported movements, births and deaths of cattle and pigs in Sweden. Submitted. Nöremark, Håkansson, Lindström, Wennergren, and Sternberg Lewerin.

Specific research questions 1.Other contacts between farms - questionnaire to farmers (SVA) 2.From measured contacts to probability of contact 3.Spread: Modeling disease specifics 4.Route planning of animal transport – effect on contacts and movement distance. 5.Production units: composition and configuration 6.Networks 1.Analysing transport network 2.Testing efficiency of network measures as predictors 1.Generating netorks 2.Testing linkdensity on network formation 3.Testing measures as predictors

1 2 ?

Specific research questions From measured contacts to probability of contact Estimating probabilities Tom Lindström

Animal movements between holdings Which farms are likely to have contacts through animal movements? – Mathematical description. – Estimation from data. Distance – Contacts between nearby farms are more common – Several different processes – Preventive Veterinary Medicine (any day now…)

A word on the data Should be good… Pigs reported at transport level by the receiving farmer Cattle reported at individual level by farmers at both origin and end. – Cattle moved on the same day between same farms constitute one transport – Mismatch – “Cleaning” using the identity of cattle Locations of many cattle farms not in the database but areas of valid for subsidies Inactive farms in the data base

Quantifying distance dependence Distance dependence needs two measurements. Probability of contacts has – Scale Measured as Variance (or Squared Displacement) – Shape Measured as Kurtosis

Variance Distance P

Kurtosis Distance P

Why these measures? Important to have quantities for comparison – Between epidemics – Between types of contacts – Between years Theoretical connection to biological invasions – Squared displacement relates to diffusion constant. – Discrete representation of space (i.e. farms has X,Y coordinates) => Fat tails more important

Kernel function A generalized normal distribution Variance and Kurtosis given by a and b. Extended to two dimensions (X,Y coordinates) – S normalizes the kernel, Volume=1.

Kernel function normalization With discrete representation of farm distribution normalization over all possible destination farms d is distance, i is start farm, k is possible destination farms (k≠i) and N is number of farms.

Kernel function normalization This separates spatial pattern of farms from distance dependence in contacts. Important if farm distribution is non random. Farm density in Sweden (farms/km 2 )

And USA From Shields and Mathews, 2003

Is the kernel function good enough? A single distribution may not be sufficient to fit data on multiple scales (both short and long distance contacts). An alternative model – A mixture model – Part distance dependent and part uniform (Mass Action Mixing) Models applied to pig and cattle transports (all transports during one year)..

An alternative model f 1 is distance dependent part: f 2 is MAM part: w is proportion of distance dependence.

Fitting to data Bayesian approach Increasingly common at least in ecological literature. Ellison 2008

Markov Chain Monte Carlo Parameters obtained through Markov Chain Monte Carlo (MCMC). Well suitable for epidemiological problems. A simple model can be expanded to include complexity. Drawback is computation time, and effective parallelization is difficult.

Markov Chain Monte Carlo Repeated (correlated) random draws from the posterior distribution of parameters. Gibbs Sampling – Direct draws from known distributions conditional on other parameter values Metropolis-Hastings – Values are proposed and subsequently accepted or rejected dependent on likelihood ratios

Markov Chain Monte Carlo Also allows for model selection by comparing the full posterior distribution of model probabilities. In our study, the mixture model was a much better model. Pigs Cattle

Comparing models and observed data Bars: observed transport distances. Dotted line: predictions by Model 1. Solid line: predictions by Model 2 CattlePigs

Network measures Will differences have consequences for estimation of disease spread dynamics? Networks generated with the different models Network measures Nodes (farms) and links (transports) A B C D

Network measures Density – proportion of farms connected Model 1 Model 2

Network measures Clustering Coefficient – proportion of “triplets” – If A is connected to B and C, are B and C connected? A B C D Model 1 Model 2

Network measures Fragmentation index – measures the amount of fragments not connected to the rest. A B C D E F D Model 1 Model 2

Network measures Betweeness centralization index – Are some nodes more central than others? A B CD E F D A B CD E F D Model 1 Model 2

Animal transports Higher Cluster Coefficient and lower Density for Model 2 – Depends on difference in short distance contacts – Depletion of susceptibles Group Betweeness higher for Model 1 in Cattle. – Due to long distance transport being more rare Conclusion: Model 2 is a better model (higher likelihood) and the difference may have impact on disease spread prediction.

More than distance? Why not compare to observed networks? Is there something but distance that matter? Some work in progress…

More than distance? Pig industry very structured, production types – Multiplying herd – Sow pool central unit – Sow pool satellite herd – Fattening herd – Farrow to finish herd – Piglet producing herd – Nucleus herd

From To

Production types in cattle? Dairy and beef producers Male calves on dairy farms are often sold to beef producers (at lest in Sweden) Other differences in production types? – Roping? – Organic farming? – Climate/geografic factors

More than distance? Reality is messy… – Data base not perfect – Missing production types – Several production types per farms Weights in the model – A farm is a fraction of each possible type. One parameter estimation per combination (sender/receiver) of production types.

More than distance? Size dependence – Size (Capacity) – Two different sizes Adult sows Piglets Different production types have different response – Different for sending or receiving – Total 64x4 parameters just for size… – Modeled as power function (Size θ )

More than distance? Distance dependence – Different for different production types – Variance, Kurtosis and mixing parameter for each combination

More than distance? Many parameters… 9*64=576 Some combinations of production types have few transports => uncertain estimations. Variance and kurtosis not clearly different from ∞. Using a prior may help – But it’s nicer to be objective…

Hierarchical Bayesian We can let the data decide the prior – Hyper parameters Hierarchical Bayesian model “Borrowing strength”

Animal transports – part 2 θ1θ1 θ2θ2 θ3θ3 θnθn Data P(θ )

Hierarchical Bayesian When would this make sense? – If parameters values are expected to be different but not totally different – E.g. distance… Parameter estimations based on much data… – Little influence of hierarchical prior Parameter estimations with little data… – Highly influenced by the hierarchical prior. Increases the variance of the prior distribution.

Thank you Questions?