Layered continuous time processes in biology Combining causal statistical time series with fossil measurements. Tore Schweder and Trond Reitan CEES, University.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

State Estimation and Kalman Filtering CS B659 Spring 2013 Kris Hauser.
A.M. Alonso, C. García-Martos, J. Rodríguez, M. J. Sánchez Seasonal dynamic factor model and bootstrap inference: Application to electricity market forecasting.
Rutgers CS440, Fall 2003 Review session. Rutgers CS440, Fall 2003 Topics Final will cover the following topics (after midterm): 1.Uncertainty & introduction.
Use of Kalman filters in time and frequency analysis John Davis 1st May 2011.
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Modeling Uncertainty over time Time series of snapshot of the world “state” we are interested represented as a set of random variables (RVs) – Observable.
An Introduction to Variational Methods for Graphical Models.
Observers and Kalman Filters
Modelling phenotypic evolution using layered stochastic differential equations (with applications for Coccolith data) How to model layers of continuous.
Introduction to Sampling based inference and MCMC Ata Kaban School of Computer Science The University of Birmingham.
Bayesian statistics – MCMC techniques
Suggested readings Historical notes Markov chains MCMC details
Visual Recognition Tutorial
Resolving the paradox of stasis: models with stabilizing selection explain evolutionary divergence on all timescales Suzanne Estes & Stevan J. Arnold.
Stanford CS223B Computer Vision, Winter 2007 Lecture 12 Tracking Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
Bootstrap in Finance Esther Ruiz and Maria Rosa Nieto (A. Rodríguez, J. Romo and L. Pascual) Department of Statistics UNIVERSIDAD CARLOS III DE MADRID.
End of Chapter 8 Neil Weisenfeld March 28, 2005.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Space-time Modelling Using Differential Equations Alan E. Gelfand, ISDS, Duke University (with J. Duan and G. Puggioni)
Bayesian Estimation (BE) Bayesian Parameter Estimation: Gaussian Case
A Unifying Review of Linear Gaussian Models
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
0 Pattern Classification, Chapter 3 0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda,
Modelling Phenotypic Evolution by Stochastic Differential Equations Combining statistical timeseries with fossil measurements. Tore Schweder and Trond.
Computer vision: models, learning and inference Chapter 19 Temporal models.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
A toolbox for continuous time series analysis.. Why continuous time? a) If you can handle continuous time, you can handle discrete observations also,
Introduction to Linear Regression
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Overview Particle filtering is a sequential Monte Carlo methodology in which the relevant probability distributions are iteratively estimated using the.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
The Logistic Growth SDE. Motivation  In population biology the logistic growth model is one of the simplest models of population dynamics.  To begin.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
CS Statistical Machine learning Lecture 24
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
Principles of the Global Positioning System Lecture 12 Prof. Thomas Herring Room ;
Sequential Monte-Carlo Method -Introduction, implementation and application Fan, Xin
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
3.1 Selection as a Surface Stevan J. Arnold Department of Integrative Biology Oregon State University.
Trond Reitan (Division of statistics and insurance mathematics, Department of Mathematics, University of Oslo) Statistical modelling and latent variables.
Kalman Filtering And Smoothing
Tracking with dynamics
3.2 Evolution on Adaptive Landscapes Stevan J. Arnold Department of Integrative Biology Oregon State University.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Chapter 2: Bayesian hierarchical models in geographical genetics Manda Sayler.
Bioinf.cs.auckland.ac.nz Juin 2008 Uncorrelated and Autocorrelated relaxed phylogenetics Michaël Defoin-Platel and Alexei Drummond.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Hierarchical models. Hierarchical with respect to Response being modeled – Outliers – Zeros Parameters in the model – Trends (Us) – Interactions (Bs)
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Introduction to Sampling based inference and MCMC
Modelling Phenotypic Evolution by Stochastic Differential Equations
Advanced Statistical Computing Fall 2016
CH 5: Multivariate Methods
Multimodal Learning with Deep Boltzmann Machines
Trond Reitan CEES, University of Oslo
Course: Autonomous Machine Learning
Pattern Classification, Chapter 3
Hidden Markov chain models (state space model)
Filtering and State Estimation: Basic Concepts
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
An Introduction to Variational Methods for Graphical Models
Boltzmann Machine (BM) (§6.4)
Parametric Methods Berlin Chen, 2005 References:
Multivariate Methods Berlin Chen, 2005 References:
Presentation transcript:

Layered continuous time processes in biology Combining causal statistical time series with fossil measurements. Tore Schweder and Trond Reitan CEES, University of Oslo Jorijntje Henderiks University of Uppsala BISP7, Madrid

Overview 1.Introduction - Motivating example: Coccolith data (microfossils) Phenotypic evolution: irregular time series related by (possibly common) latent processes 2.Causality in continuous time processes 3.Stochastic differential equation vector processes Ito representation and diagonalization Tracking processes and hidden layers Kalman filtering Model variants 4.Inference and results Bayesian inference on model properties, models, process parameters and process states Results for the coccolith data. 5.Second application: Phenotypic evolution on a phylogenetic tree Primates - preliminary results 6.Conclusion 2

The size of a single cell algae (Coccolithus) is measured by the diameter of its fossilized coccoliths (calcite platelets). Want to model the evolution of a lineage found at six sites. Phenotype (size) is a process in continuous time and changes continuously. Irregular time series related by latent processes: evolution of body size in Coccolithus Henderiks – Schweder - Reitan 19,899 coccolith measurements, 205 sediment samples (1< n < 400) of body size by site and time (0 to -60 my). 3

Our data – Coccolith size measurements 205 Sample mean log coccolith size (1 < n < 400) by time and site. 4

Evolution of size distribution Fitness = expected number of reproducing offspring. The population tracks the fitness curve (natural selection) The fitness curve moves about, the population follow. With a known fitness, µ, the mean phenotype should be an Ornstein-Uhlenbeck process (Lande 1976). With fitness as a process, µ(t),, we can make a tracking model: 5

The Ornstein-Uhlenbeck process Attributes: Stationary  Normally distributed  long-term level:   Standard deviation: s=  /  2α Markovian  α: pull  corr(x(0),x(t))=e -  t  Time for the correlation to drop to 1/e:  t =1/α   1.96 s  s tt The parameters ( ,  t, s) can be estimated from the data. In this case:  1.99,  t=1/α  0.80Myr, s 

One layer tracking another Black process (  t 1 =1/  1 =0.2, s 1 =2 ) tracking red process (  t 2 =1/  2 =2, s 2 =1 ) Auto-correlation of the upper (black) process, compared to a one-layered SDE model. A slow-tracking-fast can always be re-scaled to a fast-tracking slow process. Impose identifying restriction:  1 ≥  2 7

Process layers - illustration Layer 1 – local phenotypic expression External series Stationary expectancy Observations T Layer 2 – local fitness optimum Layer 3 – hidden climate variations or primary optimum

Causality in SDE processes We want to express in process terms the fact that climate affects the phenotypic optimum which again affects the actual phenotype. Local independence ( Schweder 1970 ):  Context: Composable continuous time processes.  Component A is locally independent of B iff transition prob. of A does not depend on B.  If B is locally dependent on A but not vice versa, we write A→B. Local dependence can form a notion of causality in cont. time processes the same way Granger causality does for discrete time processes. 9

Stochastic differential equation (SDE) vector processes 10

Stochastic differential equation (SDE) vector processes 11 The zeros in this matrix determines the local independencies.

Model variants for Coccolith evolution Model variations: 1, 2 or 3 layers (possibly more) Inclusion of external time series or extra internal time series in connection to the one we are modelling In a single layer:  Local or global parameters  Correlation between sites (inter-regional correlation)  Deterministic response to the lower layer  Random walk (no tracking) 12

Likelihood: Kalman filter 13 The Ito solution gives, together with measurement variances, what is needed to calculate the likelihood using the Kalman filter: and Process Observations Need a linear, normal Markov chain with independent normal observations:

Kalman smoothing (state inference) A tracking model with 3 layers and fixed parameters North Atlantic Red curve: expectancy Black curve: realization Green curve: uncertainty Snapshot: 14

Parameter and model inference  Wide but informative prior distributions respecting identifying restrictions  MCMC for parameter inference on individual models:  MCMC samples+state samples from Kalman smoother -> Possible to do inference on the process state conditioned only on the data.  Model likelihood (using an importance sampler) for model comparison (posterior model probabilities).  Posterior weight of a property C from model likelihoods: 15

Bayesian inference on the 723 models 16

Results Best 5 models in good agreement. (together, 19.7% of summed integrated likelihood):  Three layers.  Common expectancy in bottom layer.  No impact of exogenous temperature series.  Lowest layer: Inter-regional correlations,   0.5. Site- specific pull.  Middle layer: Intermediate tracking.  Upper layer: Very fast tracking. Middle layers: fitness optima Top layer: population mean log size 17

Phenotypic evolution on a phylogenetic tree: Body size of primates 18

First results for some primates 19

Why linear SDE processes? Parsimonious: Simplest way of having a stochastic continuous time process that can track something else. Tractable: The likelihood, L(  )  f(Data |  ), can be analytically calculated by the Kalman filter or directly by the parameterized multi- normal model for the observations. (  = model parameter set) Can code for causal structure (local dependence/independence). Some justification in biology, see Lande (1976), Estes and Arnold (2007), Hansen (1997), Hansen et. al (2008). Great flexibility, widely applicable... Thinking and modelling might then be more natural in continuous time. Allows for varying observation frequency, missing data and observations arbitrary spaced in time. Extensions: Non-linear SDE models… Non-Gaussian instantaneous stochasticity (jump processes). 20

Bibliography  Commenges D and Gégout-Petit A (2009), A general dynamical statistical model with causal interpretation. J.R. Statist. Soc. B, 71,  Lande R (1976), Natural Selection and Random Genetic Drift in Phenotypic Evolution, Evolution 30,  Hansen TF (1997), Stabilizing Selection and the Comparative Analysis of Adaptation, Evolution, 51-5,  Estes S, Arnold SJ (2007), Resolving the Paradox of Stasis: Models with Stabilizing Selection Explain Evolutionary Divergence on All Timescales, The American Naturalist, 169-2,  Hansen TF, Pienaar J, Orzack SH (2008), A Comparative Method for Studying Adaptation to a Randomly Evolving Environment, Evolution 62-8,  Schuss Z (1980). Theory and Applications of Stochastic Differential Equations. John Wiley and Sons, Inc., New York.  Schweder T (1970). Decomposable Markov Processes. J. Applied Prob. 7, 400–410 Source codes, examples files and supplementary information can be found at 21