Global Analyzing community data with joint species distribution models abundance, traits, phylogeny, co-occurrence and spatio-temporal structures Otso.

Slides:



Advertisements
Similar presentations
Space is ecologically meaningful: about the spatial component of the ecological niche, with the help of spectral analysis François Munoz *, Pierre-Olivier.
Advertisements

Null models in Ecology Diane Srivastava Sept 2010.
Community and gradient analysis: Matrix approaches in macroecology The world comes in fragments.
Commonness and rarity in species distribution Sophia Qian Niu Graduate seminar: Lost in space.
Functional traits, trade-offs and community structure in phytoplankton and other microbes Elena Litchman, Christopher Klausmeier and Kyle Edwards Michigan.
TNO orbit computation: analysing the observed population Jenni Virtanen Observatory, University of Helsinki Workshop on Transneptunian objects - Dynamical.
The current status of fisheries stock assessment Mark Maunder Inter-American Tropical Tuna Commission (IATTC) Center for the Advancement of Population.
All for one or One for All? Mapping many species individually vs. simultaneously with random forest. Emilie Henderson, Janet Ohmann, Matthew Gregory, Heather.
Null models and observed patterns of native and exotic diversity: Does native richness repel invasion? Rebecca L. Brown, 1,2 Jason D. Fridley, 1 and John.
Species interaction models. Goal Determine whether a site is occupied by two different species and if they affect each others' detection and occupancy.
Multiple Linear Regression Model
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
State-Space Models for Biological Monitoring Data Devin S. Johnson University of Alaska Fairbanks and Jennifer A. Hoeting Colorado State University.
Vogelwarte.ch BIO 232 Macroecology, niche evolution and climate change by Niklaus. E. Zimmermann Today: Damaris Zurell.
How does phylogeny influence ecological patterns? As species of the same genus have usually, though by no means invariably, some similarity in habitats.
Chapter 14 Inferential Data Analysis
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Correlation & Regression
OUR Ecological Footprint …. Ch 20 Community Ecology: Species Abundance + Diversity.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Bioscience – Aarhus University Pin-point plant cover data Christian Damgaard Bioscience Aarhus University.
BIOE 293 Quantitative ecology seminar Marm Kilpatrick Steve Munch Spring Quarter 2015.
Trait-based representation of diatom diversity in a Plankton Functional Type model N. T ERSELEER 1, J. B RUGGEMAN 2, C. L ANCELOT 1 AND N. G YPENS 1 1.
Bioscience – Aarhus University Measuring plant abundance and ecological processes with the pin-point method Christian Damgaard Department of Bioscience.
Predicting diversity of large herbivores from moisture and nutrient availability based on the article of Olff et al. (2002): “Global environmental controls.
Multiple regression models Experimental design and data analysis for biologists (Quinn & Keough, 2002) Environmental sampling and analysis.
Regression. Population Covariance and Correlation.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Components of plant species diversity in the New Zealand forest Jake Overton Landcare Research Hamilton.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
Introducing model-data fusion to graduate students in ecology Topics of discussion: The impact of NEON on ecology What are the desired outcomes from a.
Paul Eastwood Centre for Environment, Fisheries and Aquaculture Science (Cefas) Lowestoft, UK INCOFISH WP3 Brazil workshop.
Statistical approach Statistical post-processing of LPJ output Analyse trends in global annual mean NPP based on outputs from 19 runs of the LPJ model.
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
I231B QUANTITATIVE METHODS ANOVA continued and Intro to Regression.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Lecture 6 Your data and models are never perfect… Making choices in research design and analysis that you can defend.
Workshop on Applied Hierarchical Modeling in BUGS and unmarked Patuxent Wildlife Research Center November 2015.
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
Integrating large-scale survey data sets with climate and land use data to model species distribution dynamics Andrew M. Latimer and John A. Silander Department.
 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.
Monitoring and Estimating Species Richness Paul F. Doherty, Jr. Fishery and Wildlife Biology Department Colorado State University Fort Collins, CO.
Using Regional Models to Assess the Relative Effects of Stressors Lester L. Yuan National Center for Environmental Assessment U.S. Environmental Protection.
1 Occupancy models extension: Species Co-occurrence.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
 1 Modelling Occurrence of Multiple Species. 2 Motivation Often there may be a desire to model multiple species simultaneously.  Sparse data.  Compare/contrast.
 Multi-state Occupancy. Multiple Occupancy States Rather than just presence/absence of the species at a sampling unit, ‘occupancy’ could be categorized.
Functional Traits and Niche-based tree community assembly in an Amazonian Forest Kraft et al
OUR Ecological Footprint …. Fall 2008 IB Workshop Series sponsored by IB academic advisors Study Abroad for IB Majors Thursday, October 30 4:00-5:00PM.
 Occupancy Model Extensions. Number of Patches or Sample Units Unknown, Single Season So far have assumed the number of sampling units in the population.
Combining pin-point and Braun-Blanquet plant cover data
Hierarchical models. Hierarchical with respect to Response being modeled – Outliers – Zeros Parameters in the model – Trends (Us) – Interactions (Bs)
Bioscience – Aarhus University Modelling the joint abundance of more plant species - pin-point cover data Christian Damgaard Department of Bioscience Aarhus.
Multiple Regression.
Mathematical Biology Group
Combining Species Occupancy Models and Boosted Regression Trees
Multiple Regression.
Tabulations and Statistics
Department of Bioscience
OVERVIEW OF LINEAR MODELS
Non-marine biological evidence
SPATIAL ANALYSIS IN MACROECOLOGY
Structural Equation Modeling
Presentation transcript:

Global Analyzing community data with joint species distribution models abundance, traits, phylogeny, co-occurrence and spatio-temporal structures Otso Ovaskainen University of Helsinki, Finland NTNU Trondheim, Norway

What structures the assembly and dynamics of communities? Leibold et al. (2004): The dynamics and distributions of communities are shaped by the interplay between i.environmental filtering ii.species interactions iii.spatial and stochastic processes Logue et al. (2011): Metacommunity theories are still poorly linked with data. There is a lack of statistical frameworks that would enable one to infer metacommunity processes from data typically available in community ecological studies.

Data typically available for community ecological studies Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

Global Regional Local A statistical framework for community ecology global species pool regional species pool Spatial and neutral processes phylogenetic relationships species traits local species pool Biotic interactions Environmental filtering different diversity measures Environmental variation observed community Sampling process Dorazio and Royle 2005, Dorazio et al. 2006, Kery et al. 2009, Russell et al. 2009, Dorazio et al. 2010, Zipkin et al. 2010, Ovaskainen and Soininen 2011, Jackson et al. 2012, Olden et al. 2014, Dunstan et al. 2011, Hui et al. 2013, Ovaskainen et al. 2015ab le Roux et al. 2014, Pellissier et al. 2013, Ovaskainen et al. 2010, Sebastian-Gonzalez et al. 2010, Pollock et al. 2014, Clark et al. 2014, Ovaskainen et al. 2015a Pollock et al. 2012, Brown et al. 2014, Ovaskainen et al. 2015a. Dorazio and Connor 2014 Helmus et al. 2007, Ives and Helmus 2011 Latimer et al. 2009, Blangiardo et al. 2013, Borcard and Legendre 2002, Dray et al. 2006, Dray et al. 2012, Thorson et al. 2015, Ovaskainen et al 2015b Dorazio and Royle 2005, Dorazio et al. 2006, Kery et al. 2009, Russell et al. 2009, Dorazio et al. 2010, Zipkin et al Evolutionary processes

Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time SDM (species distribution model)

Linear predictor for sampling unit j Species occurrence: environmental covariates regression parameters SDM (species distribution model) Latent occurrence score: Example link function: probit regression for presence-absence data Residual:

JSDM (joint species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

Latent occurrence score for species i in sampling unit j residual environmental covariates Approaches to community modelling (Ferrier and Guisan, 2006): ‘assemble first, predict later’ ‘predict first, assemble later’ ‘assemble and predict together’ regression parameters JSDM (joint species distribution model)

residual environmental covariates Approaches to community modelling (Ferrier and Guisan, 2006): ‘assemble first, predict later’ ‘predict first, assemble later’ ‘assemble and predict together’ JSDM (joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

residual environmental covariates Species level Community level JSDM (joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

Number of species Number of sites independent models, prior 1 independent models, prior 2 community model, prior 1 community model, prior 2 training data full data Ovaskainen and Soininen (Ecology, 2011) Oldén et al. (Plos one, 2014) Example: borrowing information from other species to parameterize models for rare species 500 diatom species surveyed for presence- absence on 105 sampling units (streams) Training data: 35 sampling units Validation data: 70 sampling units

residual environmental covariates JSDM (joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j Ovaskainen et al. (Methods in Ecology and Evolution, 2015a) Warton et al. (TREE, 2015) factor loadings latent factors Modelling co-occurrence through latent factors

P(negative association)>0.95 P(positive association)>0.95 Example: co-occurrence among wood-inhabiting fungi Ovaskainen et al. (Methods in Ecology and Evolution, 2015a)

Resource unitPlotForestTotal Co-occurrence can be estimated at multiple spatial scales Ovaskainen et al. (Methods in Ecology and Evolution, 2015a)

Prevalence Ovaskainen et al. (Methods in Ecology and Evolution, 2015a) Accounting for co-occurrence improves model predictions Prediction based on covariates and the occurrences of other species Prediction based on covariates only

Latent variables can be viewed as model based ordination Model-based biplots for alpine plant data, from Warton et al. (TREE, 2015)

TJSDM (trait-based joint species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

residual environmental covariates TJSDM (trait-based joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

traits environmental covariates Species level Community level TJSDM (trait-based joint species distribution model) regression parameters regression parameters: how traits influence the species responses to environmental covariates Latent occurrence score for species i in sampling unit j

agaricoid resupinate corticioid pileate corticioid discomycetoid resupinate polyporoid pileate polyporoid ramarioid stromatoid tremelloid spore size spore ornamentation spore cell wall presence of asexual structures 30 µm 50 µm 0%40% 0%15%30%70% Life-form Example: distribution of fungal traits Most abundant group Natural forests Least abundant group Abrego, Norberg and Ovaskainen (in prep)

agaricoid resupinate corticioid pileate corticioid discomycetoid resupinate polyporoid pileate polyporoid ramarioid stromatoid tremelloid spore size spore ornamentation spore cell wall presence of asexual structures 30 µm 50 µm 0%40% 0%15%30%70% Life-form Example: distribution of fungal traits Most abundant group P(difference between natural and managed forests)>0.95 Natural forests Managed forests More common in managed forestsLess common in managed forests Least abundant group Abrego, Norberg and Ovaskainen (in prep)

residual environmental covariates TJSDM (trait-based joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

PTJSDM (phylogenetically constrained trait-based joint species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time PTJSDM (phylogenetically constrained trait-based joint species distribution model)

residual environmental covariates Species level Traits Phylogenetic relationship matrix Strength of phylogenetic signal PTJSDM (phylogenetically constrained trait-based joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j Ives and Helmus (Ecological Monographs, 2011)

agaricoid resupinate corticioid pileate corticioid discomycetoid resupinate polyporoid pileate polyporoid ramarioid stromatoid tremelloid spore size spore ornamentation spore cell wall presence of asexual structures 30 µm50 µm 0%40% 0%15%30%70% Life-form Example: distribution of fungal traits is correlated with phylogeny Most abundant group Natural forests Least abundant group Abrego, Norberg and Ovaskainen (in prep)

STSDM (spatio-temporal species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

residual environmental covariates Spatial, temporal or spatio- temporal covariance STSDM (spatio-temporal species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

Jousimo et al. (in prep) Example: inferring spatio-temporal population dynamics of wolf from winter-track data The dataThe fitted model

STJSDM (spatio-temporal joint species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time

residual environmental covariates STJSDM (spatio-temporal joint species distribution model) regression parameters Latent occurrence score for species i in sampling unit j

Latent factors Training and validation data Covariates Example: modelling the distributions of 55 butterfly species in GB Ovaskainen et al. (Methods in Ecology and Evolution, 2015b)

The inclusion of spatially structured latent factors improved the model’s ability to predict the validation data Covariates and latent factors, mean = 0.42 Covariates only, mean = 0.30 Prevalence

STPTJSDM (spatio-temporal phylogenetically constrained trait-based joint species distribution model) Y species sampling units X covariates C species T traits Occurrence Environment Phylogeny Traits Space and time Ovaskainen et al. (ms)

Software Environmental covariates Traits Presence-absence data Co-occurrence through latent variables Abundance (and other kinds of) data Phylogenetic correlations Spatio-temporal latent variables Latent variables that co-vary with measured covariates Time-series models Etc. In preparation Interested in contributing? Post-doc (and other) funding available for Contact:

Global Conclusions There is a lack of statistical frameworks that would enable one to infer metacommunity processes from data typically available in community ecological studies. Joint species distribution modelling is one fast developing area which tries to fill this gap. A lot of relevant structures can be built into generalized hierarchical linear mixed models: hierarchical layers, covariance structures, error structures and link functions. The joint species distribution models presented here are of general nature and thus applicable to many kinds of study systems and study questions. More refined information on specific systems may be obtained by other approaches (e.g. process-based state-space models).