Community assembly through trait selection ( CATS ): Modelling from incomplete information.

Slides:



Advertisements
Similar presentations
Contrasting tissue strategies explain functional beta diversity in Amazonian trees C. Fortunel, C.E.T. Paine, N. Kraft, P.V.A. Fine, C. Baraloto*
Advertisements

Population Ecology. Population Demographics Demographics are the various characteristics of a population including, Population Size, Age Structure, Density,
Community and gradient analysis: Matrix approaches in macroecology The world comes in fragments.
Diversity determinants. To be present in a community, a species has 1. To be able to reach the site (overcome the dispersal limitation) 2. To be able.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Evolution of Biodiversity
Commonness and rarity in species distribution Sophia Qian Niu Graduate seminar: Lost in space.
Species-Abundance Distribution: Neutral regularity or idiosyncratic stochasticity? Fangliang He Department of Renewable Resources University of Alberta.
1 Multiple Regression Interpretation. 2 Correlation, Causation Think about a light switch and the light that is on the electrical circuit. If you and.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
The neutral model approach Stephen P. Hubbell (1942- Motoo Kimura ( )
Review: What influences confidence intervals?
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Linear Regression.
. PGM: Tirgul 10 Parameter Learning and Priors. 2 Why learning? Knowledge acquisition bottleneck u Knowledge acquisition is an expensive process u Often.
Maximum Entropy, Maximum Entropy Production and their Application to Physics and Biology Roderick C. Dewar Research School of Biological Sciences The Australian.
Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Information Theory and Security
Lecture II-2: Probability Review
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
OUR Ecological Footprint …. Ch 20 Community Ecology: Species Abundance + Diversity.
The Triangle of Statistical Inference: Likelihoood
Introduction: Why statistics? Petter Mostad
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Chapter 5 Characterizing Genetic Diversity: Quantitative Variation Quantitative (metric or polygenic) characters of Most concern to conservation biology.
Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 © 2015, Michael Kalsher Unit 1B: Everything you wanted to know about basic.
Ecology 8310 Population (and Community) Ecology. Context.
National Accounts and SAM Estimation Using Cross-Entropy Methods Sherman Robinson.
On information theory and association rule interestingness Loo Kin Kong 5 th July, 2002.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
The Triangle of Statistical Inference: Likelihoood Data Scientific Model Probability Model Inference.
Extent and Mask Extent of original data Extent of analysis area Mask – areas of interest Remember all rasters are rectangles.
Niches versus Neutrality Reviews of Neutral Models Levine & HilleRisLambers (2009) Nature 461:254.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Confidence Interval & Unbiased Estimator Review and Foreword.
 Evolution is the change in the inherited traits of a population of organisms through successive generations  Two factors at work:  Processes that.
Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
Chp. 36 What impact did BP disaster have on the ocean ecosystem and population?? Reflect on this disaster….
Blocking The phenomenon of blocking tells us that what happens to one CS depends not only on its relationship to the US but also on the strength of other.
PCB 3043L - General Ecology Data Analysis.
Global Analyzing community data with joint species distribution models abundance, traits, phylogeny, co-occurrence and spatio-temporal structures Otso.
Lecture 1: Basic Statistical Tools. A random variable (RV) = outcome (realization) not a set value, but rather drawn from some probability distribution.
Lecture 3: MLE, Bayes Learning, and Maximum Entropy
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Ecology 8310 Population (and Community) Ecology Communities in Space (Metacommunities) Island Biogeography (an early view) Evolving views Similarity in.
© 2015 Pearson Education, Inc. POPULATION STRUCTURE AND DYNAMICS.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Functional Traits and Niche-based tree community assembly in an Amazonian Forest Kraft et al
OUR Ecological Footprint …. Fall 2008 IB Workshop Series sponsored by IB academic advisors Study Abroad for IB Majors Thursday, October 30 4:00-5:00PM.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
How do we define Biodiversity Quantitatively? The National Institute for Mathematical and Biological Synthesis.
MULTIPLE GENES AND QUANTITATIVE TRAITS
Models.
PCB 3043L - General Ecology Data Analysis.
AP Environmental Chapter 6
CHAPTER 29: Multiple Regression*
MULTIPLE GENES AND QUANTITATIVE TRAITS
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Significance Tests: The Basics
Incremental Partitioning of Variance (aka Hierarchical Regression)
Null models in community ecology
DESIGN OF EXPERIMENTS by R. C. Baker
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Community assembly through trait selection ( CATS ): Modelling from incomplete information

A seminar in three parts The ecological concept The maximum entropy formalism Edwin Jaynes The CATS model

Part 1 The ecological concept

Trait-based filtering Regional species pool: determined by traits + history abiotic filters (determined by traits) biotic filters (determined by traits) Immigration rate: determined by abundance in region + traits Local community Relative abundance in local community

Philip Grime A plant strategy is a suite of physiological, morphological or phenological traits of individuals that affect probabilities of survival, reproduction or immigration and that is systematically associated with particular environmental conditions The most common trait values in a local community will be possessed by those individuals having the greatest probabilites of survival, reproduction and immigration. relative abundance~ probability of passing filters A B C D E species t A t B t C t d t E traits of species This average trait value will reflect the selective advantage/disadvantage of this trait in passing through the various abiotic and biotic filters community mean trait value

Philip Grime Two consequences of Grime’s ideas for community assembly… 1.If we know the values of particular abiotic variables determining the filters, then we should be able to predict the « typical » values of the functional traits found in this local community; i.e. community-weighted means. 2.If we know the « typical » values of the functional traits that are found in this community, and we know the actual trait values of the species in a regional species pool, then we should be able to predict which of these species will be dominants, which will be subordinates, and which will be rare or absent.

The three basic parts of the CATS model

λ: selection on trait in the local community (greater probability of passing if smaller…) q : distribution of relative abundance of species in the metacommunity Trait (t) p : distribution of relative abundance of species in the local community Trait (t) metacommunity Local community

Measuring abundances Abundance of plant species can be measured as - numbers of stems - biomass or indirect measures of this (cover, dbh …) Units of observation are ambiguous - numbers of what? (ramets ≠ genets, biomass units?) Units of observations are never independent Local abundance Most values will be zero (species present in region but not in local site)

Measuring abundances Meta-community abundances Sometimes no information Sometimes vague information (very common, common, rare) Sometimes more quantitative information

Part 2 The maximum entropy formalism Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press.

How can we quantify learning? If the sun does set tonight (and our previous information told us it would, p= ) then we will have learned almost nothing new (almost no new information) If the sun doesn’t set tonight (and our previous information told us it would, 1-p= ) then we will have learned something incredible (lots of new information) The amount of learning (new information): Historical log 2, we will use log e =ln Claude Shannon

Average information content (i.e. new information) Specify all of the logically possible states in which some phenomenon can exist (i=1,2,…,S) Based on what we know before observing the actual state, assign values between 0 and 1 to each possible state: p=(p 1, p 2, …, p S ) Information entropy The amount of new information that we will learn if state i occurs is The average amount of new information that we will learn is

Information content and uncertainty New information = information that we don’t yet possess = uncertainty Information entropy measures the amount of new information we will gain once we learn the truth Information entropy therefore measures the amount of information that we don’t yet possess Information entropy is a measure of the degree of uncertainty that we have about the state of a phenomenon Maximum uncertainty = maximum entropy

A betting game: game A I bring you, blindfolded, to a field outside Sherbrooke. I tell you that there is a plant at your feet that belongs to either species A or species B. You have assign probabilities to the two possible states (species A or B) and will get that proportion of $1,000,000 once you learn the species name. What is your answer?

A betting game: game B New game with some new clues: (i) The site is a former cultivated field that was abandoned by a farmer last year. (ii) Species A is an annual herb. Species B is a climax tree. You have assign probabilities to the two possible states and will get that proportion of $1,000,000. What is your answer? If you changed your bet in this second game then these new clues provided you with some new, but incomplete, information before you learned the answer.

Maximum entropy (maximum uncertainty) at p=(0.5,0.5) p(annual herb)=0.9 p(climax tree)=0.1) Amount of information contained in the ecological clues Amount of remaining uncertianty, given the ecological clues

Relative entropy (Kullback-Leibler divergence) Let q i =1/S and S be the fixed number of unordered, discrete states; q is a uniform distribution Relative to a “prior” or reference distribution (q) Information entropy α relative entropy given a uniform distribution Maximizing relative entropy = maximizing entropy

The maximum entropy formalism in an ecological context: A three-step program Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press.

The maximum entropy formalism in an ecological context Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press. Specify a reference (q, prior) distribution that encodes what you know about the relative abundances of each species in the species pool before obtaining any information about the local community. Step 1: specifying the relative abundances in the regional meta- community (the prior distribution) “All I know is that there are S species in the meta-community” “All I know is that there are S species in the meta-community and I know their relative abundances in this meta-community” qiqi

The maximum entropy formalism in an ecological context Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press. Step 2: quantifying what we know about trait-based community assembly in the local community “I have measured the average value of my functional traits (community-weighted traits values)” “ I have measured the environmental conditions, and know the function linking these environmental conditions to the community-weighted trait values”

The maximum entropy formalism in an ecological context Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press. Step 3: choose a probability distribution which agrees with what we know, but doesn’t include any more information (i.e. don’t lie). Choose values of p that agree with what we know: And that maximizes the remaining uncertainty

The maximum entropy formalism in an ecological context Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press. The solution is a general exponential distribution t ij : Trait value of trait j of species i q i : Prior distribution of species i λ j :The amount by which a one-unit increase in trait j will result in a proportional change in relative abundance (p i )

The maximum entropy formalism in an ecological context Edwin Jaynes Jaynes, E.T Probability theory. The logic of science. Cambridge U Press. Practical considerations Except for maxent models with only a few constraints, we need numerical methods in order to fit them to data. I use a proportionality between the maximum likelihood of a multinomial distribution and the λ values of the solution to the maxent problem (Improved Iterative Scaling), available in the maxent() function of the FD library in R. Della Pietra, S. Della Pietra, V., Lafferty, J Inducing features of random fields. IEEE Transactions Patern Analysis and Machine Intelligence 19:1-13 Laliberté, E., Shipley, B Measuring functional diversity from multiple traits, and other tools for functional ecology R. package, Vienna, Austria By permuting trait vectors relative to species’ observed relative abundances, one can develop permutation tests of significance concerning model fit Shipley, B Ecology 91:

There are now many empirical applications of this model in many places around the world, and applied at many geographical scales. Two examples… CATS

96 1m2 quadrats containing the understory herbaceous plants of ponderosa pine forests (Arizona USA). Quadrats were distributed across seven permanent sites within a 120 km2 landscape between m altitude. SHIPLEY et al A strong test of a maximum entropy model of trait-based community assembly. Ecology 92: 507–517. Daniel Laughlin’s PhD thesis Available information 12 environmental variables measured in each quadrat 20 functional traits measured per species Relative abundance of each species in each quadrat

Significant predictive ability by 3 traits After ~7 traits, mostly redundant information If species is present, and not rare (>10%), CATS predicts its abundance well If species is present, but rare (<10%), CATS can’t distinguish degrees of rarity If species is absent (X) then CATS will predict it to be rare

If we only know the environmental conditions of a site, and the general relationship between community-weighted traits and the environment, how well can CATS do? Actual measured community-weighted value for this site = 9.5 General relationship: Y= X Predicted value for community-weighted trait, given that we know the environmental value is 7 = 7.61

The best possible prediction given the environment would be obtained with 79 separate generalized additive (form-free) regressions – one for each species in the species pool – of relative abundance vs. the environmental variables. gam(S~X)

Shipley, Paine & Baraloto Quantifying the importance of local niche-based and stochastic processes to tropical tree community assembly.. Ecology 93: Stephen Hubbell The unified neutral theory of biodiversity and biogeography The per capita probabilities of immigration from the meta-community, and the per capita probabilities of survival and reproduction of all species are equal (demographic neutrality). Subsequent population dynamics in the local community are determined purely by random drift. Local relative abundance ~meta-community relative abundance + drift « Neutral prior » = meta-community relative abundances Second example: tropical forests in French Guiana

1. Fit model using traits but a uniform prior (i.e. no effect of meta-community immigration) 2. Fit model permuting traits but with a neutral prior (i.e. no effects of local trait-based selection, but contribution from immigration) 3. Fit model with traits and with neutral prior(i.e. locat trait-based selection plus immigration from meta-community Partition the total variance explained into: 1.That due only to immigration 2.That due only to local trait-based selection 3.That due jointly to immigration and traits (correlations with meta-community 4.Unexplained variation due to demographic stochasticity

We did this at three spatial scales: 1, 0.25 and 0.01 ha. Dispersal limitation from meta- community Trait-based selection Demographic stochasticity

Practical considerations. How do we fit this model to empirical data? A trick involving likelihood the number of independent allocations to species i A total of A independent allocations of individuals or units of biomass into each of the S species in the species pool The probability of a single allocation going to species i (i.e. the probability of sufficient resources being captured to produce one individual or unit of biomass for species i); a function of its traits (T i ), the strength of the trait on selection (λ), and its meta-community abundance (q i )

The likelihood: Taking logarithms, dividing both sides by A, and re-arranging, one obtains In practice we can never know this, since neither individuals or resources (biomass) are ever allocated independently…

Maximising this… Will maximise the likelihood of the unknown multinomial distribution But we already know that

Choose values of λ that maximise this function, given the vector of traits (t i ) for each species in the species pool and their meta-community abundances (q i ): The dual solution (Della Pietra et al. 1997): the values of λ that maximise the relative entropy in the maximum entropy formalism are the same as the values that maximise the likelihood of a multinomial distribution. Della Pietra, S. Della Pietra, V., Lafferty, J Inducing features of random fields. IEEE Transactions Patern Analysis and Machine Intelligence 19:1-13. Improved Iterative Scaling algorithm  maxent() in the FD package of R. Laliberté, E., Shipley, B Measuring functional diversity from multiple traits, and other tools for functional ecology R. package, Vienna, Austria

Permutation tests for the CATS model Choose values of p that agree with what we know: And that maximizes the remaining uncertainty H 0 : trait values (t ij ) are independent of the observed relative abundances (o i )

2. Randomly permute the vector of trait values (T * i ) between the species (so traits are independent of observed relative abundances) 3. Calculate 4. Repeat steps 2 & 3 a large number of times. 1. CalculateThe smaller the value, the better the fit 5. Count the number of times f * > f. This is an estimate of the null probability. Permutation tests for the CATS model maxent.test() function in the FD library. Shipley, B Ecology 91: