Case Study 1 - Tree Species Abundance

Slides:



Advertisements
Similar presentations
Case Study 2 Neighborhood Models of the Allelopathic Effects of an Invasive Tree Species Gómez-Aparicio, L. and C. D. Canham Neighborhood analyses.
Advertisements

Case Study: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham, C.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Objectives (BPS chapter 24)
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Chapter 6 Continuous Random Variables and Probability Distributions
Excellence Justify the choice of your model by commenting on at least 3 points. Your comments could include the following: a)Relate the solution to the.
BCOR 1020 Business Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 5 Continuous Random Variables and Probability Distributions
Continuous probability distributions
Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.
Chapter 4 Continuous Random Variables and Probability Distributions
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Inference for regression - Simple linear regression
Statistical Techniques I
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Descriptive Statistics II: By the end of this class you should be able to: describe the meaning of and calculate the mean and standard deviation of a sample.
Chapter 6 Continuous Distributions The Gaussian (Normal) Distribution.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
The Normal Curve Packet #23. Normal Curve  Referred to as a bell- shaped curve  Perfect mesokurtic distribution.
Unit One Notes: Graphing How do we graph data?. Name the different types of graphs (charts).
Managerial Economics Demand Estimation & Forecasting.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Regression Regression relationship = trend + scatter
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
PCB 3043L - General Ecology Data Analysis.
Case Study 1: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham,
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
Hypothesis test flow chart
BPS - 5th Ed. Chapter 231 Inference for Regression.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Tom.h.wilson Department of Geology and Geography West Virginia University Morgantown, WV.
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
SUR-2250 Error Theory.
Case Study 1 - Tree Species Abundance
Physics 114: Lecture 13 Probability Tests & Linear Fitting
Chapter 4 Basic Estimation Techniques
Basic Estimation Techniques
Stat 31, Section 1, Last Time Sampling Distributions
Data Science Algorithms: The Basic Methods
PCB 3043L - General Ecology Data Analysis.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Analyzing and Interpreting Quantitative Data
Inverse Transformation Scale Experimental Power Graphing
The Practice of Statistics in the Life Sciences Fourth Edition
Basic Estimation Techniques
Introduction to Summary Statistics
Regression Inference.
Introduction to Summary Statistics
Case Study - Neighborhood Models of Allelopathy
Inferential Statistics
Week 3 Lecture Statistics For Decision Making
Basic Practice of Statistics - 3rd Edition Inference for Regression
6.1 Introduction to Chi-Square Space
Regression Statistics
Chapter 14 Inference for Regression
Ensemble forecasts and seasonal precipitation tercile probabilities
M248: Analyzing data Block A UNIT A3 Modeling Variation.
Z-Scores Locations of Scores & Standardized Distributions.
Continuous Probability Distributions
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

Case Study 1 - Tree Species Abundance C. D. Canham Likelihood Methods in Ecology April 25 - 29, 2011 Granada, Spain Case Study 1: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham, C. D. and R. Q. Thomas. 2010. Frequency, not relative abundance, of temperate tree species varies along climate gradients in eastern North America. Ecology 91:3433-3440

The Data Relative abundance of the 24 most common tree species in the northeastern US In ~ 20,000 US Forest Service Forest Inventory and Analysis (FIA) plots from 19 northeastern states (Maine to Wisconsin, south to Kentucky and Virginia) Climate (mean annual temperature and average annual precipitation) for each plot, averaged over the period since the previous census Using 800-m resolution gridded PRISM climate data, with bi-linear interpolation to true plot locations

The Basic Question http://www.fs.fed.us/ne/delaware/atlas/s318.html How does the abundance of tree species vary along climate gradients? http://www.fs.fed.us/ne/delaware/atlas/s318.html

Basic Approach Develop regression models that predict abundance of a given tree species in a plot as a function of climate at the location of the plot… Initial decisions: What to use as a measure of abundance? I chose relative abundance over absolute abundance What sorts of functions could describe variation in relative abundance along climate gradients Compare a Gaussian function with a null model that was flat

But is a flat line an appropriate null hypothesis? What about range limits? And shouldn’t even the Gaussian model have truncated tails?

So, what do the data look like? Acer rubrum (red maple) Quercus prinus ( chestnut oak) It doesn’t look like there’s much hope for nice Gaussian niche curves…

A first regression model (in R) # Square Gaussian - predicts mean of 0 when below lo or above hi square.gauss.model <- function(a,m,b,lo,hi,X) { ifelse(X < lo,0,ifelse(X > hi,0,a*exp(-0.5*(((X-m)/b)^2)))) } 5 Parameters: “a” determines height of curve at mode “m” temperature at peak of curve “b” breadth of the curve “lo” lower temperature limit “hi” upper temperature limit Note that a separate “null” model is not really necessary, because if “b” is large, the curve is flat

What might be an appropriate PDF? The data are clearly not normally distributed… In fact, there are a whole lot of zeros….

Divide the analysis up into two parts Predict probability of “presence” (i.e. non-zero abundance) – Separately predict relative abundance when present These two components of abundance have very different ecological meanings….

A PDF for relative abundance when present: The Gamma Function One likely choice for plots where a species is present… Gamma PDF with scale parameter = 22

First Results Relative Abundance (when present) Bottom line: for all 24 species, relative abundance is highly variable, but shows very little trend in the mean across the entire range of a species’ climate niche limits Note: these results are for relative abundance, given that a species is present…

But what about all of those zeros But what about all of those zeros? Creating a PDF that can model both “presence” and “relative abundance” “Zero-Inflated Distributions” # ZERO INFLATED NORMAL PDF zinf_norm_PDF <- function(x,mean,sd,pz) { log(ifelse(x==0,pz + (1-pz)*dnorm(0,mean,sd,log=F), (1-pz)*dnorm(x,mean,sd,log=F))) } # ZERO INFLATED GAMMA PDF zinf_gamma_PDF_climate <- function(x,mean,scale,px) { shape <- mean/scale loglh <- log(ifelse(x==0,pz + (1-pz)*dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F))) return(loglh) }

But why stop there? Is presence constant across climate gradients, or does it vary? # ZERO INFLATED GAMMA PDF with pz a gaussian function of the # independent (climate) variable zinf_gamma_PDF_climate <- function(x,mean,scale,pa,pm,pb,px) { shape <- mean/scale pz <- 1 - pa*(exp(-0.5*(((px-pm)/pb)^2))) # px is temp or precip in the plot loglh <- log(ifelse(x==0,pz + (1-pz)*dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F))) return(loglh) }

One final complication… How can I factor range limits into the PDF? If probability of presence is modeled as a Gaussian function of climate, but with truncated tails (climatic limits), what likelihood should be assigned to plots outside the estimated climatic limits? (my answer – 1 in a million…) # CLIMATE DEPENDENT ZERO INFLATED GAMMA, WITH LIMITS PDF - # ARBITRARILY SET Prob(x>0|X=0) = 0.000001 zinf_limits_gamma_PDF_climate <- function(x,mean,scale,pa,pm,pb,px) { shape <- mean/scale pz <- 1 - pa*(exp(-0.5*(((px-pm)/pb)^2))) loglh <- log(ifelse(mean == 0, ifelse(x == 0,0.999999,0.000001), ifelse(x==0,pz + (1-pz) * dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F)))) return(loglh) }

Case Study 1 - Tree Species Abundance C. D. Canham Please coordinate rides from your dorm with others…