Case Study: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham, C.

Slides:



Advertisements
Similar presentations
Case Study 2 Neighborhood Models of the Allelopathic Effects of an Invasive Tree Species Gómez-Aparicio, L. and C. D. Canham Neighborhood analyses.
Advertisements

Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Objectives (BPS chapter 24)
CORRELATON & REGRESSION
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Chapter 6 Continuous Random Variables and Probability Distributions
Excellence Justify the choice of your model by commenting on at least 3 points. Your comments could include the following: a)Relate the solution to the.
BCOR 1020 Business Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 5 Continuous Random Variables and Probability Distributions
Continuous probability distributions
Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.
Chapter 4 Continuous Random Variables and Probability Distributions
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Inference for regression - Simple linear regression
Statistical Techniques I
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Descriptive Statistics II: By the end of this class you should be able to: describe the meaning of and calculate the mean and standard deviation of a sample.
Chapter 6 Continuous Distributions The Gaussian (Normal) Distribution.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Hypothesis of Association: Correlation
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
The Normal Curve Packet #23. Normal Curve  Referred to as a bell- shaped curve  Perfect mesokurtic distribution.
Unit One Notes: Graphing How do we graph data?. Name the different types of graphs (charts).
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Managerial Economics Demand Estimation & Forecasting.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Regression Regression relationship = trend + scatter
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Analysis of Residuals ©2005 Dr. B. C. Paul. Examining Residuals of Regression (From our Previous Example) Set up your linear regression in the Usual manner.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont’d) Instructor: Prof. Johnny Luo
PCB 3043L - General Ecology Data Analysis.
Case Study 1: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham,
ANOVA, Regression and Multiple Regression March
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
Hypothesis test flow chart
BPS - 5th Ed. Chapter 231 Inference for Regression.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Tom.h.wilson Department of Geology and Geography West Virginia University Morgantown, WV.
Altitude vs Atmpospere vs temp Purpose statement: I am going to investigate the relationship between Mean pressure and Tempurature (degrees C)
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
SUR-2250 Error Theory.
Case Study 1 - Tree Species Abundance
Statistical Methods Michael J. Watts
Chapter 4 Basic Estimation Techniques
Statistical Methods Michael J. Watts
Basic Estimation Techniques
PCB 3043L - General Ecology Data Analysis.
Inverse Transformation Scale Experimental Power Graphing
Basic Estimation Techniques
Introduction to Summary Statistics
Case Study - Neighborhood Models of Allelopathy
Inferential Statistics
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
Chapter 14 Inference for Regression
Ensemble forecasts and seasonal precipitation tercile probabilities
M248: Analyzing data Block A UNIT A3 Modeling Variation.
Case Study 1 - Tree Species Abundance
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

Case Study: Distribution and Abundance of Tree Species Along Climate Gradients (Sometimes the PDF is more interesting than the actual model…) Canham, C. D. and R. Q. Thomas Frequency, not relative abundance, of temperate tree species varies along climate gradients in eastern North America. Ecology 91:

The Data l Relative abundance of the 24 most common tree species in the northeastern US - In ~ 20,000 US Forest Service Forest Inventory and Analysis (FIA) plots from 19 northeastern states (Maine to Wisconsin, south to Kentucky and Virginia) l Climate (mean annual temperature and average annual precipitation) for each plot, averaged over the period since the previous census - Using 800-m resolution gridded PRISM climate data, with bi- linear interpolation to true plot locations

The Basic Question l How does the abundance of tree species vary along climate gradients?

Basic Approach l Develop regression models that predict abundance of a given tree species in a plot as a function of climate at the location of the plot… Initial decisions: 1.What to use as a measure of abundance? I chose relative abundance over absolute abundance 2.What sorts of functions could describe variation in relative abundance along climate gradients Compare a Gaussian function with a null model that was flat

But is a flat line an appropriate null hypothesis? l What about range limits? l And shouldn’t even the Gaussian model have truncated tails?

So, what do the data look like? Acer rubrum (red maple)Quercus prinus ( chestnut oak) It doesn’t look like there’s much hope for nice Gaussian niche curves…

A first regression model (in R) # Square Gaussian - predicts mean of 0 when below lo or above hi square.gauss.model <- function(a,m,b,lo,hi,X) { ifelse(X hi,0,a*exp(-0.5*(((X-m)/b)^2)))) } 5 Parameters: “a” determines height of curve at mode “m” temperature at peak of curve “b” breadth of the curve “lo” lower temperature limit “hi” upper temperature limit Note that a separate “null” model is not really necessary, because if “b” is large, the curve is flat

What might be an appropriate PDF? The data are clearly not normally distributed… In fact, there are a whole lot of zeros….

Divide the analysis up into two parts l Predict probability of “presence” (i.e. non-zero abundance) – l Separately predict relative abundance when present These two components of abundance have very different ecological meanings….

A PDF for relative abundance when present: The Gamma Function Gamma PDF with scale parameter = 22 One likely choice for plots where a species is present…

First Results Relative Abundance (when present) Bottom line: for all 24 species, relative abundance is highly variable, but shows very little trend in the mean across the entire range of a species’ climate niche limits Note: these results are for relative abundance, given that a species is present…

But what about all of those zeros? Creating a PDF that can model both “presence” and “relative abundance” l “Zero-Inflated Distributions” # ZERO INFLATED NORMAL PDF zinf_norm_PDF <- function(x,mean,sd,pz) { log(ifelse(x==0,pz + (1-pz)*dnorm(0,mean,sd,log=F), (1-pz)*dnorm(x,mean,sd,log=F))) } # ZERO INFLATED GAMMA PDF zinf_gamma_PDF_climate <- function(x,mean,scale,px) { shape <- mean/scale loglh <- log(ifelse(x==0,pz + (1-pz)*dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F))) return(loglh) }

But why stop there? Is presence constant across climate gradients, or does it vary? # ZERO INFLATED GAMMA PDF with pz a gaussian function of the # independent (climate) variable zinf_gamma_PDF_climate <- function(x,mean,scale,pa,pm,pb,px) { shape <- mean/scale pz <- 1 - pa*(exp(-0.5*(((px-pm)/pb)^2))) # px is temp or precip in the plot loglh <- log(ifelse(x==0,pz + (1-pz)*dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F))) return(loglh) }

One final complication… l How can I factor range limits into the PDF? - If probability of presence is modeled as a Gaussian function of climate, but with truncated tails (climatic limits), what likelihood should be assigned to plots outside the estimated climatic limits? (my answer – 1 in a million…) # CLIMATE DEPENDENT ZERO INFLATED GAMMA, WITH LIMITS PDF - # ARBITRARILY SET Prob(x>0|X=0) = zinf_limits_gamma_PDF_climate <- function(x,mean,scale,pa,pm,pb,px) { shape <- mean/scale pz <- 1 - pa*(exp(-0.5*(((px-pm)/pb)^2))) loglh <- log(ifelse(mean == 0, ifelse(x == 0, , ), ifelse(x==0,pz + (1-pz) * dgamma(0,shape=shape,scale=scale,log=F), (1-pz)*dgamma(x,shape=shape,scale=scale,log=F)))) return(loglh) }