Dealing with Spatial Autocorrelation

Slides:



Advertisements
Similar presentations
Autocorrelation Functions and ARIMA Modelling
Advertisements

Autocorrelation and Heteroskedasticity
Spatial autoregressive methods
Analysis of variance and statistical inference.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
The General Linear Model Or, What the Hell’s Going on During Estimation?
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Ch11 Curve Fitting Dr. Deshi Ye
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.
Workshop 2: Spatial scale and dependence in biogeographical patterns Objective: introduce the following fundamental concepts on spatial data analysis:
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
Design of Engineering Experiments - Experiments with Random Factors
Correlation and Autocorrelation
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Spatial autoregressive methods Nr245 Austin Troy Based on Spatial Analysis by Fortin and Dale, Chapter 5.
Statistics for Business and Economics
1 Testing spatial correlation (autocorrelation) 1.Moran’s I 2.Geary’s c 3.Variogram 4.Join counts Cliff, A. D. & Ord, J. K Spatial processes: models.
Applied Geostatistics
Chapter 11 Multiple Regression.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
SA basics Lack of independence for nearby obs
REGRESSION AND CORRELATION
Linear Regression Models Powerful modeling technique Tease out relationships between “independent” variables and 1 “dependent” variable Models not perfect…need.
Correlation and Regression Analysis
Variance and covariance Sums of squares General linear models.
Rodolphe Devillers (Almost) everything you always wanted to know (or maybe not…) about Geographically Weighted Regressions JCU Stats Group, March 2012.
Lecture 5 Correlation and Regression
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Objectives of Multiple Regression
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Inference for regression - Simple linear regression
Regression Method.
Comparing Two Samples Harry R. Erwin, PhD
What does it mean? The variance of the error term is not constant
1 LES of Turbulent Flows: Lecture 1 Supplement (ME EN ) Prof. Rob Stoll Department of Mechanical Engineering University of Utah Fall 2014.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
Spatial Statistics in Ecology: Area Data Lecture Four.
Ecosystems are: Hierarchically structured, Metastable, Far from equilibrium Spatial Relationships Theoretical Framework: “An Introduction to Applied Geostatistics“,
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Geographic Information Science
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Mixed Effects Models Rebecca Atkins and Rachel Smith March 30, 2015.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Lecture 6 Your data and models are never perfect… Making choices in research design and analysis that you can defend.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
Chap 9 Regression with Time Series Data: Stationary Variables
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
STATS 10x Revision CONTENT COVERED: CHAPTERS
Testing for equal variance Scale family: Y = sX G(x) = P(sX ≤ x) = F(x/s) To compute inverse, let y = G(x) = F(x/s) so x/s = F -1 (y) x = G -1 (y) = sF.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Strategies for Metabolomic Data Analysis Dmitry Grapov, PhD.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Spatial statistics: Spatial Autocorrelation
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Chapter 5 Part B: Spatial Autocorrelation and regression modelling.
Chapter 11: Simple Linear Regression
Spatial Autocorrelation
OVERVIEW OF LINEAR MODELS
SPATIAL ANALYSIS IN MACROECOLOGY
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Presentation transcript:

Dealing with Spatial Autocorrelation Spatial Analysis Seminar Spring 2009

Spatial Autocorrelation Defined “…the property of random variables taking values, at pairs of locations a certain distance apart, that are more similar (positive autocorrelation) or less similar (negative autocorrelation) than expected for randomly associated pairs of observations.” Legendre (1993)

Types of Spatial Autocorrelation Inherent autocorrelation: caused by “contagious biotic processes” vs. Induced spatial dependence: biological variables of interest are functionally dependent on one or more autocorrelated exogenous variable(s)

Why Should We Care? “natural systems almost always have autocorrelation in the form of patchiness or gradients…over a wide range of spatial and temporal scales.” Fortin & Dale (2005) → Autocorrelation is a “fact of life” for ecologists!

2 Views of Spatial Autocorrelation: It’s a nuisance that complicates statistical hypothesis testing It’s functionally important in many ecosystems, so we must revise our theories and models to incorporate spatial structure Either way, the first step involves describing the autocorrelation (i.e., the “spatial structure”)

Describing Spatial Autocorrelation Compute Moran’s I or Geary’s c coefficients over multiple distances Correlogram: plot distance on X-axis against correlation coefficient on Y-axis Mantel correlogram: multivariate response Semi-variogram/variogram

Example Data Wetland hardwood forest (5 x 5 m cells) Response variable: log of non-ground lidar points in 0-1 m vertical height bin n1 = 217, n2 = 68 Welch’s t-test (unequal variance, unequal sample sizes) results: t = 2.33, df = 181, p-value ≈ 0.021

Moran’s I correlograms

Now what do I do??? Adjusting the effective sample size Spatial statistical modeling methods Restricted randomization Other methods: canonical ordination, partial Mantel tests, etc.

Adjusting the Effective Sample Size Estimate of effective sample size (Fortin & Dale 2005, p. 223, Equation 5.15): For first-order autocorrelation ρ and large n:

Adjusting the Effective Sample Size For the “Recently Burned” example data: For the “Long Unburned” example data: Welch’s t-test results: t = 1.76, df = 123, p ≈ 0.080 BUT, this is a very simplistic model!

Detour: Autocorrelation Models Model 1 (“spatial independence”): Model 2 (“first-order autoregressive”): Model 3 (“induced autoregressive”): Model 4 (“doubly autoregressive”): SOURCE: Fortin & Dale (2005), pp. 213-216

Detour: Autocorrelation Models The models on the previous slide were one-dimensional, but most spatial data is two-dimensional (Lat-Long, XY-coordinates, etc.) The two-dimensional spatial autocorrelation model incorporates W, a “proximity matrix” of neighbor weights, which in turn affects the variance-covariance matrix (C):

Generalized Least Squares (GLS) Relatively easy way to introduce spatial autocorrelation structure to linear models Fits a parametric correlation function (exponential, Gaussian, spherical, etc.) directly to the variance-covariance matrix Assumes normally distributed errors, but errors are allowed to be correlated and/or have unequal variances Built-in R package: nlme

GLS Model – No Spatial Structure library(nlme) … ## Model A: spatial independence ModelA <- gls(LN_COUNT~BURNED,data=SAC_data) plot(Variogram(ModelA, form=~x+y))

GLS Models with Spatial Structure > ModelB <- gls(LN_COUNT~BURNED,data=SAC_data,corr=corAR1()) > ModelC <- gls(LN_COUNT~BURNED,data=SAC_data,corr=corExp(form=~x+y)) > ModelD <- gls(LN_COUNT~BURNED,data=SAC_data,corr=corGaus(form=~x+y)) > ModelE <- gls(LN_COUNT~BURNED,data=SAC_data,corr=corSpher(form=~x+y)) > AIC(ModelA,ModelB,ModelC,ModelD,ModelE) df AIC ModelA 3 702.1288 ModelB 4 677.3121 ModelC 4 591.7996 ModelD 4 607.3873 ModelE 4 604.7950 > anova(ModelA,ModelC) Model df AIC BIC logLik Test L.Ratio p-value ModelA 1 3 702.1288 713.0652 -348.0644 ModelC 2 4 591.7996 606.3814 -291.8998 1 vs 2 112.3293 <.0001 → Exponential GLS model seems to fit best

Other Autocorrelation Models Conditional autoregressive (CAR), simultaneous autoregressive (SAR), and moving average (MA) models See pp. 229-233 of Fortin & Dale (2005) Implemented in R package spdep, as well as SAM (Spatial Analysis for Macroecology) software Generalized linear mixed models (GLMMs): R built-in packages MASS, nlme But wait, there’s more: see Dormann et al. (2007) review paper in Ecography (30) 609-628.

Models and Reality “Much of the treatment of spatial autocorrelation in the statistical literature is predicated on the simplest AR model, which produces an exponential decline in autocorrelation as a function of distance (Figure 5.16).” Fortin & Dale (2005, pp. 247-248) BUT, simple corrections based on first-order AR don’t account for effects of potentially negative autocorrelation at greater distances

Restricted Randomization PROBLEM: randomization tests based on complete spatial randomness will destroy autocorrelation structure POTENTIAL SOLUTIONS: “Toroidal shift” randomization (Figure 5.12) Contiguity-constrained permutations (see Legendre et al. 1990 for algorithms)

Conclusion Incorporating spatial structure into ecological models was identified by Legendre as a “new paradigm” in 1993, BUT… …ecologists are still refining their methods for dealing with spatial autocorrelation OUR LAST HOPE?: Dale, M.R.T. and M.-J. Fortin. (in press). Spatial Autocorrelation and Statistical Tests: Some Solutions. Journal of Agricultural, Biological, and Environmental Statistics.

Spatial autocorrelation, don’t make me open this…