ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.

Slides:

Advertisements

Similar presentations

Type of Data Matrix. Managing Dimensionality (but not acronyms) PCA, CA, RDA, CCA, MDS, NMDS, DCA, DCCA, pRDA, pCCA.

Advertisements

Gradient Analysis Approach to Ordination. Models of Species Response to Gradients.

Multivariate Description. What Technique? Response variable(s)... Predictors(s) No Predictors(s) Yes... is one distribution summary regression models...

What we Measure vs. What we Want to Know

Multivariate Description. What Technique? Response variable(s)... Predictors(s) No Predictors(s) Yes... is one distribution summary regression models...

Tables, Figures, and Equations

Step three: statistical analyses to test biological hypotheses General protocol continued.

Krishna Rajan Data Dimensionality Reduction: Introduction to Principal Component Analysis Case Study: Multivariate Analysis of Chemistry-Property data.

An Introduction to Multivariate Analysis

Multivariate analysis of community structure data Colin Bates UBC Bamfield Marine Sciences Centre.

Computer Vision – Image Representation (Histograms)

Lecture 7: Principal component analysis (PCA)

1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.

Community Ecology Conceptual Issues –Community integrity (Clements v Gleason) Individualistic responses versus super-organism –Community change St ate-transition.

Multivariate Methods Pattern Recognition and Hypothesis Testing.

Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.

Properties of Community Data in Ecology Adapted from Ecological Statistical Workshop, FLC, Daniel Laughlin.

CHAPTER 19 Correspondence Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.

Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of scientific research When you know the system: Estimation.

10/17/071 Read: Ch. 15, GSF Comparing Ecological Communities Part Two: Ordination.

CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,

Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.

Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.

Basic Concepts for Ordination Tanya, Nick, Caroline.

Principal Component Analysis. Philosophy of PCA Introduced by Pearson (1901) and Hotelling (1933) to describe the variation in a set of multivariate data.

Community Ordination and Gamma Diversity Techniques James A. Danoff-Burg Dept. Ecol., Evol., & Envir. Biol. Columbia University.

OUR Ecological Footprint …. Ch 20 Community Ecology: Species Abundance + Diversity.

¹Department of Biology, Faculty of Science, University of Guilan, Rasht,Iran ²Department of Biology, Faculty of Science, University of Mazandaran, Babolsar,

Summarizing Scores With Measures of Central Tendency

Statistical Techniques I EXST7005 Factorial Treatments & Interactions.

Dimensionality Reduction: Principal Components Analysis Optional Reading: Smith, A Tutorial on Principal Components Analysis (linked to class webpage)

Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.

CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.

Canonical Correlation Analysis, Redundancy Analysis and Canonical Correspondence Analysis Hal Whitehead BIOL4062/5062.

Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.

Introduction to the gradient analysis. Community concept (from Mike Austin)

Basic concepts in ordination

Review of Statistics and Linear Algebra Mean: Variance:

DIRECT ORDINATION What kind of biological questions can we answer? How can we do it in CANOCO 4.5?

Vamsi Sundus Shawnalee Correspondence Analysis: Simple ( CA) and Detrended (DCA)

Multidimensional scaling MDS  G. Quinn, M. Burgman & J. Carey 2003.

From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.

SINGULAR VALUE DECOMPOSITION (SVD)

Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.

Exploring the Guttman Effect Statistics 300 Zhao Chen Alexander Hristov Darvin Yi.

Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)

Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.

Principal Component Analysis

CSSE463: Image Recognition Day 25 This week This week Today: Applications of PCA Today: Applications of PCA Sunday night: project plans and prelim work.

Principal Components Analysis ( PCA)

The Data Collection and Statistical Analysis in IB Biology John Gasparini The Munich International School Part II – Basic Stats, Standard Deviation and.

Principal Component Analysis

Research in Computational Molecular Biology , Vol (2008)

Term Two and a Word about Multi-variate Statistical Methods ….

Summarizing Scores With Measures of Central Tendency

Dimension Reduction via PCA (Principal Component Analysis)

Lesson 7 Correspondence Analysis & Detrended Correspondence Analysis

Statistical Methods For Engineers

Historical Vegetation Analysis

Techniques for studying correlation and covariance structure

Descriptive Statistics vs. Factor Analysis

Combinations (= multimetrics)

Principal Components Analysis

Principal Component Analysis (PCA)

Multivariate Analysis of a Carbonate Chemistry Time-Series Study

Principal Component Analysis

The Examination of Residuals

Marios Mattheakis and Pavlos Protopapas

Presentation transcript:

ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.

How different or similar is the vegetation at these two places? What are the patterns within each of them? Biomass Productivity Diversity Species composition

Ordination Analyses of data with many response variables Search for patterns We can also quantify and test the effect of one or many predictor variables (tomorrow!!)

But first: do communities exist?

A short answer after a long debate: No. Compositional variation in nature tends to be gradual.

How can we analyse species composition? PinusTsuga Site 1310 Site 251 Site 302 Site 448 Site Within some defined environment or area we sample a number of plots and register the species present

Pinus Tsuga Site 1 Site 3 Site 2 Site 4 Site 5 PinusTsuga Site 1310 Site 251 Site 302 Site 448 Site AcerBetula SPECIES SPACE

Site 1 site Acer Tsuga Betula Pine PinusTsuga Site 1310 Site 251 Site 302 Site 448 Site AcerBetula Site space

Data dimensions The sites differ in species abundances Each species is a variable – a dimension – –in a dataset with n species the differences between plots can be described exactly by their positions in a n-dimensional space Species are not distributed independently of each other –They respond to the same factors, affect each other… Can we somehow find a few dimensions that capture the bulk of the compositional information?

Pinus Tsuga Site 1 Site 3 Site 2 Site 4 Site 5

Pinus Tsuga Site 1 Site 3 Site 2 Site 4 Site 5

Site 1 Site 3 Site 2 Site 4 Site 5

10 This line describes the relative positions of sites along one dimension that captures the largest fraction possible of the variation in species composition We have done a Principal Component Analysis!!!!

Linear vs. Unimodal methods In the examples above we assumed that species abundance and the environment is linearly related This is sometimes true! (when we are within a ca SD ’window’ along an environmental gradient)

Linear vs. Unimodal methods But what if we want to analyse the whole gradient? A linear-based method will give a ’wrong’ solution! (which would give us a statistical artifact called the ’horseshoe effect’) There are unimodal-based methods (CA, DCA, …)

Correspondence analysis (CA) when the response is unimodal Sample where the species is present. (size indicates abundance) Weigthed average optimum of this species

In the same way you can find the optimum of a sample: the weighted average of the species it contains Species present in the sample. (size indicates abundance) Weigthed average optimum of the sample

Weighted averaging species scores are weighted averages of site scores –the weights are related to how common the species are in the sites site scores are weighted averages of species scores –the weights are (again) related to how commmon the species are in the sites ITERATIVE METHOD!

The arch problem After the first CA axis is constructed, the program will start ’looking for’ a second, uncorrelated axis. If no ’real’ gradient exists in the data, it will tend to ’find’ the folded axis 1 (which by definition uncorrelated, and half the lenght of the first axis)

Identifying the arch problem …and handling it The problem is easily identified by inspecting – The CA ordination diagram can you see an arch in the plot positions along axis 2? –The eigenvalues of the first and second axes Is the eigenvalue of axis 1 ca. 2* that of axis 2) The problem can be removed by detrending –Detrend by segments in indirect methods

The magic behind the ordination diagrams PCA CA

Biplot interpretation Species and sample positions along the axes can be presented as ordinaion diagrams These diagrams tell us something about the species composition the samples Interpretation differs between ordination diagrams from linear methods (PCA) and unimodal methods (CA)!

PCA

Etc

CA Decreasing probability of occurrence

CA Decreasing probability of occurrence

Summary unimodal vs. linear methods detrending in unimodal methods biplot vs. centroid interpretation