Prediction models perform better when including transition zones Sophie Vermeersch Plant Science and Nature Plant Science and Nature Management, Department.

Slides:



Advertisements
Similar presentations
Sociology 680 Multivariate Analysis Logistic Regression.
Advertisements

Advanced analytical approaches in ecological data analysis The world comes in fragments.
An Introduction to Multivariate Analysis
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Turnover of wetland sediments on mineralization of carbon, nitrogen and phosphorus Qingren Wang* and Yuncong Li Tropical Research and Education Center.
SENSITIVITY ANALYSIS of the FOREST VEGETATION SIMULATOR Southern Variant (FVS-Sn) Nathan D. Herring Dr. Philip J. Radtke Virginia Tech Department of Forestry.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Modeling of Soil Nutrients – An Introduction to Logical Spreadsheeting Russell Yost Department of Tropical Plant and Soil Science, University of Hawai`i.
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Community Ecology Conceptual Issues –Community integrity (Clements v Gleason) Individualistic responses versus super-organism –Community change St ate-transition.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Multiple-group linear discriminant function maximum & contributing ldf dimensions concentrated & diffuse ldf structures follow-up analyses evaluating &
10/17/071 Read: Ch. 15, GSF Comparing Ecological Communities Part Two: Ordination.
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Agenda: Block Watch: Random Assignment, Outcomes, and indicators Issues in Impact and Random Assignment: Youth Transition Demonstration –Who is randomized?
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
DATA QUALITY and ANALYSIS Strategy for Monitoring Post-fire Rehabilitation Treatments Troy Wirth and David Pyke USGS – Biological Resources Division Forest.
Norms & Norming Raw score: straightforward, unmodified accounting of performance Norms: test performance data of a particular group of test takers that.
Relationships Among Variables
Community Ordination and Gamma Diversity Techniques James A. Danoff-Burg Dept. Ecol., Evol., & Envir. Biol. Columbia University.
Objectives of Multiple Regression
Statistical Inference: Which Statistical Test To Use? Pınar Ay, MD, MPH Marmara University School of Medicine Department of Public Health
A comparison of remotely sensed imagery with site-specific crop management data A comparison of remotely sensed imagery with site-specific crop management.
Multivariate Data Analysis Chapter 8 - Canonical Correlation Analysis.
Canonical Correlation Analysis, Redundancy Analysis and Canonical Correspondence Analysis Hal Whitehead BIOL4062/5062.
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
Introduction to the gradient analysis. Community concept (from Mike Austin)
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
DIRECT ORDINATION What kind of biological questions can we answer? How can we do it in CANOCO 4.5?
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Predicting diversity of large herbivores from moisture and nutrient availability based on the article of Olff et al. (2002): “Global environmental controls.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Available at Chapter 13 Multivariate Analysis BCB 702: Biostatistics
Multiple Discriminant Analysis
Species indicator values as an important tool in applied plant ecology Session: Databases and information systems 46th IAVS Symposium 8-14 June 2003 Naples.
ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Effects of parent material and land use on soil phosphorus forms in Southern Belgium Renneson 1 M., Dufey 2 J., Bock 1 L. and Colinet 1 G. 1 University.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Chapter 11: The ANalysis Of Variance (ANOVA)
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
Chapter 5: Introductory Linear Regression
Chapter 14 Regression Summarized by Lana Hesler. Learning Objectives Understand the nature and purposes of regression analysis Recognize geographic problems.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Plant functional trait expression in the Rengen Grassland Experiment (RGE) Jürgen Schellberg 1, Katharina Brüne 1 and Michal Hejcman 2 1 University of.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Canonical Correlation Analysis (CCA). CCA This is it! The mother of all linear statistical analysis When ? We want to find a structural relation between.
Logistic Regression: Regression with a Binary Dependent Variable.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Machine Learning – Classification David Fenyő
Regression Techniques
Chapter 15 Linear Regression
Chapter 11: The ANalysis Of Variance (ANOVA)
Predict Failures with Developer Networks and Social Network Analysis
Multiple Regression Analysis: Further Issues
Least-Squares Regression
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
INTRODUCTION TO Machine Learning 3rd Edition
Presentation transcript:

Prediction models perform better when including transition zones Sophie Vermeersch Plant Science and Nature Plant Science and Nature Management, Department of Biology, Vrije Universiteit Brussel

Prediction models of vegetation communities perform better when including transition zones Environmental gradients: incorporation or omission? - limited number of factors that account for the major proportion of variability in vegetation composition - at least one explanatory variable for each environmental gradient identified by preliminary explanatory analyses - environmental gradients expressed as resource gradients vegetation gradients: species or communities? - similar species compositions correspond to similar environments within a geographical area - satisfying weighting mechanisms for diagnostic species

cutting mowing Alnion glutinosae Filipendulion Calthion Phragmition

Vegetation data training set N = 324 validation set N = 50 quadrat size: 225m² nomenclature by Schaminée et al. (1995), Schaminée et al. (1996) and Stortelder et al. (1999) Environmental data water content (%) pH (CaCl 2 – glass/calomel electrode) nutrients: available phosphate, nitrate and ammonium (calorimetric methods) K +, Na +, Ca 2+, Mg 2+ (AAS)

Eigenvalues: 1st axis: 0,585 2nd axis: 0,324 3rd axis: 0,227

Modelling procedure vegetation gradients: CCA to attribute weights to vegetation communities - pairwise CCA of homogeneous vegetation communities - introduction of the heterogeneous samples in the canonical space - calculation of Euclidean distance between the community centroid and the heterogeneous sample model precision - bootstrapping (leave-one-out) evaluation of attributed weights - comparison of 2 models constructed through multiple logistic regression: (weighted and unweighted models) - % variance explained by the models - ability of calculating the estimates of occurrences of vegetation communities for an independent data set

Cumulative % variance explained: 1st axis: 10,4 2nd axis: 17,3 3rd axis: 23,5 A AB B Carici elongatae- Alnetum Carici remotae- Fraxinetum

INPUT Soil variables (% moisture, pH, available fractions of PO4, NO3, NH4, K, Na, Ca, Mg) INPUT Relevés and classification PREDICTION VALIDATION Ecological interpretation Adaptation of input PREDICTION OF VEGETATION COMMUNITIES OMITTING GRADIENTS INTERVENTION

INPUT Soil variables in homogeneous environment INPUT Relevés in homogeneous environment and classification PREDICTION OF VEGETATION COMMUNITIES IN GRADIENT SITUATIONS INTERVENTION INPUT Relevés in gradients and classification INPUT Soil variables in gradients PAIRWISE REFERENCES CCA Multiple regression of standardised canonical coefficients and soil variables WEIGHTS for relevés in gradients PREDICTIONS Ecological interpretation VALIDATION Adaptation of input

Limitations if important explanatory variables are not incorporated lower amount of correct predictions for communities with less extreme preferences low number of relevés time dependent responses to the fertility gradient prediction of the potentiallity of occurrence of vegetation communities, NOT the actual occurrence

Conclusions the used set of edaphic factors in the CCA proved to be significant and able to validate model predictions incorporation of the communities in gradient situations leads to a better model fit, especially when these variables are weighted to environmental variables - higher % of variance (increase of +/- 5%) - larger proportion of corrected samples (78% vs. 56%)