Combinations (= multimetrics)

Slides:



Advertisements
Similar presentations
Benthic Assessments One benthic ecologists concerns and suggestions Fred Nichols USGS, retired.
Advertisements

Conceptual Clustering
An Introduction to Multivariate Analysis
Trait-based Analyses for Fishes and Invertebrates in Streams Mark Pyron Stoeckerecological.com.
Multivariate analysis of community structure data Colin Bates UBC Bamfield Marine Sciences Centre.
Bioassessment and biomonitoring: some general principles.
Final stuff: n Lab practical –Coleoptera, Hemiptera n Final exam: Fri May 2:15 –Assessment with Invertebrates n Lecture material (IDEM protocol) n.
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Community Ecology Conceptual Issues –Community integrity (Clements v Gleason) Individualistic responses versus super-organism –Community change St ate-transition.
Comparable Biological Assessments from Different Methods and Analyses David B. Herbst 1 and Erik L. Silldorff 2 1 Sierra Nevada Aquatic Research Laboratory,
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
Statistical Methods Chichang Jou Tamkang University.
Brian Hemsley- Flint B.Sc. C.Biol. M.I.Biol. Northeast Region Ecology Team Leader.
10/17/071 Read: Ch. 15, GSF Comparing Ecological Communities Part Two: Ordination.
Community Ordination and Gamma Diversity Techniques James A. Danoff-Burg Dept. Ecol., Evol., & Envir. Biol. Columbia University.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Macroinvertebrate Bioassessment Tools Aquatic Life/Nutrient Workgroup August 11, 2008.
Introduction to the gradient analysis. Community concept (from Mike Austin)
Developing O/E (Observed-to-Expected) Models for Assessing Biological Condition Chuck Hawkins Western Center for Monitoring and Assessment of Freshwater.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
DIRECT ORDINATION What kind of biological questions can we answer? How can we do it in CANOCO 4.5?
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
ARROW: system for the evaluation of the status of waters in the Czech Republic Jiří Jarkovský 1) Institute of Biostatistics and Analyses, Masaryk University,
Rudy Vannevel Canada, Montreal FLEMISH ENVIRONMENT AGENCY A. Van de Maelestraat 96 B-9320 Erembodegem BELGIUM Methods and guidelines for the.
Species diversity Species diversity: variation in types of organisms present in a community Components of species diversity species richness: number of.
Available at Chapter 13 Multivariate Analysis BCB 702: Biostatistics
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Multivariate Data Analysis Chapter 1 - Introduction.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.
Environmental Assessment and Sustainability CIV913 BIOLOGICAL ASSESSMENT of River Water Quality Assessing the biological quality of fresh waters : Wright,
Multimetric Concepts Index 101 Michael Paul; Jeroen Gerritsen Tetra Tech, Inc.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L10.1 Lecture 10: Cluster analysis l Uses of cluster analysis.
Chapter XIV Data Preparation and Basic Data Analysis.
Copyright © 2011, 2005, 1998, 1993 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 19: Statistical Analysis for Experimental-Type Research.
Middle Fork Project AQ 3 – Macroinvertebrate and Aquatic Mollusk Technical Study Report Overview May 5, 2008.
Biomonitoring and assessment Why?. Ecological Society of America Demand Continues to Increase but we’re reaching a limit.
Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
MULTIVARIATE ANALYSIS. Multivariate analysis  It refers to all statistical techniques that simultaneously analyze multiple measurements on objects under.
Multivariate Analysis - Introduction. What is Multivariate Analysis? The expression multivariate analysis is used to describe analyses of data that have.
Université d’Ottawa / University of Ottawa 2003 Bio 8102A Applied Multivariate Biostatistics L4.1 Lecture 4: Multivariate distance measures l The concept.
Watershed Health Indicators
Aquatic, Watershed, and Earth Resources
1. Data Processing Sci Info Skills.
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
Quantifying Scale and Pattern Lecture 7 February 15, 2005
Making Use of Associations Tests
CH 5: Multivariate Methods
Figure 1. The relationships of bacterial operational taxonomic unit richness (A) and phylogenetic diversity (B) with aridity index based on 97% sequence.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Elementary Statistics
Environmental Studies Program
Intercalibration progress: Central - Baltic GIG Rivers
REMOTE SENSING Multispectral Image Classification
Classification (Dis)similarity measures, Resemblance functions
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Multivariate Statistics
STATISTICS Topic 1 IB Biology Miss Werba.
Intercalibration : a “WFD compliant” boundary comparing procedure
The Index of Biotic Integrity (the BI or IBI)
Lecture 7 – Algorithmic Approaches
CLASSIFICATION TOOLS FOR BENTHIC INVERTEBRATE FAUNA IN COASTAL WATERS
IBI’s: An Introduction
Multivariate Analysis - Introduction
Multivariate analysis of community structure data
Presentation transcript:

Combinations (= multimetrics) First developed by Jim Karr for fish Index of Biotic Integrity Based on the concept of economic indices Regardless, the idea is that no single measure will indicate the status of a site therefore, it’s necessary to combine a number of different measures (=metrics) Metrics are chosen that represent a range of response types (e.g., richness, % composition, diversity, ffg, biotic indices) They also are chosen to maximize differences between reference and impaired sites which need to be pre-defined. These individual measures are scaled and combined additively (most often) and then often rescaled to range from 1 to 10 Identifying impairment is based on a sites “value” relative to the established range Well … If one measure is intractable – maybe adding up a bunch will make sense??? It’s really not that bad – sorry. Go back to similarity slide

Many many methods are used Multivariate Methods Many many methods are used

Classification Similarities measures Clustering algorithms But … Many types Think about the anticipated effect Clustering algorithms Most common Unweighted pair-group with arithmetic averages (UPGMA) But … Many types of Similarity measures and clustering algorithms Dichotomous In or out of a group

Ordination sa1 sa2 sa3 sa4 sp1 4 2 89 sp2 3 5 sp3 1 56 2 89 sp2 3 5 sp3 1 56 Many different types Principal objectives Reduce the dimensionality of species X sample and or environmental X sample data Determine trends in space and time Develop a species X sample correlate to real or derived environmental variables

Ordination

Ordination in species space

A site ordination

Uses/methods of Ordination Inference Species distributions are used to infer environmental variables Temperature in Montana Indirect gradient analysis Variation in species distributions are determined Without a priori knowledge regarding “controlling” environmental gradients Often then related to environmental variables Many different methods Direct gradient analysis Often synonymous with constrained ordination Searches for gradients in species data which are a “direct” function of environmental data Sometimes “constrained” within the limits of the environmental data Correspondence Analysis, Canonical Correlation Analysis (CCA)

Two (of the many) methods of assessment Univariate approaches IBI = multimetrics Index of Biotic Integrity RBP Rapid Bioassessment Protocol Multivariate Endless methods but … RIVPACS River Invertebrate Prediction and Classification System All methods “require” the collection of physical/chemical data RBP  generally metric-driven RIVPACS  species-driven

RBP An a priori site classification is performed to establish two groups - impaired and least impaired (reference) Physical/chemical/habitat (e.g., VHA) data are used Species X site data are collected Various levels of effort (fixed count size and taxonomic) are used (RBP I, II, III) Metrics are derived Richness, FFG, Biotic indices, … Metrics are chosen based on their ability to differentiate impaired from reference They could be chosen based on a hypothesized response to a known stressor – but … Chosen metrics are added to form a multimetric and rescaled Sites are classified into levels of impairment

RIVPACS Inverts are collected from least impaired sites that represent the “total” range of sites to be “tested” “Reference” sites Physical/chemical/habitat data are collected at each site Relatively little data are needed Sites are classified into similar groups based on their species composition Using a classification routine (e.g., TWINSPAN, UPGMA w/ % similarity) The probability of a site being a member of a group is determined by Multiple Discriminant Function Analysis MDFA differs from MLR in that dependent variables are discrete not continuous Using p/c/h data that are not impacted by humans The probability of occurrence at a site of each species is calculated as: = Σ probability of a site belonging to a group X proportion of the sites within each group a species occurs  sum across all groups “Delete” rare species Choose a probability of occurrence of a species (e.g., p>.75 or .5) List the number of taxa predicted based on the above probability of occurrence and sum their probabilities to form the number of taxa Expected Collect inverts using the same effort from “test” sites Compare as Taxa observed/taxa expected (O/E)

Compare and Contrast Multimetric RIVPACS “Reference” sites are needed P/C/H data are needed Metrics are calculated and compared between ref and test Multimetric is “formed” Test sites are compared to reference sites RIVPACS “Reference” sites are needed P/C/H data are needed Multivariate analyses are used to determine probability of occurrence of individual taxa Species presence of test sites are compared to “reference” sites Both methods need to establish a measure of how different is different

The (End) Beginning Read (critically) as much as possible LEARN your stats Question virtually everything But also search for the answer Think critically Think – “if I were a bug” Integrate all you’ve learned and ask: Does it make sense?

Jaccard Coefficient

Percentage Similarity