Basic concepts in ordination

Slides:



Advertisements
Similar presentations
TABLES and FIGURES BIOL 4001.
Advertisements

Multiple Analysis of Variance – MANOVA
An Introduction to Multivariate Analysis
Linear Regression t-Tests Cardiovascular fitness among skiers.
Random Sampling and Data Description
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
Properties of Community Data in Ecology Adapted from Ecological Statistical Workshop, FLC, Daniel Laughlin.
CHAPTER 19 Correspondence Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
10/17/071 Read: Ch. 15, GSF Comparing Ecological Communities Part Two: Ordination.
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Introduction to Mapping Sciences: Lecture #5 (Form and Structure) Form and Structure Describing primary and secondary spatial elements Explanation of spatial.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Presenting information
Slide 1 Detecting Outliers Outliers are cases that have an atypical score either for a single variable (univariate outliers) or for a combination of variables.
Basic Concepts for Ordination Tanya, Nick, Caroline.
Separate multivariate observations
Community Ordination and Gamma Diversity Techniques James A. Danoff-Burg Dept. Ecol., Evol., & Envir. Biol. Columbia University.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
Graphs in Science You Can Do It!!!.
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Lionel F. Lovett, II Jackson State University Research Alliance in Math and Science Computer Science and Mathematics Division Mentors: George Ostrouchov.
Inquiry Unit.
Data Presentation & Graphing Introduction to Mechanical Engineering The University of Texas-Pan American College of Science and Engineering.
Graphs Graphs are used to display data. They visually represent relationships between data. All graphs should have a title that identifies the variables.
8 th Grade Math Common Core Standards. The Number System 8.NS Know that there are numbers that are not rational, and approximate them by rational numbers.
Principal Component Analysis Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Page 1 We will cover: Data Tables Line Graphs Bar Graphs Circle Graphs We will cover: Data Tables Line Graphs Bar Graphs Circle Graphs.
1 Copyright © Cengage Learning. All rights reserved. 3 Descriptive Analysis and Presentation of Bivariate Data.
Multivariate Data Analysis  G. Quinn, M. Burgman & J. Carey 2003.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)
In the name of GOD. Zeinab Mokhtari 1-Mar-2010 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement.
ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
Unit 2: Geographical Skills
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
Principal Component Analysis (PCA)
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
Tom.h.wilson Department of Geology and Geography West Virginia University Morgantown, WV.
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 2 – Slide 1 of 37 Chapter 2 Section 2 Organizing Quantitative Data.
Exploring High-D Spaces with Multiform Matrices and Small Multiples Presented by Ray Chen and Sorelle Friedler Authors: MacEachren, A., Dai, X., Hardisty,
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
TRIGG COUNTY MIDDLE SCHOOL. 6 th Grade Reading Standard% of Students scoring 80% or higher.
Exploring Microarray data
Principal Component Analysis (PCA)
Elementary Statistics
Quality Control at a Local Brewery
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Assessing Normality.
Principal Component Analysis (PCA)
Descriptive Analysis and Presentation of Bivariate Data
Principal Component Analysis
Multidimensional Space,
Analyzing Reliability and Validity in Outcomes Assessment
Data exploration and visualization
Presentation transcript:

Basic concepts in ordination

What is ordination? Finding a concise and useful summary of the patterns within multivariate data. An arrangement of units in a uni- or multidimensional order. There are two forms of ordination: Direct ordinations - organizing observations along some known gradient( e.g. space, time, or elevation) Indirect ordinations – detect trends within data without the user needing to define end-points or gradients within their data. Indirect ordinations are powerful tools for probing and exploring multivariate data.

D Direct ordinations Plant community succession in time on sand dunes at the southern end of Lake Michigan (Olsen , 1958).

D Direct ordinations The distribution of plant communities away from a stream edge in a floodplain forest (Hughes and Cass, 1997).

D Indirect ordinations Indirect ordinations produce a set of gradients which are inherent in the data, called ordination axes. An approach to producing a meaningful summary of the patterns underlying multivariate data. Data space – an abstract high-dimensional mathematical space where the number of dimensions equals the number of variables being plotted against each other. Each variable is considered an axis and each axis is oriented at 90 degrees to all previous axes. Samples which consists of similar measurements will be close to each other in data space and samples which have very different composition will be widely separated.

D Indirect ordinations All variables are of equal importance in ordination techniques. Multivariate data is difficult to visualize because it contains too many dimensions to allow for easy plotting of all possible graphs. Ordination techniques take as input an object in high-dimensional data space, and produce as output an object in a new lower-dimensional data space. Both the raw data space and the new data space consist of axes meeting at 90 degrees.

D Indirect ordinations All ordination techniques involve calculating new variables, called ordination or axis scores. The axes produced by an ordination will be in descending order of importance, with the first axis being the most informative, the second axis the second most informative, etc. The most useful ordination diagram plots the first ordination axis against the second. Ordination techniques do not directly provide probability levels, an ordination cannot be said to be statistical significant.

Indirect ordinations Plot of bivariate data: data points with similar values for both properties (one distinct cluster). property 2 data points with similar values for property 1 (1 or 2 clusters). property 1

Indirect ordinations Ordination diagram: D Ordination diagrams look like a bivariate graph with axes labeled with an axis number (ordination axis) not with the name of a measured variables. Ordination axis derives from the raw data by the techniques which was invoked. Each point defines the properties of the entire row of values collected for each observation. Variables Observations Axis 2 Axis 1

Indirect ordinations Bi-plots: D Ordination of properties of the columns of the matrix (variables) are overlain on top of the main ordination diagram in order to highlight relationships between the two sets of information. The second set of information is shown as arrows that run from (0,0) to the coordinate in question. Variables Cluster with high values of v5 and low values of v6. v1 v2 v3 v4 v5 v6 Observations Axis 2 v5 v6 Axis 1

Indirect ordinations The choice of variables: No division of the data into “dependent” and “independent” variables. Example of selection of variables: study of soil (pH, elemental contents, cation exchange capacity ..) morphometric study (available body dimensions) biological community ( all available species) Each variable should be checked for normality and outliers.

D Indirect ordinations Log-transform the data (log(x+1) for data containing zeros) - makes it more likely to pick up underlying trends. Decide if variables that are valid but appear to be of little importance should be excluded. Sensitivity to the inclusion of scarce species: Robust (Bray-Curtis ordination, Principal Component Analysis) Overemphasize the importance of rare species (correspondent analysis and DCA).