Multivariate Analysis of a Carbonate Chemistry Time-Series Study

Slides:

Advertisements

Similar presentations

An Introduction to Multivariate Analysis

Advertisements

Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11

1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.

Principal Components Analysis Babak Rasolzadeh Tuesday, 5th December 2006.

Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.

Techniques for studying correlation and covariance structure

The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.

Introduction to Descriptive Statistics Objectives: 1.Explain the general role of statistics in assessment & evaluation 2.Explain three methods for describing.

Computer Graphics and Image Processing (CIS-601).

Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)

Project Presentation Template (May 6)  Make a 12 minute presentation of your results (14 students ~ 132 mins for the entire class) NOTE: send ppt by mid-night.

ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.

Christina Bonfanti University of Miami- RSMAS MPO 524.

Principal Component Analysis

Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.

Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.

Methods of multivariate analysis Ing. Jozef Palkovič, PhD.

Chapter 3: Describing Relationships

Chapter 11 Analysis of Variance

Unsupervised Learning

I. ANOVA revisited & reviewed

MATH-138 Elementary Statistics

DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014

Multiple Regression Prof. Andy Field.

Chapter 3: Describing Relationships

Linear Filters and Edges Chapters 7 and 8

What we’ll cover today Transformations Inferential statistics

Analyzing Redistribution Matrix with Wavelet

APPROACHES TO QUANTITATIVE DATA ANALYSIS

Basic Statistics Overview

Fundamentals of regression analysis

Applied Statistical Analysis

Applied Statistical Analysis

Multivarite Analysis Goals

Quality Control at a Local Brewery

Stats Club Marnie Brennan

Multivariate environmental characterization of samples

Interpreting Principal Components

Multivariate Analysis of Trace Elements from Coral Cores

Elementary Statistics: Looking at the Big Picture

Chapter 3: Describing Relationships

Project Presentation Template

The survival and growth of deep sea gorgonians at the Waikiki Aquarium

Stranding Patterns in Stenella spp.

Coral Species distribution and Benthic Cover type He’eia HI

Multivariate Analysis on Stenella Longirostris Pathology Reports in the Main Hawaiian Islands Haley Boyd.

Fish Communities Before and After a Bleaching Event

Chapter 3: Describing Relationships

Descriptive and Inferential

CHAPTER 10 Comparing Two Populations or Groups

Chapter 3: Describing Relationships

Ordination for Body Condition and Cause of Death in Adult Bonin Petrels (Pterodroma hypoleuca) Goal: 1) Develop Body Condition Index (BCI) to describe.

Chapter 3: Describing Relationships

Principal Components Analysis

Basic Practice of Statistics - 3rd Edition Inference for Regression

Principal Component Analysis (PCA)

Principal Component Analysis

PCA of Waimea Wave Climate

Chapter 3: Describing Relationships

Dataset: Time-depth-recorder (TDR) raw data 1. Date 2

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships

Marios Mattheakis and Pavlos Protopapas

Unsupervised Learning

Presentation transcript:

Multivariate Analysis of a Carbonate Chemistry Time-Series Study Kellie Teague Mars 6300 Spring 2018

Objective Study A time-series observation of changes in carbonate chemistry parameters within 2 coral reefs off of Waimanalo, Oahu, Hawaii. Water samples were collected once per week (April 2010 – May 2011) at sunrise, solar noon, and sunset. Goals Identify any statistical patterns inherent in the carbonate chemistry data, particularly with respect to Total Alkalinity (TA) and Dissolved Inorganic Carbon (DIC). Determine the driving environmental influences to changes in the carbonate system surrounding coral reefs. Hypothesis: Because the independent variables are highly cross-correlated, an ordination analysis should show a distinct gradient based on time of day and time of year sampled. Location should not have a strong role, as the sampling sites are quite close together and share the same source water.

Dataset Description 294 discrete water samples – 147 from each site named by location (K or M), time of day (M, N, or E) and sampling date Ex: K1.M.040110 5 independent variables Sea Surface Temperature, SST (℃) Salinity (psu) pH Total Alkalinity (TA), µmol kg-1 Dissolved Inorganic Carbon (DIC), µmol kg-1

Dataset Processing Row/Column Summary -1 < skewness < 1 0% empty cells Initial Outlier Analysis Relative Euclidean, + 2 SD 13 samples identified None were omitted (all within 3 SD) Relativization General relativization (p = 1) was used to give all variables equal weight Final Outlier Analysis Euclidean Distance, + 3 SD 7 samples identified None were omitted (in order to maintain an equal sample size across groups) Results of the final outlier analysis, after data relativization

Dataset Exploration Cross-Correlations pH strongly negatively correlated with TA and DIC TA and DIC strongly correlated with each other Temperature is moderately correlated with pH, TA, & DIC Salinity is weakly correlated with the other variables All pair-wise correlations were found to be significant

Dataset Analysis This dataset has a normal distribution (according to the skewness values), no empty cells in the matrix, and a robust sample size. Therefore, it is ideally suited for a Principle Component Analysis (PCA) ordination. Hypothesis Because the independent variables are highly cross-correlated, an ordination analysis should show a distinct gradient based on time of day and time of year sampled. Location should not have a strong role, as the sampling sites are quite close together and share the same source water. Used covariance matrix with Euclidean distance

Results Interpretation

Results Interpretation Stopping Rules p-value : 1 axis average eigenvalue: 1 axis broken-stick eigenvalue: 1 axis Axis 1 vs. Axis 2 – orthogonality = 100% A 1-axis solution provides the best result and explains 81.4% of the variance

Results Interpretation The strongest loadings for both Axis 1 and Axis 2 are from Temperature and DIC Axis 1 – Temperature is stronger and the two variables oppose one another Axis 2 – DIC is stronger and the two variables go together

Results Interpretation

Results Interpretation Correlations with Temperature Correlations with DIC Correlations with temperature: Axis 1 -0.932, Axis 2 0.358 Correlations with DIC: Axis 1 0.869, Axis 2 0.496

Discussion - The Method The results of this PCA suggest that temperature has a strong influence on the various parameters of the carbonate chemistry system. The gradient along Axis 1 shows a clear distinction between samples taken in the morning (M), at solar noon (N), and in the evening (E). Grouping the ordination results by location confirmed that sampling site did not play a major role in the gradient distribution. As expected based on the cross-correlations, the eigenvectors for TA and DIC were very similar and pH showed an opposing trend. This method was helpful in supporting suspected patterns within the dataset.

Discussion – Next Steps For a re-analysis, I propose using a PerMANOVA to quantify the effect size of the different categorical variables (location, time of day, season of year). PerMANOVA is the best way to compare the between group dissimilarity to the within group dissimilarity via the F statistic using a distance-based approach. For a PerMANOVA analysis, the groups must be balanced (which is one reason no outliers were omitted from the original PCA). The next steps for this study should include determining if temperature is a true driving force in the carbonate system of coral reefs, or if other influences like light availability and water residence time more strongly affect coral reef metabolic processes.

Questions?