Multivariate Analysis of a Carbonate Chemistry Time-Series Study

Slides:



Advertisements
Similar presentations
An Introduction to Multivariate Analysis
Advertisements

Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Principal Components Analysis Babak Rasolzadeh Tuesday, 5th December 2006.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Techniques for studying correlation and covariance structure
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Introduction to Descriptive Statistics Objectives: 1.Explain the general role of statistics in assessment & evaluation 2.Explain three methods for describing.
Computer Graphics and Image Processing (CIS-601).
Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)
Project Presentation Template (May 6)  Make a 12 minute presentation of your results (14 students ~ 132 mins for the entire class) NOTE: send ppt by mid-night.
ORDINATION What is it? What kind of biological questions can we answer? How can we do it in CANOCO 4.5? Some general advice on how to start analyses.
Christina Bonfanti University of Miami- RSMAS MPO 524.
Principal Component Analysis
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 3: Describing Relationships
Chapter 11 Analysis of Variance
Unsupervised Learning
I. ANOVA revisited & reviewed
MATH-138 Elementary Statistics
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
Multiple Regression Prof. Andy Field.
Chapter 3: Describing Relationships
Linear Filters and Edges Chapters 7 and 8
What we’ll cover today Transformations Inferential statistics
Analyzing Redistribution Matrix with Wavelet
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Basic Statistics Overview
Fundamentals of regression analysis
Applied Statistical Analysis
Applied Statistical Analysis
Multivarite Analysis Goals
Quality Control at a Local Brewery
Stats Club Marnie Brennan
Multivariate environmental characterization of samples
Interpreting Principal Components
Multivariate Analysis of Trace Elements from Coral Cores
Elementary Statistics: Looking at the Big Picture
Chapter 3: Describing Relationships
Project Presentation Template
The survival and growth of deep sea gorgonians at the Waikiki Aquarium
Stranding Patterns in Stenella spp.
Coral Species distribution and Benthic Cover type He’eia HI
Multivariate Analysis on Stenella Longirostris Pathology Reports in the Main Hawaiian Islands Haley Boyd.
Fish Communities Before and After a Bleaching Event
Chapter 3: Describing Relationships
Descriptive and Inferential
CHAPTER 10 Comparing Two Populations or Groups
Chapter 3: Describing Relationships
Ordination for Body Condition and Cause of Death in Adult Bonin Petrels (Pterodroma hypoleuca) Goal: 1) Develop Body Condition Index (BCI) to describe.
Chapter 3: Describing Relationships
Principal Components Analysis
Basic Practice of Statistics - 3rd Edition Inference for Regression
Principal Component Analysis (PCA)
Principal Component Analysis
PCA of Waimea Wave Climate
Chapter 3: Describing Relationships
Dataset: Time-depth-recorder (TDR) raw data 1. Date 2
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Marios Mattheakis and Pavlos Protopapas
Unsupervised Learning
Presentation transcript:

Multivariate Analysis of a Carbonate Chemistry Time-Series Study Kellie Teague Mars 6300 Spring 2018

Objective Study A time-series observation of changes in carbonate chemistry parameters within 2 coral reefs off of Waimanalo, Oahu, Hawaii. Water samples were collected once per week (April 2010 – May 2011) at sunrise, solar noon, and sunset. Goals Identify any statistical patterns inherent in the carbonate chemistry data, particularly with respect to Total Alkalinity (TA) and Dissolved Inorganic Carbon (DIC). Determine the driving environmental influences to changes in the carbonate system surrounding coral reefs. Hypothesis: Because the independent variables are highly cross-correlated, an ordination analysis should show a distinct gradient based on time of day and time of year sampled. Location should not have a strong role, as the sampling sites are quite close together and share the same source water.

Dataset Description 294 discrete water samples – 147 from each site named by location (K or M), time of day (M, N, or E) and sampling date Ex: K1.M.040110 5 independent variables Sea Surface Temperature, SST (℃) Salinity (psu) pH Total Alkalinity (TA), µmol kg-1 Dissolved Inorganic Carbon (DIC), µmol kg-1

Dataset Processing Row/Column Summary -1 < skewness < 1 0% empty cells Initial Outlier Analysis Relative Euclidean, + 2 SD 13 samples identified None were omitted (all within 3 SD) Relativization General relativization (p = 1) was used to give all variables equal weight Final Outlier Analysis Euclidean Distance, + 3 SD 7 samples identified None were omitted (in order to maintain an equal sample size across groups) Results of the final outlier analysis, after data relativization

Dataset Exploration Cross-Correlations pH strongly negatively correlated with TA and DIC TA and DIC strongly correlated with each other Temperature is moderately correlated with pH, TA, & DIC Salinity is weakly correlated with the other variables All pair-wise correlations were found to be significant

Dataset Analysis This dataset has a normal distribution (according to the skewness values), no empty cells in the matrix, and a robust sample size. Therefore, it is ideally suited for a Principle Component Analysis (PCA) ordination. Hypothesis Because the independent variables are highly cross-correlated, an ordination analysis should show a distinct gradient based on time of day and time of year sampled. Location should not have a strong role, as the sampling sites are quite close together and share the same source water. Used covariance matrix with Euclidean distance

Results Interpretation

Results Interpretation Stopping Rules p-value : 1 axis average eigenvalue: 1 axis broken-stick eigenvalue: 1 axis Axis 1 vs. Axis 2 – orthogonality = 100% A 1-axis solution provides the best result and explains 81.4% of the variance

Results Interpretation The strongest loadings for both Axis 1 and Axis 2 are from Temperature and DIC Axis 1 – Temperature is stronger and the two variables oppose one another Axis 2 – DIC is stronger and the two variables go together

Results Interpretation

Results Interpretation Correlations with Temperature Correlations with DIC Correlations with temperature: Axis 1 -0.932, Axis 2 0.358 Correlations with DIC: Axis 1 0.869, Axis 2 0.496

Discussion - The Method The results of this PCA suggest that temperature has a strong influence on the various parameters of the carbonate chemistry system. The gradient along Axis 1 shows a clear distinction between samples taken in the morning (M), at solar noon (N), and in the evening (E). Grouping the ordination results by location confirmed that sampling site did not play a major role in the gradient distribution. As expected based on the cross-correlations, the eigenvectors for TA and DIC were very similar and pH showed an opposing trend. This method was helpful in supporting suspected patterns within the dataset.

Discussion – Next Steps For a re-analysis, I propose using a PerMANOVA to quantify the effect size of the different categorical variables (location, time of day, season of year). PerMANOVA is the best way to compare the between group dissimilarity to the within group dissimilarity via the F statistic using a distance-based approach. For a PerMANOVA analysis, the groups must be balanced (which is one reason no outliers were omitted from the original PCA). The next steps for this study should include determining if temperature is a true driving force in the carbonate system of coral reefs, or if other influences like light availability and water residence time more strongly affect coral reef metabolic processes.

Questions?