Multivariate Analysis on Stenella Longirostris Pathology Reports in the Main Hawaiian Islands Haley Boyd.

Slides:



Advertisements
Similar presentations
Step three: statistical analyses to test biological hypotheses General protocol continued.
Advertisements

ANOVA and Linear Models. Data Data is from the University of York project on variation in British liquids. Data is from the University of York project.
Module 36: Correlation Pitfalls Effect Size and Correlations Larger sample sizes require a smaller correlation coefficient to reach statistical significance.
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
Chapter 9 Correlational Research Designs
T-tests and ANOVA Statistical analysis of group differences.
Describing Relationships: Scatter Plots and Correlation ● The world is an indivisible whole (butterfly effect and chaos theory; quantum entanglement, etc.)
Leedy and Ormrod Ch. 11 Gray Ch. 14
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
AM Recitation 2/10/11.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Understanding Multivariate Research Berry & Sanders.
Statistics 1 The Basics Sherril M. Stone, Ph.D. Department of Family Medicine OSU-College of Osteopathic Medicine.
Section 12.1 Scatter Plots and Correlation HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Introductory Topics PSY Scientific Method.
Examining Relationships in Quantitative Research
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
QUANTITATIVE RESEARCH AND BASIC STATISTICS. TODAYS AGENDA Progress, challenges and support needed Response to TAP Check-in, Warm-up responses and TAP.
Project Presentation Template (May 6)  Make a 12 minute presentation of your results (14 students ~ 132 mins for the entire class) NOTE: send ppt by mid-night.
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Lab 9: Two Group Comparisons. Today’s Activities - Evaluating and interpreting differences across groups – Effect sizes Gender differences examples Class.
1 Association  Variables –Response – an outcome variable whose values exhibit variability. –Explanatory – a variable that we use to try to explain the.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
Developing a Hiring System Measuring Applicant Qualifications or Statistics Can Be Your Friend!
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
Principal Component Analysis
Appendix I A Refresher on some Statistical Terms and Tests.
Chapter 11 Summarizing & Reporting Descriptive Data.
Inferential Statistics
Correlation and Linear Regression
Scatter Plots and Correlation
Inferential Statistics
APPROACHES TO QUANTITATIVE DATA ANALYSIS
SIMPLE LINEAR REGRESSION MODEL
Statistical Analysis of Research
SPSS OUTPUT & INTERPRETATION
Research methods Lesson 2.
Chapter 9 Correlational Research Designs
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Stats Club Marnie Brennan
SPSS OUTPUT & INTERPRETATION
Multivarite Analysis Goals
Linear transformations
2. Find the equation of line of regression
Central tendency and spread
Multivariate community analysis
Multivariate Analysis of Trace Elements from Coral Cores
Project Presentation Template
Descriptive Statistics vs. Factor Analysis
Stranding Patterns in Stenella spp.
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Fish Communities Before and After a Bleaching Event
Ordination for Body Condition and Cause of Death in Adult Bonin Petrels (Pterodroma hypoleuca) Goal: 1) Develop Body Condition Index (BCI) to describe.
Multivariate Analysis of a Carbonate Chemistry Time-Series Study
Principal Component Analysis
PCA of Waimea Wave Climate
Dataset: Time-depth-recorder (TDR) raw data 1. Date 2
Analysis of Complex Designs
Multiple Regression – Split Sample Validation
Correlation A measure of the strength of the linear association between two numerical variables.
Correlation & Trend Lines
Association between 2 variables
Standard DA-2 Indicator 2
InferentIal StatIstIcs
Multivariate analysis of community structure data
Presentation transcript:

Multivariate Analysis on Stenella Longirostris Pathology Reports in the Main Hawaiian Islands Haley Boyd

Objective Marine Mammal Response Program completes necropsies on stranded marine cetaceans and sends samples of internal organs to pathologists on the mainland. The pathology reports contain information that identify any diseases that could have lead to the cause of death of each individual animal Through analysis of pathology report data the goal of this project is to determine what environmental factors make Stenella Longirostris most vulnerable to disease. Marine cetaceans are sentinel species that are indicators of ocean health and disease information can be used to identify possible threats to humans or other marine mammal species. The information that can be gathered from applying multivariate statistical tests Stenella longirostris data can help determine any differences within the populations amongst locations, between males and females, and at different age classes.

Data Description Species Data Environmental Data Samples: 30, Stenella Longirostirs Individuals Variables:14, Pathology Report Findings Septicemia Mastitis Pneumonia meningitis Cryptococcus Gttiigi Ulcers Etc. Samples: 30, Stenella Longirostirs Indivduals Variables: 3 Location: Big Island, Maui, Oahu, Kauai Sex: Male, Female Age: Fetus, Neonate, Calf, Subadult, Adult

Data Processing Species Data Discarded Environmental Data Discarded No outliers Identified in Outlier Analysis Rule Used: If only saw disease in one individual was thrown out of analysis Mastitis Morbillivirus Toxoplasmosis Brucella Net Entanglement Using Jaccard Distance Measure 2 samples were discarded because they had > SD

Data Exploration All of the data that I am analyzing is categorical and therefore requires a non parametric test for analysis The correlation values represent strength and the distance between two variables. This means that the relationship between location and sex is positive but not very strong which would make sense. The correlation between location and age is also very low and the correlation between sex and age is negative and .02(not strong).

Data Analysis Used NMDS test with Sorensen (Bray- Curtis) because data is non parametric Max axis=6

NMDS Results Number of axes: Criteria 1: Decrease of stress by 5 = 2 axis result Criteria 2: P value <.05 = 2 axis result

Scree Plot Significance Rule Stress reduction rule: At 2 Dimensions the Real Data line remains below the randomized data. Once 3 Dimensions is reached the line intersects the randomized data. Stress reduction rule: Stress is reduced by 5 or greater for a 2D solution For 3D it is less than 5

Results Coefficient of Determination (% Variance): Orthogonality: Measure of Independence of the three axes

Result Interpretation Ordination plot for the NMDS of Pathology Data with environmental vectors scaled to 100% Axis 1 and Axis 2 have the lowest stress and highest coefficient of variation. Clarkes Rules of thumb signify 7.24 stress level as “a good ordination with no real risk of drawing false inferences” All Variables in small solid circles and all species in open circles

Next Step: MRPP Results Location Sex Age A>0 means that the data is more similar within groups. P<.05 for significance

Discussion The results received of a significant mrpp value for environmental variable location, determine that disease presence is different within groups. The next steps would be to complete an ISA to determine WHICH groups are different via a pairwise comparison, what variables are making them most different Lessons Learned: The more data points available the more similar the sample is to the population and the more significance in results For future research, I would add in several other species for comparison to identify true patterns in the data across species