GLM Interaction Terms and Patterns of Change

Slides:



Advertisements
Similar presentations
Step three: statistical analyses to test biological hypotheses General protocol continued.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Community and gradient analysis: Matrix approaches in macroecology The world comes in fragments.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
PSY 307 – Statistics for the Behavioral Sciences
Generalized Linear Models (GLM)
Lecture 7: Principal component analysis (PCA)
ANOVA notes NR 245 Austin Troy
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
ARE OBSERVATIONS OBTAINED DIFFERENT?. ARE OBSERVATIONS OBTAINED DIFFERENT? You use different statistical tests for different problems. We will examine.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Quantitative Genetics
Lorelei Howard and Nick Wright MfD 2008
Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.
Separate multivariate observations
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Regression and Correlation Methods Judy Zhong Ph.D.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Simple Linear Regression
Some matrix stuff.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
From Micro to Macro: The Evolution of Phenotypic Diversity in Plethodon Salamanders Dean C. Adams Department of Ecology, Evolution and Organismal Biology.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
BIOL 582 Lecture Set 21 One-Way MANOVA, Part I. So far we have learned two things about multivariate data: 1.That linear models work equally well with.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
Mixed ANOVA Models combining between and within. Mixed ANOVA models We have examined One-way and Factorial designs that use: We have examined One-way.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
PCB 3043L - General Ecology Data Analysis.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Tutorial I: Missing Value Analysis
Analysis of Variance 11/6. Comparing Several Groups Do the group means differ? Naive approach – Independent-samples t-tests of all pairs – Each test doesn't.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L12.1 Lecture 12: Generalized Linear Models (GLM) What are they? When do.
Université d’Ottawa / University of Ottawa 2003 Bio 8102A Applied Multivariate Biostatistics L4.1 Lecture 4: Multivariate distance measures l The concept.
Methods of Presenting and Interpreting Information Class 9.
Chapter 11 Analysis of Variance
Factorial Experiments
PCB 3043L - General Ecology Data Analysis.
CH 5: Multivariate Methods
Effect of Mercury on Wall Lizard (Podarcis muralis) Populations: Cranial Morphology and Allometry – Preliminary Results Aleksandar UROŠEVIĆ1, Milena CVIJANOVIĆ1,
12 Inferential Analysis.
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
CHAPTER 29: Multiple Regression*
Statistics review Basic concepts: Variability measures Distributions
Advanced Biostatistics
I. Statistical Tests: Why do we use them? What do they involve?
LEARNING OUTCOMES After studying this chapter, you should be able to
12 Inferential Analysis.
Principal Component Analysis (PCA)
Fixed, Random and Mixed effects
Product moment correlation
Multivariate Statistical Methods
Chapter 10 Introduction to the Analysis of Variance
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

GLM Interaction Terms and Patterns of Change Advanced Biostatistics Dean C. Adams Lecture 7 EEOB 590C

Patterns of Variation GLM models assess patterns of variation and covariation Are groups different from one another? Does Y covary with X? Methods assess clouds of points (‘dots in space’) to look for patterns Group differences: Are clouds separated? Regression: Is cloud elliptical (i.e. covariation)? Often, what we really want to know is about patterns of change, not static patterns of variation SVL 19.58 27.65 35.73 43.80 51.88 headwdth 3.03 3.73 4.44 5.14 5.85

Patterns of Change Many hypotheses in E&E are really interested in patterns of change: How does the phenotype change across environments? (plasticity) How do traits change through evolutionary time? (quantitative genetics) How do traits change through development? (ontogenetics) Are patterns of variation constant across space or time? (e.g., spatial data) GLM method only partially address these questions because they examine static patterns of variation (though, these are the statistical tools we commonly use)

Factorial ANOVA: Interactions Interactions measure the joint effect of main effects A & B Identifies whether response to A dependent on level of B Are VERY common in biology Example: 2 species in 2 environments (Factors A & B), species 1 has higher growth rate in moist environment, while species 2 has higher growth rate in dry environment. This would be identified as an interaction between species & environment Growth rate Wet Dry Species 1 Species 2 Note: The study of trade-offs (reaction norms) in evolutionary ecology is based on the study of interactions

Understanding Interaction Terms Significant interactions identify a joint response of factors (response to Factor B depends on your level in Factor A) Interpreting interactions for univariate data is straightforward Growth rate Wet Dry Species 1 Species 2 1 2 3 4 V1 E1 E2 1 2 3 4 V2 E1 E2 1 2 3 4 V3 E1 E2 1 2 3 4 V3 E1 E2 divergence minor crossing, similar values major crossing, reversal of values effect-no effect Collyer and Adams. (2007). Ecology.

Bivariate Interaction Terms For two traits, more complicated variants are possible Pairwise comparisons do not fully describe pattern (they determine which groups differ, but not how) 1 2 3 4 V2 V1 1 2 3 4 V3 V1 direction change: rank-order direction change 1 2 3 4 V3 V2 1 2 3 4 V2 direction change: crossing effect-no effect V3 Collyer and Adams. (2007). Ecology.

Multivariate Interaction Terms Geometrically, concept extends to higher dimensions 1 2 3 4 5 V3 V1 V2 Collyer and Adams. (2007). Ecology.

Quantifying Patterns of Change Magnitude: amount of change Direction: orientation of change Phenotypic Change Vector Magnitude Direction 5 4 3 V3 2 1 4 3 2 V2 1 Collyer and Adams. (2007). Ecology. Collyer, Sekora, Adams (2015). Heredity. 1 2 3 4 V1

Change Vectors: Hypothesis Tests Patterns of change assessed using residual randomization Protocol Define model Estimate coefficients Estimate LS means Calculate vector attributes and statistics Design matrix with factors A, B, and A×B Design matrix coded to find means Magnitude Direction Collyer and Adams. (2007). Ecology.

Change Vectors: Hypothesis Tests Patterns of change assessed using residual randomization Protocol Define model Estimate coefficients Estimate LS means Calculate vector attributes and statistics Define ‘reduced’ model Estimate values of Y Obtain residuals Design matrix with factors A, B, and A×B Design matrix coded to find means Design matrix with factors A and B only Collyer and Adams. (2007). Ecology. Collyer, Sekora, Adams (2015). Heredity.

Change Vectors: Hypothesis Tests Patterns of change assessed using residual randomization Protocol Repeat many times 9. Randomize residuals i.e., shuffle rows 10. Add randomized residuals to estimated values from reduced model Random value preserved the main effects of the reduced model Repeat steps 1 – 4 to obtain random statistics By creating random (sampling) distributions of the magnitude difference and angle between vectors, P-values for the observed values are described as the percentiles in the distributions. (I.e., the P-value is the probability of finding a greater or equal value by chance) Collyer and Adams. (2007). Ecology.

Change Vectors: Hypothesis Tests Patterns of change assessed using residual randomization Protocol Repeat many times 9. Randomize residuals i.e., shuffle rows 10. Add randomized residuals to estimated values from reduced model Random value preserved the main effects of the reduced model Repeat steps 1 – 4 to obtain random statistics 1.0 2.0 3.0 Observed |d1-d2| x 100 4 8 12 16 20 24 Angle, θ Collyer and Adams. (2007). Ecology.

Example 1: Plethodon Salamanders Ecological character displacement (P. jordani vs. P. teyahalee) Significant species, site, species×site Conclusion: species differ in way they diverge, not how much change they exhibit from allopatry to sympatry Effect Exact F Df P Species 4.59 18,315 < 0.0001 Site 11.46 <0.0001 Species*Site 2.15 0.0047 DJord = 0.087, DTeh = 0.099, P = 0.172 NS  = 47.71, P < 0.0001 Data from Adams. (2004). Ecology. Collyer and Adams. (2007). Ecology.

Example 2: Desert Pupfish Sexual dimorphism in white sands pupfish (C. tularosa) Significant population, sex, population×sex Conclusion: populations display different amounts of sexual dimorphism and different directions of dimorphism 1 2 3 4 11 12 10 9 5 6 7 8 13 DSC = 0.068, DMO = 0.044, P < 0.0001 PC II Male Salt Creek Female. Salt Creek Male. Mound Spring Female. Mound Spring  = 23.88, P < 0.0001 Variance explained = 59.1% PC I Collyer and Adams. (2007). Ecology.

Patterns of Change with Covariates For many hypotheses, we must account for covariate terms while assessing patterns of change Example: Character displacement tests: Dsymp > Dallo If phenotype varies along environmental gradient, must account for it Incorporate covariate in X; rest of protocol remains unchanged Adams and Collyer. (2007). Evolution.

Simulated Examples Character change along a gradient, 3 scenarios: No character displacement (CD) Asymmetric character displacement Symmetric character displacement This approach correctly identifies CD when it is present, and does not identify it when it is not present Adams and Collyer. (2007). Evolution.

Generalizations Method easily generalized for more than 2 groups Must do in pairwise fashion (1 vs. 2, 1 vs. 3, etc.) (e.g,. Hollander, Collyer, Adams, and Johannesson. 2006. J. Evol. Biol. 19:1861-1872.) For > 2 states (e.g., environments) phenotypic change vector is now a TRAJECTORY (later) Adams and Collyer (2009) Evolution.

Trajectories: Concept Values represent sequential states (e.g., developmental stages, temporal points) y3 A data space for three variables Trajectory of multivariate change y1 y2 Adams and Collyer (2009) Evolution.

Attributes of Change Trajectories Magnitude Magnitude Difference in Direction Difference in Magnitude (Path distance) *p = principal eigenvector scaled to unit size Difference in Shape Difference in Magnitude Difference in Direction Z i= matrices that have been scaled to unit size, centered, and rotated to minimize variation among them

Procrustes Trajectory Analysis Scaled and Centered Adams and Collyer (2009) Evolution.

Procrustes Trajectory Analysis Shape difference = square root of summed squared differences between corresponding “landmarks” Scaled and Centered Rotated Adams and Collyer (2009) Evolution.

Attributes of Change Trajectories Magnitude Magnitude Difference in Direction Difference in Magnitude (Path distance) *p = principal eigenvector scaled to unit size Difference in Shape Difference in Magnitude Difference in Direction Can Residual Randomization be used to test null hypotheses for these statistics? Adams and Collyer (2009) Evolution.

Simulated Example A B D C Adams and Collyer (2009) Evolution. 5 10 15 5 10 15 20 V I V II Adams and Collyer (2009) Evolution.

Simulated Example ~ Same Length A B D C Expect PDA = PDC = PDD 5 10 15 20 V I V II ~ Same Length Expect PDA = PDC = PDD Adams and Collyer (2009) Evolution.

Simulated Example ~ Same Direction A B D C Expect QAB = QAD = QBD = 0 5 10 15 20 V I V II ~ Same Direction Expect QAB = QAD = QBD = 0 Adams and Collyer (2009) Evolution.

Simulated Example ~ Same Shape A B D C Expect DAB = DAC = DBC 5 10 15 20 V I V II ~ Same Shape Expect DAB = DAC = DBC Adams and Collyer (2009) Evolution.

Simulated Example: Results B C D 5 10 15 20 V I V II PTA identifies differences when present, and does not when they are not present. Adams and Collyer (2009) Evolution.

Example I: Parallel Evolution in Plethodon Ecological work demonstrates competition prevalent Plethodon biogeography: replicated communities across contact zones Are microevolutionary changes repeatable? Measured head shape from 336 specimens across three mountain transects (allopatrysympatry) Plethodon jordani Plethodon teyahalee Adams 2010. BMC Evol. Biol.

Microevolution Occurs Phenotypic evolution is present Patterns are REPEATABLE Factor DfFactor Pillai’s Trace Approx. F df P Species 1 0.741 48.874 18, 307 < 0.0001 Locality Type 0.794 65.612 Geographic Transect 2 0.783 11.015 36, 616 Species × Locality 0.519 18.373 Species × Transect 0.289 2.888 Locality × Transect 0.338 3.482 Species×Locality×Transect 0.161 1.499 0.0327 Vector Magnitude Vector Orientation A: P. jordani HR KP TC 0.1849 NS 0.3192 NS 0.6074 NS 0.3665 NS 0.01689 0.0309 NS 26.785 0.4071 NS 0.00871 0.02560 31.502 41.545 B: P. teyahalee 0.3363 NS 0.8106 NS 0.7965 NS 0.5579 NS 0.3261 NS 19.506 0.5069 NS 0.00253 0.01224 25.033 34.136 Adams 2010. BMC Evol. Biol.

Repeatable Evolutionary Changes NO difference in magnitude or direction of evolutionary changes among transects within species (i.e. common patterns found) Conclusion: Evolutionary response to competition repeatable in each species: parallel evolution of character displacement P. jordani P. teyahalee Adams 2010. BMC Evol. Biol.

Example II: Snake Ontogeny Measured head shape from 3,107 LIVE SNAKES from 2 species (males and females Collyer and Adams. 2103. Hystrix. Data from Davis (2012) PhD Dissertation, University of Illinois

Example II: Snake Ontogeny Measured head shape from 3,107 LIVE SNAKES from 2 species (males and females Sexual dimorphism MD = 0.0005 C. viridis P = 0. 0005 MD = 0.0060 C. oregnaus P = 0. 0001 Collyer and Adams. 2103. Hystrix. Data from Davis (2012) PhD Dissertation, University of Illinois

Example II: Snake Ontogeny Measured head shape from 3,107 LIVE SNAKES from 2 species (males and females Amount of ontogenetic shape change MD = 0.0119 Females, P = 0. 0069 MD = 0.0184 Males, P = 0. 0005 Collyer and Adams. 2103. Hystrix. Data from Davis (2012) PhD Dissertation, University of Illinois

Example II: Snake Ontogeny Measured head shape from 3,107 LIVE SNAKES from 2 species (males and females Shape of ontogenetic shape change Dp = 0.21 Females, P = 0. 0405 Dp = 0.21 Males, P = 0. 0048 Collyer and Adams. 2103. Hystrix. Data from Davis (2012) PhD Dissertation, University of Illinois

Interaction Terms: Conclusions Significant interactions are the most interesting result biologically Tell us that response to factor A dependent on level of factor B Imply that the change across levels is not consistent Many E&E questions are really interested in change Phenotypic plasticity, ontogenetics, species interactions, local adaptation, adaptive diversification, etc. Significance tests of effects are not sufficient to determine how change has occurred and how patterns of change differ Must quantify attributes of change (magnitude, orientation, shape of change trajectory) and statistically assess these Provides more complete understanding of biological change