Download presentation
Presentation is loading. Please wait.
Published byAmy Walsh Modified over 9 years ago
1
A Posteriori Tests of Phylogeographic Hypotheses Jeet Sukumaran and Allen Rodrigo Duke University
2
A Priori and A Posteriori Phylogeography Hypotheses Previous discussions on a priori and a posteriori phylogeographic hypotheses include: The dichotomy between vicariant and dispersalist hypotheses of biogeography The use of simulations of gene trees using explicit models (e.g., the single-species or multi-species coalescent) incorporating a priori phylogeographic hypotheses Approaches that use the observed data to drive the discovery of phylogeographic hypotheses (e.g., inference keys in NCPA).
3
In those instances where we observe an apparently “meaningful” biogeographic pattern, perform a test of that pattern. We focus on a very particular class of a posteriori inference: A Priori and A Posteriori Phylogeography Hypotheses McGuire et al, 2007. Evolution 61:2879 Great Basin Collared Lizard
4
The Randomization Test Permute the localities of the samples Calculate a test statistic (e.g. Maddison’s and Slatkin’s test, which counts the number of “migration” events) Calculate =0.05 critical value from distribution of test statistics Compare with observed value of test statistic
5
Distribution of Test Statistic
6
An alternative…
7
An Example Imagine a sample of individuals drawn from a population. We can imagine several different hypothetical spatial partitions of the phylogeny of individuals from this population.
8
Problems with A Posteriori Hypothesis Testing We focus on a hypothesis that was obtained after seeing the results. However, we do not take account of alternative apparently “meaningful” patterns that may be obtained by chance. Each of these patterns will cause us to perform a hypothesis test of that particular pattern.
9
Correct Procedure We define post-hoc testing of phylogeographic hypotheses as a test carried out after observing an “interesting” tree, i.e. a genealogy in which a clade-based partition corresponds (loosely) to a spatial-based partition. A classical hypothesis test sets the probability of rejecting a true null hypothesis at . There is a difference between the null hypotheses for a priori and a posteriori tests: A priori H 0 : The particular interesting pattern does not exist. A posteriori H 0 : All interesting patterns do not exisit (i.e., there is no geographic structure). We modify the randomization procedure to allow for any possible “interesting” pattern.
10
Correct Procedure There are two ways to perform the randomization test: Assume a fixed phylogeny, and permute the spatial labels. Assume fixed spatial labels, and simulate phylogenies. For either of these, we can construct a test-statistic, e.g., Fisher’s Exact Test statistic or Maddison’s and Slatkin’s s (monophyletic partition discordance). Identify all possible “interesting” patterns. For each randomized sample, select the smallest value of the statistic amongst all interesting patterns, and construct distribution using this statistic.
11
Calculating the True Null Distribution Replicatef(H 1 )f(H 2 )..f(H k ) 10.010.04..0.40 20.230.10..0.44 30.110.54..0.33 40.620.28..0.12 50.570.04..0.42............ Etc... Uncorrected Crit. CDF H1 (u) = 0.05 CDF H2 (u) = 0.05..CDF Hk (u) = 0.05 Corrected Crit..CDF H1 (v) = 0.05/k CDF H2 (v) = 0.05/k..CDF Hk (v) = 0.05/k f(H1…Hk) 0.01 0.10 0.11 0.12 0.04.. Etc.
12
Results Uncorrected critical values have a Type I error rate close to 0.05 when there is only one partition tested. As the number of partitions increase, though, the probability of finding an “interesting” pattern increases. Similar results obtained for both test statistics used (Fisher’s Exact Test and Maddison and Slatkin’s s). Similar results obtained when taxon localities allowed to vary randomly as well as tree.
13
Applying the Bonferroni Correction Thus for any a posteriori test, where the hypothesis or hypotheses being tested have not been declared a priori, we can use a corrected α ’ value of α /k, where α is the desired true error rate and k is the number of possible partition patterns in the data.
14
Conclusions A posteriori phylogeographic hypotheses tests are carried out after the inferred phylogeny “hints” at a possible spatial partitioning of the data corresponding to genetic structure. These tests are prone to inflated Type I error rates because there are potentially many random patterns that will cause us to conduct a test of the observed pattern only. This can be corrected by applying a Bonferroni-style correction, and adjusting for the possible spatial partitions in the data under the null hypothesis How do we know how many possible spatial patterns there are?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.