A Posteriori Tests of Phylogeographic Hypotheses Jeet Sukumaran and Allen Rodrigo Duke University.

Slides:



Advertisements
Similar presentations
There are two statistical tests for mean: 1) z test – Used for large samples (n ≥ 30) 1) t test – Used for small samples (n < 30)
Advertisements

Friday: Lab 3 & A3 due Mon Oct 1: Exam I  this room, 12 pm Please, no computers or smartphones Mon Oct 1: No grad seminar Next grad seminar: Wednesday,
Statistical POWERAnalysis Statistical POWER Analysis : A Conceptual Tutorial Experimental Psychology Course Dr. Morre Goldsmith March, 2005 Copyright ©
Objectives (BPS chapter 24)
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Chapter 9 Hypothesis Testing Testing Hypothesis about µ, when the s.t of population is known.
Business Statistics - QBM117
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Differentially expressed genes
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Chapter 9 Hypothesis Testing 9.4 Testing a Hypothesis about a Population Proportion.
Lecture 2: Basic steps in SPSS and some tests of statistical inference
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Probability Population:
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. A sampling error occurs.
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.
Hypothesis Testing – Introduction
© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.
Descriptive statistics Inferential statistics
Hypothesis Testing.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Claims about a Population Mean when σ is Known Objective: test a claim.
Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in heads half the time (i.e.,
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Overview Basics of Hypothesis Testing
Elementary Statistical Methods André L. Souza, Ph.D. The University of Alabama Lecture 22 Statistical Power.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Parametric tests (independent t- test and paired t-test & ANOVA) Dr. Omar Al Jadaan.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Introduction to data analysis: Case studies with iSIKHNAS data Day 3 Beef self-sufficiency 1/ 69.
Tests of Significance: The Basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
_ z = X -  XX - Wow! We can use the z-distribution to test a hypothesis.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. Sampling error means.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
PEP-PMMA Training Session Statistical inference Lima, Peru Abdelkrim Araar / Jean-Yves Duclos 9-10 June 2007.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
15 Inferential Statistics.
Chapter 8 Introducing Inferential Statistics.
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
P-values.
Review and Preview and Basics of Hypothesis Testing
Hypothesis Testing II: The Two-sample Case
Hypothesis Testing – Introduction
MATH 2311 Section 8.2.
Chapter Review Problems
Inferences on Two Samples Summary
Chapter 9: Hypothesis Tests Based on a Single Sample
Inference as Decision Section 10.4.
Presentation transcript:

A Posteriori Tests of Phylogeographic Hypotheses Jeet Sukumaran and Allen Rodrigo Duke University

A Priori and A Posteriori Phylogeography Hypotheses  Previous discussions on a priori and a posteriori phylogeographic hypotheses include:  The dichotomy between vicariant and dispersalist hypotheses of biogeography  The use of simulations of gene trees using explicit models (e.g., the single-species or multi-species coalescent) incorporating a priori phylogeographic hypotheses  Approaches that use the observed data to drive the discovery of phylogeographic hypotheses (e.g., inference keys in NCPA).

 In those instances where we observe an apparently “meaningful” biogeographic pattern, perform a test of that pattern. We focus on a very particular class of a posteriori inference: A Priori and A Posteriori Phylogeography Hypotheses McGuire et al, Evolution 61:2879 Great Basin Collared Lizard

The Randomization Test  Permute the localities of the samples  Calculate a test statistic (e.g. Maddison’s and Slatkin’s test, which counts the number of “migration” events)  Calculate  =0.05 critical value from distribution of test statistics  Compare with observed value of test statistic

Distribution of Test Statistic

An alternative…

An Example  Imagine a sample of individuals drawn from a population.  We can imagine several different hypothetical spatial partitions of the phylogeny of individuals from this population.

Problems with A Posteriori Hypothesis Testing  We focus on a hypothesis that was obtained after seeing the results.  However, we do not take account of alternative apparently “meaningful” patterns that may be obtained by chance.  Each of these patterns will cause us to perform a hypothesis test of that particular pattern.

Correct Procedure  We define post-hoc testing of phylogeographic hypotheses as a test carried out after observing an “interesting” tree, i.e. a genealogy in which a clade-based partition corresponds (loosely) to a spatial-based partition.  A classical hypothesis test sets the probability of rejecting a true null hypothesis at .  There is a difference between the null hypotheses for a priori and a posteriori tests:  A priori H 0 : The particular interesting pattern does not exist.  A posteriori H 0 : All interesting patterns do not exisit (i.e., there is no geographic structure).  We modify the randomization procedure to allow for any possible “interesting” pattern.

Correct Procedure  There are two ways to perform the randomization test:  Assume a fixed phylogeny, and permute the spatial labels.  Assume fixed spatial labels, and simulate phylogenies.  For either of these, we can construct a test-statistic, e.g., Fisher’s Exact Test statistic or Maddison’s and Slatkin’s s (monophyletic partition discordance).  Identify all possible “interesting” patterns.  For each randomized sample, select the smallest value of the statistic amongst all interesting patterns, and construct distribution using this statistic.

Calculating the True Null Distribution Replicatef(H 1 )f(H 2 )..f(H k ) Etc... Uncorrected Crit. CDF H1 (u) = 0.05 CDF H2 (u) = CDF Hk (u) = 0.05 Corrected Crit..CDF H1 (v) = 0.05/k CDF H2 (v) = 0.05/k..CDF Hk (v) = 0.05/k f(H1…Hk) Etc.

Results Uncorrected critical values have a Type I error rate close to 0.05 when there is only one partition tested. As the number of partitions increase, though, the probability of finding an “interesting” pattern increases. Similar results obtained for both test statistics used (Fisher’s Exact Test and Maddison and Slatkin’s s). Similar results obtained when taxon localities allowed to vary randomly as well as tree.

Applying the Bonferroni Correction  Thus for any a posteriori test, where the hypothesis or hypotheses being tested have not been declared a priori, we can use a corrected α ’ value of α /k, where α is the desired true error rate and k is the number of possible partition patterns in the data.

Conclusions  A posteriori phylogeographic hypotheses tests are carried out after the inferred phylogeny “hints” at a possible spatial partitioning of the data corresponding to genetic structure.  These tests are prone to inflated Type I error rates because there are potentially many random patterns that will cause us to conduct a test of the observed pattern only.  This can be corrected by applying a Bonferroni-style correction, and adjusting for the possible spatial partitions in the data under the null hypothesis  How do we know how many possible spatial patterns there are?