Lecture 15 - Hypothesis Testing A. Competing a priori hypotheses - Paired-Sites Tests Null Hypothesis : There is no difference in support for one tree.

Slides:



Advertisements
Similar presentations
Lecture 2: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.
Advertisements

Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Likelihood ratio tests
Significance Testing Chapter 13 Victor Katch Kinesiology.
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Hypothesis testing Week 10 Lecture 2.
“Inferring Phylogenies” Joseph Felsenstein Excellent reference
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Course overview Tuesday lecture –Those not presenting turn in short review of a paper using the method being discussed Thursday computer lab –Turn in short.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
T-Tests Lecture: Nov. 6, 2002.
8-2 Basics of Hypothesis Testing
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Test for a Mean. Example A city needs $32,000 in annual revenue from parking fees. Parking is free on weekends and holidays; there are 250 days in which.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Today Concepts underlying inferential statistics
Statistics 270– Lecture 25. Cautions about Z-Tests Data must be a random sample Outliers can distort results Shape of the population distribution matters.
Getting Started with Hypothesis Testing The Single Sample.
Probability Population:
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical hypothesis testing – Inferential statistics I.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Introduction to Hypothesis Testing
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Descriptive statistics Inferential statistics
Chapter 20: Testing Hypotheses about Proportions
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in heads half the time (i.e.,
Chapter 9 Hypothesis Testing: Single Population
Model Inference and Averaging
Hypothesis Testing for Proportions
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Tree Confidence Have we got the true tree? Use known phylogenies Unfortunately, very rare Hillis et al. (1992) created experimental phylogenies using phage.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Confidence intervals and hypothesis testing Petter Mostad
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Not in FPP Bayesian Statistics. The Frequentist paradigm Defines probability as a long-run frequency independent, identical trials Looks at parameters.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Simple examples of the Bayesian approach For proportions and means.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Introduction to Hypothesis Testing
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Bayesian statistics named after the Reverend Mr Bayes based on the concept that you can estimate the statistical properties of a system after measuting.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Hypothesis Testing and Statistical Significance
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Slide Slide 1 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Lecture 15 - Hypothesis Testing
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Reasoning in Psychology Using Statistics
CHAPTER 12 Inference for Proportions
CHAPTER 12 Inference for Proportions
Presentation transcript:

Lecture 15 - Hypothesis Testing A. Competing a priori hypotheses - Paired-Sites Tests Null Hypothesis : There is no difference in support for one tree over the other. Test statistic: In Parsimony (or minimum evolution);  = Length A – Length B In Likelihood;  = lnL A – lnL B

Early methods looked at the difference in optimality between the two trees on a site- by-site basis. Thus, these are called paired-sites method. So we would accept the H o that there is no difference in support for either tree. KH-Test: Parsimony – normal distribution Tree # 1 2 Length [So the test statistic is 3 steps] Kishino-Hasegawa test: Tree Length diff s.d.(diff) t P* (best) * Probability of getting a more extreme T-value under the null hypothesis of no difference between the two trees (two-tailed test). Example Normal (mean = 0)

KH-Test: Likelihood – normal distribution Tree 1 2 -ln L [So the test statistic is 28.8 lnL units] Kishino-Hasegawa test: KH test using normal approximation, two-tailed test KH-test Tree -ln L Diff -ln L P * (best) * P < 0.05 Here, the same data and test lead to rejection of H o under a different optimality criterion Same data, but using ML as criterion. KH-Test: Likelihood – RELL bootstrapping Tree 1 2 -ln L Kishino-Hasegawa test: KH test using RELL bootstrap, two-tailed test Number of bootstrap replicates = 1000 KH-test Tree -ln L Diff -ln L P (best)

Same Data, Same Two Trees, Different Inferences

Most applications of these tests have involved comparing a suboptimal tree to the best tree. This results in use of a mismatched null distribution and the tests should not be used in this way. SH Test - corrects for the a priori requirement. Uses a collection of trees (rather than just ML v. Hypothesis). Uses RELL bootstrap to generate an average lnL for the collection. Uses this average to center the null distribution. The test is very conservative, though, and very sensitive to the collection of trees used in centering the null distribution. If there are only two trees in the collection, the ML tree and the tree we’re interested in testing, the SH test is identical to the KH test. Shimodaira has applied multiscale bootstrapping to correct for this bias in the AU (almost unbiased) test

Parametric Bootstrap Tests Generate the null distribution via simulation under the hypothesis being tested. Assuming some hypothesis is true, what is the probability of an observed test statistic? Ancient Fragmentation Hypothesis Northern Dispersal Hypothesis N. Rockies N. Cascades S. Cascades Southern Dispersal Hypothesis N. RockiesS. Cascades N. Cascades PNW Biogeography

Parametric Bootstrap Tests This tree has a likelihood score of lnL =

Parametric Bootstrap Tests Can we reject an ancient fragmentation? Force the tree to fit the predictions of our hypothesis, we get a much worse tree: lnL =  = lnL (uconstrained) – lnL (constrained) = – ( ) = lnL units. If the true tee actually were the constrained tree (i.e., if the hypothesis being examined were actually true), what is the probability that we would see a tree that is lnL units better, simply due to stochasticity? So we simulate many replicate data sets under the hypothesis. We first run an unconstrained search to find the ML tree for the replicate. We then run a search constrained to find the best tree consistent with the hypothesis for each replicate.

Parametric Bootstrap Tests Can we reject an ancient fragmentation? We can reject a southern dispersal hypothesis as well and are left with the northern dispersal hypothesis that the tree suggests.

Bayesian Hypothesis Testing If we run a typical MCMC, we have sample of topologies that represent the posterior distribution of trees. The proportion of the trees in the distribution that are consistent with the topological predictions of each hypothesis provide the posterior probability of that hypothesis. Remember the difference in Bayesian and frequentist perspectives, and the different interpretations of p-values this causes.