Bayesian Hypothesis Testing In Nested Models Harold Jeffreys Jeff Rouder.

Slides:



Advertisements
Similar presentations
Fundamentals of Bayesian Inference
Advertisements

Introductory Mathematics & Statistics for Business
A small taste of inferential statistics
Chapter 10: The t Test For Two Independent Samples
Inferential Statistics and t - tests
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Chapter 11: The t Test for Two Related Samples
Multiple Regression and Model Building
Statistical POWERAnalysis Statistical POWER Analysis : A Conceptual Tutorial Experimental Psychology Course Dr. Morre Goldsmith March, 2005 Copyright ©
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
Chapter 10: Hypothesis Testing
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Chapter 9: Introduction to the t statistic
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
Lecture Slides Elementary Statistics Twelfth Edition
Bayesian Inference Using JASP
HYPOTHESIS TESTING Dr. Aidah Abu Elsoud Alkaissi
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
 The “4 Steps” of Hypothesis Testing: 1. State the hypothesis 2. Set decision criteria 3. Collect data and compute sample statistic 4. Make a decision.
Chapter 8 Introduction to Hypothesis Testing
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Chapter 8 Introduction to Hypothesis Testing
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Chapter 20 Testing hypotheses about proportions
Hypotheses tests for means
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Week 71 Hypothesis Testing Suppose that we want to assess the evidence in the observed data, concerning the hypothesis. There are two approaches to assessing.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Lecture & tutorial material prepared by: Dr. Shaffi Shaikh Tutorial presented by: Dr. Rufaidah Dabbagh Dr. Nurah Al-Amro.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Simple examples of the Bayesian approach For proportions and means.
Welcome to MM570 Psychological Statistics
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Logic of Hypothesis Testing
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
Chapter 11: Introduction to Hypothesis Testing Lecture 5a
PSY 626: Bayesian Statistics for Psychological Science
Reasoning in Psychology Using Statistics
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Bayesian Hypothesis Testing In Nested Models Harold Jeffreys Jeff Rouder

Bayesian Hypothesis Testing: Example  We prepare for you a series of 10 factual true/false questions of equal difficulty.  You answer 9 out of 10 questions correctly.  Have you been guessing?

Bayesian Hypothesis Testing: Example  The Bayesian hypothesis test starts by calculating, for each model, the (marginal) probability of the observed data.  The ratio of these quantities is called the Bayes factor:

Bayesian Hypothesis Testing: Example  BF 01 = p(D|H 0 ) / p(D|H 1 )  When BF 01 = 2, the data are twice as likely under H 0 as under H 1.  When, a priori, H 0 and H 1 are equally likely, this means that the posterior probability in favor of H 0 is 2/3.

Bayesian Hypothesis Testing: Example  The complication is that these so-called marginal probabilities are often difficult to compute.  For this simple model, everything can be done analytically…

Bayesian Hypothesis Testing: Example  BF 01 = p(D|H 0 ) / p(D|H 1 ) =  This means that the data are 1/ = 9.3 times more likely under H 1, the “no-guessing” model.  The associated posterior probability for H 0 is about  For more interesting models, things are never this straightforward!

Savage-Dickey  When the competing models are nested (i.e., one is a more complex version of the other), Savage and Dickey have shown that, in our example,

Savage-Dickey Height of prior distribution at point of interest Height of posterior distribution at point of interest

Height of prior = 1 Height of posterior = 0.107; Therefore, BF 01 = 0.107/1 =

Advantages of Savage-Dickey  In order to obtain the Bayes factor you do not need to integrate out the model parameters, as you would do normally.  Instead, you only needs to work with the more complex model, and study the prior and posterior distributions only for the parameter that is subject to test.

Practical Problem  Dr. John proposes a Seasonal Memory Model (SMM), which quickly becomes popular.  Dr. Smith is skeptical, and wants to test the model.  The model predicts that the increase in recall performance due to the intake of glucose is more pronounced in summer than in winter.  Dr. Smith conducts the experiment…

Practical Problem  And finds the following results:  For these data, t = 0.79, p =.44.  Note that, if anything, the result goes in the direction opposite to that predicted by the model.

Practical Problem  Dr. Smith reports his results in a paper entitled “False Predictions from the SMM: The impact on glucose and the seasons on memory”, which he submits to the Journal of Experimental Psychology: Learning, Memory, and Seasons.  After some time, Dr. Smith receives three reviews, one signed by Dr. John, who invented the SMM. Part of this review reads:

Practical Problem “From a null result, we cannot conclude that no difference exists, merely that we cannot reject the null hypothesis. Although some have argued that with enough data we can argue for the null hypothesis, most agree that this is only a reasonable thing to do in the face of a sizeable amount of data [which] has been collected over many experiments that control for all concerns. These conditions are not met here. Thus, the empirical contribution here does not enable readers to conclude very much, and so is quite weak (...).”

Practical Problem How can Dr. Smith analyze his results and quantify exactly the amount of support in favor of H 0 versus the SMM?

Helping Dr. Smith  Recall, however, that in the case of Dr. Smith the alternative hypothesis was directional; SMM predicted that the increase should be larger in summer than in winter – the opposite pattern was obtained.  We’ll develop a test that can handle directional hypotheses, and some other stuff also.

WSD t-test  WSD stands for WinBUGS Savage Dickey t- test.  We’ll use WinBUGS to get samples from the posterior distribution for parameter δ, which represents effect size.  We’ll then use the Savage-Dickey method to obtain the Bayes factor. Our null hypothesis is always δ = 0.

Graphical Model for the One-Sample t-test

WSD t-test  The t-test can be implemented in WinBUGS.  We get close correspondence to Rouder et al.’s analytical t-test.  The WSD t-test can also be implemented for two-sample tests (in which the two groups can also have different variances).  Here we’ll focus on the problem that still plagues Dr. Smith…

Helping Dr. Smith (This Time for Real)  The Smith data (t = 0.79, p =.44)  This is a within-subject experiment, so we can subtract the recall scores (winter minus summer) and see whether the difference is zero or not.

NB: Rouder’s test also gives BF 01 = 6.08 Dr. Smith Data: Non-directional Test

Dr. Smith Data: SMM Directional Test This means that p(H 0 |D) is about 0.93 (in case H 0 and H 1 are equally likely a priori)

Dr. Smith Data: Directional Test Inconsistent with SMM’s prediction

Helping Dr. Smith (This Time for Real)  According to our t-test, the data are about 14 times more likely under the null hypothesis than under the SMM hypothesis.  This is generally considered strong evidence in favor of the null.  Note that we have quantified evidence in favor of the null, something that is impossible in p-value hypothesis testing.