Randomization Tests Dr. Kari Lock Morgan PSU 016 11/5/14.

Slides:



Advertisements
Similar presentations
Introducing Hypothesis Tests
Advertisements

Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Hypothesis Testing: Intervals and Tests
Bootstrap Distributions Or: How do we get a sense of a sampling distribution when we only have ONE sample?
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Hypothesis Testing: Hypotheses
Introduction to Hypothesis Testing AP Statistics Chap 11-1.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Hypothesis testing Week 10 Lecture 2.
8-2 Basics of Hypothesis Testing
Section 4.4 Creating Randomization Distributions.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Using Bootstrap Intervals and Randomization Tests to Enhance Conceptual Understanding in Introductory Statistics Kari Lock Morgan Department of Statistical.
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University
Lecture Slides Elementary Statistics Twelfth Edition
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
More Randomization Distributions, Connections
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Understanding the P-value… Really! Kari Lock Morgan Department of Statistical Science, Duke University with Robin Lock, Patti Frazer.
Using Simulation Methods to Introduce Inference Kari Lock Morgan Duke University In collaboration with Robin Lock, Patti Frazer Lock, Eric Lock, Dennis.
The problem of sampling error It is often the case—especially when making point predictions—that what we observe differs from what our theory predicts.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 101 Dr. Kari Lock Morgan 9/27/12 SECTION 4.3 Significance level Statistical.
Chapter 9 Hypothesis Testing: Single Population
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Overview Basics of Hypothesis Testing
Hypothesis Testing: p-value
Chapter 10 Hypothesis Testing
STA Statistical Inference
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 101 Dr. Kari Lock Morgan 10/18/12 Chapter 5 Normal distribution Central limit theorem.
Using Randomization Methods to Build Conceptual Understanding of Statistical Inference: Day 2 Lock, Lock, Lock Morgan, Lock, and Lock MAA Minicourse- Joint.
Testing Theories: The Problem of Sampling Error. The problem of sampling error It is often the case—especially when making point predictions—that what.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
+ Chi Square Test Homogeneity or Independence( Association)
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
AP Statistics Section 11.1 B More on Significance Tests.
AP STATISTICS LESSON (DAY 1) INFERENCE FOR TWO – WAY TABLES.
Early Inference: Using Randomization to Introduce Hypothesis Tests Kari Lock, Harvard University Eric Lock, UNC Chapel Hill Dennis Lock, Iowa State Joint.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
© Copyright McGraw-Hill 2004
Statistics: Unlocking the Power of Data Lock 5 Section 4.2 Measuring Evidence with p-values.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Introduction to Hypothesis Testing
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Section 10.4: Hypothesis Tests for a Population Mean.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
Level of Significance Level of significance Your maximum allowable probability of making a type I error. – Denoted by , the lowercase Greek letter alpha.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Statistics: Unlocking the Power of Data Lock 5 Section 4.1 Introducing Hypothesis Tests.
9.3 Hypothesis Tests for Population Proportions
Randomization Tests PSU /2/14.
Introducing Hypothesis Tests
Measuring Evidence with p-values
Hypothesis Testing: Hypotheses
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Introducing Hypothesis Tests
Using Simulation Methods to Introduce Inference
Using Simulation Methods to Introduce Inference
Comparing Proportions for Multiple Populations
Presentation transcript:

Randomization Tests Dr. Kari Lock Morgan PSU 016 11/5/14

Extrasensory Perception Is there such a thing as extrasensory perception (ESP) or a “sixth sense”? Do you believe in ESP?

Extrasensory Perception One way to test for ESP is with Zener cards: Subjects draw a card at random and telepathically communicate this to someone who then guesses the symbol

Extrasensory Perception Let’s do our own study! Make your own Zener cards: Randomly choose a symbol Find a partner, telepathically communicate your symbol (no auditory or visual clues!), and have them guess your symbol. Switch roles. Did you guess correctly?

Extrasensory Perception There are five cards with five different symbols If there is no such thing as ESP, what proportion of guesses should be correct? Because there are 5 cards, each person has a 1/5 chance of guessing correctly each time, if ESP does not exist. H0: p = 1/5 Ha: p > 1/5

Extrasensory Perception Statistics vary from sample to sample: even if the population proportion is 1/5, not every sample proportion will be exactly 1/5 How do we determine when a sample proportion is far enough above 1/5 to provide evidence of ESP? More general: How do we determine when a sample statistic is far enough away from H0 to be statistically significant?

SIMULATE what would happen if H0 were true! Key Question How unusual is it to see a sample statistic as extreme as that observed, if H0 is true? How do we know how unusual a sample statistic would be if H0 were true? SIMULATE what would happen if H0 were true!

ESP: Simulate! How could we simulate what would happen, just by random chance, if the null hypotheses were true for the ESP experiment? Randomly choose a symbol. Return it to the rest, shuffle, and choose again for the (random) guess. Did you (randomly) get the correct symbol?

www.lock5stat.com/statkey Lots of simulations! We need many more simulations! www.lock5stat.com/statkey

ESP – Random Chance Are our results statistically significant? What can we conclude?

Randomization Distribution A randomization distribution is a collection of statistics from samples simulated assuming the null hypothesis is true

p-value The p-value is the chance of obtaining a sample statistic as extreme as (or more extreme than) the observed sample statistic, if the null hypothesis is true

Calculating a p-value What kinds of statistics would we get, just by random chance, if the null hypothesis were true? (randomization distribution) What proportion of these statistics are as extreme as our original sample statistic? (p-value)

Proportion as extreme as observed statistic ESP p-value Distribution of statistics that would be observed, just by random chance, if H0 true p-value = 0.247 Proportion as extreme as observed statistic p-value If you were all just guessing randomly, the chance of us getting a sample proportion as high as 0.294 is 0.247. observed statistic

Cocaine Addiction In a randomized experiment on treating cocaine addiction, 48 people were randomly assigned to take either Desipramine (a new drug), or Lithium (an existing drug), and then followed to see who relapsed Is Desipramine better than Lithium at treating cocaine addiction? pD, pL: proportion of cocaine 𝑝 𝐷 addicts who relapse after taking Desipramine or Lithium, respectively H0: pD = pL Ha: pD < pL

R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R 1. Randomly assign units to treatment groups Desipramine Lithium R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R

3. Observe relapse counts in each group 2. Conduct experiment 3. Observe relapse counts in each group R = Relapse N = No Relapse 1. Randomly assign units to treatment groups Desipramine Lithium R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R N R N R R R R R R R R R R R R R N N R N N N N N N N N R R R R R R R R R R R R N N N N N N N N N N N N N N N N N N N N N N N N 10 relapse, 14 no relapse 18 relapse, 6 no relapse

Measuring Evidence against H0 To see if a statistic provides evidence against H0, we need to see what kind of sample statistics we would observe, just by random chance, if H0 were true

Cocaine Addiction “by random chance” means by the random assignment to the two treatment groups “if H0 were true” means if the two drugs were equally effective at preventing relapses (equivalently: whether a person relapses or not does not depend on which drug is taken) Simulate what would happen just by random chance, if H0 were true…

R R R R R R R R R R R R R R R R N N R R R R R R N N N N N N R R R R R R N N N N N N N N N N N N 10 relapse, 14 no relapse 18 relapse, 6 no relapse

R R R R R R R R R R R R R R R R N N R R R R R R N N N N N N R R R R R R N N N N N N N N N N N N Simulate another randomization Desipramine Lithium R N R N N N N R R R R R R R N R R N N N R N R R R N N R N R R N R N N N R R R N R R R R 16 relapse, 8 no relapse 12 relapse, 12 no relapse

Simulate another randomization Desipramine Lithium R R R R R R R R R R R R R N R R N N R R R R R R R R N R N R R R R R R R R N R N R R N N N N N N 17 relapse, 7 no relapse 11 relapse, 13 no relapse

Cocaine Addiction Why did you re-deal your cards? Shuffle your cards and deal them into two piles. What is your sample difference in proportions? Why did you re-deal your cards? Why did you leave the outcomes (relapse or no relapse) unchanged on each card? You want to know what would happen by random chance if the null hypothesis is true

www.lock5stat.com/statkey Lots of simulations! We need many more simulations! www.lock5stat.com/statkey

Proportion as extreme as observed statistic www.lock5stat.com/statkey Distribution of statistics that would be observed, just by random chance, if H0 true Proportion as extreme as observed statistic p-value observed statistic If the two drugs are equal regarding cocaine relapse rates, we have a 1.3% chance of seeing a difference in proportions as extreme as that observed.

Randomization Test p-values can be calculated by randomization distributions: Create a randomization distribution by simulating statistics you would see, just by random chance, if H0 were true Find the p-value as the proportion of simulated statistics as extreme as the observed statistic This idea works for any parameter!

Your Turn! Correlation NFL Teams Do NFL teams with more malevolent uniforms get more penalty yards? r = 0.43

p-value and Ha H0:  = 0 Upper-tail Ha:  > 0 (Right Tail) 𝑥 =2 𝑥 =−1 Lower-tail (Left Tail) H0:  = 0 Ha:  ≠ 0 𝑥 =2 Two-tailed

Summary p-values can be calculated by randomization distributions: Create a randomization distribution by simulating statistics you would see, just by random chance, if H0 were true Find the p-value as the proportion of simulated statistics as extreme as the observed statistic