Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.

Slides:



Advertisements
Similar presentations
Introducing Hypothesis Tests
Advertisements

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 10/23/12 Sections , Single Proportion, p Distribution (6.1)
Hypothesis Testing: Intervals and Tests
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Hypothesis Testing: Hypotheses
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Chapter 8 Introduction to Hypothesis Testing
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 9 Hypothesis Testing.
Inference for Categorical Variables 2/29/12 Single Proportion, p Distribution Intervals and tests Difference in proportions, p 1 – p 2 One proportion or.
Confidence Intervals and Hypothesis Tests
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Randomization Tests Dr. Kari Lock Morgan PSU /5/14.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Chapter 8 Hypothesis Testing : An Introduction.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
More Randomization Distributions, Connections
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Hypothesis Testing: p-value
Hypothesis Testing for Proportions
LECTURE 19 THURSDAY, 14 April STA 291 Spring
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Chapter 20 Testing hypotheses about proportions
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Significance Test A claim is made. Is the claim true? Is the claim false?
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics: Unlocking the Power of Data Lock 5 Bootstrap Intervals Dr. Kari Lock Morgan PSU /12/14.
Chapter 20 Testing Hypothesis about proportions
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Chapter 21: More About Tests
Logic and Vocabulary of Hypothesis Tests Chapter 13.
AP Statistics Section 11.1 B More on Significance Tests.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Statistics: Unlocking the Power of Data Lock 5 Section 4.2 Measuring Evidence with p-values.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Introduction to Hypothesis Testing
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Tests of Significance: The Basics ESS chapter 15 © 2013 W.H. Freeman and Company.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan SECTION 7.1 Testing the distribution of a single categorical variable : 
Section 10.4: Hypothesis Tests for a Population Mean.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Statistics: Unlocking the Power of Data Lock 5 Section 4.1 Introducing Hypothesis Tests.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Randomization Tests PSU /2/14.
Unit 5: Hypothesis Testing
When we free ourselves of desire,
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Making Data-Based Decisions
Significance Tests: The Basics
Presentation transcript:

Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution p-value

Statistics: Unlocking the Power of Data Lock 5 “If “sexy” means having rare qualities that are much in demand, data scientists are already there. They are difficult and expensive to hire and, given the very competitive market for their services, difficult to retain.”

Statistics: Unlocking the Power of Data Lock 5 A Note on Random Samples Why do we take random samples? If we have access to data from the entire population, why would we take a random sample? For Project 1, if you have access to data on the entire population, USE IT! The methods of inference of no longer needed (mention this in your paper and explain why), but do a CI and test anyway just to prove you can…

Statistics: Unlocking the Power of Data Lock 5 Paul the Octopus

Statistics: Unlocking the Power of Data Lock 5 Key Question If it is very unusual, we have statistically significant evidence against the null hypothesis Today’s Question: How do we measure how unusual a sample statistic is, if H 0 is true? How unusual is it to see a sample statistic as extreme as that observed, if H 0 is true?

Statistics: Unlocking the Power of Data Lock 5 To see if a statistic provides evidence against H 0, we need to see what kind of sample statistics we would observe, just by random chance, if H 0 were true Measuring Evidence against H 0

Statistics: Unlocking the Power of Data Lock 5 Paul the Octopus We need to know what kinds of statistics we would observe just by random chance, if the null hypothesis were true How could we figure this out??? Simulate many samples of size n = 8 with p = 0.5

Statistics: Unlocking the Power of Data Lock 5 Simulate! We can simulate this with a coin! Each coin flip = a guess between two teams (Heads = correct, Tails = incorrect) Flip a coin 8 times, count the number of heads, and calculate the sample proportion of heads Come to the board to add your sample proportion to a class dotplot How extreme is Paul’s sample proportion of 1?

Statistics: Unlocking the Power of Data Lock 5 Paul the Octopus a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 Randomization Distribution A randomization distribution is a collection of statistics from samples simulated assuming the null hypothesis is true The randomization distribution shows what types of statistics would be observed, just by random chance, if the null hypothesis were true

Statistics: Unlocking the Power of Data Lock 5 Lots of simulations! For a better randomization distribution, we need many more simulations!

Statistics: Unlocking the Power of Data Lock 5 Randomization Distribution

Statistics: Unlocking the Power of Data Lock 5 Paul the Octopus a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 Key Question A randomization distribution tells us what kinds of statistics we would see just by random chance, if the null hypothesis is true This makes it straightforward to assess how extreme the observed statistic is! How unusual is it to see a sample statistic as extreme as that observed, if H 0 is true?

Statistics: Unlocking the Power of Data Lock 5 What about ESP? How could we simulate what would happen, just by random chance, if the null hypotheses were true for the ESP experiment? Roll a die. 1 = “correct letter” 2-5 = “wrong letter” 6 = roll again Did you get the correct letter? (a) Yes (b) No

Statistics: Unlocking the Power of Data Lock 5 ESP Randomization Distribution

Statistics: Unlocking the Power of Data Lock 5 ESP Based on the randomization distribution, do you think the results of our class ESP experiment (24/85 = 0.29) are statistically significant? a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 ESP What does this imply about ESP? a) Evidence that ESP exists b) Evidence that ESP does not exist c) Impossible to tell

Statistics: Unlocking the Power of Data Lock 5 Quantifying Evidence We need a way to quantify evidence against the null…

Statistics: Unlocking the Power of Data Lock 5 p-value The p-value is the chance of obtaining a sample statistic as extreme (or more extreme) than the observed sample statistic, if the null hypothesis is true The p-value can be calculated as the proportion of statistics in a randomization distribution that are as extreme (or more extreme) than the observed sample statistic

Statistics: Unlocking the Power of Data Lock 5 p-value

Statistics: Unlocking the Power of Data Lock Simulations p-value = If Paul is just guessing, the chance of him getting all 8 correct is p-value Proportion as extreme as observed statistic observed statistic

Statistics: Unlocking the Power of Data Lock 5 1. What kinds of statistics would we get, just by random chance, if the null hypothesis were true? (randomization distribution) 2. What proportion of these statistics are as extreme as our original sample statistic? (p-value) Calculating a p-value

Statistics: Unlocking the Power of Data Lock 5 p-value

Statistics: Unlocking the Power of Data Lock 5 ESP p-value p-value = If you were all just guessing randomly, the chance of us getting a sample proportion of is p-value observed statistic Proportion as extreme as observed statistic

Statistics: Unlocking the Power of Data Lock 5 Death Penalty A random sample of people were asked “Are you in favor of the death penalty for a person convicted of murder?” Did the proportion of Americans who favor the death penalty decrease from 1980 to 2010? YesNo “Death Penalty,” Gallup,

Statistics: Unlocking the Power of Data Lock 5 Death Penalty How extreme is 0.02, if p 1980 = p 2010 ? p 1980, p 2010 : proportion of Americans who favor the death penalty in 1980, 2010 H 0 : p 1980 = p 2010 H a : p 1980 > p 2010 StatKey YesNo

Statistics: Unlocking the Power of Data Lock 5 Death Penalty p – value = If proportion supporting the death penalty has not changed from 1980 to 2010, we would see differences this extreme about 16% of the time.

Statistics: Unlocking the Power of Data Lock 5 p-value Using the randomization distribution below to test H 0 :  = 0 vs H a :  > 0 Match the sample statistics: r = 0.1, r = 0.3, and r = 0.5 With the p-values: 0.005, 0.15, and 0.35 Which sample statistic goes with which p-value?

Statistics: Unlocking the Power of Data Lock 5 A one-sided alternative contains either > or < A two-sided alternative contains ≠ The p-value is the proportion in the tail in the direction specified by H a For a two-sided alternative, the p-value is twice the proportion in the smallest tail Alternative Hypothesis

Statistics: Unlocking the Power of Data Lock 5 p-value and H a Upper-tail (Right Tail) Lower-tail (Left Tail) Two-tailed

Statistics: Unlocking the Power of Data Lock 5 Sleep versus Caffeine Recall the sleep versus caffeine experiment from last class  s and  c are the mean number of words recalled after sleeping and after caffeine. H 0 :  s =  c H a :  s ≠  c Let’s find the p-value! Two-tailed alternative

Statistics: Unlocking the Power of Data Lock 5 Sleep or Caffeine for Memory? p-value = 2 × = 0.044

Statistics: Unlocking the Power of Data Lock 5 p-value Using the randomization distribution below to test H 0 :  = 0 vs H a :  > 0 Which sample statistic shows the most evidence for the alternative hypothesis? r = 0.1, r = 0.3, or r = 0.5 Therefore, which p-value shows the most evidence for the alternative hypothesis? 0.35, 0.15, or 0.005

Statistics: Unlocking the Power of Data Lock 5 p-value and H 0 If the p-value is small, then a statistic as extreme as that observed would be unlikely if the null hypothesis were true, providing significant evidence against H 0 The smaller the p-value, the stronger the evidence against the null hypothesis and in favor of the alternative

Statistics: Unlocking the Power of Data Lock 5 The smaller the p-value, the stronger the evidence against H o. p-value and H 0

Statistics: Unlocking the Power of Data Lock 5 Which of the following p-values gives the strongest evidence against H 0 ? a) b) 0.1 c) 0.32 d) 0.56 e) 0.94 p-value and H 0 The lower the p-value, the stronger the evidence against the null hypothesis.

Statistics: Unlocking the Power of Data Lock 5 Which of the following p-values gives the strongest evidence against H 0 ? a) 0.22 b) 0.45 c) 0.03 d) 0.8 e) 0.71 p-value and H 0 The lower the p-value, the stronger the evidence against the null hypothesis.

Statistics: Unlocking the Power of Data Lock 5 Two different studies obtain two different p- values. Study A obtained a p-value of and Study B obtained a p-value of 0.2. Which study obtained stronger evidence against the null hypothesis? a) Study A b) Study B p-value and H 0 The lower the p-value, the stronger the evidence against the null hypothesis.

Statistics: Unlocking the Power of Data Lock 5 Summary The randomization distribution shows what types of statistics would be observed, just by random chance, if the null hypothesis were true A p-value is the chance of getting a statistic as extreme as that observed, if H 0 is true A p-value can be calculated as the proportion of statistics in the randomization distribution as extreme as the observed sample statistic The smaller the p-value, the greater the evidence against H 0

Statistics: Unlocking the Power of Data Lock 5 To Do Read Section 4.2 Idea and data for Project 1 (proposal due 9/27)Project 1proposal Do Homework 4 (due Thursday, 10/4)Homework 4