Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.

Slides:



Advertisements
Similar presentations
Statistics Hypothesis Testing.
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Hypothesis Testing: Intervals and Tests
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 101 Dr. Kari Lock Morgan SECTION 4.3, 4.5 Significance level (4.3)
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Chapter 9 Hypothesis Testing.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 250 Dr. Kari Lock Morgan SECTION 4.3 Significance level (4.3) Statistical.
Determining Statistical Significance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Statistical hypothesis testing – Inferential statistics I.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Hypothesis testing is used to make decisions concerning the value of a parameter.
Testing Hypotheses About Proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 101 Dr. Kari Lock Morgan 9/27/12 SECTION 4.3 Significance level Statistical.
Apr. 8 Stat 100. To do Read Chapter 21, try problems 1-6 Skim Chapter 22.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Chapter 11 Testing Hypotheses about Proportions © 2010 Pearson Education 1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. CHAPTER 12 Significance Tests About Hypotheses TESTING HYPOTHESES ABOUT PROPORTIONS.
Chapter 20 Testing hypotheses about proportions
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Hypothesis Testing State the hypotheses. Formulate an analysis plan. Analyze sample data. Interpret the results.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
Chapter 21: More About Tests
Logic and Vocabulary of Hypothesis Tests Chapter 13.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Statistics: Unlocking the Power of Data Lock 5 Section 4.2 Measuring Evidence with p-values.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
6.2 Large Sample Significance Tests for a Mean “The reason students have trouble understanding hypothesis testing may be that they are trying to think.”
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Chapter 20 Testing Hypotheses About Proportions. confidence intervals and hypothesis tests go hand in hand:  A confidence interval shows us the range.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Hypothesis Tests Hypothesis Tests Large Sample 1- Proportion z-test.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Statistics 20 Testing Hypothesis and Proportions.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Module 10 Hypothesis Tests for One Population Mean
STA 291 Spring 2010 Lecture 18 Dustin Lueker.
Testing Hypotheses About Proportions
Testing Hypotheses About Proportions
STA 291 Spring 2008 Lecture 18 Dustin Lueker.
STA 291 Summer 2008 Lecture 18 Dustin Lueker.
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Today: Hypothesis testing p-value

Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is this evidence that Paul’s chance of guessing correctly, p, is really greater than 50%? What are the null and alternative hypotheses? a) H 0 : p ≠ 0.5, H 1 : p = 0.5 b) H 0 : p = 0.5, H 1 : p ≠ 0.5 c) H 0 : p = 0.5, H 1 : p > 0.5 d) H 0 : p > 0.5, H 1 : p = 0.5

For testing H 0 : p = 0.5, H 1 : p > 0.5. With sample size of n. Which of these sample statistics do you think is statistically significant against the null hypothesis? Statistical Evidence

For testing H 0 : p = 0.5, H 1 : p > 0.5. With sample size of n. Which of these sample statistics do you think is statistically significant against the null hypothesis? Statistical Evidence

If it is very unusual, we have statistically significant evidence against the null hypothesis Today’s Question: How do we measure how unusual a sample statistic is, if H 0 is true? How unusual is it to see a sample statistic as extreme as that observed, if H 0 is true?

To see if a statistic provides evidence against H 0, we need to see what kind of sample statistics we would observe, just by random chance, if H 0 were true Measuring Evidence against H 0

We need to know what kinds of statistics we would observe just by random chance, if the null hypothesis were true How could we figure this out??? Simulate many samples of size n = 8 with p = 0.5 Example: Paul the Octopus

Simulate! We can simulate this with a coin! Each coin flip = a guess between two teams (Heads = correct, Tails = incorrect) Flip a coin 8 times, count the number of heads, and calculate the sample proportion of heads Did you get all 8 heads (correct)? (a) Yes(b) No How extreme is Paul’s sample proportion of 1?

Simulate the statistics we would observe just, if the null hypothesis were true

Quantifying Evidence We need a way to quantify evidence against the null…

p-value The p-value is the chance of obtaining a sample statistic as extreme (or more extreme) than the observed sample statistic, if the null hypothesis is true The p-value can be calculated as the proportion of statistics in the simulated distribution that are as extreme (or more extreme) than the observed sample statistic

p-value: Paul the Octopus

1000 Simulations p-value = If Paul is just guessing, the chance of him getting all 8 correct is p-value Proportion as extreme as observed statistic observed statistic

1.What kinds of statistics would we get, just by random chance, if the null hypothesis were true? (We looked at the simulated distribution) 2.What proportion of these statistics are as extreme as our original sample statistic? (p-value) Calculating a p-value

A one-sided alternative contains either > or < A two-sided alternative contains ≠ The p-value is the proportion in the tail in the direction specified by H 1 For a two-sided alternative, the p-value is twice the proportion in a single tail Alternative Hypothesis

p-value and H 1 Upper-tail (Right Tail) Lower-tail (Left Tail) Two-tailed

p-value and H 0 If the p-value is small, then a statistic as extreme as that observed would be unlikely if the null hypothesis were true, providing significant evidence against H 0 The smaller the p-value, the stronger the evidence against the null hypothesis and in favor of the alternative

The smaller the p-value, the stronger the evidence against H o. p-value and H 0

Formal Decisions If the p-value is small: REJECT H 0 the sample would be extreme if H 0 were true the results are statistically significant we have evidence for H 1 If the p-value is not small: DO NOT REJECT H 0 the sample would not be too extreme if H 0 were true the results are not statistically significant the test is inconclusive; either H 0 or H 1 may be true

H 0 : X is not guilty H 1 : X is guilty If the evidence proves the defendant guilty beyond a reasonable doubt, the verdict is 'guilty'. If the evidence is not enough to prove the defendant guilty, a 'not guilty' verdict does not mean the judge or jury concluded that the defendant is innocent -- it just means that the evidence was not strong enough to persuade the judge or jury that the defendant was guilty. Guilty or not Guilty

“For the logical fallacy of believing that a hypothesis has been proved to be true, merely because it is not contradicted by the available facts, has no more right to insinuate itself in statistical than in other kinds of scientific reasoning…” -Sir R. A. Fisher Never Accept H 0 “Do not reject H 0 ” is not the same as “accept H 0 ”! Lack of evidence against H 0 is NOT the same as evidence for H 0 !

Formal Decisions A formal hypothesis test has only two possible conclusions: 1.The p-value is small: reject the null hypothesis in favor of the alternative 2.The p-value is not small: do not reject the null hypothesis How small?

Significance Level The significance level, , the probability that type I error occurs Is the threshold below which the p-value is deemed small enough to reject the null hypothesis p-value <   Reject H 0 p-value >   Do not Reject H 0

Type I Error and Type II Error “Guility or not Guilty” example: Type I error : innocent person go to jail Type II error : unpunished criminals H o : X is not guilty H 1 : X is guilty

Choosing α By default, usually α = 0.05 If a Type I error (rejecting a true null) is much worse than a Type II error, we may choose a smaller α, like α = 0.01 If a Type II error (not rejecting a false null) is much worse than a Type I error, we may choose a larger α, like α = 0.10

Class Project Find an a question that you would like to investigate. (You may find an idea from the news or the web) Answer the following question in 3-5 pages: 1.What is the back ground of the study. 2.Find existing data or collect data. 3.Perform exploratory data analysis 4.Construct your hypothesis regarding the research question 5.Perform a hypothesis test 6.Conclusion

Project Proposal due Oct 30th Submit half-page document: Describe the question that you want to investigate. i.e population parameter of interest. Describe the data that you obtained. If found online, provide the link