Inference about Population Parameters: Hypothesis Testing

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Chapter 10: Hypothesis Testing
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Stat Day 16 Observations (Topic 16 and Topic 14)
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
AM Recitation 2/10/11.
Chapter 10 Hypothesis Testing
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
Fundamentals of Hypothesis Testing: One-Sample Tests
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
Lesson Carrying Out Significance Tests. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Statistical Inference
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Chapter 10.2 TESTS OF SIGNIFICANCE.
Confidence intervals and hypothesis testing Petter Mostad
CHAPTER 9 Testing a Claim
The z test statistic & two-sided tests Section
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
CHAPTER 9 Testing a Claim
Unit 8 Section 8-3 – Day : P-Value Method for Hypothesis Testing  Instead of giving an α value, some statistical situations might alternatively.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
© Copyright McGraw-Hill 2004
Chapter 9 Day 2 Tests About a Population Proportion.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Section 9.1 First Day The idea of a significance test What is a p-value?
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Chapter Nine Hypothesis Testing.
CHAPTER 9 Testing a Claim
Lecture Nine - Twelve Tests of Significance.
CHAPTER 9 Testing a Claim
STA 291 Spring 2010 Lecture 18 Dustin Lueker.
Statistical inference: distribution, hypothesis testing
Chapter 9 Hypothesis Testing.
Daniela Stan Raicu School of CTI, DePaul University
Essential Statistics Introduction to Inference
Daniela Stan Raicu School of CTI, DePaul University
STA 291 Spring 2008 Lecture 18 Dustin Lueker.
Intro to Confidence Intervals Introduction to Inference
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
STA 291 Summer 2008 Lecture 18 Dustin Lueker.
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Inference about Population Parameters: Hypothesis Testing Sampling distributions can be used to infer about population parameters: Confidence intervals (interval estimates of the parameter) Hypothesis testing (today) Hypothesis testing involves assessing the evidence in the sample for some claim (hypothesis) about the value(s) of an unknown population parameter(s). General logic: What's the probability of observing the sample data, if the hypothesis were true? If this probability is very small, we reject the hypothesis. Otherwise we fail to reject it. e.g. A playground basketball player claims he can make 80% of his free throws. “Show me”, you say. He shoots 20 throws and makes 8 of them. Do you believe in his 80% claim? What's the intuition behind your decision?

The Null and Alternative Hypotheses The Null Hypothesis, H0 : The claim about the population parameter(s) we want to test. Examples of H0: The population proportion/mean equals certain value No difference in the means of two groups No relationship between two variables The “alternative hypothesis”, Ha, is the alternative statement about the population parameter(s) instead of H0. Often Ha is what we suspect or hope is true.

Elements of Hypothesis Testing Determine the two hypotheses Compute the observed value of the test statistic, which is some sample statistic (such as sample proportion or sample mean) that has a known sampling distribution under the null hypothesis. Using this sampling distribution, determine how (un)likely it is to observe the test statistic taking values as extreme as observed or more so, if the null hypothesis were true (i.e., find the “p-value”) Make a decision/conclusion based on the p-value: based on the observed sample statistic, shall we reject the null hypothesis? (Or, do we consider the computed p-value “small” enough?) e.g. The basketball case (testing hypothesis about a proportion) You may want to explore http://www.whfreeman.com/scc7e

Testing Hypotheses about a Population Proportion: Stating the hypotheses Null: H0: p=p0 One sided alternatives (for “one tailed test”) Ha: p>p0 Ha: p<p0 Two sided alternative (for “two tailed test”) Ha: pp0

Testing Hypotheses about a Population Proportion: Computing the test statistic and determining its sampling distribution under the null hypothesis Quite intuitively, our test statistic for testing the population proportion is the sample proportion, Recall the sampling distribution of the sample proportion: approximately normal with mean p and standard deviation Under the null hypothesis (i.e., assuming the null hypothesis is true), p=p0, so the sample proportion follows a normal distribution with p replaced by p0 Which means:

Testing Hypotheses about a Population Proportion: Finding the p-value of the test statistic Using the sampling distribution of the sample proportion under the null hypothesis, we can find out the probability of observing a sample proportion value as extreme as in our data or more so. A small p-value indicates that the observed data (or relationship) is unlikely to have occurred if the null hypothesis were actually true The p-value tends to be small when there is evidence in the data against the null hypothesis

Finding the P-value for Testing Proportions For one tailed test, the p-value is the area in the right tail if the alternative hypothesis takes the form Ha: p>p0 i.e., The p- value is the probability of getting a value as large or larger than the observed test statistic value. (Left tail if Ha: p<p0 ) For two tailed test, the p-value is the sum of the areas in both tails. i.e., The p-value is the probability of getting a value as far away (or farther) from the mean as the absolute value of the observed test statistic (z) value. (Doubles the p-value from one tailed test). Using Stata: Statistics-->Summaries, tables, and tests --> classical tests of hypotheses, or “prtest”

P-value for Testing Proportions: H0: p=p0; Ha: p>p0 Example: p0=0.5, n=50, # success=28 (or 36)

P-value for Testing Proportions: H0: p=p0; Ha: pp0 Example: p0=0.5, n=4040, # success=2048

Testing Hypotheses about a Population Proportion: Making a decision based on the p-value If we think the P-value is too low for the observed test statistic to lend support/evidence for the null hypothesis, then we would reject the null hypothesis; otherwise, we fail to reject the null hypothesis. When the null hypothesis takes the form of having no relationship or no difference, our rejection of it would mean that we found evidence that there is a statistically significant relationship/difference. In general, we say that:

Typical Cut-off for the P-value So how small is “small”? Typically, a p-value less than 0.05 is considered to be small enough to lead us to reject the null hypothesis. Some researchers use 0.10 or 0.01 as the cut-off instead of 0.05. This “cut-off” value is typically referred to as the significance level  of the test.  is also referred to as the “type I error” of the test. It is the probability of rejecting the null hypothesis when it is true. If we decide not to reject the null, we risk the “type II error”, which is the probability of failing to reject the null when it is false (its size depends on a, as well as the “true” parameter value and sample size.) (1-type II error) is defined as the power of the test. Recommendation: always report the actual p-value, in addition to (or even instead of) reporting “significance at the .05 level”.

Testing Hypotheses about a Population Mean The idea is exactly the same as the proportion case. The test statistic now involves the sample mean, which has a slightly different form of sampling distribution from the proportion, as we have seen last time. That's all. Same logic, same procedure e.g. National Assessment of Educational Progress (NAEP) quantitative test scores for young men aged 21-25. (Score can range from 0-500). Says a score of 275 or higher reflects the skills needed to balance a check book. A random sample of 840 men 21-25 yrs of age has a sample mean of 272, and sample s.d. Of 59. Do we have enough evidence to say that the mean score in the population of all 9.5 million young men in this age group is less than 275? H0: m=275; Ha: m<275 Under H0, the sample mean follows N(275, 59/sqrt(840))=N(275,2.036). Z=(272-275)/2.036=-1.47

Testing Hypotheses about a Population Mean The shaded area is about .07, which is the p-value. Would you reject the null hypothesis?

Testing Hypotheses about a Population Mean: a two tailed test example Executives' blood pressure different from the national mean of 128? H0: m=128; Ha: m128. Sample mean is 126.1, s=15.2, n=72. Z=-1.06. The shaded area is about .27, which is the p-value. Would you reject the null hypothesis?

Comparing two Proportions (or Means) The logic and procedure are exactly the same as the single parameter case, only that now we need to make use of the sampling distribution of the difference between two proportions (or means), which turns out also to be normal, with mean being the difference of the population proportions (or means), and standard deviation computable through a simple formula Under the null hypothesis that the two population proportions are the same, the mean of the sampling distribution is 0. Thus, upon observing two sample proportions/means, one can take the difference of them, and use the sampling distribution to find out the p-value under the null hypothesis, then make a decision to reject the null or not. In Stata, choose “two sample” tests. Or use by() option for “prtest” or “ttest” e.g., gss2002.dta, prtest vote00, by(born)