Chapter 5 Hypothesis Testing
5.1 - Introduction Definition Hypothesis testing is a formal approach for determining if data from a sample support a claim about a population. 1.State the null and alternative hypotheses 2.Calculate the test statistic 3.Find the critical value (or calculate the P-value) 4.State the technical conclusion 5.State the final conclusion
The Process Claim: More than half the students at a particular university are from out of state Step 1: State the null and alternative hypotheses – Define the parameter p = The proportion of all students at the university who are from out of state – State the hypotheses H 0 : p = 0.50 H 1 : p > 0.50
The Process
Step 3: Find the critical value – At the 95% confidence level,
The Process Step 4: State the technical conclusion – One of two statements: Reject H 0 or Do not reject H 0 – Reject H 0 if the test statistic falls into the critical region and do not reject H 0 otherwise – z = 2.36, critical region: [1.645, ∞) Reject H 0
The Process Step 5: State the final conclusion – The data support the claim
P-value Method Criteria 1.Reject H 0 if P-value ≤ α 2.Do not reject H 0 if P-value > α
P-value Method
5.2 – Testing Claims about a Proportion
1-Proportion Z-Test The test statistic is The critical value is a z-score and the P-value is an area under the standard normal density curve.
1-Proportion Z-Test
Example A student claims that less than 25% of plain M&M candies are red. A random sample of 195 candies contains 37 red candies. Use this data to test the claim at the 0.05 significance level. 1.The parameter about which the claim is made is p = The proportion of all M&M candies that are red
Example 5.2.1
Types of Errors
5.3 – Testing Claims about a Mean
T-Test Requirements 1.The sample is random 2.Either the population is normally distributed or n > 30
Example 5.3.3
5.4 – Comparing Two Proportions
2-Proportion Z-Test Requirements 1.Both samples are random and independent 2.In both samples, the conditions for a binomial distribution are satisfied 3.In both samples, there are at least 5 successes and 5 failures
Example In a survey of voters in the 2008 Texas Democrat primary, 54% of the 1167 females voted for Hillary Clinton while 47% of the 881 males voted for Clinton (data from March 5, 2008). Use this data to test the claim that of the voters in the 2008 Texas Democrat primary, the proportion of females who voted for Clinton is higher than the proportionof males who voted for Clinton.
Example 5.4.1
5.5 – Comparing Two Variances
F-Test
Example 5.5.1
1.The parameters about which the claim is made are 2.Test statistic:
Example 5.5.1
5.6 – Comparing Two Means
Equal Population Variances
Unequal Population Variances Test statistic: The critical value is a t-score with r degrees of freedom where r is If r is not an integer, then round it down to the nearest whole number
Requirements 1.Both samples are random 2.The samples are independent 3.Both populations are normally distributed or both sample sizes are greater than 30
Example 5.6.1
1.The parameters about which the claim is made are 2.Assume equal population variances. Test statistic:
Example 5.6.1
Paired T-Test
Example At a large university, freshman students are required to take an introduction to writing class. Students are given a survey on their attitudes toward writing at the beginning and end of the class. Each student receives a score between 0 and 100 (the higher the score, the more favorable his or her attitude toward writing). Test the claim that the scores improve from the beginning to the end.
Example 5.6.3
4.Critical region: [1.860, ∞) – Reject H 0 5.Final conclusion: The data support the claim
5.7 – Goodness-of-fit Test
Chi-square Goodness-of-fit Test
Example A student simulated dandelions in a lawn by randomly placing 300 dots on a piece of paper with an area of 100 in 2. He then randomly chose 75 different 1 in 2 sections of paper and counted the number of dots in each section.
Example 5.7.3
5.8 – Test of Independence Two students want to determine if their university men’s basketball team benefits from home-court advantage. They randomly select 205 games played by the team and record if each one was played at home or away and if the team won or lost (data collected by Emily Hudgins and Courtney Santistevan, 2009). “Contingency Table”
Chi-square Test of Independence
Test of Independence
Example
5.9 – One-way ANOVA A seed company plants four types of new corn seed on several plots of land and records the yield (in bushels/acre) of each plot as shown below. Test the claim that the four types of seed produce the same mean yield.
One-way ANOVA
Example 5.9.3
5.10 – Two-way ANOVA Randomized Block Design A statistics professor is comparing four different delivery methods for her introduction to statistics class: face-to- face, online, hybrid, and video (called the treatments). – She divides the population of students into three groups according to their overall GPA: high, middle, and low. (called blocks) – She randomly chooses four students from each block and randomly assigns each one to a class using one of the delivery methods – At the end of the semester she records each student’s overall grade
Two-way ANOVA Two questions: 1.Do the four treatments have the same population mean? 2.Do the blocks have any affect on the scores?
Two-way ANOVA
ANOVA Table P-value = – Do not reject H 0 – There is not a statistically significant difference between the treatment means P-value = – Reject H B – There is a statistically significant difference in the block effects
Factorial Experiment The statistics professors randomly chooses 16 students from each GPA level, randomly assigns four to each delivery method, and records their scores at the end of the semester. Three questions: 1.Is there any difference between the population mean scores of the delivery methods? 2.Does the GPA level affect the scores? 3.Does the interaction of the delivery method and GPA level affect the scores?
Factorial Experiment
ANOVA Table P-value = – Do not reject H B – There is not a statistically significant difference between the means of the delivery methods P-value = – Reject H A – There is a statistically significant difference in the effects of the GPA levels
ANOVA Table P-value = – Reject H AB – There is a statistically significant difference in the interaction effects Overall – There is not a “best” method – Consider certain delivery methods to certain GPA levels