BPS - 5th Ed. Chapter 181 Two-Sample Problems. BPS - 5th Ed. Chapter 182 Two-Sample Problems u The goal of inference is to compare the responses to two.

Slides:



Advertisements
Similar presentations
Estimating a Population Mean
Advertisements

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Chapter 18: Inference about One Population Mean STAT 1450.
Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
BPS - 5th Ed. Chapter 171 Inference about a Population Mean.
Chapter 11: Inference for Distributions
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
CHAPTER 19: Two-Sample Problems
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
Estimating a Population Mean
Chapter 8: Estimating with Confidence
Ch 11 – Inference for Distributions YMS Inference for the Mean of a Population.
AP STATISTICS LESSON 11 – 2 (DAY 1) Comparing Two Means.
The Practice of Statistics Third Edition Chapter 11: Inference for Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
BPS - 5th Ed. Chapter 171 Inference about a Population Mean.
Section 8.3 Estimating a Population Mean. Section 8.3 Estimating a Population Mean After this section, you should be able to… CONSTRUCT and INTERPRET.
CHAPTER 18: Inference about a Population Mean
1 Happiness comes not from material wealth but less desire.
CHAPTER 11 DAY 1. Assumptions for Inference About a Mean  Our data are a simple random sample (SRS) of size n from the population.  Observations from.
Chapter 17 Population means: Population means: Two-Sample Problems.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Essential Statistics Chapter 161 Inference about a Population Mean.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
BPS - 3rd Ed. Chapter 191 Comparing Two Proportions.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Essential Statistics Chapter 171 Two-Sample Problems.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
+ Z-Interval for µ So, the formula for a Confidence Interval for a population mean is To be honest, σ is never known. So, this formula isn’t used very.
Inference about the mean of a population of measurements (  ) is based on the standardized value of the sample mean (Xbar). The standardization involves.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
CHAPTER 19: Two-Sample Problems ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Inference about the mean of a population of measurements (  ) is based on the standardized value of the sample mean (Xbar). The standardization involves.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
10.2 ESTIMATING A POPULATION MEAN. QUESTION: How do we construct a confidence interval for an unknown population mean when we don’t know the population.
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Basic Practice of Statistics - 5th Edition
Inference for Distributions
Chapter 8: Estimating with Confidence
CHAPTER 21: Comparing Two Means
CHAPTER 19: Two-Sample Problems
Thus we have (Xbar - m )/(s/sqrt(n)) which has a Z distribution if:
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Chapter 8: Estimating with Confidence
CHAPTER 18: Inference about a Population Mean
Inference for Distributions
Estimating with Confidence
Chapter 8: Estimating with Confidence
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Comparing Two Populations
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 18: Inference about a Population Mean
Basic Practice of Statistics - 3rd Edition
CHAPTER 18: Inference about a Population Mean
Chapter 8: Estimating with Confidence
CHAPTER 19: Two-Sample Problems
Chapter 8: Estimating with Confidence
2/5/ Estimating a Population Mean.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Essential Statistics Inference about a Population Mean
Presentation transcript:

BPS - 5th Ed. Chapter 181 Two-Sample Problems

BPS - 5th Ed. Chapter 182 Two-Sample Problems u The goal of inference is to compare the responses to two treatments or to compare the characteristics of two populations. u We have a separate sample from each treatment or each population. –Each sample is separate. The units are not matched, and the samples can be of differing sizes.

BPS - 5th Ed. Chapter 183 Case Study Exercise and Pulse Rates A study if performed to compare the mean resting pulse rate of adult subjects who regularly exercise to the mean resting pulse rate of those who do not regularly exercise. This is an example of when to use the two-sample t procedures. nmeanstd. dev. Exercisers Nonexercisers

BPS - 5th Ed. Chapter 184 Conditions for Comparing Two Means u We have two independent SRSs, from two distinct populations –that is, one sample has no influence on the other-- matching violates independence –we measure the same variable for both samples. u Both populations are Normally distributed –the means and standard deviations of the populations are unknown –in practice, it is enough that the distributions have similar shapes and that the data have no strong outliers.

BPS - 5th Ed. Chapter 185 Two-Sample t Procedures In order to perform inference on the difference of two means ( 1 – 2 ), well need the standard deviation of the observed difference :

BPS - 5th Ed. Chapter 186 Two-Sample t Procedures Problem: We dont know the population standard deviations 1 and 2. u Solution: Estimate them with s 1 and s 2. The result is called the standard error, or estimated standard deviation, of the difference in the sample means.

BPS - 5th Ed. Chapter 187 Two-Sample t Confidence Interval Draw an SRS of size n 1 form a Normal population with unknown mean 1, and draw an independent SRS of size n 2 form another Normal population with unknown mean 2. A confidence interval for 1 – 2 is: –here t* is the critical value for confidence level C for the t density curve. The degrees of freedom are equal to the smaller of n 1 – 1 and n 2 – 1.

BPS - 5th Ed. Chapter 188 Case Study Exercise and Pulse Rates Find a 95% confidence interval for the difference in population means (nonexercisers minus exercisers). We are 95% confident that the difference in mean resting pulse rates (nonexercisers minus exercisers) is between 4.35 and beats per minute.

BPS - 5th Ed. Chapter 189 Two-Sample t Significance Tests Draw an SRS of size n 1 form a Normal population with unknown mean 1, and draw an independent SRS of size n 2 form another Normal population with unknown mean 2. To test the hypothesis H 0 : 1 = 2, the test statistic is: u Use P-values for the t density curve. The degrees of freedom are equal to the smaller of n 1 – 1 and n 2 – 1.

BPS - 5th Ed. Chapter 1810 P-value for Testing Two Means H a : 1 > 2 v P-value is the probability of getting a value as large or larger than the observed test statistic (t) value. H a : 1 < 2 v P-value is the probability of getting a value as small or smaller than the observed test statistic (t) value. H a : 1 2 v P-value is two times the probability of getting a value as large or larger than the absolute value of the observed test statistic (t) value.

BPS - 5th Ed. Chapter 1811 Case Study Exercise and Pulse Rates Is the mean resting pulse rate of adult subjects who regularly exercise different from the mean resting pulse rate of those who do not regularly exercise? Null: The mean resting pulse rate of adult subjects who regularly exercise is the same as the mean resting pulse rate of those who do not regularly exercise? [H 0 : 1 = 2 ] Alt: The mean resting pulse rate of adult subjects who regularly exercise is different from the mean resting pulse rate of those who do not regularly exercise? [H a : 1 2 ] Degrees of freedom = 28 (smaller of 31 – 1 and 29 – 1).

BPS - 5th Ed. Chapter Hypotheses:H 0 : = 2 H a : 2. Test Statistic: 3. P-value: P-value = 2P(T > 3.961) = (using a computer) P-value is smaller than 2(0.0005) = since t = is greater than t* = (upper tail area = ) (Table C) 4. Conclusion: Since the P-value is smaller than = 0.001, there is very strong evidence that the mean resting pulse rates are different for the two populations (nonexercisers and exercisers). Case Study

BPS - 5th Ed. Chapter 1813 Robustness of t Procedures u The two-sample t procedures are more robust than the one-sample t methods, particularly when the distributions are not symmetric. u When the two populations have similar distribution shapes, the probability values from the t table are quite accurate, even when the sample sizes are as small as n 1 = n 2 = 5. u When the two populations have different distribution shapes, larger samples are needed. u In planning a two-sample study, it is best to choose equal sample sizes. In this case, the probability values are most accurate.

BPS - 5th Ed. Chapter 1814 Using the t Procedures u Except in the case of small samples, the assumption that each sample is an independent SRS from the population of interest is more important than the assumption that the two population distributions are Normal. u Small sample sizes (n 1 + n 2 < 15): Use t procedures if each data set appears close to Normal (symmetric, single peak, no outliers). If a data set is skewed or if outliers are present, do not use t. u Medium sample sizes (n 1 + n 2 15): The t procedures can be used except in the presence of outliers or strong skewness in a data set. u Large samples: The t procedures can be used even for clearly skewed distributions when the sample sizes are large, roughly n 1 + n 2 40.

BPS - 5th Ed. Chapter 1815 Details of t Degrees of Freedom u Using degrees of freedom as the smallest of n 1 – 1 and n 2 – 1 is only a rough approximation to the actual degrees of freedom for the two- sample t procedures. u A better approximation that is used by software uses a function of the sample sizes and sample standard deviations to compute degrees of freedom df. u Use of df from the software calculation gives more accurate results than when simply using the smaller of n 1 – 1 and n 2 – 1.

BPS - 5th Ed. Chapter 1816 Details of t Degrees of Freedom

BPS - 5th Ed. Chapter 1817 Case Study Exercise and Pulse Rates Compute the degrees of freedom df used by software to analyze these data using two-sample t procedures. This is the degrees of freedom value used by software when computing critical values and P-values.

BPS - 5th Ed. Chapter 1818 Avoid Inference About Standard Deviations u There are methods for inference about the standard deviations of Normal populations. u Most software packages have methods for comparing the standard deviations. u However, these methods are extremely sensitive to non-Normal distributions and this lack of robustness does not improve in large samples. u Hence it is not recommended that one do inference about population standard deviations in basic statistical practice.