Chapter 10 Inferences from Two Samples

Slides:



Advertisements
Similar presentations
Testing a Claim about a Proportion Assumptions 1.The sample was a simple random sample 2.The conditions for a binomial distribution are satisfied 3.Both.
Advertisements

Section 9.3 Inferences About Two Means (Independent)
STATISTICS ELEMENTARY MARIO F. TRIOLA
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Confidence Interval and Hypothesis Testing for:
Chapter 9: Inferences for Two –Samples
Two Sample Hypothesis Testing for Proportions
© 2010 Pearson Prentice Hall. All rights reserved Hypothesis Testing Using a Single Sample.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Inferences about two proportions Assumptions 1.We have proportions from two simple random samples that are independent (not paired) 2.For both samples,
8-4 Testing a Claim About a Mean
8-3 Testing a Claim about a Proportion
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
Chapter 9 Hypothesis Testing.
Lecture Slides Elementary Statistics Twelfth Edition
1 Chapter 9 Inferences from Two Samples In this chapter we will deal with two samples from two populations. The general goal is to compare the parameters.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Slide 1 Copyright © 2004 Pearson Education, Inc..
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 9-2 Inferences About Two Proportions.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
Hypothesis Testing for Variance and Standard Deviation
381 Hypothesis Testing (Testing with Two Samples-III) QSCI 381 – Lecture 32 (Larson and Farber, Sects 8.3 – 8.4)
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Pengujian Hipotesis Dua Populasi By. Nurvita Arumsari, Ssi, MSi.
Chapter 9 Inferences from Two Samples
Created by Erin Hodgess, Houston, Texas Section 8-5 Comparing Variation in Two Samples.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
1 Chapter 9 Inferences from Two Samples 9.2 Inferences About Two Proportions 9.3 Inferences About Two Means (Independent) 9.4 Inferences About Two Means.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 9-1 Review and Preview.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
to accompany Introduction to Business Statistics
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Lecture Slides Elementary Statistics Twelfth Edition
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-4 Inferences from Matched.
1 Pertemuan 09 & 10 Pengujian Hipotesis Mata kuliah : A Statistik Ekonomi Tahun: 2010.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-3 Inferences About Two Means:
AP Statistics Chapter 24 Comparing Means. Objectives: Two-sample t methods Two-Sample t Interval for the Difference Between Means Two-Sample t Test for.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Chapter 9 Hypothesis Testing.
Testing the Difference Between Two Means
Inferences About Two Means: Independent Samples
Inferential Statistics Inferences from Two Samples
Exercises #8.74, 8.78 on page 403 #8.110, on page 416
Lecture Slides Elementary Statistics Twelfth Edition
Estimation & Hypothesis Testing for Two Population Parameters
Lecture Slides Elementary Statistics Twelfth Edition
Math 4030 – 10a Tests for Population Mean(s)
Chapter 8 Hypothesis Testing with Two Samples.
Lecture Slides Elementary Statistics Twelfth Edition
Elementary Statistics
Elementary Statistics
Lesson Comparing Two Means.
Independent Samples: Comparing Means
Hypothesis Tests for a Standard Deviation
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 8 Inferences from Two Samples
Inferences from Matched Pairs
Chapter 9 Lecture 4 Section: 9.4.
Chapter 8 Inferences from Two Samples
Presentation transcript:

Chapter 10 Inferences from Two Samples Inferences about Two Means: Independent and Large Samples Inferences about Two Means: Independent  and Small Samples Inferences about Two Means: Matched Pairs Inferences about Two Proportions

Overview There are many important and meaningful situations in which it becomes necessary to compare two sets of sample data. page 438 of text Examples in the discussion

10-2 Inferences about Two Means: Independent and Large Samples

Two Samples: Independent Definitions Two Samples: Independent The sample values selected from one population are not related or somehow paired with the sample values selected from the other population. If the values in one sample are related to the values in the other sample, the samples are dependent. Such samples are often referred to as matched pairs or paired samples. Text will use the wording ‘matched pairs’. Example at bottom of page 438- 439

Assumptions 1. The two samples are independent. 2. The two sample sizes are large. That is, n1 > 30 and n2 > 30. 3. Both samples are simple random samples. page 439

Characteristics of the Sampling Distribution XA-XB The mean of the sampling distribution of all possible XA-XB is µA- µB. The standard deviation of the sampling distribution of all the possible values of

Test Statistic for Two Means: Independent and Large Samples Hypothesis Tests Test Statistic for Two Means: Independent and Large Samples

Test Statistic for Two Means: Independent and Large Samples Hypothesis Tests Test Statistic for Two Means: Independent and Large Samples (x1 - x2) - (µ1 - µ2) z = 1. 2 2 2 + n1 n2

Test Statistic for Two Means: Independent and Large Samples Hypothesis Tests Test Statistic for Two Means: Independent and Large Samples  and  If and are not known, use s1 and s2 in their places. provided that both samples are large. P-value: Use the computed value of the test statistic z, and find the P-value . Critical values: Based on the significance level , find critical values .

Coke Versus Pepsi Sample statistics are shown. Use the 0.01 significance level to test the claim that the mean weight of regular Coke is different from the mean weight of regular Pepsi. Example on page 440 of text

Coke Versus Pepsi Sample statistics are shown. Use the 0.01 significance level to test the claim that the mean weight of regular Coke is different from the mean weight of regular Pepsi. Regular Coke Regular Pepsi n 36 36 x 0.81682 0.82410 s 0.007507 0.005701

Coke Versus Pepsi

Coke Versus Pepsi Claim: 1  2 Ho : 1 = 2 H1 : 1  2  = 0.01 Reject H0 Fail to reject H0 Reject H0 Z = - 2.575 Z = 2.575 1 -  = 0 or Z = 0

Test Statistic for Two Means: Independent and Large Samples Coke Versus Pepsi Test Statistic for Two Means: Independent and Large Samples 2 (x1 - x2) - (µ1 - µ2) z = n1 n2 + 1. 2

Test Statistic for Two Means: Independent and Large Samples Coke Versus Pepsi Test Statistic for Two Means: Independent and Large Samples z = (0.81682 - 0.82410) - 0 0.0075707 2 0.005701 2 + 36 36 = - 4.63

Coke Versus Pepsi Claim: 1  2 Ho : 1 = 2 H1 : 1  2  = 0.01 Reject H0 Fail to reject H0 Reject H0 Z = - 2.575 Z = 2.575 sample data: z = - 4.63 1 -  = 0 or Z = 0

Coke Versus Pepsi Claim: 1  2 Ho : 1 = 2 H1 : 1  2  = 0.01 There is significant evidence to support the claim that there is a difference between the mean weight of Coke and the mean weight of Pepsi. Reject H0 Fail to reject H0 Reject H0 Further explanation of interpretation is given in text. The magnitude of the difference is the weights is not anything that consumers would notice. Also this test simply indicates the Coke ingredients weigh less which does not indicate that there is less volume of the product. Reject Null Z = - 2.575 Z = 2.575 sample data: z = - 4.63 1 -  = 0 or Z = 0

Confidence Intervals Begins at bottom of page 441.

Confidence Intervals (x1 - x2) - E < (µ1 - µ2) < (x1 - x2) + E page 442 of text

Confidence Intervals (x1 - x2) - E < (µ1 - µ2) < (x1 - x2) + E 1 2 2 2 where E = z + n1 n2

Confidence Intervals (x1 - x2) - E < (µ1 - µ2) < (x1 - x2) + E where E = z n1 n2 + 1 2 2 Find an 80% confidence interval for the difference for Coke and Pepsi. For Coke versus Pepsi, x1 - x2 = .00728, and z = 1.28 .00728 + 1.28(.00157) = (.00527, .00929)

t = t-Distribution Model (x1 - x2) - (µ1 - µ2) n1 n2 1. 2 + The degrees of freedom are nA+ nB – 2 A pooled variance is the weighted mean of the sample variances. and is used if the the data is not normally distributed. 2 (x1 - x2) - (µ1 - µ2) t = n1 n2 + 1. 2

Two groups were tested to see whether calcium reduces blood pressure. The following data was collected. Is there evidence at the .1 level that calcium reduces blood pressure? Group 1 (calcium) 7 -4 18 17 -3 -5 1 10 11 –2 Group 2 (placebo) -1 12 -1 -3 3 -5 5 2 -11 -1 -3 Group Treatment n x s 1 Calcium 10 5.00 8.743 2 Placebo 11 -.273 5.901 HO: µ1 - µ2 > 0 HA: µ1 - µ2 < 0 3. tcritical = -1.328 One tail t test, n < 30 11 + 10 – 2 = 19 d.f.

5. There is not enough evidence at the .1 level that calcium reduces blood pressure. -1.328 1.604

Inferences about Two Proportions Assumptions 1. We have proportions from two     independent simple random samples. 2. For both samples, the conditions np  5 and nq  5 are satisfied. page 458 of text

Notation for Two Proportions For population 1, we let: p1 = population proportion n1 = size of the sample x1 = number of successes in the sample page 459 of text The value of x1 is sometimes given, but sometimes must be calculated from the information in the problem. See example below definition box on this page. Computed values of x1 should be a whole number with rounding possibly necessary.

Notation for Two Proportions For population 1, we let: p1 = population proportion n1 = size of the sample x1 = number of successes in the sample ^ p1 = x1/n1 (the sample proportion)

Notation for Two Proportions For population 1, we let: p1 = population proportion n1 = size of the sample x1 = number of successes in the sample ^ p1 = x1/n1 (the sample proportion) q1 = 1 - p1 ^ ^

Notation for Two Proportions For population 1, we let: π1 = population proportion n1 = size of the sample x1 = number of successes in the sample p1 = x1/n1 (the sample proportion) q1 = 1 - p1 The corresponding meanings are attached to π2, n2 , x2 , p2. and q2 , which come from population 2.

Test Statistic for Two Proportions For H0: π1 = π2 , H0: π1  π2 , H0: π1 π2 HA:π1  π2 , HA: π1 < π2 , HA: π 1> π2

Test Statistic for Two Proportions For H0: p1 = p2 , H0: p1  p2 , H0: p1 p2 H1: p1  p2 , H1: p1 < p2 , H1: p 1> p2 where π1 - π 2 = 0 (assumed in the null hypothesis)

Test Statistic for Two Proportions For H0: p1 = p2 , H0: p1  p2 , H0: p1 p2 H1: p1  p2 , H1: p1 < p2 , H1: p 1> p2 where p1 - p 2 = 0 (assumed in the null hypothesis) x1 x2 p1 p2 and = = n1 n2 Example given at the bottom of page 460-462.

Confidence Interval Estimate of π1 - π2 (p1 - p2 ) - E < (π1 - π2 ) < (p1 - p2 ) + E If 0 is not in the interval, one may be C% confident that the two population proportions are different.

A sample of households in urban and rural homes displayed the following data for preference of artificial or natural Christmas trees: Population n X p = X/n 1(urban) 261 89 .341 2(rural 160 64 .400 Is there a difference in preference between urban and rural homes? with a confidence interval of 90%?

(-.139, .021) We are 90% confident that the difference in proportions is between -.14 and .02. Because the interval contains 0, we are not confident that either group has a stronger preference for natural trees than the other group.

Assumptions 1. The sample data consist of matched pairs. 2. The samples are simple random samples. 3. If the number of pairs of sample data is small (n  30), then the population of differences in the paired values must be approximately normally distributed. page 449 of text

Notation for Matched Pairs µd = mean value of the differences d for the population of paired data page 450 of text Use Table 8-1 on page 449 to point out that it will be the differences between the measured pairs of data that will be investigated. Looking at the individual sample means would waste important information about the paired data.

Notation for Matched Pairs µd = mean value of the differences d for the population of paired data d = mean value of the differences d for the paired sample data (equal to the mean of the x - y values) sd = standard deviation of the differences d for the paired sample data n = number of pairs of data.

Test Statistic for Matched Pairs of Sample Data

Test Statistic for Matched Pairs of Sample Data d - µd t = sd n page 450 of text where degrees of freedom = n - 1

Critical Values If n  30, critical values are found in Table A-4 (t-distribution). If n > 30, critical values are found in Table A- 2 (normal distribution). page 451 of text Hypothesis example given on this page

Confidence Intervals page 452 of text

Confidence Intervals d - ME < µd < d + ME

Confidence Intervals d - ME < µd < d + ME sd where ME = t sd n degrees of freedom = n -1

How Much Do Male Statistics Students Exaggerate Their Heights? Using the sample data from Table 8-1 with the outlier excluded, construct a 95% confidence interval estimate of d, which is the mean of the differences between reported heights and measured heights of male statistics students. Example on page 453 of text

Reported and Measured Heights (in inches) of Male Statistics Students Student A B C D E F G H I J K L Reported 68 74 82.25 66.5 69 68 71 70 70 67 68 70 Height Measured 66.8 73.9 74.3 66.1 67.2 67.9 69.4 69.9 68.6 67.9 67.6 68.8 Difference 1.2 0.1 7.95 0.4 1.8 0.1 1.6 0.1 1.4 -0.9 0.4 1.2 Table found on page 449 of text. Discussion of outlier possibilities below text. The outlier data pair should be discarded before starting development of confidence interval.

How Much Do Male Statistics Students Exaggerate Their Heights? t = 2.201 (found from Table A-3 with 11 degrees of freedom and 0.05 in two tails)

How Much Do Male Statistics Students Exaggerate Their Heights? sd n E = t E = (2.201)( ) 2.244 12 = 1.426

How Much Do Male Statistics Students Exaggerate Their Heights? In the long run, 95% o f such samples will lead to confidence intervals that actually do contain the true population mean of the differences. Since the interval does contain 0, the true value of µd is not significantly different from 0. There is not sufficient evidence to support the claim that there is a difference between the reported heights and the measured heights of male statistics students. Some students will need the following explanation: If the measurements are not different, then the differences between the two measurements should have an average (mean) value of 0. Since the 95% confidence interval of these mean differences does not contain 0, the differences are significantly different from 0. Consequently, it would appear that male statistics students do exaggerate their heights.