June 25, 2008Stat 111 - Lecture 14 - Two Means1 Comparing Means from Two Samples Statistics 111 – Lecture 14 One-Sample Inference for Proportions and.

Slides:



Advertisements
Similar presentations
Testing a Claim about a Proportion Assumptions 1.The sample was a simple random sample 2.The conditions for a binomial distribution are satisfied 3.Both.
Advertisements

Lecture 6 Outline – Thur. Jan. 29
Section 9.3 Inferences About Two Means (Independent)
Confidence Interval and Hypothesis Testing for:
The Normal Distribution. n = 20,290  =  = Population.
Lecture 4 Chapter 11 wrap-up
Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study –Two sample t-test –Confidence Intervals Testing.
Chapter Goals After completing this chapter, you should be able to:
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
A Decision-Making Approach
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
T-Tests Lecture: Nov. 6, 2002.
Horng-Chyi HorngStatistics II41 Inference on the Mean of a Population - Variance Known H 0 :  =  0 H 0 :  =  0 H 1 :    0, where  0 is a specified.
Chapter 11: Inference for Distributions
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
5-3 Inference on the Means of Two Populations, Variances Unknown
Hypothesis Testing Using The One-Sample t-Test
Comparing Population Parameters (Z-test, t-tests and Chi-Square test) Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director,
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
June 23, 2008Stat Lecture 13 - One Mean1 Inference for a Population Mean Confidence Intervals and Tests with unknown variance and Two- sample Tests.
7.1 Lecture 10/29.
Statistical Inference for Two Samples
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
June 19, 2008Stat Lecture 12 - Testing 21 Introduction to Inference More on Hypothesis Tests Statistics Lecture 12.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
More About Significance Tests
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
June 26, 2008Stat Lecture 151 Two-Sample Inference for Proportions Statistics Lecture 15.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Mid-Term Review Final Review Statistical for Business (1)(2)
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Week 8 October Three Mini-Lectures QMM 510 Fall 2014.
AP Statistics Chapter 24 Comparing Means.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
© Copyright McGraw-Hill 2004
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 10 Statistical Inference for Two Samples More than one but less than three! Chapter 10B < X
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Sample Size Needed to Achieve High Confidence (Means)
Chapter 7 Inference Concerning Populations (Numeric Responses)
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Review Statistical inference and test of significance.
AP Statistics Chapter 24 Comparing Means. Objectives: Two-sample t methods Two-Sample t Interval for the Difference Between Means Two-Sample t Test for.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Chapter 9 Introduction to the t Statistic
Lecture #24 Thursday, November 10, 2016 Textbook: 13.1 through 13.6
Comparing Means from Two Samples
Inferences on Two Samples Summary
Elementary Statistics
Chapter 6 Confidence Intervals.
Presentation transcript:

June 25, 2008Stat Lecture 14 - Two Means1 Comparing Means from Two Samples Statistics 111 – Lecture 14 One-Sample Inference for Proportions and

June 25, 2008Stat Lecture 14 - Two Means2 Administrative Notes Homework 5 is posted on website Due Wednesday, July 1 st

June 25, 2008Stat Lecture 14 - Two Means3 Outline Two Sample Z-test (known variance) Two Sample t-test (unknown variance) Matched Pair Test and Examples Tests and Intervals for Proportions (Chapter 8)

June 25, 2008Stat Lecture 14 - Means4 Comparing Two Samples Up to now, we have looked at inference for one sample of continuous data Our next focus in this course is comparing the data from two different samples For now, we will assume that these two different samples are independent of each other and come from two distinct populations Population 1:  1,  1 Sample 1:, s 1 Population 2:  2,  2 Sample 2:, s 2

June 25, 2008Stat Lecture 14 - Means5 Blackout Baby Boom Revisited Nine months (Monday, August 8th) after Nov 1965 blackout, NY Times claimed an increased birth rate Already looked at single two-week sample: found no significant difference from usual rate (430 births/day) What if we instead look at difference between weekends and weekdays? SunMonTueWedThuFriSat WeekdaysWeekends

June 25, 2008Stat Lecture 14 - Means6 Two-Sample Z test We want to test the null hypothesis that the two populations have different means H 0 :  1 =  2 or equivalently,  1 -  2 = 0 Two-sided alternative hypothesis:  1 -  2  0 If we assume our population SDs  1 and  2 are known, we can calculate a two-sample Z statistic: We can then calculate a p-value from this Z statistic using the standard normal distribution

June 25, 2008Stat Lecture 14 - Two Means7 Two-Sample Z test for Blackout Data To use Z test, we need to assume that our pop. SDs are known:  1 = s 1 = 21.7 and  2 = s 2 = 24.5 From normal table, P(Z > 7.5) is less than , so our p-value = 2  P(Z > 7.5) is less than Conclusion here is a significant difference between birth rates on weekends and weekdays We don’t usually know the population SDs, so we need a method for unknown  1 and  2

June 25, 2008Stat Lecture 14 - Two Means8 Two-Sample t test We still want to test the null hypothesis that the two populations have equal means (H 0 :  1 -  2 = 0) If  1 and  2 are unknown, then we need to use the sample SDs s 1 and s 2 instead, which gives us the two-sample T statistic: The p-value is calculated using the t distribution, but what degrees of freedom do we use? df can be complicated and often is calculated by software Simpler and more conservative: set degrees of freedom equal to the smaller of (n 1 -1) or (n 2 -1)

June 25, 2008Stat Lecture 14 - Two Means9 Two-Sample t test for Blackout Data To use t test, we need to use our sample standard deviations s 1 = 21.7 and s 2 = 24.5 We need to look up the tail probabilities using the t distribution Degrees of freedom is the smaller of n 1 -1 = 22 or n 2 -1 = 7

June 25, 2008Stat Lecture 14 - Two Means10

June 25, 2008Stat Lecture 14 - Two Means11 Two-Sample t test for Blackout Data From t-table with df = 7, we see that P(T > 7.5) < If our alternative hypothesis is two-sided, then we know that our p-value < 2  = We reject the null hypothesis at  -level of 0.05 and conclude there is a significant difference between birth rates on weekends and weekdays Same result as Z-test, but we are a little more conservative

June 25, 2008Stat Lecture 14 - Two Means12 Two-Sample Confidence Intervals In addition to two sample t-tests, we can also use the t distribution to construct confidence intervals for the mean difference When  1 and  2 are unknown, we can form the following 100·C% confidence interval for the mean difference  1 -  2 : The critical value t k * is calculated from a t distribution with degrees of freedom k k is equal to the smaller of (n 1 -1) and (n 2 -1)

June 25, 2008Stat Lecture 14 - Two Means13 Confidence Interval for Blackout Data We can calculate a 95% confidence interval for the mean difference between birth rates on weekdays and weekends: We get our critical value t k * = is calculated from a t distribution with 7 degrees of freedom, so our 95% confidence interval is: Since zero is not contained in this interval, we know the difference is statistically significant!

June 25, 2008Stat Lecture 14 - Two Means14 Matched Pairs Sometimes the two samples that are being compared are matched pairs (not independent) Example: Sentences for crack versus powder cocaine We could test for the mean difference between X 1 = crack sentences and X 2 = powder sentences However, we realize that these data are paired: each row of sentences have a matching quantity of cocaine Our t-test for two independent samples ignores this relationship

June 25, 2008Stat Lecture 14 - Two Means15 Matched Pairs Test First, calculate the difference d = X 1 - X 2 for each pair Then, calculate the mean and SD of the differences d Quantity Sentences Crack X 1 Powder X 2 Difference d = X 1 - X

June 25, 2008Stat Lecture 14 - Two Means16 Instead of a two-sample test for the difference between X 1 and X 2, we do a one-sample test on the difference d Null hypothesis: mean difference between the two samples is equal to zero H 0 :  d = 0 versus H a :  d  0 Usual test statistic when population SD is unknown: p-value calculated from t-distribution with df = 8 P(T > 5.24) < so p-value < Difference between crack and powder sentences is statistically significant at  -level of 0.05 Matched Pairs Test

June 25, 2008Stat Lecture 14 - Two Means17 We can also construct a confidence interval for the mean difference  d of matched pairs We can just use the confidence intervals we learned for the one-sample, unknown  case Example: 95% confidence interval for mean difference between crack and powder sentences: Matched Pairs Confidence Interval

June 25, 2008Stat Lecture 14 - Two Means18 Summary of Two-Sample Tests Two independent samples with known  1 and  2 We use two-sample Z-test with p-values calculated using the standard normal distribution Two independent samples with unknown  1 and  2 We use two-sample t-test with p-values calculated using the t distribution with degrees of freedom equal to the smaller of n 1 -1 and n 2 -1 Also can make confidence intervals using t distribution Two samples that are matched pairs We first calculate the differences for each pair, and then use our usual one-sample t-test on these differences

June 25, 2008Stat Lecture 14 - Two Means19 One-Sample Inference for Proportions

June 25, 2008Stat Lecture 14- One- Sample Proportions 20 Revisiting Count Data Chapter 6 and 7 covered inference for the population mean of continuous data We now return to count data: Example: Opinion Polls X i = 1 if you support Obama, X i = 0 if not We call p the population proportion for X i = 1 What is the proportion of people who support the war? What is the proportion of Red Sox fans at Penn?

June 25, 2008Stat Lecture 14- One- Sample Proportions 21 Inference for population proportion p We will use sample proportion as our best estimate of the unknown population proportion p where Y = sample count Tool 1: use our sample statistic as the center of an entire confidence interval of likely values for our population parameter Confidence Interval : Estimate ± Margin of Error Tool 2: Use the data to for a specific hypothesis test 1.Formulate your null and alternative hypotheses 2.Calculate the test statistic 3.Find the p-value for the test statistic

June 25, 2008Stat Lecture 14- One- Sample Proportions 22 Distribution of Sample Proportion In Chapter 5, we learned that the sample proportion technically has a binomial distribution However, we also learned that if the sample size is large, the sample proportion approximately follows a Normal distribution with mean and standard deviation: We will essentially use this approximation throughout chapter 8, so we can make probability calculations using the standard normal table

June 25, 2008Stat Lecture 14- One- Sample Proportions 23 Confidence Interval for a Proportion We could use our sample proportion as the center of a confidence interval of likely values for the population parameter p: The width of the interval is a multiple of the standard deviation of the sample proportion The multiple Z * is calculated from a normal distribution and depends on the confidence level

June 25, 2008Stat Lecture 14- One- Sample Proportions 24 Confidence Interval for a Proportion One Problem: this margin of error involves the population proportion p, which we don’t actually know! Solution: substitute in the sample proportion for the population proportion p, which gives us the interval:

June 25, 2008Stat Lecture 14- One- Sample Proportions 25 Example: Red Sox fans at Penn What proportion of Penn students are Red Sox fans? Use Stat 111 class survey as sample Y = 25 out of n = 192 students are Red Sox fans so 95% confidence interval for the population proportion: Proportion of Red Sox fans at Penn is probably between 8% and 18%

June 25, 2008Stat Lecture 14- One- Sample Proportions 26 Hypothesis Test for a Proportion Suppose that we are now interested in using our count data to test a hypothesized population proportion p 0 Example: an older study says that the proportion of Red Sox fans at Penn is Does our sample show a significantly different proportion? First Step: Null and alternative hypotheses H 0 : p = 0.10 vs. H a : p  0.10 Second Step: Test Statistic

June 25, 2008Stat Lecture 14- One- Sample Proportions 27 Hypothesis Test for a Proportion Problem: test statistic involves population proportion p For confidence intervals, we plugged in sample proportion but for test statistics, we plug in the hypothesized proportion p 0 : Example: test statistic for Red Sox example

June 25, 2008Stat Lecture 14- One- Sample Proportions 28 Hypothesis Test for a Proportion Third step: need to calculate a p-value for our test statistic using the standard normal distribution Red Sox Example: Test statistic Z = 1.39 What is the probability of getting a test statistic as extreme or more extreme than Z = 1.39? ie. P(Z > 1.39) = ? Two-sided alternative, so p-value = 2  P(Z>1.39) = 0.16 We don’t reject H 0 at a  =0.05 level, and conclude that Red Sox proportion is not significantly different from p 0 =0.10 Z = 1.39 prob = 0.082

June 25, 2008Stat Lecture 14- One- Sample Proportions 29 Another Example Mass ESP experiment in 1977 Sunday Mirror (UK) Psychic hired to send readers a mental message about a particular color (out of 5 choices). Readers then mailed back the color that they “received” from psychic Newspaper declared the experiment a success because, out of 2355 responses, they received 521 correct ones ( ) Is the proportion of correct answers statistically different than we would expect by chance (p 0 = 0.2) ? H 0 : p= 0.2 vs. H a : p  0.2

June 25, 2008Stat Lecture 14- One- Sample Proportions 30 Mass ESP Example Calculate a p-value using the standard normal distribution Two-sided alternative, so p-value = 2  P(Z>2.43) = We reject H 0 at a  =0.05 level, and conclude that the survey proportion is significantly different from p 0 =0.20 We could also calculate a 95% confidence interval for p: Z = 2.43 prob = Interval doesn’t contain 0.20

June 25, 2008Stat Lecture 14- One- Sample Proportions 31 Margin of Error Confidence intervals for proportion p is centered at the sample proportion and has a margin of error: Before the study begins, we can calculate the sample size needed for a desired margin of error Problem: don’t know sample prop. before study begins! Solution: use which gives us the maximum m So, if we want a margin of error less than m, we need

June 25, 2008Stat Lecture 14- One- Sample Proportions 32 Margin of Error Examples Red Sox Example: how many students should I poll in order to have a margin of error less than 5% in a 95% confidence interval? We would need a sample size of 385 students ESP example: how many responses must newspaper receive to have a margin of error less than 1% in a 95% confidence interval?

June 25, 2008Stat Lecture 14- One- Sample Proportions 33 Next Class - Lecture 15 Two-Sample Inference for Proportions Moore, McCabe and Craig: Section 8.2