Lecture 6 Comparing Proportions (II)

Slides:



Advertisements
Similar presentations
Categorical Data Analysis
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Two Sample Hypothesis Testing for Proportions
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.
Differences Between Group Means
BCOR 1020 Business Statistics Lecture 28 – May 1, 2008.
Correlation Patterns. Correlation Coefficient A statistical measure of the covariation or association between two variables. Are dollar sales.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
5-3 Inference on the Means of Two Populations, Variances Unknown
Hypothesis Testing :The Difference between two population mean :
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 G Lect 6b G Lecture 6b Generalizing from tests of quantitative variables to tests of categorical variables Testing a hypothesis about a.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
STATISTICAL INFERENCE PART VI HYPOTHESIS TESTING 1.
CHI SQUARE TESTS.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Inferences Concerning the Difference in Population Proportions (9.4) Previous sections (9.1,2,3): We compared the difference in the means (  1 -  2 )
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
More inferential statistics. Chi square tests compare observed frequency distributions, either to theoretical expectations or to other observed frequency.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Hypothesis Testing.
Inference concerning two population variances
Final exam practice questions (answers at the end)
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Lecture #24 Thursday, November 10, 2016 Textbook: 13.1 through 13.6
Chapter 4. Inference about Process Quality
8-1 of 23.
Hypothesis Tests: One Sample
Hypothesis Testing: Hypotheses
Inferences About Means from Two Groups
Lecture 2. The Binomial Distribution
Chapter 9 Hypothesis Testing.
Lecture 4. The Multinomial Distribution (II)
CONCEPTS OF ESTIMATION
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter 9 Hypothesis Testing.
Day 67 Agenda: Submit THQ #6 Answers.
Lecture 7 The Odds/ Log Odds Ratios
Discrete Event Simulation - 4
Hypothesis Testing.
Lecture 5, Goodness of Fit Test
Hypothesis Tests for Proportions
Lecture 10 Comparing 2xk Tables
Tutorial 4 For the Seat Belt Data, the Death Penalty Data, and the University Admission Data, (1). Identify the response variable and the explanatory.
Lecture 3. The Multinomial Distribution
Chapter 10: One- and Two-Sample Tests of Hypotheses:
Analyzing the Association Between Categorical Variables
SA3202, Solution for Tutorial 4
Confidence Interval.
Copyright © Cengage Learning. All rights reserved.
Hypothesis Testing S.M.JOSHI COLLEGE ,HADAPSAR
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and
Marketing Research and Consumer Behavior Insights
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Summary Table of Influence Procedures for a Single Sample (I)
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Lecture 6 Comparing Proportions (II) Outline of Today: Testing Equality of Proportions The Chi-squared Test 9/19/2018 SA3202, Lecture 6

Testing Equality of Proportions Hypothesis Testing A basic hypothesis of interest is H0: p1=p2 means H0: “C no effect on R” means H0: “C and R are independent” This test is also known as “Independence Test”. The alternative may be one-sided “H1: p1<p2” Or “H1: p1>p2”, or two sided , “ H1: p1 not equal to p2”. Let =p1-p2 be the population proportion difference. Its estimator is the sample proportion difference ~A N( , ) where the variance is The null hypothesis H0: p1=p2 is equivalent o H0: =0. The latter may be tested using Z= ~ AN(0, 1) under H0 9/19/2018 SA3202, Lecture 6

Testing Equality of the Proportions Estimating the Common Variance Since the variance is unknown, we need to estimate it under H0. Let p denote the common value of p1 and p2 under H0 : p1=p2=p. Then Var( ||H0)=p1(1-p1)/n1+p2(1-p2)/n2=(1/n1+1/n2) p(1-p). The best estimator of p is the Pooled estimator (X1+X2)/(n1+n2) So the standard error is s.e( |H0)= . 9/19/2018 SA3202, Lecture 6

Testing Equality of the Proportions The Z- Test Statistic is then constructed as: Z= /s.e( | H0) ~AN(0,1) under H0 Testing Procedure: H0: p1=p2 is tested by comparing Z with the percentage points of N(0,1) if against H1: p1<p2, reject H0 if Z is too small, against H1: p1>p2, reject H0 if Z is too large, against H1: p1 not equal p2, reject H0 if |Z| is too large 9/19/2018 SA3202, Lecture 6

Example For the Vitamin C Data, consider testing H0: Vitamin C has no effect (p1=p2) against the alternative H1: it is effective in preventing cold (p1>p2). Placebo Vitamin C Cold 31 17 Not Cold 109 122 Total 140 139 Now the sample proportion difference =.0991, the pooled sample proportion =(X1+X2)/(n1+n2)=(31+17) /(140+139)=.1720 s.e. ( |H0)=.0448 Z= /s.e.( |H0 )=.0991/.0448=2.19, At 5% level, Reject H0: p1=p2 in favor H1: p1>p2 since Z>1.645 . This means that Vitamin C seems to have some effect. This is consistent with the conclusion made previously using the CI procedure. 9/19/2018 SA3202, Lecture 6

The Chi-squared Test The H0: p1=p2 can also be tested by a chi-squared test: compare the observed frequencies with the expected frequencies using the Pearson’s Goodness of test of the Wilk’s Likelihood ratio test. Response Group 1 Group 2 Positive X1 X2 Negative n1-X1 n2-X2 Total n1 n2 Under H0: p1=p2=p, the expected frequencies are Response Group 1 Group 2 Positive n1p n2p Negative n1(1-p) n2(1-p) Total n1 n2 9/19/2018 SA3202, Lecture 6

The Chi-squared Test Replacing the p by its pooled sample proportion , we get the contingency table of the estimated expected frequencies. Response Group 1 Group 2 Positive Negative Total n1 n2 Thus we can constructed the Pearson’s Goodness of fit Test statistic or the Wilk’s Likelihood Ratio Test statistic The d.f. =4-3=1, with 3 constraints: 9/19/2018 SA3202, Lecture 6

Example For the Vitamin C Data, consider testing H0: Vitamin C has no effect (p1=p2) against the alternative H1: it is effective. Placebo Vitamin C Cold 31 17 Not Cold 109 122 Total 140 139 Under H0, the pooled sample proportion =(X1+X2)/(n1+n2)=(31+17) /(140+139)=.1720 Under H0, the estimated expected frequencies are Cold 24.1 23.9 e.g 23.9=139*.172 Not Cold 115.9 115.1 The Pearson’s Goodness of Fit Test statistic is T=(31-24.1)^2/24.1+(109-115.9)^2/115.9+(17-24.1)^2/24.1+(122-115.1)^2/115.1=4.61 The 95% table value with 1 df= 3.84. Therefore, H0 is rejected . That is the effect of Vitamin is significant at 5% level. Note that the Chi-squared Test is applicable only for two-sided alternatives. 9/19/2018 SA3202, Lecture 6