John Loucks St. Edward’s University . SLIDES . BY.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 South-Western /Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Uji Kebaikan Suai (Uji Kecocokan) Pertemuan 23
Inference about the Difference Between the
1 1 Slide Mátgæði Kafli 11 í Newbold Snjólfur Ólafsson + Slides Prepared by John Loucks © 1999 ITP/South-Western College Publishing.
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter Goals After completing this chapter, you should be able to:
1 Pertemuan 09 Pengujian Hipotesis Proporsi dan Data Katagorik Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chapter 11a: Comparisons Involving Proportions and a Test of Independence Inference about the Difference between the Proportions of Two Populations Hypothesis.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1/71 Statistics Tests of Goodness of Fit and Independence.
Chapter Outline Goodness of Fit test Test of Independence.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 23 日 第六週:配適度與獨立性檢定.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Pertemuan 24 Uji Kebaikan Suai Matakuliah: I0134 – Metoda Statistika Tahun: 2005 Versi: Revisi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Comparing Counts Chi Square Tests Independence.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
St. Edward’s University
Chapter 9: Non-parametric Tests
Chi-square test or c2 test
Chapter 12 Chi-Square Tests.
Chapter 11 Chi-Square Tests.
Testing a Claim About a Mean:  Not Known
John Loucks St. Edward’s University . SLIDES . BY.
Statistics for Business and Economics (13e)
Chapter 11 Inferences About Population Variances
Elementary Statistics
Chi Square Two-way Tables
Econ 3790: Business and Economics Statistics
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Overview and Chi-Square
Chapter 11 Chi-Square Tests.
Inference on Categorical Data
CHI SQUARE TEST OF INDEPENDENCE
Analyzing the Association Between Categorical Variables
Hypothesis Tests for a Standard Deviation
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
St. Edward’s University
Presentation transcript:

John Loucks St. Edward’s University . SLIDES . BY

Chapter 12 Comparing Multiple Proportions, Test of Independence and Goodness of Fit Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test

Comparing Multiple Proportions, Test of Independence and Goodness of Fit In this chapter we introduce three additional hypothesis-testing procedures. The test statistic and the distribution used are based on the chi-square (c2) distribution. In all cases, the data are categorical.

Testing the Equality of Population Proportions for Three or More Populations Using the notation p1 = population proportion for population 1 p2 = population proportion for population 2 and pk = population proportion for population k The hypotheses for the equality of population proportions for k > 3 populations are as follows: H0: p1 = p2 = . . . = pk Ha: Not all population proportions are equal

Testing the Equality of Population Proportions for Three or More Populations If H0 cannot be rejected, we cannot detect a difference among the k population proportions. If H0 can be rejected, we can conclude that not all k population proportions are equal. Further analyses can be done to conclude which population proportions are significantly different from others.

Testing the Equality of Population Proportions for Three or More Populations Example: Finger Lakes Homes Finger Lakes Homes manufactures three models of prefabricated homes, a two-story colonial, a log cabin, and an A-frame. To help in product-line planning, management would like to compare the customer satisfaction with the three home styles. p1 = proportion likely to repurchase a Colonial for the population of Colonial owners p2 = proportion likely to repurchase a Log Cabin for the population of Log Cabin owners p3 = proportion likely to repurchase an A-Frame for the population of A-Frame owners

Testing the Equality of Population Proportions for Three or More Populations We begin by taking a sample of owners from each of the three populations. Each sample contains categorical data indicating whether the respondents are likely or not likely to repurchase the home.

Testing the Equality of Population Proportions for Three or More Populations Observed Frequencies (sample results) Home Owner Colonial Log A-Frame Total Likely to Yes 100 81 83 264 Repurchase No 35 20 41 96 Total 135 101 124 360

Testing the Equality of Population Proportions for Three or More Populations Next, we determine the expected frequencies under the assumption H0 is correct. Expected Frequencies Under the Assumption H0 is True If a significant difference exists between the observed and expected frequencies, H0 can be rejected.

Testing the Equality of Population Proportions for Three or More Populations Expected Frequencies (computed) Home Owner Colonial Log A-Frame Total Likely to Yes 99.00 74.07 90.93 264 Repurchase No 36.00 26.93 33.07 96 Total 135 101 124 360

Testing the Equality of Population Proportions for Three or More Populations Next, compute the value of the chi-square test statistic. where: fij = observed frequency for the cell in row i and column j eij = expected frequency for the cell in row i and column j under the assumption H0 is true Note: The test statistic has a chi-square distribution with k – 1 degrees of freedom, provided the expected frequency is 5 or more for each cell.

Testing the Equality of Population Proportions for Three or More Populations Computation of the Chi-Square Test Statistic.   Obs. Exp. Sqd. Sqd. Diff. / Likely to Home Freq. Diff. Exp. Freq. Repurch. Owner fij eij (fij - eij) (fij - eij)2 (fij - eij)2/eij Yes Colonial 97 97.50 -0.50 0.2500 0.0026 Log Cab. 83 72.94 10.06 101.1142 1.3862 A-Frame 80 89.56 -9.56 91.3086 1.0196 No 38 37.50 0.50 0.0067 18 28.06 -10.06 3.6041 44 34.44 9.56 2.6509 Total 360 c2 = 8.6700

Testing the Equality of Population Proportions for Three or More Populations Rejection Rule p-value approach: Reject H0 if p-value < a Critical value approach: Reject H0 if where  is the significance level and there are k - 1 degrees of freedom

Testing the Equality of Population Proportions for Three or More Populations Rejection Rule (using a = .05) Reject H0 if p-value < .05 or c2 > 5.991 With  = .05 and k - 1 = 3 - 1 = 2 degrees of freedom Do Not Reject H0 Reject H0 2 5.991

Testing the Equality of Population Proportions for Three or More Populations Conclusion Using the p-Value Approach Area in Upper Tail .10 .05 .025 .01 .005 c2 Value (df = 2) 4.605 5.991 7.378 9.210 10.597 Because c2 = 8.670 is between 9.210 and 7.378, the area in the upper tail of the distribution is between .01 and .025. The p-value < a . We can reject the null hypothesis. Actual p-value is .0131

Testing the Equality of Population Proportions for Three or More Populations We have concluded that the population proportions for the three populations of home owners are not equal. To identify where the differences between population proportions exist, we will rely on a multiple comparisons procedure.

Multiple Comparisons Procedure We begin by computing the three sample proportions. We will use a multiple comparison procedure known as the Marascuillo procedure.

Multiple Comparisons Procedure Marascuillo Procedure We compute the absolute value of the pairwise difference between sample proportions. Colonial and Log Cabin: Colonial and A-Frame: Log Cabin and A-Frame:

Multiple Comparisons Procedure Critical Values for the Marascuillo Pairwise Comparison For each pairwise comparison compute a critical value as follows: For a = .05 and k = 3: c2 = 5.991

Multiple Comparisons Procedure Pairwise Comparison Tests CVij Significant if Pairwise Comparison > CVij Colonial vs. Log Cabin Colonial vs. A-Frame Log Cabin vs. A-Frame .061 .072 .133 .0923 .0971 .1034 Not Significant Significant

Test of Independence 1. Set up the null and alternative hypotheses. H0: The column variable is independent of the row variable Ha: The column variable is not independent of the row variable 2. Select a random sample and record the observed frequency, fij , for each cell of the contingency table. 3. Compute the expected frequency, eij , for each cell.

4. Compute the test statistic. Test of Independence 4. Compute the test statistic. 5. Determine the rejection rule. Reject H0 if p -value < a or . where  is the significance level and, with n rows and m columns, there are (n - 1)(m - 1) degrees of freedom.

Test of Independence Example: Finger Lakes Homes (B) Each home sold by Finger Lakes Homes can be classified according to price and to style. Finger Lakes’ manager would like to determine if the price of the home and the style of the home are independent variables.

Test of Independence Example: Finger Lakes Homes (B) The number of homes sold for each model and price for the past two years is shown below. For convenience, the price of the home is listed as either $99,000 or less or more than $99,000. Price Colonial Log Split-Level A-Frame < $99,000 18 6 19 12 > $99,000 12 14 16 3

Test of Independence Hypotheses H0: Price of the home is independent of the style of the home that is purchased Ha: Price of the home is not independent of the style of the home that is purchased

Test of Independence Expected Frequencies Price Colonial Log Split-Level A-Frame Total < $99K > $99K Total 18 6 19 12 55 12 14 16 3 45 30 20 35 15 100

Test of Independence Rejection Rule With  = .05 and (2 - 1)(4 - 1) = 3 d.f., Reject H0 if p-value < .05 or 2 > 7.815 Test Statistic = .1364 + 2.2727 + . . . + 2.0833 = 9.149

Test of Independence Conclusion Using the p-Value Approach Area in Upper Tail .10 .05 .025 .01 .005 c2 Value (df = 3) 6.251 7.815 9.348 11.345 12.838 Because c2 = 9.145 is between 7.815 and 9.348, the area in the upper tail of the distribution is between .05 and .025. The p-value < a . We can reject the null hypothesis. Actual p-value is .0274

Test of Independence Conclusion Using the Critical Value Approach We reject, at the .05 level of significance, the assumption that the price of the home is independent of the style of home that is purchased.

Goodness of Fit Test: Multinomial Probability Distribution 1. State the null and alternative hypotheses. H0: The population follows a multinomial distribution with specified probabilities for each of the k categories Ha: The population does not follow a multinomial distribution with specified probabilities for each of the k categories

Goodness of Fit Test: Multinomial Probability Distribution 2. Select a random sample and record the observed frequency, fi , for each of the k categories. 3. Assuming H0 is true, compute the expected frequency, ei , in each category by multiplying the category probability by the sample size.

Goodness of Fit Test: Multinomial Probability Distribution 4. Compute the value of the test statistic. where: fi = observed frequency for category i ei = expected frequency for category i k = number of categories Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies are 5 or more for all categories.

Goodness of Fit Test: Multinomial Probability Distribution 5. Rejection rule: p-value approach: Reject H0 if p-value < a Critical value approach: Reject H0 if where  is the significance level and there are k - 1 degrees of freedom

Multinomial Distribution Goodness of Fit Test Example: Finger Lakes Homes (A) Finger Lakes Homes manufactures four models of prefabricated homes, a two-story colonial, a log cabin, a split-level, and an A-frame. To help in production planning, management would like to determine if previous customer purchases indicate that there is a preference in the style selected.

Multinomial Distribution Goodness of Fit Test Example: Finger Lakes Homes (A) The number of homes sold of each model for 100 sales over the past two years is shown below. Split- A- Model Colonial Log Level Frame # Sold 30 20 35 15

Multinomial Distribution Goodness of Fit Test Hypotheses H0: pC = pL = pS = pA = .25 Ha: The population proportions are not pC = .25, pL = .25, pS = .25, and pA = .25 where: pC = population proportion that purchase a colonial pL = population proportion that purchase a log cabin pS = population proportion that purchase a split-level pA = population proportion that purchase an A-frame

Multinomial Distribution Goodness of Fit Test Rejection Rule Reject H0 if p-value < .05 or c2 > 7.815. With  = .05 and k - 1 = 4 - 1 = 3 degrees of freedom Do Not Reject H0 Reject H0 2 7.815

Multinomial Distribution Goodness of Fit Test Expected Frequencies Test Statistic e1 = .25(100) = 25 e2 = .25(100) = 25 e3 = .25(100) = 25 e4 = .25(100) = 25 = 1 + 1 + 4 + 4 = 10

Multinomial Distribution Goodness of Fit Test Conclusion Using the p-Value Approach Area in Upper Tail .10 .05 .025 .01 .005 c2 Value (df = 3) 6.251 7.815 9.348 11.345 12.838 Because c2 = 10 is between 9.348 and 11.345, the area in the upper tail of the distribution is between .025 and .01. The p-value < a . We can reject the null hypothesis.

Multinomial Distribution Goodness of Fit Test Conclusion Using the Critical Value Approach c2 = 10 > 7.815 We reject, at the .05 level of significance, the assumption that there is no home style preference.

Goodness of Fit Test: Normal Distribution 1. State the null and alternative hypotheses. H0: The population has a normal distribution Ha: The population does not have a normal distribution 2. Select a random sample and a. Compute the mean and standard deviation. b. Define intervals of values so that the expected frequency is at least 5 for each interval. c. For each interval, record the observed frequencies Compute the expected frequency, ei , for each interval. (Multiply the sample size by the probability of a normal random variable being in the interval.

Goodness of Fit Test: Normal Distribution 4. Compute the value of the test statistic. 5. Reject H0 if (where  is the significance level and there are k - 3 degrees of freedom).

Goodness of Fit Test: Normal Distribution Example: IQ Computers IQ Computers (one better than HP?) manufactures and sells a general purpose microcomputer. As part of a study to evaluate sales personnel, management wants to determine, at a .05 significance level, if the annual sales volume (number of units sold by a salesperson) follows a normal probability distribution.

Goodness of Fit Test: Normal Distribution Example: IQ Computers A simple random sample of 30 of the salespeople was taken and their numbers of units sold are listed below. 33 43 44 45 52 52 56 58 63 64 64 65 66 68 70 72 73 73 74 75 83 84 85 86 91 92 94 98 102 105 (mean = 71, standard deviation = 18.54)

Goodness of Fit Test: Normal Distribution Hypotheses H0: The population of number of units sold has a normal distribution with mean 71 and standard deviation 18.54. Ha: The population of number of units sold does not have a normal distribution with mean 71 and standard deviation 18.54.

Goodness of Fit Test: Normal Distribution Interval Definition To satisfy the requirement of an expected frequency of at least 5 in each interval we will divide the normal distribution into 30/5 = 6 equal probability intervals.

Goodness of Fit Test: Normal Distribution Interval Definition Areas = 1.00/6 = .1667 53.02 71 88.98 = 71 + .97(18.54) 71 - .43(18.54) = 63.03 78.97

Goodness of Fit Test: Normal Distribution Observed and Expected Frequencies i fi ei fi - ei Less than 53.02 53.02 to 63.03 63.03 to 71.00 71.00 to 78.97 78.97 to 88.98 More than 88.98 6 3 5 4 30 5 30 1 -2 -1 Total

Goodness of Fit Test: Normal Distribution Rejection Rule With  = .05 and k - p - 1 = 6 - 2 - 1 = 3 d.f. (where k = number of categories and p = number of population parameters estimated), Reject H0 if p-value < .05 or 2 > 7.815. Test Statistic

Goodness of Fit Test: Normal Distribution Conclusion Using the p-Value Approach Area in Upper Tail .90 .10 .05 .025 .01 c2 Value (df = 3) .584 6.251 7.815 9.348 11.345 Because c2 = 1.600 is between .584 and 6.251 in the Chi-Square Distribution Table, the area in the upper tail of the distribution is between .90 and .10. The p-value > a . We cannot reject the null hypothesis. There is little evidence to support rejecting the assumption the population is normally distributed with  = 71 and  = 18.54. Actual p-value is .6594

End of Chapter 12