Download presentation
Presentation is loading. Please wait.
Published byEmma Strickland Modified over 9 years ago
1
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Slides by John Loucks St. Edward’s University
2
2 2 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of Fit Test: Goodness of Fit Test: Poisson and Normal Distributions Test of Independence Test of Independence
3
3 3 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 1. State the null and alternative hypotheses. H 0 : The population follows a multinomial distribution with specified probabilities distribution with specified probabilities for each of the k categories for each of the k categories H 0 : The population follows a multinomial distribution with specified probabilities distribution with specified probabilities for each of the k categories for each of the k categories H a : The population does not follow a multinomial distribution with specified multinomial distribution with specified probabilities for each of the k categories probabilities for each of the k categories H a : The population does not follow a multinomial distribution with specified multinomial distribution with specified probabilities for each of the k categories probabilities for each of the k categories
4
4 4 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 2. Select a random sample and record the observed frequency, f i, for each of the k categories. frequency, f i, for each of the k categories. 3. Assuming H 0 is true, compute the expected frequency, e i, in each category by multiplying the frequency, e i, in each category by multiplying the category probability by the sample size. category probability by the sample size.
5
5 5 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 4. Compute the value of the test statistic. Note: The test statistic has a chi-square distribution Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies with k – 1 df provided that the expected frequencies are 5 or more for all categories. are 5 or more for all categories. Note: The test statistic has a chi-square distribution Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies with k – 1 df provided that the expected frequencies are 5 or more for all categories. are 5 or more for all categories. f i = observed frequency for category i e i = expected frequency for category i k = number of categories where:
6
6 6 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population where is the significance level and there are k - 1 degrees of freedom p -value approach: Critical value approach: Reject H 0 if p -value < 5. Rejection rule: Reject H 0 if
7
7 7 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Multinomial Distribution Goodness of Fit Test n Example: Finger Lakes Homes (A) Finger Lakes Homes manufactures four models of Finger Lakes Homes manufactures four models of prefabricated homes, a two-story colonial, a log cabin, a split-level, and an A-frame. To help in production planning, management would like to determine if previous customer purchases indicate that there is a preference in the style selected.
8
8 8 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Split- A- Split- A- Model Colonial Log Level Frame # Sold 30 20 35 15 The number of homes sold of each model for 100 The number of homes sold of each model for 100 sales over the past two years is shown below. Multinomial Distribution Goodness of Fit Test n Example: Finger Lakes Homes (A)
9
9 9 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Hypotheses Multinomial Distribution Goodness of Fit Test where: p C = population proportion that purchase a colonial p C = population proportion that purchase a colonial p L = population proportion that purchase a log cabin p L = population proportion that purchase a log cabin p S = population proportion that purchase a split-level p S = population proportion that purchase a split-level p A = population proportion that purchase an A-frame p A = population proportion that purchase an A-frame H 0 : p C = p L = p S = p A =.25 H a : The population proportions are not p C =.25, p L =.25, p S =.25, and p A =.25 p C =.25, p L =.25, p S =.25, and p A =.25
10
10 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Rejection Rule 22 22 7.815 Do Not Reject H 0 Reject H 0 Multinomial Distribution Goodness of Fit Test With =.05 and k - 1 = 4 - 1 = 3 k - 1 = 4 - 1 = 3 degrees of freedom degrees of freedom if p -value 7.815. Reject H 0 if p -value 7.815.
11
11 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Expected Frequencies n Test Statistic Multinomial Distribution Goodness of Fit Test e 1 =.25(100) = 25 e 2 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 = 1 + 1 + 4 + 4 = 10
12
12 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Multinomial Distribution Goodness of Fit Test n Conclusion Using the p -Value Approach The p -value < . We can reject the null hypothesis. The p -value < . We can reject the null hypothesis. Because 2 = 10 is between 9.348 and 11.345, the Because 2 = 10 is between 9.348 and 11.345, the area in the upper tail of the distribution is between area in the upper tail of the distribution is between.025 and.01..025 and.01. Area in Upper Tail.10.05.025.01.005 2 Value (df = 3) 6.251 7.815 9.348 11.345 12.838
13
13 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Conclusion Using the Critical Value Approach Multinomial Distribution Goodness of Fit Test We reject, at the.05 level of significance, We reject, at the.05 level of significance, the assumption that there is no home style preference. 2 = 10 > 7.815
14
14 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Worksheet (showing data) Note: Rows 13-101 are not shown. Multinomial Distribution Goodness of Fit Test
15
15 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Formula Worksheet Note: Columns A-B and rows 13-101 are not shown. Multinomial Distribution Goodness of Fit Test CDEFGHI 1Hyp.ObservedExpect.Sq'd.Sq.Diff./ 2Categ.Prop.FrequencyFreq.Diff. Exp.Freq. 3Col.0.25=COUNTIF(B2:B101,"Col")=D3*$E$7=E3-F3=G3^2=H3/F3 4Log0.25=COUNTIF(B2:B101,"Log")=D4*$E$7=E4-F4=G4^2=H4/F4 5Split-L0.25=COUNTIF(B2:B101,"Spl")=D5*$E$7=E5-F5=G5^2=H5/F5 6A-Fr.0.25=COUNTIF(B2:B101,"Afr")=D6*$E$7=E6-F6=G6^2=H6/F6 7 Total=SUM(E3:E6)=SUM(I3:I6) 8 94 10=I7 11=E9-1 12=CHISQ.DIST.RT(E10,E11) Categories Degr. of Free. p-Value Test Statistic
16
16 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Value Worksheet Multinomial Distribution Goodness of Fit Test Note: Columns A-B and rows 13-101 are not shown. CDEFGHI 1Hyp.ObservedExpect.Sq'd.Sq.Diff./ 2Categ.Prop.FrequencyFreq.Diff. Exp.Freq. 3Col.0.2530255 1 4Log0.252025-5251 5Split-L0.253525101004 6A-Fr.0.251525-101004 7 Total10010 8 94 113 120.0186 Categories Degr. of Free. p-Value Test Statistic
17
17 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Test of Independence: Contingency Tables 1. Set up the null and alternative hypotheses. 2. Select a random sample and record the observed frequency, f ij, for each cell of the contingency table. frequency, f ij, for each cell of the contingency table. 3. Compute the expected frequency, e ij, for each cell. H 0 : The column variable is independent of the row variable the row variable H 0 : The column variable is independent of the row variable the row variable H a : The column variable is not independent of the row variable of the row variable H a : The column variable is not independent of the row variable of the row variable
18
18 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Test of Independence: Contingency Tables 5. Determine the rejection rule. Reject H 0 if p -value < or. 4. Compute the test statistic. where is the significance level and, with n rows and m columns, there are ( n - 1)( m - 1) degrees of freedom.
19
19 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Each home sold by Finger Lakes Homes can be Each home sold by Finger Lakes Homes can be classified according to price and to style. Finger Lakes’ manager would like to determine if the price of the home and the style of the home are independent variables. Contingency Table (Independence) Test n Example: Finger Lakes Homes (B)
20
20 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Price Colonial Log Split-Level A-Frame Price Colonial Log Split-Level A-Frame The number of homes sold for each model and The number of homes sold for each model and price for the past two years is shown below. For convenience, the price of the home is listed as either $200,000 or less or more than $200,000. > $200,000 12 14 16 3 < $200,000 18 6 19 12 Contingency Table (Independence) Test n Example: Finger Lakes Homes (B)
21
21 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Hypotheses Contingency Table (Independence) Test H 0 : Price of the home is independent of the style of the home that is purchased style of the home that is purchased H a : Price of the home is not independent of the style of the home that is purchased style of the home that is purchased
22
22 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Expected Frequencies Contingency Table (Independence) Test Price Colonial Log Split-Level A-Frame Total Price Colonial Log Split-Level A-Frame Total < $200K > $200K Total Total 30 20 35 15 100 30 20 35 15 100 12 14 16 3 45 12 14 16 3 45 18 6 19 12 55 18 6 19 12 55
23
23 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Rejection Rule Contingency Table (Independence) Test With =.05 and (2 - 1)(4 - 1) = 3 d.f., Reject H 0 if p -value 7.815 =.1364 + 2.2727 +... + 2.0833 = 9.149 n Test Statistic
24
24 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Conclusion Using the p -Value Approach The p -value < . We can reject the null hypothesis. The p -value < . We can reject the null hypothesis. Because 2 = 9.145 is between 7.815 and 9.348, the Because 2 = 9.145 is between 7.815 and 9.348, the area in the upper tail of the distribution is between area in the upper tail of the distribution is between.05 and.025..05 and.025. Area in Upper Tail.10.05.025.01.005 2 Value (df = 3) 6.251 7.815 9.348 11.345 12.838 Contingency Table (Independence) Test
25
25 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Conclusion Using the Critical Value Approach Contingency Table (Independence) Test We reject, at the.05 level of significance, We reject, at the.05 level of significance, the assumption that the price of the home is independent of the style of home that is purchased. 2 = 9.145 > 7.815
26
26 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Worksheet (showing data) Note: Rows 11-101 are not shown. Contingency Table (Independence) Test ABCDE 1HomePrice ($)Style 21>200KColonial 32<=200KLog 43>200KLog 54<=200KA-Frame 65<=200KColonial 76<=200KSplit-Level 87>200KA-Frame 98>200KColonial 109<=200KA-Frame
27
27 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Worksheet (showing Pivot Table) Contingency Table (Independence) Test Note: Columns A-D are not shown. EFGHIJ 1 2Price ($)ColonialLogSplit-Lev.A-Frame 3<=200K186191255 4 >200K121416345 5Grand Total30203515100 6 Count of Home Preference Grand Tot.
28
28 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Formula Worksheet Contingency Table (Independence) Test Note: Columns A-D are not shown. EFGHIJ 1 2Price ($)ColonialLogSplit-Lev.A-Frame 3 <=200K186191255 4 >200K121416345 5 Grand Total30203515100 6 Count of Home Preference 7 <=200K >200K Expected Frequencies ColonialLog 8 9 10 11 12 =I5*J3/J5 =I5*J4/J5 A-Frame =H5*J3/J5 =H5*J4/J5 Split-Lev. =F5*J3/J5=G5*J3/J5 =F5*J4/J5=G5*J4/J5 p-Value=CHISQ.TEST(F3:I4,F9:I10) Grand Tot.
29
29 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Value Worksheet Contingency Table (Independence) Test EFGHIJ 1 2Price ($)ColonialLogSplit-Lev.A-Frame 3 <=200K186191255 4 >200K121416345 5 Grand Total30203515100 6 Count of Home Preference 7 <=200K >200K Expected Frequencies ColonialLog 8 9 10 11 12 8.25 6.75 A-Frame 19.25 15.75 Split-Lev. 16.5011.00 13.50 9.00 p-Value0.0274 Grand Tot. Note: Columns A-D are not shown.
30
30 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Goodness of Fit Test: Poisson Distribution 1. State the null and alternative hypotheses. 2. Select a random sample and a. Record the observed frequency f i for each value of a. Record the observed frequency f i for each value of the Poisson random variable. the Poisson random variable. b. Compute the mean number of occurrences . b. Compute the mean number of occurrences . H 0 : The population has a Poisson distribution H a : The population does not have a Poisson distribution
31
31 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Goodness of Fit Test: Poisson Distribution 3. Compute the expected frequency of occurrences e i for each value of the Poisson random variable. for each value of the Poisson random variable. Multiply the sample size by the Poisson probabilityMultiply the sample size by the Poisson probability of occurrence for each value of the Poisson random variable. of occurrence for each value of the Poisson random variable. If there are fewer than five expected occurrencesIf there are fewer than five expected occurrences for some values, combine adjacent values and for some values, combine adjacent values and reduce the number of categories as necessary. reduce the number of categories as necessary.
32
32 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Goodness of Fit Test: Poisson Distribution 4. Compute the value of the test statistic. f i = observed frequency for category i e i = expected frequency for category i k = number of categories where:
33
33 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. where is the significance level and there are k - 2 degrees of freedom there are k - 2 degrees of freedom p -value approach: Critical value approach: Reject H 0 if p -value < 5. Rejection rule: Reject H 0 if Goodness of Fit Test: Poisson Distribution
34
34 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Example: Troy Parking Garage In studying the need for an additional entrance to In studying the need for an additional entrance to a city parking garage, a consultant has recommended an analysis approach that is applicable only in situations where the number of cars entering during a specified time period follows a Poisson distribution. Goodness of Fit Test: Poisson Distribution
35
35 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. A random sample of 100 one-minute time intervals A random sample of 100 one-minute time intervals resulted in the customer arrivals listed below. A statistical test must be conducted to see if the assumption of a Poisson distribution is reasonable. Goodness of Fit Test: Poisson Distribution n Example: Troy Parking Garage # Arrivals 0 1 2 3 4 5 6 7 8 9 10 11 12 Frequency 0 1 4 10 14 20 12 12 9 8 6 3 1
36
36 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Hypotheses H a : Number of cars entering the garage during a one-minute interval is not Poisson distributed one-minute interval is not Poisson distributed H 0 : Number of cars entering the garage during a one-minute interval is Poisson distributed a one-minute interval is Poisson distributed Goodness of Fit Test: Poisson Distribution
37
37 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Estimate of Poisson Probability Function otal Arrivals = 0(0) + 1(1) + 2(4) +... + 12(1) = 600 Hence, Estimate of = 600/100 = 6 Total Time Periods = 100 Goodness of Fit Test: Poisson Distribution
38
38 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Expected Frequencies x f ( x ) nf ( x ) 0123456 13.77 13.77 10.33 10.33 6.88 6.88 4.13 4.13 2.25 2.25 2.01 2.01100.00.1377.1377.1033.1033.0688.0688.0413.0413.0225.0225.0201.02011.0000 7 8 9 10 10 11 11 12+ 12+Total.0025.0149.0446.0892.1339.1606.1606.25.25 1.49 1.49 4.46 4.46 8.92 8.9213.3916.0616.06 x f ( x ) nf ( x ) Goodness of Fit Test: Poisson Distribution
39
39 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Observed and Expected Frequencies i f i e i f i - e i i f i e i f i - e i -1.20 1.08 1.08 0.61 0.61 3.94 3.94-4.06-1.77-1.33 1.12 1.12 1.61 1.61 6.20 6.20 8.92 8.9213.3916.0616.0613.7710.33 6.88 6.88 8.39 8.39 51014201212 9 810 0 or 1 or 2 3 4 5 6 7 8 9 10 or more Goodness of Fit Test: Poisson Distribution
40
40 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Test Statistic With =.05 and k - p - 1 = 9 - 1 - 1 = 7 d.f. With =.05 and k - p - 1 = 9 - 1 - 1 = 7 d.f. (where k = number of categories and p = number of population parameters estimated), Reject H 0 if p -value 14.067. n Rejection Rule Goodness of Fit Test: Poisson Distribution
41
41 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Conclusion Using the p -Value Approach The p -value > . We cannot reject the null hypothesis. There is no reason to doubt the assumption of a Poisson distribution. The p -value > . We cannot reject the null hypothesis. There is no reason to doubt the assumption of a Poisson distribution. Because 2 = 3.268 is between 2.833 and 12.017 in the Chi-Square Distribution Table, the area in the upper tail Because 2 = 3.268 is between 2.833 and 12.017 in the Chi-Square Distribution Table, the area in the upper tail of the distribution is between.90 and.10. Area in Upper Tail.90.10.05.025.01 2 Value (df = 7) 2.833 12.017 14.067 16.013 18.475 Goodness of Fit Test: Poisson Distribution
42
42 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Formula Worksheet Goodness of Fit Test: Poisson Distribution ABCDEF 1NumberObservedExpectedSq'd.Sq.Diff./ 2of Arriv.Frequency Differ. Exp.Freq. 30, 1, or 25 =POISSON(2,6,TRUE)*B12 =B3-C3=D3^2=E3/C3 4310 =POISSON(3,6,FALSE)*B12 =B4-C4=D4^2=E4/C4 5414 =POISSON(4,6,FALSE)*B12 =B5-C5=D5^2=E5/C5 6520 =POISSON(5,6,FALSE)*B12 =B6-C6=D6^2=E6/C6 7612 =POISSON(6,6,FALSE)*B12 =B7-C7=D7^2=E7/C7 8712 =POISSON(7,6,FALSE)*B12 =B8-C8=D8^2=E8/C8 989 =POISSON(8,6,FALSE)*B12 =B9-C9=D9^2=E9/C9 1098 =POISSON(9,6,FALSE)*B12=B10-C10 =D10^2=E10/C10 1110 or more10 =(1-POISSON(9,6,TRUE))*B12=B11-C11 =D11^2=E11/C11 12Total =SUM(B3:B11) = SUM(F3:F11 ) 139 14=F12 15=C13-2 16=CHISQ.DIST.RT(C14,C15) Categories Test Statistic Degrees of Freedom p-Value
43
43 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Value Worksheet Goodness of Fit Test: Poisson Distribution
44
44 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. Goodness of Fit Test: Normal Distribution 1. State the null and alternative hypotheses. 3.Compute the expected frequency, e i, for each interval. (Multiply the sample size by the probability of a (Multiply the sample size by the probability of a normal random variable being in the interval. normal random variable being in the interval. 2. Select a random sample and a. Compute the mean and standard deviation. b. Define intervals of values so that the expected frequency is at least 5 for each interval. frequency is at least 5 for each interval. c. For each interval, record the observed frequencies H 0 : The population has a normal distribution H a : The population does not have a normal distribution
45
45 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. 4. Compute the value of the test statistic. Goodness of Fit Test: Normal Distribution 5. Reject H 0 if (where is the significance level and there are k - 3 degrees of freedom). and there are k - 3 degrees of freedom).
46
46 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Example: IQ Computers IQ Computers (one better than HP?) manufactures IQ Computers (one better than HP?) manufactures and sells a general purpose microcomputer. As part of a study to evaluate sales personnel, management wants to determine, at a.05 significance level, if the annual sales volume (number of units sold by a salesperson) follows a normal probability distribution. Goodness of Fit Test: Normal Distribution
47
47 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. A simple random sample of 30 of the salespeople was taken and their numbers of units sold are listed below. n Example: IQ Computers (mean = 71, standard deviation = 18.54) 33 43 44 45 52 52 56 58 63 64 64 65 66 68 70 72 73 73 74 75 83 84 85 86 91 92 94 98 102 105 Goodness of Fit Test: Normal Distribution
48
48 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Hypotheses H a : The population of number of units sold does not have a normal distribution with does not have a normal distribution with mean 71 and standard deviation 18.54. mean 71 and standard deviation 18.54. H 0 : The population of number of units sold has a normal distribution with mean 71 has a normal distribution with mean 71 and standard deviation 18.54. and standard deviation 18.54. Goodness of Fit Test: Normal Distribution
49
49 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Interval Definition To satisfy the requirement of an expected To satisfy the requirement of an expected frequency of at least 5 in each interval we will divide the normal distribution into 30/5 = 6 equal probability intervals. Goodness of Fit Test: Normal Distribution
50
50 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Interval Definition Areas = 1.00/6 =.1667 Areas = 1.00/6 =.1667 71 53.02 71 .43(18.54) = 63.03 78.97 88.98 = 71 +.97(18.54) Goodness of Fit Test: Normal Distribution
51
51 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Observed and Expected Frequencies 1-2 1 0 1 5 5 5 5 5 530 6 3 6 5 4 630 Less than 53.02 53.02 to 63.03 53.02 to 63.03 63.03 to 71.00 63.03 to 71.00 71.00 to 78.97 71.00 to 78.97 78.97 to 88.98 78.97 to 88.98 More than 88.98 i f i e i f i - e i i f i e i f i - e i Total Goodness of Fit Test: Normal Distribution
52
52 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Test Statistic With =.05 and k - p - 1 = 6 - 2 - 1 = 3 d.f. With =.05 and k - p - 1 = 6 - 2 - 1 = 3 d.f. (where k = number of categories and p = number of population parameters estimated), Reject H 0 if p -value 7.815. n Rejection Rule Goodness of Fit Test: Normal Distribution
53
53 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Conclusion Using the p -Value Approach The p -value > . We cannot reject the null hypothesis. There is little evidence to support rejecting the assumption the population is normally distributed with = 71 and = 18.54. The p -value > . We cannot reject the null hypothesis. There is little evidence to support rejecting the assumption the population is normally distributed with = 71 and = 18.54. Because 2 = 1.600 is between.584 and 6.251 in the Chi-Square Distribution Table, the area in the upper tail Because 2 = 1.600 is between.584 and 6.251 in the Chi-Square Distribution Table, the area in the upper tail of the distribution is between.90 and.10. Area in Upper Tail.90.10.05.025.01 2 Value (df = 3).584 6.251 7.815 9.348 11.345 Goodness of Fit Test: Normal Distribution
54
54 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Formula Worksheet Goodness of Fit Test: Normal Distribution ABCDE 1Number ofObservedExpectedSq'd.Sq.Differ./ 2Units SoldFrequency Differ.Exp.Freq. 3Less than 53.0265=(B3-C3)^2=D3/C3 453.02 to 63.0335=(B4-C4)^2=D4/C4 563.03 to 71.0065=(B5-C5)^2=D5/C5 671.00 to 78.9755=(B6-C6)^2=D6/C6 778.97 to 88.9845=(B7-C7)^2=D7/C7 888.98 and Over65=(B8-C8)^2=D8/C8 9Total=SUM(B3:B8)=SUM(E3:E8) 10 11 12 13p-Value =CHISQ.DIST.RT(C11,C12) Categories Test Statistic Degrees of Freedom 6 =E9 =C10-3
55
55 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. n Excel Value Worksheet Goodness of Fit Test: Normal Distribution
56
56 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. or duplicated, or posted to a publicly accessible website, in whole or in part. End of Chapter 12
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.