The Chi-Square Distribution
Preliminary Idea Sum of n values of a random variable
Sum of Squares of random numbers
The distribution 1.It is called the chi-square distribution. 2.“Chi” rhymes with “High” – and the “ch” is pronounced like “k”. 3.It is a continuous random variable. 4.It has n – 1 degrees of freedom 5.It’s values are non-negative (i.e. ≥ 0) 6.It is always skewed to the right. 7.It becomes more symmetrical as n increases 8.It approximates a normal distribution for large values of n
Two Chi-square distributions
The sample variance s 2 follows a chi-square distribution
Standardizing the Test Statistic In a test of hypothesis for a population variance σ 2, the test statistic is the sample variance s 2. The standardized test statistic is denoted by and is defined by: Note: The standardized values are found in the standard chi-square tables on page 7 in the Formulas and Tables handout.
Chi-square table characteristics The chi-square tables are not symmetrical. Therefore lower-tail values and upper-tail values must be listed separately. In the extract of the chi-square tables shown in the next slide, lower-tail areas are shaded in yellow, upper tail areas are shaded in blue.
Chi-square table (Page 7 in Formulas & Tables) df
Chi-square table Examples
Two-Tail Test of Hypothesis
Lower Tail Test of Hypothesis
Upper Tail Test of Hypothesis
Example
The test of hypothesis
The F distribution
Comparison of Two Population variances We want to test the hypothesis that two population variances are equal, i.e. We need to rewrite the null and alternative hypotheses so that we can use a single value to represent the test statistic.
Ratio of Variances The null and alternative hypotheses are converted to the following form.
The Test Statistic A natural candidate to be the test statistic for the ratio of two population variances is the ratio of the corresponding sample variances
The F-distribution Statisticians have shown that the ratio of two chi-square variables follows a new distribution known as the F-distribution.
Extract of F-tables (1-α=.95) The F-distribution with 1 - α =.95 Denominatornumerator df df
F-distribution examples F(.95;4,9) = 3.63 F(.95;8,3) = 8.85 F(.99;15,20) = 3.09 F(.99;40,30) = 2.30
Ratio of Variances We have already seen that for a sample of size n the sample variance has a χ 2 distribution with n - 1 degrees of freedom. It follows that the ratio of two variances
Test of Hypothesis for two variances
One-Tail Tests For Lower Tail Tests: A = F( ; n 1 - 1; n 2 - 1) For Upper Tail Tests: A = F(1 - ; n 1 - 1; n 2 - 1).
Formula for Lower Tail F-values Since the lower tail F-values are not given in the table we must use the formulas:
Examples of Lower tail F-values F(.05;5,9)= 1/F(.95;9,5) = 1/4.77 = F(.05;7,4)= 1/F(.95;4,7) = 1/4.12 =
EXAMPLE The production manager of a textile company wants to test the hypothesis that the mean cost of producing a polyester fabric is the same for two different production processes. Assume that production costs are normally distributed for both processes. Random samples of production costs for several production runs using the two different production processes are as follows: Test the hypothesis that the two population variances are equal with a 2% level of significance. Process I $20$15$20$23$24$21 Process II $27$19$41$30$16
Sample Data Pop 1Pop 2 Sample size n 1 = 6n 2 = 5 Mean Variance
Testing the Hypothesis