Download presentation
Presentation is loading. Please wait.
Published byChester O’Neal’ Modified over 9 years ago
1
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on nominal scales — that is, the data are in the form of frequencies for different categories (e.g., How many consumers studied preferred each of the five most popular brands of toothpaste in the U.S.? How many professors are female and how many are male in 10 major academic fields in China?). You would need a multinomial distribution to determine the exact probabilities associated with random choices when dealing with more than two categories. It is much easier to use an approximation, instead. A chi-square statistic is calculated, which follows, approximately, the well-known chi-square distribution, when the null hypothesis is true. Chapter 20: Chi-Square Tests
2
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 2 The Chi-Square (χ 2 ) Distribution –This distribution can be used to represent the null hypothesis, and is therefore used to find the appropriate critical values for chi-square tests. –Its shape depends on the number of degrees of freedom associated with the table of data. –Because χ 2 can’t be less than 0, all chi-square distributions are positively skewed, though the skewing decreases as the df get larger. –With infinite df, the chi-square distribution becomes identical to the normal distribution. Similar to the way the F distribution is used for ANOVA, only one tail of the chi-square distribution is used to determine the statistical significance of a chi-square test. –Differences larger than expected by chance lead to χ 2 values in the positive tail; unusually small values are in the smaller, negative tail, near zero. –Only large discrepancies from chance expectations are inconsistent with the null hypothesis.
3
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 3 One-Way Chi-Square Tests “One way” means that all categories are considered levels of the same factor. The df equal one less than the number of categories (i.e., k – 1) Sometimes referred to as a “goodness of fit” test Types of One-Way Χ 2 Tests 1.Expected frequencies are hypothesized to be equal Games of chance (with equal frequencies of outcomes), or Testing for equal preferences among a set of categories The expected frequency for each category is the total number of obtained responses (N) divided by the number of categories (k)
4
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 4 Types of One-Way Chi- Square Tests (cont.) 2. Population proportions are known Sometimes we have good estimates of population proportions. –# of voters registered in different political parties. –% of population at different income levels. 3.The shape of a distribution is being tested E.g., the distribution of annual income in a sample. The expected frequencies are generated by the appropriate normal distribution. Rejection of the null implies, if the sample were truly random, that the underlying population is not normally distributed. If the population were known to be normal, rejection of the null would imply that either: –The sample is not a random one, or –The sample has not been drawn from the hypothesized population
5
Chapter 20For Explaining Psychological Statistics, 4th ed. by B. Cohen 5 Types of One-Way Chi- Square Tests (cont.) 4.A theoretical model is being tested Sometimes a well-formulated theory can make quantitative predictions in terms of frequencies expected in different categories. In such a case, you will want your observed frequencies to be as close as possible to the theoretically expected frequencies. For this type of chi-square test, you do not want to reject the null hypothesis, because the expected frequencies come from your research hypothesis.
6
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 6 Try This Example… Is one of the following three energy drinks preferred in the U.S. more often than chance? N = 90: –df = k – 1; where k is the # of categories –χ 2 crit increases as alpha decreases –χ 2 crit increases as df increases Red BullStarbucks Espresso 5 Hour Energy fofo 382824 fefe 30
7
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 7 Answer to the Example: The df = k – 1 = 3 – 1 = 2. The critical value is Х 2.05 (2) = 5.99. 3.467 < 5.99, so we cannot reject the null hypothesis of equal prefer- ence in the population at the.05 level.
8
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 8 The Relationship Between the Binomial Test and the Chi-Square Test With Two Categories –With two categories, the null hypothesis can be tested by using the binomial distribution, yielding a z score, or by performing a one-way χ 2 Test. The χ 2 statistic will equal the square of the z score. –Squaring the normal distribution yields a chi-square distribution with one degree of freedom.
9
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 9 Two-Way Chi-Square Tests The most interesting psychological questions involve the relationship between at least two variables rather than the distribution of just one variable. The two-way chi-square test is appropriate for quantifying the relationship between two categorical variables. It is often referred to as Pearson’s Chi-Square Test of Association (or Independence). The Null Hypothesis (H 0 ) for the two-way test is: There is no association between the two variables; that is, the way one of the variables is distributed into categories does not change at different levels of the second variable. The Alternative Hypothesis (H A ) is simply: The null hypothesis is not true; that is, the two variables are not independent.
10
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 10 Contingency Tables –The data for a two-way chi-square test are usually arranged in a two-way contingency table, also known as a cross-classification table. (Therefore, the data are often referred to as cross- classified categorical data.) –The df for a two-way chi-square test are equal to: (R – 1)(C – 1), where R = the number of rows in the table, and C = the number of columns. The following is an example of a two-way conting- ency table for a study investigating the relationship between musical ability (high or low), and confidence (high, medium, or low) in 10-year-old children. HighMediumLow MusicalHigh 17 32 1160 AbilityLow 13 43 3490 3075 45150
11
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 11 Fill in the expected frequencies in the empty parentheses in the table below for the musical ability / confidence example, using the formula shown below the table (where N is the total of all the frequencies in the table). After all the f e ’s are found, the two-way chi-square statistic is calculated using the same formula as in the one-way case, applied to each cell of the table. Confidence HighMediumLow High17 ( )32 ( )11 ( )60 Low13 ( )43 ( )34 ( )90 307545150 Try This Example…
12
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 12 The df = (3 – 1) (2 – 1) = 2 The critical value is Х 2.05 (2) = 5.99. 8.23 > 5.99, so we can reject the null hypothesis that musical ability is unrelated to levels of confidence in 10-year-old children. Confidence HighMediumLow High7 (12)32 (30)11 (18)60 Low13 (18)43 (45)34 (27)90 307545150 Answer to the Example:
13
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 13 Simplified Formula for the 2 x 2 Case where the frequencies of the 2 x 2 table are labeled as: AB CD Strength of Association –Can also be called effect size For a 2 x 2 contingency table, it is appropriate to calculate Pearson’s r for the two variables, but first you have to assign arbitrary values (e.g., 1 and 2) to the two levels of each variable. Pearson’s r for two dichotomous variables is called the phi coefficient, and it can be obtained directly from the chi-square statistic, as shown next.
14
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 14 Phi (φ) Coefficient (can also be called the fourfold point correlation) Measure of Effect Size for the 2 x 2 case only: –Because the square root is positive, the value of φ ranges between 0 and +1.0. –The square of φ is a measure of the proportion of variance accounted for in one variable by the other. Cramer’s V or phi (φ C ) –For contingency tables larger than 2 x 2 –L = # of rows or columns, whichever is smaller, or either if both are the same –Ranges from 0 to +1.0
15
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 15 Assumptions of the χ 2 Test 1. The categories are mutually exclusive and exhaustive (all cases fall into one and only one category). 2. Independence of observations: This assumption is usually violated when the same subject is categorized more than once. A violation of this assumption seriously undermines the validity of the test. 3. Minimal size of expected frequencies: The use of the chi-square distribution as an approximation becomes inaccurate if the f e ’s are too low. Strict rule: f e for each cell should be at least 5 when df > 1; 10 when df = 1 Less strict: no f e < 1, and no more than 20% of the f e s < 5
16
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 16 Some Uses for the χ 2 Test for Independence –It can measure the strength of the relationship between two categorical variables. –It can be used for a study with a quantitative DV, originally designed to be analyzed with a t test or ANOVA. If the distribution of the DV is very far from normal, and N is not very large, it can be desirable to transform the DV to a few distinct categories (e.g., annual income $80K). Some power is lost by throwing away most of the quantitative information, but the validity of the chi-square test does not depend on making an assumption about the distribution of the DV.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.