8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.

Slides:



Advertisements
Similar presentations
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Bivariate Analysis Cross-tabulation and chi-square.
One-sample T-Test of a Population Mean
Statistical Inference for Frequency Data Chapter 16.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Assumption of normality
Chapter 13: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Chi-square Test of Independence
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Multiple Regression – Basic Relationships
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Assumption of Homoscedasticity
SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
SW388R6 Data Analysis and Computers I Slide 1 Chi-square Test of Goodness-of-Fit Key Points for the Statistical Test Sample Homework Problem Solving the.
Stepwise Binary Logistic Regression
Sampling Distribution of the Mean Problem - 1
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
8/23/2015Slide 1 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a one-sample test of a.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
Inferential Statistics: SPSS
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
9/23/2015Slide 1 Published reports of research usually contain a section which describes key characteristics of the sample included in the study. The “key”
Chi-Square Test of Independence Practice Problem – 1
Chi-square Test of Independence Steps in Testing Chi-square Test of Independence Hypotheses.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 16 The Chi-Square Statistic
As shown in Table 1, the groups differed in terms of language skills and the type of job last held. The intake form asked the client to indicate languages.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
SW388R6 Data Analysis and Computers I Slide 1 Independent Samples T-Test of Population Means Key Points about Statistical Test Sample Homework Problem.
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
11/16/2015Slide 1 We will use a two-sample test of proportions to test whether or not there are group differences in the proportions of cases that have.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
11/19/2015Slide 1 We can test the relationship between a quantitative dependent variable and two categorical independent variables with a two-factor analysis.
Chi-square Test of Independence
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
SW388R6 Data Analysis and Computers I Slide 1 Percentiles and Standard Scores Sample Percentile Homework Problem Solving the Percentile Problem with SPSS.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
1/5/2016Slide 1 We will use a one-sample test of proportions to test whether or not our sample proportion supports the population proportion from which.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Leftover Slides from Week Five. Steps in Hypothesis Testing Specify the research hypothesis and corresponding null hypothesis Compute the value of a test.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square Analyses.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a goodness-of-fit.
Hypothesis Testing Review
Hypothesis Testing Using the Chi Square (χ2) Distribution
Chapter 10 Analyzing the Association Between Categorical Variables
Presentation transcript:

8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases that fall in each category, and work with those counts as numbers, percentages, probabilities, or odds. Hypothesis tests for categorical variables have the same limitation – they can only compare counts, percentages, probabilities, or odds. The chi-square test of goodness-of-fit is used to test whether or not the distribution of a categorical variable fits a particular pattern. Specifically, the goodness-of-fit test compares the actual or observed frequency for a categorical variable to a hypothesized distribution, referred to as the expected frequencies. The pattern of expected frequencies can be based on: a distribution from previous research, e. g., does the breakdown of the variable in my study match what is found in previous studies; a conceptual distribution of the way we think events should occur, e.g. are the number of referrals to an agency consistent across months; a theoretical distribution, e.g. are the frequencies in my distribution equivalent to what we would expected for a variable that is normally distributed. The comparison of actual to expected frequencies assumes there will be differences due to sampling variation. The statistical question asks whether the differences are large enough to make it unlikely that they represent the same distribution in the population.

8/15/2015Slide 2 Goodness of fit defines the degree to which two distributions are similar. The null hypothesis for the test is: the actual frequencies from our sample are equal to the expected frequencies. The alternative hypothesis specifies a difference similar to a two-tailed research hypothesis: the actual frequencies are not equal to the expected frequencies. To compute the expected frequency counts, we need a percentage breakdown of the categories in the distribution we want to compare to, e. g. 20% in category 1, 30% in category 2, 15% in category 3, etc. The expected frequencies are the percentage in each category multiplied by the total number of valid cases for the variable. For example, if I had 50 cases in my sample to compare to the percentage breakdown above, my expected frequencies would be 50 x 20% = 10 for category 1, 50 x 30% = 15 for category 2, 50 x 15% = 7.5 for category 3, etc. Note that expected frequencies can be decimal fractions, even though actual counts would always be whole numbers. The chi-square statistic is based on the difference between actual and expected counts divided by a measure of standard error, which is this case is the expected frequency. The statistic is computed by summing across all categories.

8/15/2015Slide 3 The chi-square statistic goodness-of-fit test derives its name from the chi-square distribution, which is the theoretical distribution used to derive probabilities. The chi-square distribution can take on different shapes according to the degrees of freedom in the analysis. Fortunately, SPSS can easily compute both the degrees of freedom and the probability as the p-value or sig value that we use for making statistical decisions. The chi-square distribution provides accurate probabilities providing the distribution meets the minimal requirement that the expected frequency for each cell is 5 or more. There are more complicated versions of this rule (e.g. expected frequencies of 5 or more cases in 80% of the cells and no cells with an expected frequency less than 1) but we will apply the rule of expected frequencies of 5 or more per cell, or category, for the frequency distribution. The chi-square distribution is also affected by very large samples. Since the statistic is based on differences in counts, doubling the sample size doubles the size of the statistic. Since degrees of freedom is based on the number of categories in the distribution, the probability will be smaller (more significant) as the sample size increases. This is demonstrated on the next slide.

8/15/2015Slide 4 Finally, we note that, similar to our other hypothesis tests, the interpretation of the results of the statistical test is applied to the population from which our sample is drawn. The sample size is doubled from the first table to the second, and from the second table to the third, though each table contains the same percentage breakdown in the three categories. The size of the chi-square statistic in the second table is double that for the first table, and the chi-square statistic in the third table is double the size of the statistic for the second table.

8/15/2015Slide 5 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a chi-square test of goodness-of-fit The variable to use in the analysis respondent's degree of religious fundamentalism [fund] The percentage breakdown from previous research: 21.3% in the category 'fundamentalist', 30.8% in the category 'moderate' and 47.8% in the category 'liberal' The alpha level of significance for the hypothesis test: 0.05

8/15/2015Slide 6 Since the SPSS chi-square goodness-of-fit test requires that the breakdown be entered by count rather than percentage, the second paragraph of the directions gives you the counts to use for each category.

8/15/2015Slide 7 The first statement asks about the level of measurement. The chi-square test of goodness- of-fit requires that the variable be categorical.

8/15/2015Slide 8 "Respondent's degree of religious fundamentalism" [fund] is ordinal level, which can be treated as a categorical variable. Mark the statement as correct.

8/15/2015Slide 9 The next statement asks about the sample size. The chi-square test of goodness-of-fit requires that all cells have an expected frequency of five or more. To answer this question, we need to compute the chi-square test of goodness-of-fit.

8/15/2015Slide 10 To compute the chi-square test of goodness-of-fit, select the Nonparametric Tests > Chi-Square from the Analyze menu.

8/15/2015Slide 11 First, move the variable fund to the Test Variable List. Second, click on the Values option button to signal that we will enter the expected frequencies in the dialog box. Third, type the count for the first category, 54, in the text box. The counts must be entered in the correct order or the test will produce incorrect results. Fourth, click on the Add button to add the 54 to the list of expected values.

8/15/2015Slide 12 First, type the count for the second category, 78, in the text box. Second, click on the Add button to add the 78 to the list of expected values.

8/15/2015Slide 13 First, add the count for the third category, 121, to the list of expected values. Second, click on the OK button to produce the output.

8/15/2015Slide 14 We can use the Expected N column of the frequency table to verify that we entered the expected frequencies correctly. The critical piece of information about sample size is found in the footnote to the Test Statistics table. 0 cells had an expected frequency less than 5.

8/15/2015Slide 15 The expected frequency for all cells was equal to or greater than 5. The sample size requirement is satisfied. Mark the check box for a correct answer.

8/15/2015Slide 16 The next statement asks us what the null hypothesis for the test states.

8/15/2015Slide 17 The null hypothesis for the test is stated correctly: the actual frequencies are equal to the expected frequencies. Mark the check box as correct.

8/15/2015Slide 18 The next statement asks us to relate the chi-square test to the data in our problem.

8/15/2015Slide 19 Mark the statement as correct. The chi-square statistic is based on the differences between the actual frequencies and the expected frequencies relative to the expected frequencies.

8/15/2015Slide 20 The next statement asks us about the probability (p-value or sig. value) for the chi-square test of goodness-of-fit.

8/15/2015Slide 21 The probability that the actual frequencies are equal to the expected frequencies was p <.001.

8/15/2015Slide 22 The probability that the actual frequencies are equal to the expected frequencies was p <.001. Mark the check box for a correct answer.

8/15/2015Slide 23 The next statement asks about the statistical decision or conclusion that we would make based on the p-value.

8/15/2015Slide 24 When the p-value for the statistical test is less than or equal to alpha, we reject the null hypothesis and interpret the results of the test. If the p-value is greater than alpha, we fail to reject the null hypothesis and do not interpret the result. The p-value for this test (p <.001) is less than or equal to the alpha level of significance (p =.050) supporting the conclusion to reject the null hypothesis. Mark the check box for a correct answer.

8/15/2015Slide 25 The final statement asks us to interpret the results of the statistical test.

8/15/2015Slide 26 Since we rejected the null hypothesis, we know that the breakdown of cases in our sample is different from that found in previous research. We can interpret this difference as indicating that the sample used in this analysis is probably not from the sample population reported in previous research, or some event has altered the breakdown of the cases. We mark the check box for a correct answer. When we do not reject the null hypothesis, we do not interpret the results.

8/15/2015Slide 27 Variable is categorical? Yes Do not mark check box. Mark statement check box. No Mark only “None of the above.” Stop. Compute chi-square goodness-of-fit test in SPSS All cells have expected frequency of 5+? Do not mark check box. No Stop. Yes Mark statement check box.

8/15/2015Slide 28 Yes Do not mark check box. No Mark statement check box. H 0 : actual frequencies = expected frequencies Chi-square test accurately described? Yes Do not mark check box. No Mark statement check box.

8/15/2015Slide 29 Yes P-value (sig.) stated correctly? Yes Do not mark check box. Mark statement check box. No Do not mark check box. No Mark statement check box. Reject H 0 is correct decision (p ≤ alpha)? Stop. We interpret results only if we reject null hypothesis.

8/15/2015Slide 30 Interpretation is stated correctly? Yes Do not mark check box. Mark statement check box. No