Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and 2000 17-1 l Chapter 17 l Chi-Squared Analysis: Testing for Patterns in Qualitative Data.

Slides:



Advertisements
Similar presentations
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Inference about the Difference Between the
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Homogeneity.
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chapter 26: Comparing Counts
Chi-square Test of Independence
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 15 The.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 15 Inference for Counts:
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chi-square test or c2 test
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Non-parametric tests (chi-square test) Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Methods and Applications CHAPTER 15 ANOVA : Testing for Differences among Many Samples, and Much.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Statistics 300: Elementary Statistics Section 11-2.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Statistics 300: Elementary Statistics Section 11-3.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Qualitative data – tests of association
Chi Square Two-way Tables
Econ 3790: Business and Economics Statistics
Chapter 10 Analyzing the Association Between Categorical Variables
Overview and Chi-Square
Analyzing the Association Between Categorical Variables
Inference for Two Way Tables
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 17 l Chi-Squared Analysis: Testing for Patterns in Qualitative Data

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Chi-Squared Tests  Hypothesis Tests for Qualitative Data Categories instead of numbers Based on counts the number of sampled items falling into each category Chi-squared statistic Measures the difference between Actual counts and Expected counts (as expected under the null hypothesis H 0 ) where the sum extends over all categories or combinations of categories Significant if the chi-squared statistic is large enough

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Summarizing Qualitative Data  Use Counts and Percentages, for Example: How many (of people sampled) prefer your product? What percentage of sophisticated product users said they would like to purchase an upgrade? What percentage of products produced today are both designed for export and imperfect

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Testing Population Percentages  Testing if Population Percentages are Equal to Known Reference Values The chi-squared test for equality of percentages Could a table of observed counts have reasonably come from a population with known percentages (the reference values)? The data: A table indicating the count for each category for a single qualitative variable The hypotheses: H 0 : The population percentages are equal to a set of known, fixed reference values H 1 : The population percentages are not equal to a set of known, fixed reference values

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Testing Percentages (continued) The expected counts: For each category, multiply the population reference proportion by the sample size, n The assumptions: 1.Data set is a random sample from the population of interest 2.At least 5 counts are expected in each category The chi-squared statistic: The degrees of freedom: Number of categories minus 1 The test result: Significant if the chi-squared statistic is larger than the critical value from the table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Independence  Two Qualitative Variables are Independent if: knowledge about the value (i.e., category) of one variable does not help you predict the other variable  For Example: Background Sales Information The two variables are where the customer lives, and the customer’s favorite product Independence would say that customers tend to have the same pattern (distribution) of favorite products, regardless of where they live, and where customers live is not associated with which product is their favorite

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Testing for Association  The chi-squared test for independence The data: A table indicating the counts for each combination of categories for two qualitative variables The hypotheses: H 0 : The two variables are independent of one another H 1 : The two variables are associated; they are not independent The expected table: Tells you what the counts would have been, on average, if the variables were independent and there were no randomness

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Testing Association (continued) The assumptions: 1.Data set is a random sample from the population of interest 2.At least 5 counts are expected in each combination of categories The chi-squared statistic: where the sum extends over all combinations of categories The degrees of freedom: The test result: Significant if the chi-squared statistic is larger than the critical value from the table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example: Market Segmentation  Data: Rowing Machine Purchases Which model rowing machine was purchased? Basic, Designer, or Complete Which type of customer purchased it? Practical or Impulsive Basic Designer Complete Total Practical Impulsive Total Observed Counts Table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example (continued)  Overall Percentages Divide each count by the overall total, 221 e.g., 10.0% = 22/221 were practical customers who purchased basic machines Note: impulsive customers bought most designer machines, while practical customers bought most complete machines Basic Designer Complete Total Practical 10.0% 5.9% 24.4% 40.3% Impulsive 11.3% 39.8% 8.6% 59.7% Total 21.3% 45.7% 33.0% 100.0% Overall Percentages Table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example (continued)  Percentages by Model Divide each count by the total for its model type e.g., 22/47 = 46.8% of the 47 basic machines were purchased by practical customers Note: Practical customers make up 40.3% of all customers, but represent 74.0% of complete-machine purchasers Basic Designer Complete Total Practical 46.8% 12.9% 74.0% 40.3% Impulsive 53.2% 87.1% 26.0% 59.7% Total 100.0% Percentages by Model Table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example (continued)  Percentages by Customer Type Divide each count by the total for its customer type e.g., 22/89 = 24.7% of the 89 practical customers purchased basic machines Note: Of all machines, 33.0% are complete; looking only at practical- customer purchases, 60.7% are complete Basic Designer Complete Total Practical 24.7% 14.6% 60.7% 100.0% Impulsive 18.9% 66.7% 14.4% 100.0% Total 21.3% 45.7% 33.0% 100.0% Percentages by Customer Type Table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example (continued)  Expected Counts Multiply row total by column total, then divide by the overall total of 221 e.g., if there were no association between type of customer and type of machine, we would have expected to find 89  47/221 = basic machines purchased by practical customers Basic Designer Complete Total Practical Impulsive Total Expected Counts Table

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Example (continued)  Chi-Squared Statistic (do not use the total row or column) 66.8  Degrees of freedom (Rows – 1)(Column – 1) = (3 – 1)(2 – 1) = 2  Result If there were no association, such a large chi-squared value would be highly unlikely The association between customer type and model purchased is very highly significant (p < 0.001)