Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data.

Slides:



Advertisements
Similar presentations
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Advertisements

CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Hypothesis Testing IV Chi Square.
Chapter 11 Inference for Distributions of Categorical Data
Chapter 13: Inference for Distributions of Categorical Data
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Chi-square Test of Independence
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Lesson Inference for Two-Way Tables. Vocabulary Statistical Inference – provides methods for drawing conclusions about a population parameter from.
AM Recitation 2/10/11.
The Chi-Square Distribution 1. The student will be able to  Perform a Goodness of Fit hypothesis test  Perform a Test of Independence hypothesis test.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 15 Inference for Counts:
Goodness-of-Fit Tests and Categorical Data Analysis
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 10 Inferring Population Means.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Chapter 26 Chi-Square Testing
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
+ Chi Square Test Homogeneity or Independence( Association)
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
CHI SQUARE TESTS.
Copyright © 2010 Pearson Education, Inc. Slide
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter Outline Goodness of Fit test Test of Independence.
Dan Piett STAT West Virginia University Lecture 12.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Lesson Inference for Two-Way Tables. Knowledge Objectives Explain what is mean by a two-way table. Define the chi-square (χ 2 ) statistic. Identify.
© Copyright McGraw-Hill 2004
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a goodness-of-fit.
Presentation 12 Chi-Square test.
Vocabulary Statistical Inference – provides methods for drawing conclusions about a population parameter from sample data Expected Values– row total *
Chapter 12 Tests with Qualitative Data
Data Analysis for Two-Way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Analyzing the Association Between Categorical Variables
Presentation transcript:

Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data

Lean Six Sigma Bonacorsi Consulting 2 What Is A Chi-Square Test?  What is a Chi-Square?  The probability density curve of a chi-square distribution is asymmetric curve stretching over the positive side of the line and having a long right tail.  The form of the curve depends on the value of the degrees of freedom.

Lean Six Sigma Bonacorsi Consulting 3 Types of Chi-Square Tests?  Types of Chi-Square Analysis:  Chi-square Test for Association is a (non-parametric, therefore can be used for nominal data) test of statistical significance widely used bivariate tabular association analysis. Typically, the hypothesis is whether or not two different populations are different enough in some characteristic or aspect of their behavior based on two random samples. This test procedure is also known as the Pearson chi-square test.  Chi-square Goodness-of-fit Test is used to test if an observed distribution conforms to any particular distribution. Calculation of this goodness of fit test is by comparison of observed data with data expected based on the particular distribution.

Lean Six Sigma Bonacorsi Consulting 4 The Basics When to apply a Chi-Squared Test:  Chi-Squared test is used to determine if there is a statistically significant difference in the proportions for different groups. To accomplish this, it breaks all outcomes into groups. What the Chi-Squared Test does:  It starts by determining how many defects, for example, would be “expected” in each group involved.  It does this by assuming that all groups have the same defect rate (which Minitab approximates from the data provided).  Minitab then compares the expected counts with what was actually observed.  If the numbers are different by a large enough amount, Chi-Square determines that the groups do not have the same proportion.

Lean Six Sigma Bonacorsi Consulting 5 The Basics (Cont.) Chi-Square Requirements:  Data is typically attribute (discrete). At the very least, all data must be able to be categorized as being in some category or another).  Expected cell counts should not be low (definitely not less than 1 and preferable not less than 5) as this could lead to a false positive indication that there is a difference when, in fact, none exists. Chi-Square Hypotheses:  Ho: The null hypotheses (P-Value > 0.05) means the populations have the same proportions.  Ha: The alternate hypotheses (P-Value <= 0.05) means the populations do NOT have the same proportions.

Lean Six Sigma Bonacorsi Consulting 6 Using Minitab – Step 1 Enter Category Names Enter the Defect Qty for the Data Collection Period Enter the Success Qty for the Data Collection Period Step 1: Enter data into Minitab

Lean Six Sigma Bonacorsi Consulting 7 Using Minitab – Step 2 Step 2: Choose Stat, then Tables, then Chi-Squared Test (Two-Way Table in Worksheet…)

Lean Six Sigma Bonacorsi Consulting 8 Using Minitab – Step 3 Step 3: Enter the subgroup data to compare Determine the subgroups you want to compare for variation. In this example we examine the variation between success (C2) & Defect (C3) for each Engineering Change Proposal (ECP) Enter category names you want to compare Click help for Minitab help on Chi-Square Choose Ok to run Chi- Squared Test Enter category names you want to compare

Lean Six Sigma Bonacorsi Consulting 9 The Session Window Results Observed counts Expected counts Sum of each category (Machine) Sum of all observed counts Degree of Freedom (df = n-1) The Chi-Square test calculates using the formula ( ) 2 / = Since the P-Value is <= 0.05, there is a 95% chance that there IS a statistically significant difference, in other words the groups (Machines) involved do NOT have the same defect rate. Machine #5

Lean Six Sigma Bonacorsi Consulting 10 Chi-Square Statistic The series of numbers are added together for the overall chi- square statistic. When the value of the Chi-square statistic is high enough, the differences between the groups are concluded to be too large to be mere chance and the groups are determined to be truly different.

Lean Six Sigma Bonacorsi Consulting 11 Errors in the Chi-Square Note: if the expected cell counts are below 5, Minitab will print a warning. The warning is generated because of the fact that with the expected count in the denominator, a small value potentially creates an artificially large chi- square statistic. This is particularly troublesome if more than 20% of the cells have expected counts less than 5 and the contribution to the overall chi-square statistic is considerable. Additionally, if any of the expected cell counts are below 1, Minitab will not even produce a p-value since the chi-square statistic is sure to be artificially inflated. In either of these cases, the binomial distribution (Minitab: Stat/ ANOVA/ Analysis of Means) may be able to be used. Lastly: Attribute Gage R&R (AR&R) or Kappa Test is needed with an acceptable level of measurement system error prior to running a Chi-Square Analysis

Lean Six Sigma Bonacorsi Consulting 12 Lets See How You Do Example test questions for Analyze 1. Consider a certain operation which produced 100 documents on line 1, 200 documents on line 2, and 300 documents on line 3. Additionally, it is known that group 1 produced 17 defective documents, group 2 produced 8 defective documents, and group 3 produced 11 defective documents. If a chi-square test were done on this data, what would be the expected number of defects seen by group 3 if the null hypothesis were completely true? a) 24 b) 18 c) 12 d) 6 2. Using the same data set, is this data statistically significant? Yes or No Enter data in to Minitab like this: Group Defective Acceptable (group 1)1783 ( group 2) ( group 3) Note: Answers are located in the note section of this slide

Lean Six Sigma Bonacorsi Consulting 13 Tips and Tricks Tips: Determine the subgroups and categories to be tested for variation (differences in proportions) as part of your data collection plan. Define the operational definitions for success/defect, the stratifications layers (subgroups) and the Cause & Effect diagram (fishbone) to pre-determine where the team believes differences in proportions may exist. Continuous (Variable) data can usually be converted into Discrete (Attribute) data by using categories (Example: cycle time (continuous 1 hr, 1.5 hr, 2 hr) can be categorized into Cycle Time Met = 1 where success is cycle time 8 hrs.) Tricks An (MSA) Attribute R&R (Kappa Analysis) for discrete data or Gage R&R for continuous (variable) data is used prior to calculating the Chi-Square Test to ensure that the measurement variation 10% then the variation you will see in the Chi-Square Test is not valid as too much of the variation seen is coming from your measurement system (10% MSA error) and not your process variation.

Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Appendix A Backup Slides

Lean Six Sigma Bonacorsi Consulting 15 Chi-Square (X2) Distribution Row for 95% confidence Degree of Freedom (df) = n-1 Machine example for Success or Defect per Month where we had n = 10 (Machines) Df = 10 – 1 = 9. Machine Example Summary Chi-Sq = = DF = 9, P-Value = The sum of the Chi-Squared Test ( would need to be > where df = 9 To be 95% confident of a difference, the P- Value <= 0.05 Chi Squared Distribution table is used by Minitab to calculate the P-Value using the Degree of Freedom (df) and the variation between each category’s data sets being compared for differences. The sum of the Chi- Squared Test ( would need to be > where df = 9) for Ho Degree of Freedom (df) = n-1

Lean Six Sigma Bonacorsi Consulting 16 Chi-Square (for Two-Way) * For expected frequencies less than 5 the calculated chi-square value changes dramatically as f e changes. Therefore the calculated value becomes less reliable and should be interpreted cautiously. H o : Independent (There is no relationship between the populations) H a : Dependent (There is a relationship between the populations) Rejection Criteria: Fail to reject H o when p  0.05; Accept H a when p < 0.05 Compare a calculated value to a critical values from a  2 table. Rejection Criteria: Fail to reject H o when p  0.05; Accept H a when p < 0.05 Compare a calculated value to a critical values from a  2 table. Chi-Square equation for two-way tables: Where: f 0 = observed frequency f e = expected frequency r = number of rows c = number of columns g = number of groups = r * c degrees of freedom = (r-1) * (c-1) All f e ’s should be > 5 * e  = (f e j = 1 g 2 f o f )  2

Lean Six Sigma Bonacorsi Consulting 17 Bonacorsi Consulting Steven Bonacorsi, President Lean Six Sigma Master Black Belt 14 Clinton Street Salem NH