Lecture 8 Chi-Square STAT 3120 Statistical Methods I.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

An explanation of the Chi-Square Test for Independence Jeffrey Marks Bhavisha Talsania California State University San Marcos.
Essentials of Applied Quantitative Methods for Health Services Managers Class Slides.
Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
Chubaka Producciones Presenta :.
2012 JANUARY Sun Mon Tue Wed Thu Fri Sat
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chi Square Test Dealing with categorical dependant variable.
Chi-square Test of Independence
Crosstabs and Chi Squares Computer Applications in Psychology.
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Analyzing Data: Bivariate Relationships Chapter 7.
Presentation 12 Chi-Square test.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
Using SPSS for Chi Square UDP 520 Lab 5 Lin November 8 th, 2007.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Quantitative Methods Partly based on materials by Sherry O’Sullivan Part 3 Chi - Squared Statistic.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
WORD JUMBLE. Months of the year Word in jumbled form e r r f b u y a Word in jumbled form e r r f b u y a february Click for the answer Next Question.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Remember Playing perfect black jack – the probability of winning a hand is.498 What is the probability that you will win 8 of the next 10 games of blackjack?
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Dan Piett STAT West Virginia University Lecture 12.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
ContentFurther guidance  Hypothesis testing involves making a conjecture (assumption) about some facet of our world, collecting data from a sample,
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
2011 Calendar Important Dates/Events/Homework. SunSatFriThursWedTuesMon January
Chi-Square INCM Chi Square When presented with categorical data, one common method of analysis is the “Contingency Table” or “Cross Tab”. This is.
Lesson 12 - R Chapter 12 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter 16: Analysis of Categorical Data. LO1Use the chi-square goodness-of-fit test to analyze probabilities of multinomial distribution trials along.
Chapter 12 Chi-Square Tests and Nonparametric Tests.
July 2007 SundayMondayTuesdayWednesdayThursdayFridaySaturday
Statistics 300: Elementary Statistics Section 11-3.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Business Statistics, 4e by Ken Black
Chi-square test or c2 test
Hypothesis Testing Review
Qualitative data – tests of association
Bivariate Testing (Chi Square)
The Chi-Square Distribution and Test for Independence
Bivariate Testing (Chi Square)
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Chi Square Two-way Tables
Statistical Analysis Chi-Square.
Data Analysis Module: Chi Square
Chapter 26 Comparing Counts.
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
2015 January February March April May June July August September
Presentation transcript:

Lecture 8 Chi-Square STAT 3120 Statistical Methods I

STAT3120 – Chi Square Dependent Variable Independent (predictor) Variable Statistical Test Comments QuantitativeCategoricalT-TEST (one, two or paired sample) Determines if categorical variable (factor) affects dependent variable; typically used for experimental or planned change studies Quantitative Correlation /Regression Analysis Test establishes a regression model; used to explain, predict or control dependent variable Categorical Chi-SquareTests if variables are statistically independent (i.e. are they related or not?)

STAT3120 – Chi Square When presented with categorical data, one common method of analysis is the “Contingency Table” or “Cross Tab”. This is a great way to display frequencies - For example, lets say that a firm has the following data: 120 male and 80 female employees 40 males and 10 females have been promoted

STAT3120 – Chi Square Using this data, we could create the following 2x2 matrix: PromotedNot PromotedTotal Male Female Total

STAT3120 – Chi Square Now, a few questions… 1)From the data, what is the probability of being promoted? 2)Given that you are MALE, what is the probability of being promoted? 3)Given that you are promoted, what is the probability that you are MALE? 4)Given that you are FEMALE, what is the probability of being promoted? 5)Given that you are promoted, what is the probability that you are female?

STAT3120 – Chi Square The answers to these questions help us start to understand if promotion status and gender are related. Specifically, we could test this relationship using a Chi- Square. This is the test used to determine if two variables are related. The relevant hypothesis statements for a Chi-Square test are: H0: Variable 1 and Variable 2 are NOT Related Ha: Variable 1 and Variable 2 ARE Related Develop the appropriate hypothesis statements and testing matrix for the gender/promotion data.

STAT3120 – Chi Square The Chi-Square Test uses the Χ 2 test statistic, which has a distribution that is skewed to the right (it approaches normality as the number of obs increases). You can see an example of the distribution on pg 641. The Χ 2 test statistic calculation can be found on page 640. The observed counts are provided in the dataset. The expected counts are the counts which would be expected if there was NO relationship between the two variables.

STAT3120 – Chi Square PromotedNot PromotedTotal Male Female Total Going back to our example, the data provided is “observed”: What would the matrix look like if there was no relationship between promotion status and gender? The resulting matrix would be “expected”…

STAT3120 – Chi Square From the data, 25% of all employees were promoted. Therefore, if gender plays no role, then we should see 25% of the males promoted (75% not promoted) and 25% of the females promoted… PromotedNot PromotedTotal Male 120*.25 = 30120*.75 = Female 80*.25 = 2080*.75 = Total Notice that the marginal values did not change…only the interior values changed.

STAT3120 – Chi Square Now, calculate the X 2 statistic using the observed and the expected matrices: ((40-30) 2 /30)+((80-90) 2 /90)+((10-20) 2 /20)+((70- 60) 2 /60) = = This is conceptually equivalent to a t-statistic or a z-score.

To determine if this is in the rejection region, we must determine the df and then use the table on page 732. Df = (r-1)*(c-1)… In the current example, we have two rows and two columns. So the df = 1*1 = 1. At alpha =.05 and 1df, the critical value is 3.84…our value of is clearly in the reject region…so what does this mean? STAT3120 – Chi Square

From the book Outliers, Malcolm Glidewell makes the point that the month in which a boy is born will determine his probability of playing in the NHL. The months of birth for players in the NHL are on the next page… (data taken from ge=merron/081208)

January51 February46 March61 April49 May46 June49 July36 August41 September36 October34 November33 December30 STAT3120 – Chi Square Now, if there is NO relationship between birth month and playing hockey, what SHOULD the distribution of months look like? Lets do this one in EXCEL… Note that this is technically referred to as a “goodness of fit” test – where we are assessing if the actual distribution “fits” what would be expected.

STAT3120 – Chi Square Practice Problems for Chi-Square: For all of these, identify the hypothesis statements, the testing matrix, and the decision.