Bivariate Testing (Chi Square)

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Reasoning in Psychology Using Statistics Psychology
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Paired Samples Lecture 39 Section 11.3 Tue, Nov 15, 2005.
Chi-Square INCM Chi Square When presented with categorical data, one common method of analysis is the “Contingency Table” or “Cross Tab”. This is.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
26134 Business Statistics Autumn 2017
Introduction to Marketing Research
Chi-square test.
Data Analysis Module: One Way Analysis of Variance (ANOVA)
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Data Analysis Module: Bivariate Testing
Bivariate Testing (ttests and proportion tests)
Chapter 9: Non-parametric Tests
Presentation 12 Chi-Square test.
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
Review 1. Describing variables.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Data Analysis Module: Correlation and Regression
LEVELS of DATA.
Bivariate Testing (ANOVA)
Confidence Intervals and Hypothesis Tests for Variances for One Sample
Hypothesis Testing Review
STAT 4030 – Programming in R STATISTICS MODULE: Multiple Regression
Bivariate Testing (ttests and proportion tests)
Correlation and Regression Basics
Qualitative data – tests of association
STAT 4030 – Programming in R STATISTICS MODULE: Confidence Intervals
Correlation and Regression Basics
Data Analysis Module: Basic Visualizations
Bivariate Testing (ANOVA)
HMI 7530– Programming in R STATISTICS MODULE: Multiple Regression
Data Analysis for Two-Way Tables
Bivariate Testing (Chi Square)
The Chi-Square Distribution and Test for Independence
HMI 7530– Programming in R STATISTICS MODULE: Confidence Intervals
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Bivariate Testing (ttests and proportion tests)
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Day 67 Agenda: Submit THQ #6 Answers.
Reasoning in Psychology Using Statistics
Hypothesis testing. Chi-square test
Warmup Which part- time jobs employed 10 or more of the students?
Contingency Tables (cross tabs)
Statistical Analysis Chi-Square.
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Chi-square test or c2 test
Data Analysis Module: Chi Square
Reasoning in Psychology Using Statistics
Parametric versus Nonparametric (Chi-square)
Chapter 26 Comparing Counts.
Reasoning in Psychology Using Statistics
Inference for Two Way Tables
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Contingency Tables (cross tabs)
Chi Square Test of Homogeneity
CHI SQUARE (χ2) Dangerous Curves Ahead!.
What is Chi-Square and its used in Hypothesis? Kinza malik 1.
Presentation transcript:

Bivariate Testing (Chi Square) STAT 4030 – Programming in R STATISTICS MODULE: Bivariate Testing (Chi Square) Jennifer Lewis Priestley, Ph.D. Kennesaw State University 1

STATISTICS MODULE Basic Descriptive Statistics and Confidence Intervals Basic Visualizations Histograms Pie Charts Bar Charts Scatterplots Ttests One Sample Paired Independent Two Sample Proportion Testing ANOVA Chi Square and Odds Regression Basics 2 2 2

STATISTICS MODULE: Chi Square When presented with categorical data, one common method of analysis is the “Contingency Table” or “Cross Tab”. This is a great way to display frequencies - For example, lets say that a firm has the following data: 120 male and 80 female employees 40 males and 10 females have been promoted 3

STATISTICS MODULE: Chi Square Using this data, we could create the following 2x2 matrix: Promoted Not Promoted Total Male 40 80 120 Female 10 70 50 150 200 4

STATISTICS MODULE: Chi Square What is the probability of: Selecting a female? Selecting someone who was promoted? Selecting a female GIVEN that the individual was promoted? Selecting someone who was promoted GIVEN that the individual was female? 5

STATISTICS MODULE: Chi Square The answers to these questions help us start to understand if promotion status and gender are related. Specifically, we could test this relationship using a Chi-Square. This is the test used to determine if two categorical variables are related. The relevant hypothesis statements for a Chi-Square test are: H0: Variable 1 and Variable 2 are NOT Related Ha: Variable 1 and Variable 2 ARE Related 6

STATISTICS MODULE: Chi Square The Chi-Square Test uses the Χ2 test statistic, which has a distribution that is skewed to the right (it approaches normality as the number of obs increases). The observed counts are provided in the dataset. The expected counts are the counts which would be expected if there was NO relationship between the two variables. 7

STATISTICS MODULE: Chi Square Going back to our example, the data provided is “observed”: Promoted Not Promoted Total Male 40 80 120 Female 10 70 50 150 200 What would the matrix look like if there was no relationship between promotion status and gender? The resulting matrix would be “expected”… 8

STATISTICS MODULE: Chi Square From the data, 25% of all employees were promoted. Therefore, if gender plays no role, then we should see 25% of the males promoted (75% not promoted) and 25% of the females promoted… Promoted Not Promoted Total Male 120*.25 = 30 120*.75 = 90 120 Female 80*.25 = 20 80*.75 = 60 80 50 150 200 Notice that the marginal values did not change…only the interior values changed. 9

STATISTICS MODULE: Chi Square Now, calculate the X2 statistic using the observed and the expected matrices: ((40-30)2/30)+((80-90)2/90)+((10-20)2/20)+((70-60)2/60) = 3.33+1.11+5+1.67 = 11.11 This is conceptually equivalent to a t-statistic or a z-score. 10

STATISTICS MODULE: Chi Square To determine if this is in the rejection region, we must determine the df. Df = (r-1)*(c-1)… In the current example, we have two rows and two columns. So the df = 1*1 = 1. At alpha = .05 and 1df, the critical value is 3.84…our value of 11.11 is clearly in the reject region…so what does this mean? 11

STATISTICS MODULE: Chi Square #here, the code is pretty simple…first install the “prettyR” package. Then, you can run an xtab: Xtab(var1~var2, data=data) Then a Chi Squared test: chisq.test(var1, var2, correct=FALSE) 12