Chi Square & Correlation

Slides:



Advertisements
Similar presentations
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
S519: Evaluation of Information Systems
PSY 307 – Statistics for the Behavioral Sciences
CJ 526 Statistical Analysis in Criminal Justice
Chi Square Test Dealing with categorical dependant variable.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Lecture 5 Correlation and Regression
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
AM Recitation 2/10/11.
Selecting the Correct Statistical Test
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
CJ 526 Statistical Analysis in Criminal Justice
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chi-Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters of these.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
URBP 204A QUANTITATIVE METHODS I Statistical Analysis Lecture IV Gregory Newmark San Jose State University (This lecture is based on Chapters 5,12,13,
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Chapter 11, 12, 13, 14 and 16 Association at Nominal and Ordinal Level The Procedure in Steps.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
I. ANOVA revisited & reviewed
Chi-square Basics.
Chapter 9: Non-parametric Tests
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Hypothesis Testing Using the Chi Square (χ2) Distribution
The Chi-Square Distribution and Test for Independence
Different Scales, Different Measures of Association
Chapter 10 Analyzing the Association Between Categorical Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Parametric versus Nonparametric (Chi-square)
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Presentation transcript:

Chi Square & Correlation

Nonparametric Test of Chi2 Used when too many assumptions are violated in T-Tests: Sample size too small to reflect population Data are not continuous and thus not appropriate for parametric tests based on normal distributions. χ2 is another way of showing that some pattern in data is not created randomly by chance. X2 can be one or two dimensional. X2 deals with the question of whether what we observed is different from what is expected

Calculating X2 What would a contingency table look like if no relationship exists between gender and voting for Bush? (i.e. statistical independence) Male Female Voted for Bush 25 50 Voted for Kerry 50 50 50 100 NOTE: INDEPENDENT VARIABLES ON COLUMS AND DEPENDENT ON ROWS

Calculating X2 What would a contingency table look like if a perfect relationship exists between gender and voting for Bush? Male Female Voted for Bush 50 Voted for Kerry

Calculating the expected value The expected frequency of the cell in the ith row and jth column Fi = The total in the ith row marginal Fj = The total in the jth column marginal N = The grand total, or sample size for the entire table Expected Voted for Bush = 50x50 / 100 = 25

Nonparametric Test of Chi2 Again, the basic question is what you are observing in some given data created by chance or through some systematic process? O= Observed frequency E= Expected frequency

Nonparametric Test of Chi2 The null hypothesis we are testing here is that the proportion of occurrences in each category are equal to each other (Ho: B=K). Our research hypothesis is that they are not equal (Ha: B =K). Given the sample size, how many cases could we expect in each category (n/#categories)? The obtained/critical value estimation will provide a coefficient and a Pr. that the results are random.

Let’s do a X2 50 (50-25)2/25=25 (0 - 25)2 /25=25 (50-25)2 /25=25 Male Female Voted for Bush 50 Voted For Kerry What would X2 be when there is statistical independence?

Let’s corroborate with SPSS

Testing for significance How do we know if the relationship is statistically significant? We need to know the df (df= (R-1) (C-1) ) (2-1)(2-1)= 1 We go to the X2 distribution to look for the critical value (CV= 3.84) We conclude that the relationship gender and voting is statistically significant. Male Female Voted for Bush 20 30 Voted for Kerry X2= 4

When is X2 appropriate to use? X2 is perhaps the most widely used statistical technique to analyze nominal and ordinal data Nominal X nominal (gender and voting preferences) Nominal and ordinal (gender and opinion for W)

X2 can also be used with larger tables Opinion of Bush MALE FEMALE Favorable 40 5 Indifferent 10 20 Unfavorable 15 55 45 (19.4) (15.8) 30 (.88) (.72) 70 (8.6) (6.9) 65 80 145 X2=52.3 Do we reject the null hypothesis?

Correlation (Does not mean causation) We want to know how two variables are related to each other Does eating doughnuts affect weight? Does spending more hours studying increase test scores? Correlation means how much two variables overlap with each other

Types of Correlations X (cause) Y (effect) Correlation Values Increases Positive 0 to1 Decreases 0 to 1 Negative -1 to 0 Increase Does not change Independent

Conceptualizing Correlation Measuring Development Strong Weak GPD POP WEIGHT GDP EDUCATION Correlation will be associated with what type of validity?

Correlation Coefficient

Home Value & Square footage Log value Log sqft value2 sqft2 Val * sqft 5.13 4.02 26.3169 16.1604 20.6226 5.2 4.54 27.04 20.6116 23.608 4.53 3.53 20.5209 12.4609 15.9909 4.79 3.8 22.9441 14.44 18.202 4.78 3.86 22.8484 14.8996 18.4508 4.72 4.17 22.2784 17.3889 19.6824 29.15 23.92 141.95 95.96 116.56

Correlation Coefficient

Rules of Thumb Size of correlation coefficient General Interpretation .8 - 1.0 Very Strong .6 - .8 Strong .4 - .6 Moderate .2 - .4 Weak .0 - .2 Very Weak or no relationship

Multiple Correlation Coefficients

Limitation of correlation coefficients They tell us how strong two variables are related However, r coefficients are limited because they cannot tell anything about: Causation between X and Y Marginal impact of X on Y What percentage of the variation of Y is explained by X Forecasting Because of the above Ordinary Least Square (OLS) is most useful

Do you have the BLUES? B for Best (Minimum error) L for Linear (The form of the relationship) U for Un-bias (does the parameter truly reflect the effect?) E for Estimator

Home value and sq. Feet Does the above line meet the BLUE criteria?