1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Chi Square Tests Chapter 17.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
CJ 526 Statistical Analysis in Criminal Justice
Chi-square Test of Independence
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
T-Tests Lecture: Nov. 6, 2002.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Hypothesis Testing Using The One-Sample t-Test
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Hypothesis Testing :The Difference between two population mean :
Richard M. Jacobs, OSA, Ph.D.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Inferential Statistics: SPSS
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
ANOVA Greg C Elvers.
CJ 526 Statistical Analysis in Criminal Justice
Single Sample Inferences
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Ordinally Scale Variables
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
CHI SQUARE TESTS.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Chapter Outline Goodness of Fit test Test of Independence.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
© Copyright McGraw-Hill 2004
Chi Square & Correlation
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Inferential Statistics Introduction to Hypothesis Testing.
Chapter 9 Introduction to the t Statistic
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 9: Non-parametric Tests
Qualitative data – tests of association
Data Analysis for Two-Way Tables
Is a persons’ size related to if they were bullied
Chapter 10 Analyzing the Association Between Categorical Variables
Inference on Categorical Data
Analyzing the Association Between Categorical Variables
Presentation transcript:

1 Nominal Data Greg C Elvers

2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain assumptions about how the data are distributed Typically, they assume that the data are distributed normally

3 Nonparametric Statistics Nonparametric statistics do not make assumptions about the underlying distribution of the data Thus, nonparametric statistics are useful when the data are not normally distributed Because nominally scaled variables cannot be normally distributed, nonparametric statistics should be used with them

4 Parametric vs Nonparametric Tests When you have a choice, you should use parametric statistics because they have greater statistical power than the corresponding nonparametric tests That is, parametric statistics are more likely to correctly reject H 0 than nonparametric statistics

5 Binomial Test The binomial test is a type of nonparametric statistic The binomial test is used when the DV is nominal, and it has only two categories or classes It is used to answer the question: In a sample, is the proportion of observations in one category different than a given proportion?

6 Binomial Test A researcher wants to know if the proportion of ailurophiles in a group of 20 librarians is greater than that found in the general population,.40 There are 9 ailurophiles in the group of 20 librarians

7 Binomial Test Write H 0 and H 1 : H 0 : P .40 H 1 : P >.40 Is the hypothesis one-tailed or two-tailed? Directional, one-tailed Determine the statistical test The librarians can either be or not be ailurophiles, thus we have a dichotomous, nominally scaled variable Use the binomial test

8 Binomial Test Determine the critical value from a table of critical binomial values Find the column that corresponds to the p value (in this case.40) Find the row that corresponds to the sample size (N = 20) and  (.05) The critical value is 13

9 Binomial Test If the observed number of ailurophiles (9) is greater than or equal to the critical value (13), you can reject H 0 We fail to reject H 0 ; there is insufficient evidence to conclude that the percentage of librarians who are ailurophiles is probably greater than that of the general population

10 Normal Approximation to the Binomial Test When the sample size is greater than or equal to 50, then a normal approximation (i.e. a z-test) can be used in place of the binomial test When the product of the sample size (N), p, and 1 - p is greater than or equal to 9, then the normal approximation can be use

11 Normal Approximation to the Binomial Test The normal approximation to the binomial test is defined as: x = number of observations in the category N = sample size P = probability in question

12 Normal Approximation to the Binomial Test A researcher wants to know if the proportion of ailurophiles in a group of 100 librarians is greater than that found in the general population,.40 There are 43 ailurophiles in the group of 100 librarians

13 Normal Approximation to the Binomial Test Write H 0 and H 1 : H 0 : P .40 H 1 : P >.40 Is the hypothesis one-tailed or two-tailed? Directional, one-tailed Determine the statistical test The librarians can either be or not be ailurophiles, thus we have a dichotomous, nominally scaled variable Use the z test, because n  50

14 Normal Approximation to the Binomial Test Calculate the z-score

15 Normal Approximation to the Binomial Test Determine the critical value from a table of area under the normal curve Find the z-score that corresponds to an area of.05 above the z-score That value is 1.65 Compare the calculated z-score to the critical z-score If |z calculated |  z critical, then reject H < 1.65; fail to reject H 0

16  2 -- One Variable When you have nominal data that has more than two categories, the binomial test is not appropriate The  2 (chi squared) test is appropriate in such instances The  2 test answers the following question: Is the observed number of items in each category different from a theoretically expected number of observations in the categories?

17  2 -- One Variable At a recent GRE test, each of 28 students took one of 5 subject tests Was there an equal number of test takers for each test?

18  2 -- One Variable Write H 0 and H 1 : H 0 :  (O - E) 2 = 0 H 1 :  (O - E) 2  0 O = observed frequencies E = expected frequencies Specify   =.05 Calculate the  2 statistic  2 =  [(O i -E i ) 2 /E i ]

19  2 Calculations

20  2 Calculations  2 =  [(O i -E i ) 2 /E i ]  2 = =10.83 Calculate the degrees of freedom: df = number of groups - 1 = = 4 Determine the critical value from a table of critical  2 values df = 4,  =.05 Critical  2  =.05 (4) =9.488

21  2 Decision If the observed / calculated value of  2 is greater than or equal to the critical value of  2, then you can reject H 0 that there is no difference between the observed and expected frequencies Because the observed  2 = is larger than the critical  2 =9.488, we can reject H 0 that the observed and expected frequencies are the same

22  2 Test of Independence  2 can also be used to determine if two variables are independent of each other E.g., is being an ailurophile independent of whether you are male or female? Write H 0 and H 1 : H 0 :  (O - E) 2 = 0 H 1 :  (O - E) 2  0 Specify   =.05

23  2 Test of Independence The procedure for answering such questions is virtually identical to the one variable  2 procedure, except that we have no theoretical basis for the expected frequencies The expected frequencies are derived from the data

24  2 Test of Independence The expected frequencies are given by the formula to the right:

25  2 Test of Independence

26  2 Test of Independence Calculate the observed value of  2

27  2 Test of Independence First, determine the degrees of freedom: df = (r - 1) * ( c - 1) In this example, the number of rows (r) is 2, and the number of columns (c) is 2, so the degrees of freedom are (2 - 1) * (2 - 1) = 1 Determine the critical value of  2 from a table of critical  2 values Critical  2  =.05 (1)=3.841

28  2 Test of Independence Make the decision If the observed /calculated value of  2 is greater than or equal to the critical value of  2, then you can reject H 0 that the expected and observed frequencies are equal If this example, the observed  2 = is not greater than or equal to the critical  2 = 3.841, so we fail to reject H 0

29 Requirements for the Use of  2 Even though  2 makes no assumptions about the underlying distribution, it does make some assumptions that needs to be met prior to use Assumption of independence Frequencies must be used, not percentages Sufficiently large sample size

30 Assumption of Independence Each observation must be unique; that is an individual cannot be contained in more than one category, or counted in one category more than once When this assumption is violated, the probability of making a Type-I error is greatly enhanced

31 Frequencies The data must correspond to frequencies in the categories; percentages are not appropriate as data

32 Sufficient Sample Size Different people have different recommendation about how large the sample should be, and what the minimum expected frequency in each cell should be Good, Grover, and Mitchell (1977) suggest that the expected frequencies can be as low as 0.33 without increasing the likelihood of making a Type-I error Small samples reduce power