Non-Parametric Statistics

Slides:



Advertisements
Similar presentations
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Advertisements

© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
CJ 526 Statistical Analysis in Criminal Justice
Parametric Tests 1) Assumption of population normality 2) homogeneity of variance Parametric more powerful than nonparametric.
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Presented By: ang ling poh ong mei yean soo pei zhi
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
CJ 526 Statistical Analysis in Criminal Justice
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
CHAPTER 5 INTRODUCTORY CHI-SQUARE TEST This chapter introduces a new probability distribution called the chi-square distribution. This chi-square distribution.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
CHI SQUARE TESTS.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
CHAPTER INTRODUCTORY CHI-SQUARE TEST Objectives:- Concerning with the methods of analyzing the categorical data In chi-square test, there are 2 methods.
Chapter Outline Goodness of Fit test Test of Independence.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Chi Square & Correlation
CHAPTER INTRODUCTORY CHI-SQUARE TEST Objectives:- Concerning with the methods of analyzing the categorical data In chi-square test, there are 3 methods.
Chapter Fifteen Chi-Square and Other Nonparametric Procedures.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 10 l Non-Parametric Statistics 10.1 The Chi-Square Tests: Goodness-of-fit Test.
CHAPT 18 Resampling and Nonparametric Approaches to Data &
Analysis of Variance l Chapter 8 l 8.1 One way ANOVA
Comparing Counts Chi Square Tests Independence.
Basic Statistics The Chi Square Test of Independence.
NONPARAMETRIC STATISTICS
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Chapter 12 Chi-Square Tests and Nonparametric Tests
NONPARAMETRIC STATISTICS
Presentation 12 Chi-Square test.
Done by : Mohammad Da’as Special thanks to Dana Rida and her slides 
Chi-square test or c2 test
5.1 INTRODUCTORY CHI-SQUARE TEST
Chi-Square Applications
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis testing. Chi-square test
Nonparametric Methods: Goodness-of-Fit Tests
Nonparametric Statistical Methods: Overview and Examples
Hypothesis Testing Using the Chi Square (χ2) Distribution
Nonparametric Statistical Methods: Overview and Examples
Elementary Statistics
The Analysis of Categorical Data and Chi-Square Procedures
Nonparametric Statistical Methods: Overview and Examples
Different Scales, Different Measures of Association
Testing for Independence
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Nonparametric Statistical Methods: Overview and Examples
Chapter 11: Inference for Distributions of Categorical Data
Nonparametric Statistical Methods: Overview and Examples
Statistics in SPSS Lecture 9
Contingency Tables: Independence and Homogeneity
Chi-square test or c2 test
Inference on Categorical Data
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 13: Inference for Distributions of Categorical Data
Parametric versus Nonparametric (Chi-square)
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 26 Comparing Counts.
Section 11-1 Review and Preview
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 18: The Chi-Square Statistic
UNIT-4.
Chapter Outline Goodness of Fit test Test of Independence.
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
InferentIal StatIstIcs
Chapter 11 Lecture 2 Section: 11.3.
Presentation transcript:

Non-Parametric Statistics l Chapter 10 l Non-Parametric Statistics 10.1 The Chi-Square Tests: 10.1.1 Goodness-of-fit Test 10.1.2 Independence Test 10.1.3 Homogeneity Test 10.2 Nonparametric Statistics: 10.2.1 Sign Test 10.2.2 Mann-Whitney Test 10.2.3 Kruskal Wallis Test

10.0 Non-Parametric Statistics When assumptions of parametric tests were violated, nonparametric techniques can be used. These tests are less powerful than their parametric counterparts.

10.1 The Chi-Square Tests This subchapter develop two statistical techniques that involve nominal data. Chi-square distribution will be used in carrying out hypothesis to analyze whether: A sample could have come from a given type of POPULATION DISTRIBUTION. Two nominal variable/categorical variable could be INDEPENDENT and HOMOGENEOUS of each other. There are two main types of chi-square test: Goodness-of-fit Test (To analyze single categorical variable) The Chi-square Test For Homogeneity and Independence (To analyze the relationship between two categorical variables)

10.1 The Chi-Square Tests Assumption testing for nonparametric techniques is not as critical as for parametric methods. However, a number of generic assumptions apply. There are three assumptions you need to address before conducting chi-square tests: Random sampling – observation should be randomly sampled from the population. Independence of observations - observation should be generated by a different subject and no subject should be counted twice. Size of expected frequencies – when the number of cells is less than 10 and has small sample size, the lowest expected frequency required is five. However, the observed frequency can be any value.

Attitude towards US military bases in Australia 10.1.1 Goodness-of-fit Test The following table outlines the attitudes of 60 people towards US military bases in Australia. Attitude towards US military bases in Australia Frequency of response In favour 8 Against 20 Undecided 32

10.1.1 Goodness-of-fit Test In SPSS (Data view) In SPSS (Variable view)

10.1.1 Goodness-of-fit Test Analysis (for the case of even expected frequency) 2 3 1

10.1.1 Goodness-of-fit Test Analysis (for the case of even expected frequency) 4 5

10.1.1 Goodness-of-fit Test Output (for the case of even expected frequency) : There are no differences in the frequency of attitudes towards military bases in Australia. : There are significant differences in the frequency of attitudes towards P-value = 0.001< α = 0.05 Reject There are significant differences in the frequency of attitudes towards military bases in Australia. Assumption no. 3

10.1.1 Goodness-of-fit Test Sometimes the expected frequencies are not evenly balanced. Across categories. For example, the following table outlines the attitudes of 60 people towards US military bases in Australia with their respected expected frequency of response. Attitude towards US military bases in Australia Frequency of response Expected frequency of response In favour 8 15 Against 20 Undecided 32 30

10.1.1 Goodness-of-fit Test Analysis (for the case of uneven expected frequency) 1 2

10.1.1 Goodness-of-fit Test Output (for the case of uneven expected frequency) : There are no differences between the observed and expected frequency of attitudes towards military bases in Australia. : There are differences between the observed military bases in Australia for at least one attitude. P-value = 0.079> α = 0.05 Fail to reject There are no differences between the observed and expected frequency of attitudes towards military bases in Australia.

10.1.1 Goodness-of-fit Test Exercise A coffee company is about to launch a range of speciality tea and would like to know whether some varieties are likely to sell better than others. Four of its new varieties are placed on a supermarket shelf for one week. The number of boxes sold for each variety is as follows: Determine whether the distribution of sales suggests that some varieties of tea are more popular than others. Speciality tea Frequency Black tea 40 Green tea 20 Oolong tea 30 White tea 10

10.1.2 Independence Test The arrangement of and are fixed and should not be arrange upside down. Hypothesis statement for Independence test: : Row and column variable are independent : Row and column variable are not independent Need to be changed according to question

The Australian Financial Review 10.1.2 Independence Test A magazine publisher wishes to determine whether preference for certain publications depends on the geographic location of the reader. In a survey, the distribution of those preferring The Australian Financial review and Newsweek are as follows: The Australian Financial Review Newsweek Urban 32 28 Rural 24 4

10.1.2 Independence Test In SPSS (Data view) In SPSS (Variable view)

10.1.2 Independence Test Analysis 3 2 1

10.1.2 Independence Test Analysis 5 4

10.1.2 Independence Test Analysis 6 7

10.1.2 Independence Test Output : Preference for certain publications and the geographic location of the reader are independent. reader are not independent. P-value = 0.003 < α = 0.05 Reject Preference for certain publications and the geographic location of the reader are not independent.

10.1.2 Independence Test Exercise A researcher is interested in determining whether drinking preference of coffee and tea is gender related. The data given as follows: Determine whether the preference is gender related. Preferred tea Preferred coffee Male 16 31 Female 21 19

10.1.3 Homogeneity Test Just as Independence test, and for this test should also be in a correct position before conducting hypothesis testing. Hypothesis statement for Homogeneity test: : The proportion of row variables are same with column variable :The proportion of row variables are not same with column variable Need to be changed according to question

10.1.3 Homogeneity Test 200 female owners and 200 male owners of Proton cars selected at random and the colour of their cars are noted. The following data shows the results: Test whether the proportions of colour preference are the same for female and male. Car Colour Black Dull Bright Male 40 110 50 Female 20 80 100

10.1.3 Homogeneity Test As the SPSS analysis for this test is the same as Independence test, we will skip the process and focus on the output. : The proportions of colour preference are the same for female and male. : The proportions of colour preference are not the same for female and male. P-value ≈ 0.000 < α = 0.05 Reject The proportions of colour preference are not the same for female and male.

10.1.3 Homogeneity Test Exercise Four machines manufacture cylindrical steel pins. The pins are subjected to a diameter specification. A pin may meet the specification or it may be too thin or too thick. Pins are sampled from each machine and the number of pins in each category is counted. Determine whether the proportion of pins that are too thin, OK, or too thick is the same for all machines. Too thin OK Too Thick Machine 1 10 102 8 Machine 2 34 161 5 Machine 3 12 79 9 Machine 4 60

10.2 Nonparametric Statistics This subchapter introduce statistical techniques that deal with ordinal data. Shown below are the list of nonparametric techniques that can be used as an alternative to their respective parametric counterpart. Nonparametric test Parametric counterpart 10.2.1 Sign test One sample t-test Paired sample t-test 10.2.2 Mann-Whitney Test Two sample independent t-test 10.2.3 Kruskal Wallis Test One way ANOVA

10.2.2 Mann-Whitney Test The productivity level of two factories, A and B, are to be compared. The monthly output, in kilogram, is recorded for two consecutive months: The data has been identified violating the parametric assumptions. Factory A B 14000 16000 17000 10000 13000 8000 19000 11000 15000 12000

10.2.2 Mann-Whitney Test In SPSS (Data view) In SPSS (Variable view)

10.2.2 Mann-Whitney Test Analysis 2 1 3

10.2.2 Mann-Whitney Test Output : The productivity level of two factories, A and B, are the same. factories, A and B, are different. P-value = 0.354 > α = 0.05 Fail to reject The productivity level of two factories, A and B, are the same.

10.2.2 Mann-Whitney Test Exercise Data below show the marks obtained by electrical engineering students in an examination: Can we conclude the achievements of male and female students identical?