Analysis of Discrete Variables

Slides:



Advertisements
Similar presentations
Contingency Table Analysis Mary Whiteside, Ph.D..
Advertisements

Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Statistical Inference for Frequency Data Chapter 16.
Chapter 13: The Chi-Square Test
Chi-Square Tests and the F-Distribution
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Phi Coefficient Example A researcher wishes to determine if a significant relationship exists between the gender of the worker and if they experience pain.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Quantitative Methods Partly based on materials by Sherry O’Sullivan Part 3 Chi - Squared Statistic.
Chapter 16 The Chi-Square Statistic
Other Chi-Square Tests
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square Test Dr. Asif Rehman.
Introduction to Marketing Research
Other Chi-Square Tests
T-test calculations.
Chi-Square hypothesis testing
Chi-square Basics.
Chapter 9: Non-parametric Tests
Presentation 12 Chi-Square test.
Lecture8 Test forcomparison of proportion
5.1 INTRODUCTORY CHI-SQUARE TEST
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 11 Chi-Square Tests.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Qualitative data – tests of association
Difference Between Means Test (“t” statistic)
Data Analysis for Two-Way Tables
Warm-Up: Chi Square Practice!!!
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Testing for Independence
Day 67 Agenda: Submit THQ #6 Answers.
Is a persons’ size related to if they were bullied
Statistical Inference about Regression
Chapter 10 Analyzing the Association Between Categorical Variables
Statistical Analysis Chi-Square.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Chi-square test or c2 test
Chapter 11 Chi-Square Tests.
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
SENIORS: Final transcript request must be made by Friday.
Chapter Outline Goodness of Fit test Test of Independence.
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
Chapter 11 Chi-Square Tests.
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
What is Chi-Square and its used in Hypothesis? Kinza malik 1.
Presentation transcript:

Analysis of Discrete Variables Gender (x1 = male, x2 = female) Educ. level (x1 = low, x2 = middle, x3 = high) 5-point scales (x1 = 1, x2 = 2, ..., x5 = 5) Diagnosis (x1 = Neurosis, x2 = Schizophrenia, ...)

Distribution of Discrete Variables: The General Case x x x .... x 1 2 3 k p p p .... p 1 2 3 k

An Example of Discrete Distribution xi: 1 2 3 pi: 0.20 0.35 0.40 0.05

Analysis of 1 discrete variable in 1 population

Distribution Fitting Hypothetical example: Which do you prefer best: Coke (x1), Pepsi (x2), or Fanta (x3)? Null hypothesis: H0: P(x1) = P(x2) = P(x3) = 1/3 Obtained frequencies (ni): Out of 150 Ss n1 = 80, n2 = 50, n3 = 20 Expected frequencies (ni): If H0 were true, one would expect 50-50-50 for each ni.

Distribution Fitting with 2-test The greater the difference between obtained (ni) and expected (ni) frequencies, the greater the likelihood that H0 is false. A possible measure of the difference: c2 = (n1 - n1)2/n1 + (n2 - n2)2/n2 + ... + (ng - ng)2/ng If H0 is true, the distribution of c2 is approximately chi-square, with df = g - 1.

Calculations ni: 80 50 20 S=150 ni: 50 50 50 S=150 18 + 0 + 18 = 36 > 9.210 = c20.01 (df = 2) Thus we reject H0 and say: ‘The three proportions differ significantly.’

2-test c2 < c20.05 c2 ³ c20.05 Keep H0 Condition: ni ³ 5 H0: P(x1) = p1, P(x2) = p2 , ... , P(xg) = pg X-sample 0,6 c2 (df=1) 0,4 0,2 (df =g - 1) 0.95 0.05 c2 1 2 3 0.05 c2 < c20.05 c2 ³ c20.05 Keep H0 HA: For at least one i: P(xi) ¹ pi

Comparing 2 Populations by means of 1 Discrete Variable Example: Is there a difference between males and females with respect to education level (EL)? H0: The distribution of EL is the same among males and females P(xi|Males) = P(xi|Females), (i = 1, 2, 3) x1 = Low, x2 = Middle, x3 = High

Two-Way Frequency Table Low Middle High Total Male 16 32 32 n1=80 Female 18 45 27 n2=90 Total 34 77 59 N=170

Two-Way Frequency Table: Row Percentages Low Middle High Total Male 20% 40% 40% 100% Female 20% 50% 30% 100% Total 20% 45.3% 34.7%

2-test for Comparing Groups If H0 is true then Follows c2-distribution with df=(g-1)·(h-1). Decision c2 < c20.05: Keep H0 (p > .05 n. s.). c2 ³ c20.05 : Reject H0 (p < .05 significant).

Comparison of Males and Females Number of rows: g = 2 Number of columns: h = 3 Degrees of freedom: df = (2-1)×(3-1) = 2 Critical values: c20.1 = 4.605; c20.05 = 5.991; c20.01 = 9.210 Computed chi-square value: c2 = 2.155 Decision: Keep H0 (p > .10 n. s.).

General Case Condition: nij ³ 5 df = (g-1)×(h-1) nij= (ni×mj)/N Samples X=x X=x X=x3 ... Total 1 2 Sample 1 n n n n 11 12 13 1 Sample 2 n n n n 21 22 23 2 nij= (ni×mj)/N ... Total m m m N 1 2 3 df = (g-1)×(h-1) Condition: nij ³ 5

Comparing the Distribution of 2 Variables in 1 Population Example: Lecture about the disadvantages of smoking. Outcome: 8 of 36 students give up smoking, 3 start smoking. Any effect? H0: The proportion of smokers does not change. Indicatior of change: x1= positive change, x2 = negative change H0: P(x1) = P(x2)

Computation: McNemar’s test FrequencyTable: Smoking Time 2: No Time 2: Yes Time 1: No a b = 8 Time 1: Yes c = 3 d Computation: McNemar’s test Condition: (b+c)/2 ³ 5, that is b+c ³ 10

More General Cases X is arbitrary, two related samples: Bowker’s test X is dichotomous, h related samples: Cochran’s Q test

Relationship of 2 Discrete Variables Girls at 15 Makes friends easily Test of independence = Comparisons

Table of Row Percentages Girls at 15 Makes friends easily

Table of Column Percentages Girls at 15 Makes friends easily

Strength of Relationship Cramér’s contingency coefficient: Ordinally scaled variables: Kendall’s G Dichotomous variables: G= Yule’s Q