1 Inference for Categorical Data William P. Wattles, Ph. D. Francis Marion University.

Slides:



Advertisements
Similar presentations
Chapter 11 Other Chi-Squared Tests
Advertisements

CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Making Inferences for Associations Between Categorical Variables: Chi Square Chapter 12 Reading Assignment pp ; 485.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Lecture 11 Psyc 300A. Null Hypothesis Testing Null hypothesis: the statistical hypothesis that there is no relationship between the variables you are.
Chi-square Test of Independence
Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Descriptive Statistics
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Presentation 12 Chi-Square test.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Hypothesis Testing:.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Chi-Square Test of Independence Practice Problem – 1
Statistical Inference Two Statistical Tasks 1. Description 2. Inference.
Chi-square Test of Independence Steps in Testing Chi-square Test of Independence Hypotheses.
Tests of Significance June 11, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Analyzing and Interpreting Quantitative Data
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Chi-square Test of Independence
Statistical Significance for a two-way table Inference for a two-way table We often gather data and arrange them in a two-way table to see if two categorical.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
Reasoning in Psychology Using Statistics Psychology
Inference about a population proportion. 1. Paper due March 29 Last day for consultation with me March 22 2.
Chapter 6: Analyzing and Interpreting Quantitative Data
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
© Copyright McGraw-Hill 2004
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
Quantitative Methods in the Behavioral Sciences PSY 302
Inference about a population proportion.
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Introduction to Inferential Statistics
Data Analysis for Two-Way Tables
Is a persons’ size related to if they were bullied
Inference for Categorical Data
Is a persons’ size related to if they were bullied
Reasoning in Psychology Using Statistics
Chapter 10 Analyzing the Association Between Categorical Variables
Hypothesis Testing.
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Inference for Two Way Tables
Presentation transcript:

1 Inference for Categorical Data William P. Wattles, Ph. D. Francis Marion University

2 Continuous vs. Categorical Continuous (measurement) variables have many values Categorical variables have only certain values representing different categories Ordinal-a type of categorical with a natural order (e.g., year of college) Nominal-a type of categorical with no order (e.g., brand of cola)

3 Categorical Data Tells which category an individual is in rather than telling how much. Sex, race, occupation naturally categorical A quantitative variable can be grouped to form a categorical variable. Analyze with counts or percents.

4 Describing relationships in categorical data No single graph portrays the relationship Also no similar number summarizes the relationship Convert counts to proportions or percents

5 5 Prediction

6 6

7 Moving from descriptive to Inferential Chi Square Inference involves a test of independence. If variable are independent, knowledge of one variable tells you nothing about the other.

8 Moving from descriptive to Inferential Inference involves expected counts. –Expected count=The count that would occur if the variables are independent

9 Inference for two-way tables Chi Square test of independence. For more than two groups Cannot compare multiple groups one at a time.

10 To Analyze Categorical Data First obtain counts In Excel can do this with a pivot table Put data in a Matrix or two-way table

11 Matrix or two-way table

12 Inference for two-way tables Expected count The count that would occur if the variables are independent

13 Matrix or two-way table Rows Columns Distribution: how often each outcome occurred Marginal distribution: Count for all entries in a row or column

14 Row and column totals

15

16 Expected counts 37% of all subjects are Republicans If independent 37% of females should be Republican (expected value) 37% of 80= 29 37% of 75 = 28

17 Expected counts rounded

18 Observed vs. Expected

19 Chi-Square Chi-square A measure of how far the observed counts are from the expected counts

20 Chi-square test of independence

21 Chi Square test of independence with SPSS

22 Chi Square test of independence with SPSS

23 Chi Square

24 Chi-square test of independence Degrees of Freedom df=number of rows-1 times number of columns -1 compare the observed and expected counts. P-value comes from comparing the Chi- square statistic with critical values for a chi- square distribution

25 Example Have the percent of majors changed by school?

26 Data collection /2005 Fall 2004 Graduates by Major

27

28

29 Chi Square

30 Marital Status, page 543

31 Marital Status, page 543

32 Olive Oil, page 578

33 Olive Oil, page 578

34 Business Majors, page 563

35 Business Majors, page 563

36 Exam Three 37 multiple choice questions, 4 short answer T-tests and chi square on Excel General questions about analyzing categorical data and t-tests Review from earlier this term

37 Inference as a decision We must decide if the null hypothesis is true. We cannot know for sure. We choose an arbitrary standard that is conservative and set alpha at.05 Our decision will be either correct or incorrect.

38 Type I and Type II errors

39 Type I error If we reject H o when in fact H o is true, this is a Type I error Statistical procedures are designed to minimize the probability of a Type I error, because they are more serious for science. With a Type I error we erroneously conclude that an independent variable works.

40 Type II error If we accept H o when in fact H o is false this is a Type II error. A type two error is serious to the researcher. The Power of a test is the probability that H o will be rejected when it is, in fact, false.

41 Probability

42 Power The goal of any scientific research is to reject H o when H o is false. To increase power: –a. increase sample size –b. increase alpha –c. decrease sample variability –d. increase the difference between the means

43 Categorical data example African-American students more likely to register via the web.

44 Table

45 Web Registration by Race 34% 25% 44% 29% 0% 10% 20% 30% 40% 50% 60% Year White African-American

46 Categorical Data Example African-American students university-wide (44%) were more likely that white students (34%) to use web registration, X 2 (1, N = 1963) = 20.7, p <.001.

47

48 Smoking among French Men Do these data show a relationship between education and smoking in French men?

49

50

51 The End

52 Benford’s Law page 550 Faking data?

53 Problem 20.14

54

55

56 Significance test

57 Example Survey2 Berk & Carey page 261