1 Chapter 14 Preprocessing the Data, And Cross-Tabs © 2005 Thomson/South-Western.

Slides:



Advertisements
Similar presentations
The Analysis of Categorical Data and Goodness of Fit Tests
Advertisements

CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Crosstabs and Chi Squares Computer Applications in Psychology.
Dr. Michael R. Hyman, NMSU Cross-tabulations and Banners.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Cross-Tabulations.
Hypothesis Testing :The Difference between two population mean :
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 11: Inference for Distributions of Categorical Data.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Basic Data Analysis Chapter 14. Overview  Descriptive Analysis.
Correlation Patterns.
Chapter 16 The Chi-Square Statistic
Chapter 11 Hypothesis Testing IV (Chi Square). Chapter Outline  Introduction  Bivariate Tables  The Logic of Chi Square  The Computation of Chi Square.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Other Chi-Square Tests
Chi-Square Test.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chi square analysis Just when you thought statistics was over!!
Warm up On slide.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
AGENDA:. AP STAT Ch. 14.: X 2 Tests Goodness of Fit Homogeniety Independence EQ: What are expected values and how are they used to calculate Chi-Square?
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
1 Chapter 15 Data Analysis: Basic Questions © 2005 Thomson/South-Western.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Statistics 300: Elementary Statistics Section 11-3.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Other Chi-Square Tests
Test of independence: Contingency Table
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
Qualitative data – tests of association
Chi-Square Test.
Chi-Square Test.
Statistical Inference about Regression
Contingency Tables (cross tabs)
Chapter 11 Chi-Square Tests.
Chi-Square Test.
Assistant prof. Dr. Mayasah A. Sadiq FICMS-FM
Chapter 11 Chi-Square Tests.
Presentation transcript:

1 Chapter 14 Preprocessing the Data, And Cross-Tabs © 2005 Thomson/South-Western

2 Figure 1: Histogram and Frequency Polygon of Incomes of Families in Car Ownership Study

3 Figure 2: Cumulative Distribution of Incomes of Families in Car Ownership Study

4 Family Income and Number of Cars Family Owns Number of Cars Income Less than $37,500 More than $37,500 TOTAL 1 or None 2 or More Total

5 Number of Cars by Family Income Number of Cars Income Less than $37,500 More than $37,500 1 or None 2 or More Total 89% 59% 11% 41% 100% # of Cases 54 46

6 Family Income by Number of Cars Number of Cars Income Less than $37,500 More than $37,500 Total 1 or None 2 or More 64% 36% 100% (75) 24% 76% 100% (25) (Number of Cases)

7 Number of Cars and Size of Family Number of Cars Size of Family 4 or Less 5 or More Total 1 or None 2 or More Total

8 Number of Cars by Size of Family Number of Cars Size of Family 4 or Less 5 or More 1 or None 2 or More Total 90% 23% 10% 77% 100% # of Cases (78) (22)

9 Number of Cars by Income and Size of Family Income Less than $37,500 More than $37,500 TOTAL or None 2 or MoreTotal or None 2 or MoreTotal or None 2 or MoreTotal Four Members or Less: Total Number of Cars Number of Cars Five Members or More:

10 Number of Cars by Income and Size of Family Income Less than $37,500 More than $37,500 96% 81% 4% 19% 100% (46) 100% (32) 1 or None 2 or MoreTotal 50% 7% 50% 93% 100% (8) 100% (14) 1 or None 2 or More Total 89% 59% 11% 41% 100% (54) 100% (46) 1 or None 2 or MoreTotal Four Members or Less: Total Number of Cars Number of Cars Five Members or More:

11 Car Ownership for Small, Below Average Income Families Number of Cars Income Less than $37,500 1 or None 2 or More Total 96%4%100% (46)

12 Percentage of Families Owning Two or More Cars by Income Number of Cars Income Less than $37,500 4 or Less 5 or More Total 4%50%11% (6) More than $37,500 19%93%41% (19)

13 Conditions That Can Arise with the Introduction of an Additional Variable into a Cross Tabulation With the Additional Variable Initial Situation Change Conclusion Retain Conclusion Some Relationship Refine Explanation Reveal Spurious Explanation Provide Limiting Conditions A. B. C. II IV No Relationship I III

14 The Researcher’s Dilemma True Situation Researcher’s Conclusion No Relationship Some Relationship No Relationship Some Relationship Correct Decision Spurious Correlation Correct Decision if Concluded Relationship is of Proper Form Spurious Noncorrelation

15 Source: Appendix 14A Chi-Square Tests

16 Measures of Association for Nominal Data Measures Appropriate for Nominal Data * Contingency Table (Chi-Square) * Contingency Coefficient * Index of Predictive Association

17 Family Size: 4 or less 5 or more #Cars: 0 or Frequencies of Combinations of Row (i) and Column (j) Cross Tabulations

18 H 0 : Row variable independent of column variable; No association between family size & #cars analogous to: “no correlation” Cross-Tabs & Chi-Squares Family Size: 4 or less 5 or more #Cars: 0 or % 25% 78 78% 22 22% 100

19 We’d EXPECT frequencies to be distributed “randomly”; i.e., in proportion to the margins If Family Size & #Cars are Independent: Family Size: 4 or less 5 or more #Cars: 0 or % 25% 78 78% 22 22%

20 If A & B are independent: P(A 1 B 1 ) = P(A 1 )P(B 1 ) Using the Statistical Definition of “Independence” to Calculate the Expected Frequencies e 11 = nP(A 1 B 1 ) = 100 (78/100) (75/100) = (78 x 75) 100

21  Chi-square measures how much our data differ from what we’d expect (given the hypothesis of independence)  Are the row and column variables associated ? Chi-Square Formula

22 X 2 = ( ) 2 + ( ) 2 + ( ) 2 + ( ) = = Is this large? Chi-Square for Our Data df= degrees of freedom = ( r-1) ( c-1) For our 2x2 table, df=1 critical value for X 2 with 1 df = 3.84 (.05) X 2 = exceeds 3.84.

23  Three-way table: Example: Family size x #Cars x household income  Log Linear Models Extension Beyond 2-Way Tables

24  Equation:  Degrees of Freedom:( r-1 )  When would you use this statistic?  e.g., compare sample to population characteristics, or to previous study’s benchmark  to investigate the great M&M caper: One-Way Chi-Square

25 PLAINPEANUT e i ’so i ’se i ’so i ’s  blue  brown  green  orange  red  yellow  critical chi-square on 5 df = The Case of the Blue M&M’s: