Chapter 10 The Analysis of Frequencies. The expression “cross partition” refers to an abstract process of set theory. When the cross partition idea is.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
The Chi-Square Test for Association
Bivariate Analysis Cross-tabulation and chi-square.
Hypothesis Testing IV Chi Square.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chapter 4 Probability.
Chapter 9 Principles of Analysis and Interpretation.
Crosstabs and Chi Squares Computer Applications in Psychology.
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chi-Square Tests and the F-Distribution
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chi-square Test of Independence Hypotheses Neha Jain Lecturer School of Biotechnology Devi Ahilya University, Indore.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Multinomial Distribution
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Chapter 4 Probability ©. Sample Space sample space.S The possible outcomes of a random experiment are called the basic outcomes, and the set of all basic.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Non-parametric tests (chi-square test) Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Copyright © 2010 Pearson Education, Inc. Slide
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Chi-square test.
Chapter 11 Chi-Square Tests.
Hypothesis Testing Review
Chapter 12 Tests with Qualitative Data
Active Learning Lecture Slides
Qualitative data – tests of association
Statistics.
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 11 Chi-Square Tests.
Inference on Categorical Data
Chapter 11 Chi-Square Tests.
Presentation transcript:

Chapter 10 The Analysis of Frequencies

The expression “cross partition” refers to an abstract process of set theory. When the cross partition idea is applied to the analysis of frequencies to study relations between variables, we call the cross partitions crosstabs or sometimes crossbreaks. This kind of analysis to be shown is called contingency analysis, or contingency table analysis. Table 10.1, 10.2

Data and Variable Terminology A distinction was made between active and attribute variable, the former meaning an experimental or manipulated variable and the latter a measured variables. Remember that a attribute variable is not always a categorical variable or qualitative variable. An attribute is any property of any object, whether the object is measured in an all-or none way or with a set of continuous measures.

Crosstabs: Definitions and Purpose A crosstab is a numerical tabular presentation of data, usually in frequency or percentage form, in which variables are cross partitioned. Crosstabs enable the researcher to determine the nature of the relations between variables, but also have other side purposes: They can be used to organize data in convenient form for statisitcal analysis.

Simple Crosstabs and Rules for Crosstab Construction The rules are (1) categories are set up according to the research hypotheses; (2) categories are independent and mutually exclusive; (3) categories are exhaustive; (4) each category is derived from one and only one classification principle; (5) all categories are on one level of discourse. In general, we will report the levels of the independent variable in column and the outcome responses of the dependent variable as rows in the contingency table.

Calculation of Percentages Percentages are calculated from the independent variable to the dependent variable. Figure Table 10.3 (p.225) Use percentages but not frequencies to highlight the relation of variables in crosstabs. Why? A common base.

Calculation of Percentages In Table The Payette-Clarizio problem is pointed toward the misclassification of children as eligible or ineligible for learning disability treatment. A hypothesis implied by the problem is: Decisionmakers are biased in their decisions about female children. This is a statement of the “If p, then q” kind: If female, then they are most likely eligible for learning disable considerations.

Calculation of Percentages Why not calculate the percentages the other way: from the dependent variable to the independent variable? Why not calculate the percentages over the whole table?

Statistical Significance and the Chi- square Test Look at Table Do they really express a relation between gender and learning disability eligibility? Or could they have happened by chance? Are they one pattern among many patterns of frequencies that one would get picking numbers from a table of random numbers, such selection being limited only by the given marginal frequencies?

Statistical Significance and the Chi- square Test We may say here that “degrees of freedom” defines the latitude of variation continued in a statistical problem. In the problem above, there is one degree of freedom because the total number of cases is fixed, 100, and because as soon as one of the frequencies is given, the other is immediately determined.

Levels of Statistical Significance The 0.05 level means that an obtained result that is significant at the 0.05 level could occur by chance no more than five times in 100 trials. The 0.05 level was originally chosen—and has persisted with researchers—because it is considered a reasonably good gamble. It is neither too high nor too low for most social scientific research.

Levels of Statistical Significance There is a newer trend of thinking that advocates reporting the significance levels of all results. Another school of thought advocates working with what are called “confidence intervals.” Rozeboom (1960) advocates the use of confidence intervals and the reporting of precise probability values of experimental outcomes. However, Brady (1988) states that such precision is generally meaningless in the social and behavioral sciences because of the inaccuracy of measurements.

Levels of Statistical Significance A statistically significant result does not imply personal or practical significance. It is Cramer’s V, a measure of association based on the chi-square value. The formula is:

Types of Crosstabs and Tables There are three types of tables: one- dimensional, two-dimensional, and k- dimensional. Theoretically, there is no limit to the number of variables that can be considered at one time. The only limitations are practical ones: insufficient sample size and difficulty of comprehension of the relations contained in a multidimensional table.

One-dimensional Tables There are two kinds of one-dimensional tables. One is a “true” one-dimensional table; it is of little interest to us because it does not express a relation. Only one variable is used in the table. Social scientists sometimes choose to report their data in tables that look one- dimensional but are really two-dimensional, such as Table 10.7.

Two-dimensional Tables Two-dimensional tables or crosstabs have two variables, each with two or more subclasses. Table 10.8, 10.9, 10.10,

Three- and k-Dimensional Tables The analysis of three or more variables simultaneously has two main purposes. First, is to study the relations among three or more variables. The second purpose is to control one variable while studying the relation between the other two variables.

Specification Specification is a process of describing the conditions under which a relation does or does not exist, or exists to a greater or a lesser extent. In the above analysis (Table 10.13, 10.14), the data were specified: it was shown, by introducing the social-class variable, that the relation between level of aspiration and success in college was stronger in one group (middle class) than in another group (working class).

Specification This is similar to the phenomenon of interaction discussed in chapter 9. Strictly speaking, “interaction” is a term used in experimental research and analysis of variance. The position taken in this book is that interaction is a general phenomenon of great importance occurring in both experimental and nonexperimental research.

Crosstabs, Relations, and Ordered Pairs A relation is a set of ordered pairs. Table 10.15, 10.16, Figure 10.4.

The Odds Ratio Odds are computed as the ratio of the probability that the event will occur to the probability that it will not occur. Odds ratio The chi-square statistic is still the preferred method; however, it is unable to give the type of information that odds ratios can give.

Multivariate Analysis of Frequency Data Many frequency data analysis, however, are of three and more variables. It is so- called “multi-way contingency tables with frequency data,” which can be handled by log-linear model.

Computer Addendum Figure 10.10~10.13