Chapter 12: Measures of Association for Nominal and Ordinal Variables

Slides:



Advertisements
Similar presentations
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Advertisements

Chapter 13 (Ch. 11 in 2nd Can. Ed.)
1. Nominal Measures of Association 2. Ordinal Measure s of Association
Chapter 16: Correlation.
Association Between Two Variables Measured at the Nominal Level
Chapter 7: Measures of Association for Nominal and Ordinal Variables
Measures of Association for contingency tables 4 Figure 8.2 : lambda – association; +-1: strong; near 0: weak Positive association: as value of the independent.
Describing Relationships Using Correlation and Regression
Chapter 13: The Chi-Square Test
Sociology 601 Class 17: October 28, 2009 Review (linear regression) –new terms and concepts –assumptions –reading regression computer outputs Correlation.
Sociology 601 Class 13: October 13, 2009 Measures of association for tables (8.4) –Difference of proportions –Ratios of proportions –the odds ratio Measures.
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Correlation Chapter 9.
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
PPA 501 – Analytical Methods in Administration Lecture 9 – Bivariate Association.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Leon-Guerrero and Frankfort-Nachmias,
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Chapter 14 Association Between Variables Measured at the Ordinal Level.
Chapter 14 in 1e Ch. 12 in 2/3 Can. Ed. Association Between Variables Measured at the Ordinal Level Using the Statistic Gamma and Conducting a Z-test for.
Measures of Central Tendency
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Chapter 8: Bivariate Regression and Correlation
Chapter 2: Organization of Information: Frequency Distributions Frequency Distributions Proportions and Percentages Percentage Distributions Comparisons.
Significance Testing 10/22/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp ) Chapter 5.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 2 BROOKLYN COLLEGE – CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Association between Variables Measured at the Nominal Level.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Measures of Association. When examining relationships (or the lack thereof) between nominal- and ordinal-level variables, Crosstabs are our instruments.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Chapter 16 The Elaboration Model Key Terms. Descriptive statistics Statistical computations describing either the characteristics of a sample or the relationship.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
1 Lecture 7: Two Way Tables Graduate School Quantitative Research Methods Gwilym Pryce
Chapter 2: The Organization of Information: Frequency Distributions  Frequency Distributions  Proportions and Percentages  Percentage Distributions.
Chapter 7 – 1 Chapter 12: Measures of Association for Nominal and Ordinal Variables Proportional Reduction of Error (PRE) Degree of Association For Nominal.
Chapter 16 Data Analysis: Testing for Associations.
Contingency Tables – Part II – Getting Past Chi-Square?
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Chapter 7 – 1 Chapter 7 Measures of Association for Nominal and Ordinal Variables Proportional Reduction of Error (PRE) Degree of Association For Nominal.
Chapter 10: Cross-Tabulation Relationships Between Variables  Independent and Dependent Variables  Constructing a Bivariate Table  Computing Percentages.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Practice Problem: Lambda (1)
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
DATA ANALYSIS GRAPHS Graphs are easy to read, and highlight distribution’s shape. The are useful because they show the full range of variation and identity.
Chapter 8 – 1 Regression & Correlation:Extended Treatment Overview The Scatter Diagram Bivariate Linear Regression Prediction Error Coefficient of Determination.
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
Measures of Association June 25, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
CHAPTER 8: RELATIONSHIPS BETWEEN TWO VARIABLES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Copyright © 2012 by Nelson Education Limited. Chapter 12 Association Between Variables Measured at the Ordinal Level 12-1.
CHAPTER 5: THE NORMAL DISTRIBUTION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 9 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Chapter 1: The What and the Why of Statistics
Association Between Variables Measured at the Ordinal Level
Final Project Reminder
Final Project Reminder
Chapter 14 in 1e Ch. 12 in 2/3 Can. Ed.
Chapter 13 (1e), (Ch. 11 2/3e) Association Between Variables Measured at the Nominal Level: Phi, Cramer’s V, and Lambda.
Association Between Variables Measured at the Ordinal Level
Association Between Variables Measured at Nominal Level
THE PRINCIPLE OF PRE.
1. Nominal Measures of Association 2. Ordinal Measure s of Associaiton
Association Between Variables Measured At Ordinal Level
1. Nominal Measures of Association 2. Ordinal Measure s of Associaiton
Presentation transcript:

Chapter 12: Measures of Association for Nominal and Ordinal Variables Proportional Reduction of Error (PRE) Degree of Association For Nominal Variables Lambda For Ordinal Variables Gamma Using Gamma for Dichotomous Variables Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Measures of Association Measure of association—a single summarizing number that reflects the strength of a relationship, indicates the usefulness of predicting the dependent variable from the independent variable, and often shows the direction of the relationship. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

The most common race/ethnicity for U.S. residents (e.g., the mode)! Take your best guess? If you know nothing else about a person except that he or she lives in United States and I asked you to guess his or her race/ethnicity, what would you guess? The most common race/ethnicity for U.S. residents (e.g., the mode)! Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Take your best guess? Now, if we know that this person lives in San Diego, California, would you change your guess? With quantitative analyses we are generally trying to predict or take our best guess at value of the dependent variable. One way to assess the relationship between two variables is to consider the degree to which the extra information of the independent variable makes your guess better. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Proportional Reduction of Error (PRE) PRE—the concept that underlies the definition and interpretation of several measures of association. PRE measures are derived by comparing the errors made in predicting the dependent variable while ignoring the independent variable with errors made when making predictions that use information about the independent variable. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Proportional Reduction of Error (PRE) where: E1 = errors of prediction made when the independent variable is ignored E2 = errors of prediction made when the prediction is based on the independent variable Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Two PRE Measures: Lambda & Gamma Appropriate for… Lambda NOMINAL variables Gamma ORDINAL & DICHOTOMOUS NOMINAL variables Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Lambda Lambda—An asymmetrical measure of association suitable for use with nominal variables and may range from 0 (meaning the extra information provided by the independent variable does not help prediction) to 1 (meaning use of independent variable results in no prediction errors). It provides us with an indication of the strength of an association between the independent and dependent variables. A lower value represents a weaker association, while a higher value is indicative of a stronger association Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Lambda where: E1= Ntotal - Nmode of dependent variable Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Example 1: 2000 Vote By Abortion Attitudes Vote Yes No Row Total Gore 46 39 85 Bush 41 73 114 Total 87 112 199 Abortion Attitudes (for any reason) Source: General Social Survey, 2002 Step One—Add percentages to the table to get the data in a format that allows you to clearly assess the nature of the relationship. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Example 1: 2000 Vote By Abortion Attitudes Abortion Attitudes (for any reason) Vote Yes No Row Total Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Source: General Social Survey, 2002 Now calculate E1 E1 = Ntotal – Nmode = 199 – 114 = 85 Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Example 1: 2000 Vote By Abortion Attitudes Vote Yes No Row Total Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Abortion Attitudes (for any reason) Source: General Social Survey, 2002 Now calculate E2 E2 = [N(Yes column total) – N(Yes column mode)] + [N(No column total) – N(No column mode)] = [87 – 46] + …

Example 1: 2000 Vote By Abortion Attitudes Vote Yes No Row Total Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Abortion Attitudes (for any reason) Source: General Social Survey, 2002 Now calculate E2 E2 = [N(Yes column total) – N(Yes column mode)] + [N(No column total) – N(No column mode)] = [87 – 46] + [112 – 73]

Example 1: 2000 Vote By Abortion Attitudes Vote Yes No Row Total Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Abortion Attitudes (for any reason) Source: General Social Survey, 2002 Now calculate E2 E2 = [N(Yes column total) – N(Yes column mode)] + [N(No column total) – N(No column mode)] = [87 – 46] + [112 – 73] = 80

Example 1: 2000 Vote By Abortion Attitudes Vote Yes No Row Total Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Abortion Attitudes (for any reason) Source: General Social Survey, 2002 Lambda = [E1– E2] / E1 = [85 – 80] / 85 = .06 Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Example 1: 2000 Vote By Abortion Attitudes Gore 52.9% 34.8% 42.7% 46 39 85 Bush 47.1% 65.2% 57.3% 41 73 114 Total 100% 100% 100% 87 112 199 Abortion Attitudes (for any reason) Vote Yes No Row Total Source: General Social Survey, 2002 Lambda = .06 So, we know that six percent of the errors in predicting 2000 presidential election vote can be reduced by taking into account abortion attitudes. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Example 2: Victim-Offender Relationship and Type of Crime: 1993 Step One—Add percentages to the table to get the data in a format that allows you to clearly assess the nature of the relationship. *Source: Kathleen Maguire and Ann L. Pastore, eds., Sourcebook of Criminal Justice Statistics 1994., U.S. Department of Justice, Bureau of Justice Statistics, Washington, D.C.: USGPO, 1995, p. 343. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Victim-Offender Relationship & Type of Crime: 1993 Now calculate E1 E1 = Ntotal – Nmode = 9,898,980 – 5,045,040 = 4,835,940 Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Victim-Offender Relationship & Type of Crime: 1993 Now calculate E2 E2 = [N(rape/sexual assault column total) – N(rape/sexual assault column mode)] + [N(robbery column total) – N(robbery column mode)] + [N(assault column total) – N(assault column mode)] = [472,760 – 350,670] + …

Victim-Offender Relationship and Type of Crime: 1993 Now calculate E2 E2 = [N(rape/sexual assault column total) – N(rape/sexual assault column mode)] + [N(robbery column total) – N(robbery column mode)] + [N(assault column total) – N(assault column mode)] = [472,760 – 350,670] + [1,161,900 – 930,860] + …

Victim-Offender Relationship and Type of Crime: 1993 Now calculate E2 E2 = [N(rape/sexual assault column total) – N(rape/sexual assault column mode)] + [N(robbery column total) – N(robbery column mode)] + [N(assault column total) – N(assault column mode)] = [472,760 – 350,670] + [1,161,900 – 930,860] + [8,264,320 – 4,272,230] = 4,345,220

Victim-Offender Relationship and Type of Crime: 1993 Lambda = [E1– E2] / E1 = [4,835,940 – 4,345,220] / 4,835,940 = .10 So, we know that ten percent of the errors in predicting the relationship between victim and offender (stranger vs. non-stranger) can be reduced by taking into account the type of crime that was committed.

Asymmetrical Measure of Association A measure whose value may vary depending on which variable is considered the independent variable and which the dependent variable. Lambda is an asymmetrical measure of association. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Symmetrical Measure of Association A measure whose value will be the same when either variable is considered the independent variable or the dependent variable. Gamma is a symmetrical measure of association… Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Before Computing GAMMA: It is necessary to introduce the concept of paired observations. Paired observations – Observations compared in terms of their relative rankings on the independent and dependent variables. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Tied Pairs Same order pair (Ns) – Paired observations that show a positive association; the member of the pair ranked higher on the independent variable is also ranked higher on the dependent variable. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Tied Pairs Inverse order pair (Nd) – Paired observations that show a negative association; the member of the pair ranked higher on the independent variable is ranked lower on the dependent variable. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Gamma Gamma—a symmetrical measure of association suitable for use with ordinal variables or with dichotomous nominal variables. It can vary from 0 (meaning the extra information provided by the independent variable does not help prediction) to 1 (meaning use of independent variable results in no prediction errors) and provides us with an indication of the strength and direction of the association between the variables. When there are more Ns pairs, gamma will be positive; when there are more Nd pairs, gamma will be negative. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Interpreting Gamma The sign depends on the way the variables are coded: + the two “high” values are associated, as are the two “lows” – the “highs” are associated with the “lows” Interpretation….when Gamma = 0.xx, then xx% of the variation in the dependent variable can be accounted for by the variation in the independent variable. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications

Measures of Association Measures of association—a single summarizing number that reflects the strength of the relationship. This statistic shows the magnitude and/or direction of a relationship between variables. Magnitude—the closer to the absolute value of 1, the stronger the association. If the measure equals 0, there is no relationship between the two variables. Direction—the sign on the measure indicates if the relationship is positive or negative. In a positive relationship, when one variable is high, so is the other. In a negative relationship, when one variable is high, the other is low. Frankfort-Nachmias and Leon-Guerrero, Statistics for a Diverse Society, 6e © 2011 SAGE Publications