Bi-variate #1 Cross-Tabulation

Slides:



Advertisements
Similar presentations
POL242 October 9 and 11, 2012 Jennifer Hove. Questions of Causality Recall: Most causal thinking in social sciences is probabilistic, not deterministic:
Advertisements

CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Basic Data Analysis. Tabulation Frequency table Percentages.
Bivariate Analysis Cross-tabulation and chi-square.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
Statistics: An Introduction Alan Monroe: Chapter 6.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Quantitative Data Analysis: Univariate (cont’d) & Bivariate Statistics
Inferential Statistics  Hypothesis testing (relationship between 2 or more variables)  We want to make inferences from a sample to a population.  A.
19 May Crawford School 1 Basic Statistics – 1 Semester 1, 2009 POGO8096/8196: Research Methods Crawford School of Economics and Government.
Social Research Methods
PPA 501 – Analytical Methods in Administration Lecture 9 – Bivariate Association.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Crosstabs. When to Use Crosstabs as a Bivariate Data Analysis Technique For examining the relationship of two CATEGORIC variables  For example, do men.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Some Introductory Statistics Terminology. Descriptive Statistics Procedures used to summarize, organize, and simplify data (data being a collection of.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Research Methods Chapter 8 Data Analysis. Two Types of Statistics Descriptive –Allows you to describe relationships between variables Inferential –Allows.
Copyright © 2012 by Nelson Education Limited. Chapter 10 Hypothesis Testing IV: Chi Square 10-1.
Chapter 18 Some Other (Important) Statistical Procedures You Should Know About Part IV Significantly Different: Using Inferential Statistics.
Chapter 16 The Chi-Square Statistic
CHI SQUARE TESTS.
Chi-square Test of Independence
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Chapter Eight: Using Statistics to Answer Questions.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi Square & Correlation
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
Cross Tabs and Chi-Squared Testing for a Relationship Between Nominal/Ordinal Variables.
Organizing the Data Levin and Fox Elementary Statistics In Social Research Chapter 2.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
Introduction to Marketing Research
Data measurement, probability and Spearman’s Rho
Statistical analysis.
Final Project Reminder
Hypothesis Testing.
Final Project Reminder
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Statistical analysis.
Hypothesis Testing Review
Basic Statistics Overview
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Social Research Methods
Types of Bivariate Relationships and Associated Statistics
Summarising and presenting data - Bivariate analysis
NURS 790: Methods for Research and Evidence Based Practice
The Table Categorization
Contingency Tables.
3.2 Pie Charts and Two-Way Tables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Making Use of Associations Tests
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Chapter 18: The Chi-Square Statistic
Descriptive Statistics
Regression Part II.
CHI SQUARE (χ2) Dangerous Curves Ahead!.
Presentation transcript:

Bi-variate #1 Cross-Tabulation PS 372

Single Variable “Typical Case” is central tendency (Mean, Mode, Median). Variability or dispersion (Variance). Overall pattern is the distribution (Histogram, Bar Charts, Pie Charts).

Two Variables Cross Tabulation Difference-of-means Analysis of Variance Correlation Analysis Regression Analysis

Steps in Two Variable Analysis Is there a statistical relationship? How strong is the relationship? What is the relationship’s direction? Is the relationship casual or not? If we know the answers we can make predictions (subject to error) if we know the value of one the two variables.

Statistical Relationship A statistical relationship between two variables exists if the values of the observations for one variable are associated or connected to the values of the observations of the other. Without a statistical relationship cannot answer the other three questions!

Statistical Relationship Always a possibility (especially with samples) that an observed relationship is due only to chance and is an inaccurate indication of what we would have observed in the entire population. It is possible to test for “statistical significance”.

Strength of Relationship Strength of relationship indicates how consistently the values of an IV variable are associated with the values of a DV! Weak and Strong relationships. Remember correlation (r).

Strength of Relationship Nil or Weak if proportion of cases across various categories of independent variable are nearly equal across categories/values of the dependent variable!!! Proportion of Men voting/non voting is nearly equal to percentage of women voting/non-voting! i.e. Gender does not explain participation at the ballot box.

Strength of Relationship Strong or Perfect relationship if proportion of cases of various categories of independent variable differ substantially across categories of the dependent variable! Perfect if ALL values for one category of the IV fall into one category of the DV…and values for another IV category fall into another category of the DV!!!

Direction of Relationship The direction or shape of the relationship tells us which values of the independent variable are associated with which values of the dependent variable, rather than simply whether the two are related.

Direction of the Relationship The direction of the relationship tells us how the values of the independent variable are associated with the values of the dependent variable Positive and Negative Relationship between Independent and DV! Need Directionality… Ordinal or Higher!

Direction of the Relationship Positive Relationship – Higher values of the Independent Variable are associated with Higher Values of the DV!!! More Literacy; More Democracy More Education; More Political Knowledge Any More????? Casinos and Crime??? Income and Campaign Contributions???

Direction of the Relationship Negative Relationship – Higher Values of the IV are associated with Lower Values of the DV! And Vice Versa! More Democracy; Less War More African Americans; Less Welfare More Income; Less Liberal Any More???? More negative ads; Less Voting

Causality That a relationship exists between independent and dependent variables does not necessarily imply causality. What do we need to prove causality? Covariance; Time Dependence; Rival Explanations!!!

What is a Crosstabulation? Crosstabulations are appropriate for examining relationships between variables that are nominal, ordinal, or dichotomous. Few Categories!!!

What is a Crosstabulation? Crosstab displays the joint distribution of values of the variables by listing the categories for one of the variables along one side (DV) and the categories for the other variables across the top (IV). Each case is then placed in the cell of the table that represents the combination of values that corresponds to its scores on the variables.

What is a Crosstabulation? Example: We would like to know if presidential vote choice in 2000 was related to race. Dependent variable: presidential vote in 2000. Voted for Bush or Gore (Nominal). Independent variable: race. Non-White and White (Nominal).

What is a Crosstabulation? To show the data we construct a table showing each case’s value for both variables by putting the independent variable across the top and the dependent variable down the side and creating a grid of boxes or cells, one for each combination of the variables.

Are Race and Vote Choice Related? Why?

Crosstab What is important in testing the hypothesis is not which cases have particular values for the independent and dependent variables, but how many have each combination of values. The number or frequency of observations in each CELL is important.

Interpretation Each cell in the table should contain two numbers. The first is the frequency of cases having that particular combination of values. By themselves the frequencies are not especially helpful.

Interpretation The second number is the relative frequency (percentage) of the column. It is important to stress that the percentages add to 100 down the columns (not the rows).

Are Race and Vote Choice Related? Why?

Strength of the Relationships Weak or Strong Relationship? How do we know relationship strength from a cross-tab??? Need to look at how values across categories of IV are distributed across values of DV!!!

Strength of the Relationships A weak relationship would be one in which the differences in the observed values of the dependent variable for different categories of the independent variable are Very Slight The weakest is one in which the joint distribution is identical for all categories of the independent variable.

Direction of the Relationships The direction of the relationship shows which values of the independent variable are associated with which values of the dependent variable. Important if the variable is ordinal. Often a critical part of testing a hypothesis.

Crosstabs Important to check to see if the IV is the columns and not the rows. Want to know proportion of values of IV that fall into different categories of the DV!!! Sometimes the independent variable is the rows to save space and for easier presentation of the table. Important that the percentage adds to 100 in each category of the IV variable (Column %)

Crosstab No matter the number of cases and categories the procedure remains the same: Separate the cases into groups based on their values for the independent variable. Compare the values of the dependent variable for those groups. Decide whether the values for the dependent variable are different for the groups.