Summarizing the Relationship Between Two Variables with Tables and a bit of a review Chapters 6 and 7 Jan 31 and Feb 1, 2012.

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Bivariate Analysis Cross-tabulation and chi-square.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
Chapter 13: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #17.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Section 2.6 Relations in Categorical Variables So far in chapter two we have dealt with data that is quantitative. In this section we consider categorical.
Problem 1: Relationship between Two Variables-1 (1)
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Significance Testing 10/15/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp ) Chapter 5.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
Chi-Square Test of Independence Practice Problem – 1
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
CHAPTER 11 SECTION 2 Inference for Relationships.
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Analysis of Two-Way tables Ch 9
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
Aim: How do we analyze data with a two-way table?
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Chapter 3: Descriptive Study of Bivariate Data. Univariate Data: data involving a single variable. Multivariate Data: data involving more than one variable.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
Chi-Square Analyses.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Summarizing the Relationship Between Two Variables with Tables Chapter 6.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
CHAPTER 6: Two-Way Tables*
1 ES9 A random sample of registered voters was selected and each was asked his or her opinion on Proposal 129, a property tax reform bill. The distribution.
Scatterplots & Correlations Chapter 4. What we are going to cover Explanatory (Independent) and Response (Dependent) variables Displaying relationships.
Second factor: education
Making Use of Associations Tests
Inferential Statistics
Spearman’s rho Chi-square (χ2)
Analysis of two-way tables - Data analysis for two-way tables
Second factor: education
Summarising and presenting data - Bivariate analysis
Data Analysis for Two-Way Tables
Hypothesis Testing and Comparing Two Proportions
Second factor: education
Contingency Tables (cross tabs)
1.3 Data Recording, Analysis and Presentation
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Section 4-3 Relations in Categorical Data
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Making Use of Associations Tests
Relations in Categorical Data
Contingency Tables (cross tabs)
Presentation transcript:

Summarizing the Relationship Between Two Variables with Tables and a bit of a review Chapters 6 and 7 Jan 31 and Feb 1, 2012

Looking at Tables Tables are useful for examining the relationship between: –variables measured at the nominal or ordinal level, –or variables measured at the interval or ratio level with a small number of discrete values.

Some Terminology and Conventions Two-way tables, Crosstabulations (Crosstabs) Column (explanatory or independent) Row (response or dependent) Although the book does not always do this, there is a soft convention of stating a table title as “Dependent by Independent”. E.g. Table 2: Election Needed Now by Province.

Cells: As in a spread sheet a table is divided into cells aligned along rows and columns. Marginal Distributions: These are the numbers that summarize the rows and the columns at the side and bottom of a table Conditional Distributions: The book gives a very complicated explanation for what this is. In reality it is just the percentage of the cases in a cell or cells. This can be the percentage of cases along the horizontal row or the vertical column.

Here is an example of a Two Way table made with the “crosstab” procedure in SPSS (see my tip sheet for doing them in Excel). Question. Is there a difference between the number of bathrooms that homes in urban and rural Ontario have? Look at row 1. Reading Across: we see 94% of the homes with 1 bathroom are Urban 6% are Rural Look at Column 1. Reading Down: we see 52.2% of urban homes have 1 bath-room, 36.8% have 2 bathrooms 11.0% have three or more bathrooms.

The percentages give us a way to ‘eyeball’ the data and estimate if there is a difference, but to ask if these difference are meaningful, we must go further and calculate some statistics.

The Chi Sq. Test is a common one to use in a table with nominal variables such as this. It measures the difference between the number of cases we expect to see in each cell and the number of cases we actually observe in each cell of our table. The Chi Sq. value itself has little meaning for us. What matters is whether or not the value is significant. In this case it is >.05. Therefore we reject the hypothesis that the results we see are meaningfully different from what we could expect through simple probability Therefore we also reject that there is any meaningful difference between the number of bathrooms in urban and rural homes.

Simpson’s Paradox That lurking variable thing again Example 6.4 in your book gives you a look at a problem called Simpson’s Paradox. An association or comparison that holds for all or several groups can reverse direction when the data are combined to form a single group (Moore pg. 169). As Moore further notes, this is usually the sign of a “lurking” variable.

In order to check for lurking variables we can subdivide tables by a further categorical variable (such as was done in the book where the data was divided into serious and less serious accidents).

The book has two very nice “four step” graphics in Chapter 7

I would probably adjust this a bit. In the final leg I would say: –If the Y (response or dependent) variable is quantitative (measured at the interval or ratio level) a regression line is a good summary. –However, if the Y (response or dependent) variable is ordinal or nominal, then a two way table is needed.

And again the general four step approaches to any statistical problem State: What is the practical question, in the context of the real world setting Plan: What specific statistical operations does this problem call for Solve: Make the graphs and carry out the calculations needed for this problem Conclude: Give your practical conclusion in the setting of the real-world problem

Also keep in mind… I would also remind you that we looked at a couple of things the book did not cover, such as “spearman’s rho” as an alternative measure of correlation when working with a table in which both variables are ordinal and Chi Sq.