MATH 2311-06 Section 5.6. Relations in Categorical Data A two-way table organizes the data for two categorical variables. The totals of each row and column.

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

Introduction to Stats Honors Analysis. Data Analysis Individuals: Objects described by a set of data. (Ex: People, animals, things) Variable: Any characteristic.
Three or more categorical variables
Chapter 4 Review: More About Relationship Between Two Variables
Displaying & Describing Categorical Data Chapter 3.
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Comparitive Graphs.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
Exploring Two Categorical Variables: Contingency Tables
Section 2.6 Relations in Categorical Variables So far in chapter two we have dealt with data that is quantitative. In this section we consider categorical.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
1 Chapter 5 Two-Way Tables Associations Between Categorical Variables.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
CHAPTER 3: Displaying and Describing Categorical Data KENNESAW STATE UNIVERSITY MATH 1107.
Warm-Up List all of the different types of graphs you can remember from previous years:
4.3 Categorical Data Relationships.
Relations and Categorical Data Target Goal: I can describe relationships among categorical data using two way tables. 1.1 cont. Hw: pg 24: 20, 21, 23,
Probability Unit 4 - Statistics What is probability? Proportion of times any outcome of any random phenomenon would occur in a very long series of repetitions.
Ogive, Stem and Leaf plot & Crosstabulation. Ogive n An ogive is a graph of a cumulative distribution.. n The data values are shown on the horizontal.
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Displaying Categorical Data THINK SHOW TELL What is categorical data? Bar, Segmented Bar, and Pie Charts Frequency vs. Relative Frequency Tables/Charts.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
Chapter 3: Displaying and Describing Categorical Data Sarah Lovelace and Alison Vicary Period 2.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
Statistics Day 4 Displaying Categorical Data. Do Now Act question ACT #9,10,13,14.
Warm-up An investigator wants to study the effectiveness of two surgical procedures to correct near-sightedness: Procedure A uses cuts from a scalpel and.
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
CHAPTER 6: Two-Way Tables*
Displaying and Describing Categorical Data Chapter 3.
Second factor: education
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Bell Ringer The State Education Department requires local school districts to keep these records on all students: age, race or ethnicity, days absent,
Analysis of two-way tables - Data analysis for two-way tables
Second factor: education
Looking at Data - Relationships Data analysis for two-way tables
Data Analysis for Two-Way Tables
STA 291 Spring 2008 Lecture 3 Dustin Lueker.
Lecture 2 Chapter 2. Displaying and Describing Categorical Data
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
Relations in Categorical Data
AP Statistics Chapter 3 Part 2
Second factor: education
Relations in Categorical Data
Warmup Which part- time jobs employed 10 or more of the students?
Section 4-3 Relations in Categorical Data
4.2 Relationships between Categorical Variables and Simpson’s Paradox
Displaying and Describing Categorical Data
MATH 2311 Section 5.6.
Essentials of Statistics for Business and Economics (8e)
Active Learning Lecture Slides
Chapter 11 Analyzing the Association Between Categorical Variables
Displaying and Describing Categorical data
Section Way Tables and Marginal Distributions
Displaying and Describing Categorical Data
Relations in Categorical Data
Chapter 4: More on Two-Variable Data
MATH 2311 Section 5.6.
Presentation transcript:

MATH Section 5.6

Relations in Categorical Data A two-way table organizes the data for two categorical variables. The totals of each row and column are considered marginal distributions because they appear in the margins of the table.

Example: The following two-way table describes the preferences in movies and pizza toppings for a random sample of 100 people. Enter the marginal distributions in the table.

Draw a Bar Chart to Display the Marginal Distribution of Pizza Topping Preference Using Excel would be the best option to do this. Rstudio will work, but the syntax is very difficult to use.

Draw a Bar Chart to Display the Marginal Distribution of Pizza Topping Preference What percent of our sample likes Gone with the Wind? What percent of pepperoni lovers like Star Wars?

Draw a Bar Chart to Display the Marginal Distribution of Pizza Topping Preference A conditional distribution is made up of the percentages that satisfy a given condition.

Popper 16: Compare the conditional distributions of movie preference for hamburger lovers and mushroom lovers. Back up your description with percentages. 1.What percent of hamburger lovers like Star Wars? a. 23%b. 36%c. 68%d. 15% 2. What percent of hamburger lovers like Jurassic Park? a. 23%b. 12%c. 5%d. 35% 3. What percent of mushroom lovers like Gone with the Wind? a. 59%b. 37%c. 2%d. 22%

Simpson’s Paradox Always be careful if combining data to make a comparison. Simpson’s Paradox is the reversal of the direction of a comparison or an association when data from several groups are combined to form a single group.

This is adapted from Subsection of A. Agresti (2002), Categorical Data Analysis, 2nd ed., Wiley, pp In a 1991 study by Radelet and Pierce of the effect of race on death-penalty sentences, the following table was obtained tabulating the death-penalty sentences (Death) and non-death-penalty sentences (No death) in murder convictions in the state of Florida. Now, we consider the very same data, except that we stratify according to the race of the victim of the murder. Below is the table. Here we see that when considering the cases involving Caucasian victims separately from the cases involving African-American victims, that the African-American defendants are more likely than Caucasian ones to receive the death penalty in both instances (22.9% vs 11.3% in the first case and 2.8% vs. 0.0% in the second case).