Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

Displaying and Describing Categorical Data 60 min.
Introduction to Stats Honors Analysis. Data Analysis Individuals: Objects described by a set of data. (Ex: People, animals, things) Variable: Any characteristic.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
Slide Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
The Three Rules of Data Analysis
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
CHAPTER 1 STATISTICS Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world.
. Chapter 3 Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
  The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These.
Copyright © 2010 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Do Now Have you: Read Harry Potter and the Deathly Hallows Seen Harry Potter and the Deathly Hallows (part 2)
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Chapter 3 Displaying and Describing Categorical Data
Chapter 3 Addie Molique, Ash Nair Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Chapter 2 DISPLAYING AND DESCRIBING CATEGORICAL DATA.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Displaying Categorical Data THINK SHOW TELL What is categorical data? Bar, Segmented Bar, and Pie Charts Frequency vs. Relative Frequency Tables/Charts.
Chapter 3: Displaying and Describing Categorical Data *Data Analysis *Frequency Tables, Bar Charts, Pie Charts Contingency Tables.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
Chapter 3: Displaying and Describing Categorical Data Sarah Lovelace and Alison Vicary Period 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. - use pie charts, bar graphs, and tables to display data Chapter 3: Displaying and Describing Categorical.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
Aim: How do we analyze data with a two-way table?
1 Chapter 3 Displaying and Describing Categorical Data.
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Slide 3-1 Copyright © 2004 Pearson Education, Inc.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Chapter 3 Displaying and Describing Categorical Data.
Displaying & Describing Categorical Data Chapter 3.
CHAPTER 6: Two-Way Tables*
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Objectives Given a contingency table of counts, construct a marginal distribution. Given a contingency table of counts, create a conditional distribution.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Unit 6, Module 15 – Two Way Tables (Part I) Categorical Data Comparing 2.
Chapter 3 Displaying and Describing Categorical Data Math2200.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 2 Displaying and Describing Categorical Data.
Copyright © 2009 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Bell Ringer The State Education Department requires local school districts to keep these records on all students: age, race or ethnicity, days absent,
Analysis of two-way tables - Data analysis for two-way tables
Displaying and Describing
Displaying and Describing Categorical Data
Math 153 Stats Starts Here.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Relations in Categorical Data
AP Statistics Chapter 3 Part 2
Announcements 100 Years: Let's celebrate! The National Park Service turns 100 on August 25, 2016, and everyone can take part in the celebration! To honor.
Displaying and Describing Categorical Data
Section 4-3 Relations in Categorical Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Relations in Categorical Data
Displaying and Describing Categorical Data
Presentation transcript:

Unit 3 Relations in Categorical Data

Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents to describe information falling into various categories

Contingency (Two-Way) Tables A contingency table allows us to look at two categorical variables together. It shows how individuals are distributed along each variable, contingent on the value of the other variable. –Example: we can examine the class of ticket and whether a person survived the Titanic:

Contingency Tables (cont.) The margins of the table, both on the right and on the bottom, give totals and the frequency distributions for each of the variables. Each frequency distribution is called a marginal distribution of its respective variable.

Contingency Tables (cont.) Each cell of the table gives the count for a combination of values of the two values. –For example, the second cell in the crew column tells us that 673 crew members died when the Titanic sunk.

Conditional Distributions A conditional distribution shows the distribution of one variable for just the individuals who satisfy some condition on another variable. –The following is the conditional distribution of ticket Class, conditional on having survived:

Conditional Distributions (cont.) –The following is the conditional distribution of ticket Class, conditional on having perished:

Conditional Distributions (cont.) The conditional distributions tell us that there is a difference in class for those who survived and those who perished. This is better shown with pie charts of the two distributions:

Conditional Distributions (cont.) We see that the distribution of Class for the survivors is different from that of the non-survivors. This leads us to believe that Class and Survival are associated and are not independent. The variables would be considered independent when the distribution of one variable in a contingency table is the same for all categories of the other variable.

Segmented Bar Charts A segmented bar chart displays the same information as a pie chart, but in the form of bars instead of circles. Question: Why a bar chart and not a histogram?

Example A company held a blood pressure screening clinic for its employees. The results are summarized in the table below by age group and blood pressure level. Find the marginal distribution of blood pressure level. Age BP Under Over 50 Low Normal High235972

Example A company held a blood pressure screening clinic for its employees. The results are summarized in the table below by age group and blood pressure level. Find the conditional distribution of blood pressure level for employees under 30. Age BP Under Over 50 Low Normal High235972

Simpson’s Paradox Example: Surgical survival rates –Hospital A and Hospital B both serve your community. Which is better? Compare. Hospital AHospital B Died6316 Survived

Simpson’s Paradox Example: Surgical survival rates –Hospital A and Hospital B both serve your community. Which is better? Compare. Hospital AHospital B Died6316 Survived Hospital AHospital B Died6316 Survived Total

Simpson’s Paradox Example: Surgical survival rates –Hospital A and Hospital B both serve your community. Which is better? Compare. Hospital A loses 3% (63/2100) and Hospital B loses 2% (16/800). Obviously, Hospital B is the better choice. Hospital AHospital B Died6316 Survived Hospital AHospital B Died6316 Survived Total

Simpson’s Paradox Not all surgery cases are equally serious. –Patients classified as “poor” or “good” condition before surgery Good condition Poor condition Only 1% of “good” (6/600) died in Hospital A, while 1.3% (8/600) in B. And, 3.8% of “poor” (57/1500) died in A, while 4% (8/200) died in B. Which is really better? Hospital AHospital B Died68 Survived Total600 Hospital AHospital B Died578 Survived Total

Simpson’s Paradox How can A do better in each group, yet do worse overall? –Original table was misleading because it did not show original patient condition –Hospital A took on more patients in poor condition and thus had the overall higher death rate

Simpson’s Paradox Simpson’s Paradox: –The reversal of the direction of a comparison or an association when data from several groups are combined to form a single group. –We are dealing with lurking categorical variables.