CHAPTER 6: Two-Way Tables*

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

Comparitive Graphs.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
CHAPTER 1 Exploring Data 1.1 Analyzing Categorical Data.
Section 2.6 Relations in Categorical Variables So far in chapter two we have dealt with data that is quantitative. In this section we consider categorical.
CHAPTER 1 Exploring Data
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data 1.1 Analyzing Categorical.
1 Chapter 5 Two-Way Tables Associations Between Categorical Variables.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Chapter 1: Exploring Data
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
Warm-Up List all of the different types of graphs you can remember from previous years:
4.3 Categorical Data Relationships.
Exploring Data Section 1.1 Analyzing Categorical Data.
Chapter 1: Exploring Data Sec. 1.1 Analyzing Categorical Data.
Statistics: Analyzing 2 Categorical Variables MIDDLE SCHOOL LEVEL  Session #1  Presented by: Dr. Del Ferster.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data 1.0 Introduction Data Analysis:
Warm Up The number of motor vehicles registered (in millions) in the U.S. has grown as charted in the table. 1)Plot the number of vehicles against time.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Statistics.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Introduction:
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
Aim: How do we analyze data with a two-way table?
Warm-up An investigator wants to study the effectiveness of two surgical procedures to correct near-sightedness: Procedure A uses cuts from a scalpel and.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
DO NOW: Oatmeal and cholesterol Does eating oatmeal reduce cholesterol
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Chapter 1: Exploring Data
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Warm Up Which of these variables are categorical? Which are quantitative?
Chapter 1.1 – Analyzing Categorical Data A categorical variable places individuals into one of several groups of categories. A quantitative variable takes.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Introduction:
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
4.3 Reading Quiz (second half) 1. In a two way table when looking at education given a person is 55+ we refer to it as ____________ distribution. 2. True.
+ Analyzing Categorical Data Categorical Variables place individuals into one of several groups or categories The values of a categorical variable are.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 1: Exploring Data
Second factor: education
CHAPTER 1 Exploring Data
The Practice of Statistics in the Life Sciences Third Edition
AP Statistics Chapter 3 Part 3
Analysis of two-way tables - Data analysis for two-way tables
Second factor: education
Looking at Data - Relationships Data analysis for two-way tables
Chapter 2 Looking at Data— Relationships
The Practice of Statistics in the Life Sciences Fourth Edition
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
CHAPTER 6: Two-Way Tables
Second factor: education
Warmup Which part- time jobs employed 10 or more of the students?
Chapter 2 Looking at Data— Relationships
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Section 4-3 Relations in Categorical Data
Section Way Tables and Marginal Distributions
Relations in Categorical Data
Presentation transcript:

CHAPTER 6: Two-Way Tables* Basic Practice of Statistics - 3rd Edition CHAPTER 6: Two-Way Tables* *This material is important in statistics, but it is needed later in this book only for Chapter 24. You may omit it if you do not plan to read Chapter 24 or delay reading it until you reach Chapter 24. Basic Practice of Statistics 7th Edition Lecture PowerPoint Slides Chapter 5

In Chapter 6, We Cover … Marginal distributions Conditional distributions Simpson’s paradox

Categorical Variables Review: Categorical variables place individuals into one of several groups or categories. The values of a categorical variable are labels for the different categories. The distribution of a categorical variable lists the count or percent of individuals who fall into each category. When a dataset involves two categorical variables, we begin by examining the counts or percents in various categories for one of the variables. Two-way table – Describes two categorical variables, organizing counts according to a row variable and a column variable.

Two-Way Table What are the variables described by this two-way table? Job outside home Stay home No preference Total Women, no college 81 104 10 195 Women, with college 173 115 15 303 Men, no college 92 32 2 126 Men, with college 299 8 388 1012 What are the variables described by this two-way table? How many young adults were surveyed?

Marginal Distribution The marginal distribution of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table. Note: Percents are often more informative than counts, especially when comparing groups of different sizes. To examine a marginal distribution: Use the data in the table to calculate the marginal distribution (in percents) of the row or column totals. Make a graph to display the marginal distribution.

Marginal Distribution Job outside home Stay home No preference Total Women, no college 81 104 10 195 Women, with college 173 115 15 303 Men, no college 92 32 2 126 Men, with college 299 8 388 1012 Examine the marginal distribution of gender/ education. Response Percent Women, no college 195/1012= 19.3% Women, with college 303/1012 = 29.9% Men, no college 126/1012 = 12.5% Men, with college 388/1012 = 38.3%

Conditional Distribution Marginal distributions tell us nothing about the relationship between two variables. A conditional distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. To examine or compare conditional distributions: Select the row(s) or column(s) of interest. Use the data in the table to calculate the conditional distribution (in percents) of the row(s) or column(s). Make a graph to display the conditional distribution. Use a side-by-side bar graph or segmented bar graph to compare distributions.

Conditional Distribution Job outside home Stay home No preference Total Women, no college 81 104 10 195 Women, with college 173 115 15 303 Men, no college 92 32 2 126 Men, with college 299 8 388 1012 Calculate the conditional distribution of job preference for women and men with no college. Response Women no college Job outside home 81/195 = 41.5% Stay home 104/195 = 53.3% No preference 10/195 = 5.1% Men no college 92/126 = 73.0% 32/126 = 25.4% 2/126 = 1.6%

Simpson’s Paradox When studying the relationship between two variables, there may exist a lurking variable that creates a reversal in the direction of the relationship when the lurking variable is ignored, as opposed to the direction of the relationship when the lurking variable is considered. The lurking variable creates subgroups, and failure to take these subgroups into consideration can lead to misleading conclusions regarding the association between the two variables. An association or comparison that holds for all of several groups can reverse direction when the data are combined to form a single group. This reversal is called Simpson’s paradox.

Simpson’s Paradox Consider the survival rates for the following groups of victims who were taken to the hospital, either by helicopter, or by road: A higher percentage of those transported by helicopter died. Does this mean that this (more costly) mode of transportation isn’t helping? Counts Helicopter Road Victim died 64 260 Victim Survived 136 840 Total 200 1100 Percents Died Survived Helicopter 32% 68% Road 24% 76%

Simpson’s Paradox Consider the survival rates when broken down by type of accident. Serious accidents Counts Helicopter Road Died 48 60 Survived 52 40 Total 100 Percents Died Survived Helicopter 48% 52% Road 60% 40% Less-serious accidents Percents Died Survived Helicopter 16% 84% Road 20% 80% Counts Helicopter Road Died 16 200 Survived 84 800 Total 100 1000

Simpson’s Paradox Lurking variable: Accidents were of two sorts—serious (200) and less serious (1100). Helicopter evacuations had a higher survival rate within both types of accidents than did road evacuations. This is not evidence of the inefficacy of helicopter evacuation! This is an example of Simpson’s paradox. When the lurking variable (type of accident: serious or less serious) is ignored, the data seem to suggest road evacuations are safer than helicopter. However, when the type of accident is considered, the association is reversed and suggests helicopter evacuations are, in fact, safer.