Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

C HAPTER 1.1 Analyzing Categorical Data. I NDIVIDUALS AND V ARIABLES Individuals are the objects described by a set of data. Individuals may be people,animals,
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Comparitive Graphs.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
2.4 Cautions about Correlation and Regression. Residuals (again!) Recall our discussion about residuals- what is a residual? The idea for line of best.
BPSChapter 61 Two-Way Tables. BPSChapter 62 To study associations between quantitative variables  correlation & regression (Ch 4 & Ch 5) To study associations.
Section 2.6 Relations in Categorical Variables So far in chapter two we have dealt with data that is quantitative. In this section we consider categorical.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
1 Chapter 5 Two-Way Tables Associations Between Categorical Variables.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Displaying & Describing Categorical Data Chapter 3.
BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple.
4.3 Categorical Data Relationships.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
HW#8: Chapter 2.5 page Complete three questions on the last two slides.
Relations and Categorical Data Target Goal: I can describe relationships among categorical data using two way tables. 1.1 cont. Hw: pg 24: 20, 21, 23,
Probability Unit 4 - Statistics What is probability? Proportion of times any outcome of any random phenomenon would occur in a very long series of repetitions.
Chapter 241 Two-Way Tables and the Chi-square Test.
Exploring Data Section 1.1 Analyzing Categorical Data.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 2 Looking at Data: Relationships.
Analysis of Two-Way tables Ch 9
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Summarizing the Relationship Between Two Variables with Tables and a bit of a review Chapters 6 and 7 Jan 31 and Feb 1, 2012.
Warm Up The number of motor vehicles registered (in millions) in the U.S. has grown as charted in the table. 1)Plot the number of vehicles against time.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
Chapter 3: Displaying and Describing Categorical Data Sarah Lovelace and Alison Vicary Period 2.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Aim: How do we analyze data with a two-way table?
Inference about a population proportion. 1. Paper due March 29 Last day for consultation with me March 22 2.
Warm-up An investigator wants to study the effectiveness of two surgical procedures to correct near-sightedness: Procedure A uses cuts from a scalpel and.
DO NOW: Oatmeal and cholesterol Does eating oatmeal reduce cholesterol
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Chapter 6 Sections 1-2. Bell Ringer I CAN: Daily Agenda Bell Ringer Review Bell Ringer I CAN Chapter 6 Sections 1-2.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
Chapter 6 EPS Due 11/06/15.
Chapter 1.1 – Analyzing Categorical Data A categorical variable places individuals into one of several groups of categories. A quantitative variable takes.
CHAPTER 6: Two-Way Tables*
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Copyright ©2011 Brooks/Cole, Cengage Learning Turning Data Into Information Use table and/or graph to represent Categorical Data Chapter 2 – Class 11 1.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
4.3 Reading Quiz (second half) 1. In a two way table when looking at education given a person is 55+ we refer to it as ____________ distribution. 2. True.
AP Statistics Chapter 3 Part 2 Displaying and Describing Categorical Data.
Second factor: education
Inference about a population proportion.
The Practice of Statistics in the Life Sciences Third Edition
Essential Statistics Two Categorical Variables: The Chi-Square Test
AP Statistics Chapter 3 Part 3
Analysis of two-way tables - Data analysis for two-way tables
Second factor: education
Looking at Data - Relationships Data analysis for two-way tables
Chapter 2 Looking at Data— Relationships
The Practice of Statistics in the Life Sciences Fourth Edition
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
AP Statistics Chapter 3 Part 2
Second factor: education
Relations in Categorical Data
Section 4-3 Relations in Categorical Data
Section Way Tables and Marginal Distributions
Section Way Tables and Marginal Distributions
Relations in Categorical Data
Chapter 4: More on Two-Variable Data
Presentation transcript:

Two-Way Tables Categorical Data. Chapter 4 1

 In this chapter we will study the relationship between two categorical variables (variables whose values fall in groups or categories).  To analyze categorical data, use the counts or percents of individuals that fall into various categories. BPS - 5th Ed.Chapter 62

 When there are two categorical variables, the data are summarized in a two-way table ◦ each row in the table represents a value of the row variable ◦ each column of the table represents a value of the column variable  The number of observations falling into each combination of categories is entered into each cell of the table BPS - 5th Ed.Chapter 63

 A distribution for a categorical variable tells how often each outcome occurred ◦ totaling the values in each row of the table gives the marginal distribution of the row variable (totals are written in the right margin) ◦ totaling the values in each column of the table gives the marginal distribution of the column variable (totals are written in the bottom margin) BPS - 5th Ed.Chapter 64

 It is usually more informative to display each marginal distribution in terms of percents rather than counts ◦ each marginal total is divided by the table total to give the percents  A bar graph could be used to graphically display marginal distributions for categorical variables BPS - 5th Ed.Chapter 65

BPS - 5th Ed.Chapter 66 Data from the U.S. Census Bureau for the year 2000 on the level of education reached by Americans of different ages. (Statistical Abstract of the United States, 2001) Age and Education

BPS - 5th Ed.Chapter 67 Age and Education Variables Marginal distributions

BPS - 5th Ed.Chapter 68 Age and Education Variables Marginal distributions 21.6% 46.5% 32.0% 15.9% 33.1% 25.4% 25.6%

BPS - 5th Ed.Chapter 69 Age and Education Marginal Distribution for Education Level Not HS grad15.9% HS grad33.1% College 1-3 yrs25.4% College ≥4 yrs25.6%

 Relationships between categorical variables are described by calculating appropriate percents from the counts given in the table ◦ prevents misleading comparisons due to unequal sample sizes for different groups BPS - 5th Ed.Chapter 610

BPS - 5th Ed.Chapter 611 Age and Education Compare the age group to the age group in terms of success in completing at least 4 years of college: Data are in thousands, so we have that 11,071,000 persons in the age group have completed at least 4 years of college, compared to 23,160,000 persons in the age group. The groups appear greatly different, but look at the group totals.

BPS - 5th Ed.Chapter 612 Age and Education Compare the age group to the age group in terms of success in completing at least 4 years of college: Change the counts to percents: Now, with a fairer comparison using percents, the groups appear very similar.

BPS - 5th Ed.Chapter 613 Age and Education If we compute the percent completing at least four years of college for all of the age groups, this would give us the conditional distribution of age, given that the education level is “completed at least 4 years of college”: Age: and over Percent with ≥ 4 yrs college: 29.3%28.4%18.9%

 The conditional distribution of one variable can be calculated for each category of the other variable.  These can be displayed using bar graphs.  If the conditional distributions of the second variable are nearly the same for each category of the first variable, then we say that there is not an association between the two variables.  If there are significant differences in the conditional distributions for each category, then we say that there is an association between the two variables. BPS - 5th Ed.Chapter 614

BPS - 5th Ed.Chapter 615 Age and Education Conditional Distributions of Age for each level of Education:

 When studying the relationship between two variables, there may exist a lurking variable that creates a reversal in the direction of the relationship when the lurking variable is ignored as opposed to the direction of the relationship when the lurking variable is considered.  The lurking variable creates subgroups, and failure to take these subgroups into consideration can lead to misleading conclusions regarding the association between the two variables. BPS - 5th Ed.Chapter 616

Consider the acceptance rates for the following group of men and women who applied to college. BPS - 5th Ed.Chapter 617 counts Accepted Not accepted Total Men Women Total percents Accepted Not accepted Men55%45% Women44%56% A higher percentage of men were accepted: Discrimination?

Lurking variable: Applications were split between the Business School (240) and the Art School (320). BPS - 5th Ed.Chapter 618 counts Accepted Not accepted Total Men Women Total percents Accepted Not accepted Men15%85% Women20%80% A higher percentage of women were accepted in Business BUSINESS SCHOOL

Lurking variable: Applications were split between the Business School (240) and the Art School (320). BPS - 5th Ed.Chapter 619 counts Accepted Not accepted Total Men Women Total percents Accepted Not accepted Men75%25% Women80%20% ART SCHOOL A higher percentage of women were also accepted in Art

 So within each school a higher percentage of women were accepted than men. There is not any discrimination against women!!!  This is an example of Simpson’s Paradox. When the lurking variable (School applied to: Business or Art) is ignored the data seem to suggest discrimination against women. However, when the School is considered the association is reversed and suggests discrimination against men. BPS - 5th Ed.Chapter 620