BPSChapter 61 Two-Way Tables. BPSChapter 62 To study associations between quantitative variables  correlation & regression (Ch 4 & Ch 5) To study associations.

Slides:



Advertisements
Similar presentations
Data Analysis for Two-Way Tables
Advertisements

Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Comparitive Graphs.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
2.4 Cautions about Correlation and Regression. Residuals (again!) Recall our discussion about residuals- what is a residual? The idea for line of best.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
CHAPTER 1 Exploring Data 1.1 Analyzing Categorical Data.
Ch 2 and 9.1 Relationships Between 2 Variables
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
The Practice of Statistics
1 Chapter 5 Two-Way Tables Associations Between Categorical Variables.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
HW#8: Chapter 2.5 page Complete three questions on the last two slides.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
Chapter 4 More on Two-Variable Data “Each of us is a statistical impossibility around which hover a million other lives that were never destined to be.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 2 Looking at Data: Relationships.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Summarizing the Relationship Between Two Variables with Tables and a bit of a review Chapters 6 and 7 Jan 31 and Feb 1, 2012.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
Analysis of two-way tables - Data analysis for two-way tables IPS chapter 2.6 © 2006 W.H. Freeman and Company.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
Aim: How do we analyze data with a two-way table?
Warm-up An investigator wants to study the effectiveness of two surgical procedures to correct near-sightedness: Procedure A uses cuts from a scalpel and.
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Chapter 3: Descriptive Study of Bivariate Data. Univariate Data: data involving a single variable. Multivariate Data: data involving more than one variable.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
Summarizing the Relationship Between Two Variables with Tables Chapter 6.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Section 4.4 Contingency Tables and Association. Definitions Contingency Table (Two-Way Table): Relates two categories of data Row Variable: Each row in.
CHAPTER 6: Two-Way Tables*
4.3 Reading Quiz (second half) 1. In a two way table when looking at education given a person is 55+ we refer to it as ____________ distribution. 2. True.
Analyzing Categorical Data
CHAPTER 1 Exploring Data
AP Statistics Chapter 3 Part 3
Analysis of two-way tables - Data analysis for two-way tables
CHAPTER 1 Exploring Data
Second factor: education
Looking at Data - Relationships Data analysis for two-way tables
Chapter 2 Looking at Data— Relationships
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
Second factor: education
Warmup Which part- time jobs employed 10 or more of the students?
Chapter 2 Looking at Data— Relationships
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Section 4-3 Relations in Categorical Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Section Way Tables and Marginal Distributions
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Relations in Categorical Data
CHAPTER 1 Exploring Data
Chapter 4: More on Two-Variable Data
CHAPTER 1 Exploring Data
Presentation transcript:

BPSChapter 61 Two-Way Tables

BPSChapter 62 To study associations between quantitative variables  correlation & regression (Ch 4 & Ch 5) To study associations between categorical variables  cross-tabulate frequencies & calculate conditional percents (this Chapter) Association

BPSChapter 63 Example: Age and Education Variables Marginal distributions “Age groups” is the categorical explanatory variable “Education level” is the categorical response variable

BPSChapter 64 Example: Marginal Totals Variables Marginal totals 37,786 81,435 56,008 27,858 58,077 44,465 44,828

BPSChapter 65 Marginal Distributions Marginal distributions are used as background information only. They do not address association

BPSChapter 66 Marginal Distribution, Row Variable % not completed HS =27,859 / 175,230 × 100% = 15.9% % graduated HS =58,077 / 175,230 × 100% = 33.1% % finished 1-3 yrs col. =44,465 / 175,230 × 100% = 25.4% % finished ≥4 yrs col. =44,828 / 175,230 × 100% = 25.6%

BPSChapter 67 Marginal Distribution, Column Variable % age 25–34 =37,786 / 175,230 × 100% = 21.6% % age 35–54 =81,435 / 175,230 × 100% = 46.5% % 55 and over =56,008 / 175,230 × 100% = 32.0%

BPSChapter 68 Association To determine associations, calculate conditional distributions (conditional percents) Two types of conditional distributions: Conditioned on row variable Conditioned on column variable

BPSChapter 69 Association If explanatory variable is in rows  calculate row percents  analyze row conditional distributions

BPSChapter 610 Association If explanatory variable is in columns  calculate column percents  analyze column conditional distribution

BPSChapter 611 Example: Column Percents Is AGE associated with EDUCATION? AGE is explanatory var.  use column percents

BPSChapter 612 Example: Association Percents completing college by age Age % completed college 29.3%28.4%18.9% As age goes up, % completing college goes down NEGATIVE association between age and education

BPSChapter 613 No association: conditional percents nearly equal at all levels of explanatory variable Positive association: as explanatory variable rises  conditional percentages increase Negative associations: as explanatory variable rises  conditional percentages go down Association

BPSChapter 614 Statement of problem: Is ACCEPTANCE into a graduate program (response variable) predicted by GENDER (explanatory variable)? Example 2: Row Percent AcceptedNot accept.Total Male Female Total Explanatory variable (gender) is in rows  use row percents

BPSChapter 615 Example 2 AcceptedNot acceptTotal Male Female Total Explanatory variable in rows  use row percents Therefore: positive association with “maleness” Statement of problem: Is ACCEPTANCE associated with GENDER?

BPSChapter 616 Simpson’s Paradox In example 2, consider the lurking variable "major” –Business School (240 applicants) –Art School (320 applicants) Does this lurking variable explain the association? To address this potential problem, subdivide the data according to the lurking variable Lurking variables can change or even reverse the direction of an association

BPSChapter 617 Simpson’s Paradox Illustration Business School Applicants SuccessFailureTotal Male Female Total Male proportion = 18 / 120 = 0.15 Female prop. = 24 / 120 = 0.20 Negative association All Applicants SuccessFailureTotal Male Female Total Art School Applicants SuccessFailureTotal Male Female Total Male proportion = 180 / 240 = 0.75 Female proportion = 64 / 80 = 0.80 Negative association

BPSChapter 618 Overall: higher proportion of men accepted than women Within majors  higher proportion of women accepted than men Reason  Men applied to easier majors  the initial association was an artifact of the lurking variable “MAJOR applied to” Simpson’s Paradox Illustration