Chapter 4: More on Two-Variable Data

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

Data Analysis for Two-Way Tables
Chapter 4 Review: More About Relationship Between Two Variables
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
AP Statistics Section 4.2 Relationships Between Categorical Variables.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing.
2.4 Cautions about Correlation and Regression. Residuals (again!) Recall our discussion about residuals- what is a residual? The idea for line of best.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
BPSChapter 61 Two-Way Tables. BPSChapter 62 To study associations between quantitative variables  correlation & regression (Ch 4 & Ch 5) To study associations.
Ch 2 and 9.1 Relationships Between 2 Variables
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Chapter 4 More on Two-Variable Data “Each of us is a statistical impossibility around which hover a million other lives that were never destined to be.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 2 Looking at Data: Relationships.
CHAPTER 6: Two-Way Tables. Chapter 6 Concepts 2  Two-Way Tables  Row and Column Variables  Marginal Distributions  Conditional Distributions  Simpson’s.
Chapter 4 More on Two-Variable Data YMS 4.1 Transforming Relationships.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Two-way tables BPS chapter 6 © 2006 W. H. Freeman and Company.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In this chapter we will study the relationship between two categorical variables (variables.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
Two-Way Tables Categorical Data. Chapter 4 1.  In this chapter we will study the relationship between two categorical variables (variables whose values.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
Chapter 6 Two-Way Tables BPS - 5th Ed.Chapter 61.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
BPS - 3rd Ed. Chapter 61 Two-Way Tables. BPS - 3rd Ed. Chapter 62 u In prior chapters we studied the relationship between two quantitative variables with.
AP Statistics Section 4.2 Relationships Between Categorical Variables
4.3 Relations in Categorical Data.  Use categorical data to calculate marginal and conditional proportions  Understand Simpson’s Paradox in context.
Cautions About Correlation and Regression Section 4.2.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
CHAPTER 6: Two-Way Tables*
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
Analyzing Categorical Data
CHAPTER 1 Exploring Data
Cautions About Correlation and Regression
The Practice of Statistics in the Life Sciences Third Edition
Two-Way Tables and The Chi-Square Test*
Cautions about Correlation and Regression
Chapter 2 Looking at Data— Relationships
Chapter 2: Looking at Data — Relationships
Analysis of two-way tables - Data analysis for two-way tables
Second factor: education
Chapter 2 Looking at Data— Relationships
The Practice of Statistics in the Life Sciences Fourth Edition
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
DRILL Put these correlations in order from strongest to weakest.
CHAPTER 6: Two-Way Tables
Second factor: education
Warmup Which part- time jobs employed 10 or more of the students?
Cautions about Correlation and Regression
Chapter 2 Looking at Data— Relationships
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Least-Squares Regression
Section 4-3 Relations in Categorical Data
4.2 Cautions about Correlation and Regression
Correlation/regression using averages
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Section Way Tables and Marginal Distributions
Elementary Statistics: Looking at the Big Picture
Relations in Categorical Data
Correlation/regression using averages
Presentation transcript:

Chapter 4: More on Two-Variable Data

4.2: Cautions about Correlation & Regression Correlation and Regression Describe only linear relationships Are not resistant One influential observation or incorrectly entered data point can greatly change these measures. Extrapolation Use of a regression line for prediction far outside the domain Such predictions are not generally accurate

Cautions about Correlation & Regression cont. Lurking Variable Variable not among the explanatory or response variables May influence the interpretation of relationships among those variables. Confounding Variable Two variables are confounded when their effects on a response variable cannot be distinguished from each other May be either explanatory variables or lurking variables

Cautions about Correlation & Regression cont. Studies Using Averaged Data Resist (don’t) applying results to individuals Correlations usually too high when applied to individuals Causation (cause & effect) Even a very strong association between two variables is not by itself good evidence that there is a cause-and-effect link High correlation DOES NOT imply causation Establishing Causation Conduct a carefully designed experiment Control effects of possible lurking variables

Section 4.2 Complete Homework: #’s 27, 28, 33, 36 Any questions on pg. 25-28 in additional notes packet

Section 4.3: Relations in Categorical Data Categorical Variables Use counts or percentages that fall into various categories Organized into two-way tables Two-way tables describe two categorical variables Rows make up one variable; columns make up the other One Variable with its categories Second variable Cat 1 Cat 2 Cat 3 Cat 4 Total X1-1 X1-2 X1-3 X1-4 Marginal Distributions X2-1 X2-2 X2-3 X2-4 Conditional Distributions will be created by taking these values and dividing by either one of the Marginal Distributions

(a) How many students do these data describe? (b) What percent of these students smoke? (c) Give the marginal distribution of parents’ smoking behavior, both in counts and in percents.

26.7% 41.7% 31.6% (a) How many students do these data describe? (b) What percent of these students smoke? (c) Give the marginal distribution of parents’ smoking behavior, both in counts and in percents. (d) Give the conditional distribution of parents’ smoking behavior given the student does not smoke.

Cold Neutral Hot Hatched 16 38 75 Did not hatch 9 18 29 64% 36% 68% 32% 72% 28%

Simpson’s Paradox: Refers to the reversal of the direction of a comparison or an association when data from several groups are combined to form a single group. The principle behind this reversal is that of weighted averages. Using a higher weight in one category for one group while the second group uses a second category to be its higher weight.

An Example of Simpson’s Paradox Upper Wabash Tech has two professional schools, business and law. Here are two-way tables of applicants to both schools, categorized by gender and admission decision. Combine the data to make a single two-way table of gender by admission decision. Business Law Admit Deny Male 480 120 10 90 Female 180 20 100 200

Cont. Business Law Admit Deny Male 480 120 10 90 Female 180 20 100 200 What percent of males are admitted? What percent of females are admitted? Which percent is higher? Now compute the percent males and females admitted by each of the separate schools. Which percents are higher? Percent Percent 80% 10% 90% 34% Combined Admit Deny Male Female Percent 490 210 70% 280 220 56%

Cont. Why is this occurring? Explain how it can happen that Wabash appears to favor males when each school individually favors females. Ans: The business school accepts a higher percentage of students all together than the law school does. So when 86% of the males applied to the business school while only 40% of the females applied to the business school, more males got into Wabash university all together. This gives the appearance that Wabash favors males. This reversal of the direction of an association when data from several groups are combined to form a single group is know as Simpson’s Paradox.

Section 4.3 Complete Homework: #’s 55, 59, 61, 62, Any questions on pg. 29-32 in additional notes packet

Chapter Review

Chapter 4 Complete Homework: #’s 79, 81, 83 Any questions on pg. 33-36 in additional notes packet