Displaying and Describing Categorical Data

Slides:



Advertisements
Similar presentations
Displaying & Describing Categorical Data Chapter 3.
Advertisements

Area Principle  The area occupied by a part of the graph should correspond to the magnitude of the value it represents.
Exploring Two Categorical Variables: Contingency Tables
Slide Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
The Three Rules of Data Analysis
CHAPTER 1 STATISTICS Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
  The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These.
Copyright © 2010 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Do Now Have you: Read Harry Potter and the Deathly Hallows Seen Harry Potter and the Deathly Hallows (part 2)
Displaying & Describing Categorical Data Chapter 3.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Chapter 3 Displaying and Describing Categorical Data
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
Chapter 3 Addie Molique, Ash Nair Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Chapter 2 DISPLAYING AND DESCRIBING CATEGORICAL DATA.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
Chapter 3: Displaying and Describing Categorical Data Sarah Lovelace and Alison Vicary Period 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. - use pie charts, bar graphs, and tables to display data Chapter 3: Displaying and Describing Categorical.
Statistics Day 4 Displaying Categorical Data. Do Now Act question ACT #9,10,13,14.
1 Chapter 3 Displaying and Describing Categorical Data.
Slide 3-1 Copyright © 2004 Pearson Education, Inc.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
Lesson 2 9/4/12.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Displaying & Describing Categorical Data Chapter 3.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Unit 6, Module 15 – Two Way Tables (Part I) Categorical Data Comparing 2.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data Chapter 3.
August 25,  Passengers on the Titanic by class of ticket. ClassCount 1 st nd rd th 885.
CATEGORICAL DATA CHAPTER 3 GET A CALCULATOR!. Slide 3- 2 THE THREE RULES OF DATA ANALYSIS won’t be difficult to remember: 1. Make a picture — things may.
Sections TAKE OUT YOUR NOTES, Book & Do Page 8 #7-8
Smart Start In June 2003, Consumer Reports published an article on some sport-utility vehicles they had tested recently. They had reported some basic.
Displaying and describing categorical data
Chapter 1: Exploring Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Honors Statistics Chapter 3 Part 1
AP Statistics Chapter 3 Part 3
Chapter 3: Displaying and Describing Categorical Data
Bell Ringer The State Education Department requires local school districts to keep these records on all students: age, race or ethnicity, days absent,
CATEGORICAL DATA CHAPTER 3
Displaying and Describing
Displaying and Describing Categorical Data
Math 153 Stats Starts Here.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
AP Statistics Chapter 3 Part 2
Warmup Which part- time jobs employed 10 or more of the students?
Quick review of last time~
The Table Categorization
Displaying and Describing Categorical Data
Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Displaying Distributions with Graphs – Part 1
CHAPTER 1 Exploring Data
4.2 Relationships between Categorical Variables and Simpson’s Paradox
Displaying and Describing Categorical Data
Active Learning Lecture Slides
Displaying and Describing Categorical data
Displaying and Describing Categorical Data
Grab a post it note and place it in the correct bin for where you went to middle school
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Presentation transcript:

Displaying and Describing Categorical Data Chapter 3 Displaying and Describing Categorical Data

Frequency tables are often used to organize categorical data Frequency tables are often used to organize categorical data. Frequency tables display the category names and the counts of the number of data values in each category. Relative frequency tables also display the category names, but they give the percentages rather than the counts for each category.

M & M Activity

A bar chart is often used to display categorical data A bar chart is often used to display categorical data. The height of each bar represents the count for each category. Bars are displayed next to each other for easy comparison. When constructing a bar chart, note that the bars do not touch one another. Categorical variables usually cannot be ordered in a meaningful way; therefore the order in which the bars are displayed is often meaningless. A relative frequency bar chart displays the proportion of counts for each category. The sum of the relative frequencies is 100%.

Create a frequency bar chart and a relative frequency bar chart.

A pie chart is another type of display used to show categorical data A pie chart is another type of display used to show categorical data. Pie charts show parts of a whole. Pie charts are often difficult to construct by hand. A contingency table shows two categorical variables together. The margins give the frequency distributions for each of the variables, also called the marginal distribution.

What percent of the class are girls with liberal political views? Examine the class data about gender and political view – liberal, moderate, conservative. What percent of the class are girls with liberal political views? What percent of the liberals are girls? What percent of the girls are liberals? What is the marginal distribution of gender? What is the marginal distribution of political views? Liberal Moderate Conservative TOTAL Male Female

A conditional distribution shows the distribution of one variable for only the individuals who satisfy some condition on another variable. The conditional distribution of political preference, conditional on being male: Liberal Moderate Conservative TOTAL Male

The conditional distribution of political preference, conditional on being female: Liberal Moderate Conservative TOTAL Female

What is the conditional relative frequency distribution of gender among conservatives? Liberal Moderate Conservative TOTAL Male Female

If the conditional distributions are the same, we can conclude that the variables are not associated. Therefore, they are independent of one another. If the conditional distributions differ, we can conclude that the variables are somehow associated. Therefore, they are not independent of one another. Are gender and political view independent?

A segmented bar chart displays the same information as a pie chart, but in the form of bars instead of circles. Comparing segmented bar charts is a good way to tell if two variables are independent of one another or not. Create a segmented bar chart.

Explain how the graph on the left violates the “area principle.”

Explain what is wrong with the graph below.

Averaging one variable across different levels of a second variable can lead to Simpson’s Paradox. Consider the following example: It’s the last inning of an important game. Your team is a run down with the bases loaded and two outs. The pitcher is due up, so you’ll be sending in a pinch-hitter. There are 2 batters available on the bench. Whom should you send in to bat? Compare A’s batting average to B’s batting average. Which player appears to be the better choice? Player A has a higher batting average (0.320 vs. 0.298), so he looks like the better choice. Player Overall A 33 for 103 B 45 for 151

Does it matter whether the pitcher throws right- or left-handed Does it matter whether the pitcher throws right- or left-handed? Compare A’s batting average vs. a left-handed pitcher to B’s. Compare A’s batting average against a right-handed pitcher. Which player appears to be the better choice? Player Overall vs LHP vs RHP A 33 for 103 28 for 81 5 for 22 B 45 for 151 12 for 32 33 for 119

Player B has a higher batting average against both right- and left-handed pitching, even though his overall average is lower. Player B hits better against both right- and left-handed pitchers. So no matter the pitcher, B is a better choice. So why is his batting “average” lower? Because B sees a lot more right-handed pitchers than A, and (at least for these guys) right-handed pitchers are harder to hit. For some reason, A is used mostly against left-handed pitchers, so A has a higher average.

Pooling the data together loses important information and leads to the wrong conclusion. We always should take into account any factor that might matter.