Chapter 3 Displaying and Describing Categorical Data.

Slides:



Advertisements
Similar presentations
Displaying and Describing Categorical Data 60 min.
Advertisements

Introduction to Stats Honors Analysis. Data Analysis Individuals: Objects described by a set of data. (Ex: People, animals, things) Variable: Any characteristic.
Displaying & Describing Categorical Data Chapter 3.
Area Principle  The area occupied by a part of the graph should correspond to the magnitude of the value it represents.
Exploring Two Categorical Variables: Contingency Tables
You have 15 min. to: Meet me at the table in the middle for HW Questions Should have paper labeled #1-10 with each question answered.
Chapter 3 Graphical and Numerical Summaries of Categorical Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs.
Chapter 3 Graphical and Numerical Summaries of Qualitative Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs.
In 2007, deaths of a large number of pet dogs and cats were ultimately traced to contamination of some brands of pet food. The manufacturer NOW claims.
Slide Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
The Three Rules of Data Analysis
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Chapter 3 Graphical and Numerical Summaries of Qualitative Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs.
CHAPTER 1 STATISTICS Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world.
. Chapter 3 Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
  The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These.
Copyright © 2012 Pearson Education. Chapter 4 Displaying and Describing Categorical Data.
Copyright © 2010 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Do Now Have you: Read Harry Potter and the Deathly Hallows Seen Harry Potter and the Deathly Hallows (part 2)
Displaying & Describing Categorical Data Chapter 3.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Chapter 3 Displaying and Describing Categorical Data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Chapter 2 DISPLAYING AND DESCRIBING CATEGORICAL DATA.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
Displaying Categorical Data THINK SHOW TELL What is categorical data? Bar, Segmented Bar, and Pie Charts Frequency vs. Relative Frequency Tables/Charts.
Chapter 3: Displaying and Describing Categorical Data *Data Analysis *Frequency Tables, Bar Charts, Pie Charts Contingency Tables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. - use pie charts, bar graphs, and tables to display data Chapter 3: Displaying and Describing Categorical.
Chapter 2 Displaying and Describing Categorical Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs that appropriately.
1 Chapter 3 Displaying and Describing Categorical Data.
Slide 3-1 Copyright © 2004 Pearson Education, Inc.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
Lesson 2 9/4/12.
Displaying & Describing Categorical Data Chapter 3.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Objectives Given a contingency table of counts, construct a marginal distribution. Given a contingency table of counts, create a conditional distribution.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Unit 6, Module 15 – Two Way Tables (Part I) Categorical Data Comparing 2.
Chapter 3 Displaying and Describing Categorical Data Math2200.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 2 Displaying and Describing Categorical Data.
Copyright © 2009 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1.1 ANALYZING CATEGORICAL DATA. FREQUENCY TABLE VS. RELATIVE FREQUENCY TABLE.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data Chapter 3.
August 25,  Passengers on the Titanic by class of ticket. ClassCount 1 st nd rd th 885.
CATEGORICAL DATA CHAPTER 3 GET A CALCULATOR!. Slide 3- 2 THE THREE RULES OF DATA ANALYSIS won’t be difficult to remember: 1. Make a picture — things may.
Sections TAKE OUT YOUR NOTES, Book & Do Page 8 #7-8
Smart Start In June 2003, Consumer Reports published an article on some sport-utility vehicles they had tested recently. They had reported some basic.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
AP Statistics Chapter 3 Part 3
Displaying and Describing Categorical Data
Chapter 3: Displaying and Describing Categorical Data
CATEGORICAL DATA CHAPTER 3
Displaying and Describing
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Quick review of last time~
Displaying and Describing Categorical Data
Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Displaying and Describing Categorical Data
Displaying and Describing Categorical data
Displaying and Describing Categorical Data
Grab a post it note and place it in the correct bin for where you went to middle school
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Presentation transcript:

Chapter 3 Displaying and Describing Categorical Data

Categorical variable There are only finite number of possible values

Categorical variable There are only finite number of possible values –Gender –Car size –Course grade

Example: Titanic At 11:40pm on Apr 14, 1912 Over 1500 passengers and crewmembers died How can we process the information more efficiently? SurvivedAgeSexClass DeadAdultMaleThird DeadAdultMaleCrew DeadAdultMaleThird DeadAdultMaleCrew DeadAdultMaleCrew DeadAdultMaleCrew AliveAdultFemaleFirst DeadAdultMaleThird DeadAdultMaleCrew

Three rules of data analysis 1.Make a picture 2.Make a picture 3.Make a picture

Three rules of data analysis 1.Make a picture A picture can reveal the pattern and relationship hidden in your data 2.Make a picture A picture can show extraordinary data values or unexpected patterns 3.Make a picture Fast for readers to understand

Florence Nightingale Founder of modern nursing First female member of British Statistical Society Used a picture to argue forcefully for better hospital conditions for soldiers

Different colors represent deaths caused by different reasons. The blue area corresponds to death due to preventable or mitigable Zymotic disease. The picture showed that in the Crimean war, far more soldiers died of illness and infection than died of battle wounds. Her campaign succeeded in improving hospital conditions and nursing for soldiers.

Frequency tables Making piles: count the number of cases corresponding to each category and pile them up Titanic Example: by ticket class ClassCount First325 Second285 Third706 Crew885

Relatively frequency table Proportion: divide counts by the total number of cases Percentage: multiply by 100 The frequency table or relative frequency table describe the distribution of a categorical variable

Relatively frequency table Proportion: divide counts by the total number of cases Percentage: multiply by 100 The frequency table or relative frequency table describe the distribution of a categorical variable ClassPercentage First14.77% Second12.95% Third32.08% Crew40.21%

What is your feeling about the proportion of crew members on board?

Why is the picture misleading?

The length of each ship corresponds to the number of people in each category Our eyes tend to be more impressed by the area than by other aspects of the image. Even though the length of the ship is about 3 times, but the area is about 9 times. And that is misleading.

The area principle the area occupied by a part of the graph should correspond to the magnitude of the value it represents.

Bar chart Display of counts of a categorical variable with bars

Bar chart Display of counts of a categorical variable with bars

Pie Charts

When you make a bar chart or pie chart, pay attention to the following

Make sure the variable is indeed categorical

When you make a bar chart or pie chart, pay attention to the following Make sure the variable is indeed categorical Your data are counts or percentages of cases in categories

When you make a bar chart or pie chart, pay attention to the following Make sure the variable is indeed categorical Your data are counts or percentages of cases in categories Make sure that the categories do not overlap

Was there a relationship between the kind of ticket a passenger held and the passenger’s chances of making it into the lifeboat? What table should we make to answer this question?

Contingency table A two-way table The table shows how the subjects are distributed along each variable, contingent on the value of the other variable FirstSecondThirdCrewtotal Alive Dead Total

Add relative frequencies FirstSecondThirdCrewtotal Alive Counts % of Row28.55%16.60%25.04%29.82%100.00% % of Column62.46%41.40%25.21%23.95%32.30% % of Table9.22%5.36%8.09%9.63%32.30% Dead Counts % of Row8.19%11.21%35.44%45.17%100.00% % of Column37.54%58.60%74.79%76.05%67.70% % of Table5.54%7.59%23.99%30.58%67.70% Total Counts % of Row14.77%12.95%32.08%40.21%100.00% % of Column100.00% % of Table14.77%12.95%32.08%40.21%100.00%

A simplified table FirstSecondThirdCrewtotal Alive9.22%5.36%8.09%9.63%32.30% Dead5.54%7.59%23.99%30.58%67.70%

Percent of what? What percent of the survivors were in second class? What percent were second-class passengers who survived? What percent of the second-class passengers survived?

Percent of what? What percent of the survivors were in second class? –118/711 What percent were second-class passengers who survived? –The Who is everyone on board, i.e., 2201 is the denominator –118/2201 What percent of the second-class passengers survived? –118/285

Marginal distribution In the margins of a contingency table, the frequency distribution of one of the variables is called its marginal distribution

Conditional distribution FirstSecondThirdCrewtotal Alive %16.60%25.04%29.82%100.00% Dead %11.21%35.44%45.17%100.00%

Pie chart for conditional distributions

Bar chart for conditional distributions

Segmented Bar Chart

What can go wrong? Do not violate the area principle

What can go wrong? Keep it honest –Pay attention to labels –Whether all percentages add up to 1? Do not confuse similar-sounding percentages –The percentage of passengers who were both in first class and survived –The percentage of the first class passengers who survived –The percentage of the survivors who were in first class

What can go wrong? Do not forget to look at the variables separately, too. –Look at both conditional and marginal distributions Be sure to use enough individuals Do not overstate your case

What can go wrong? Be careful with averages of proportion across several different groups Simpson’s Paradox On-time record for two pilots DayNightOverall Moe90/100=90%10/20=50%100/120=83% Jill19/20=95%75/100=75%94/120=78%