Analyzing Categorical Data & Displaying Quantitative Data Section 1.1 & 1.2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates,

Slides:



Advertisements
Similar presentations
So What Do We Know? Variables can be classified as qualitative/categorical or quantitative. The context of the data we work with is very important. Always.
Advertisements

Analyzing Data (C2-5 BVD) C2-4: Categorical and Quantitative Data.
CHAPTER 1 Exploring Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
8.1 Types of Data Displays Remember to Silence Your Cell Phone and Put It In Your Bag!
1.1 Displaying and Describing Categorical & Quantitative Data.
Displaying & Summarizing Quantitative Data
+ Chapter 1 Section 2 By Abby Chopoorian and Morgan Smith.
CHAPTER 1: Picturing Distributions with Graphs
AP STATISTICS Section 1.1: Displaying Distributions.
Definitions Data: A collection of information in context.
Chapter 1 Exploring Data
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
Chapter 1: Exploring Data
Statistics Unit 2: Organizing Data Ms. Hernandez St. Pius X High School
+ Chapter 1: Exploring Data Section 1.2 Displaying Quantitative Data with Graphs The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
Unit 4 Statistical Analysis Data Representations.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Essential Statistics Chapter 11 Picturing Distributions with Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
SWBAT: Construct and interpret dotplots, stemplots, and histograms. Dot Plot: Each data value is shown as a dot above its location on a number line. 1.
The TITANIC In 1912 the luxury liner Titanic, on its first voyage across the Atlantic, struck an iceberg and sank. Some passengers got off the ship in.
Describing Distributions Visually Section Starter Problem Mr. McPeak plays a lot of golf. This summer he got a new driver and kept track of how.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data 1.2 Displaying Quantitative.
+ Chapter 1: Exploring Data Section 1.2 Displaying Quantitative Data with Graphs The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
+ Chapter 1: Exploring Data Section 1.2 Displaying Quantitative Data with Graphs The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
The Practice of Statistics Third Edition Chapter 1: Exploring Data Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
1 Take a challenge with time; never let time idles away aimlessly.
Statistics - is the science of collecting, organizing, and interpreting numerical facts we call data. Individuals – objects described by a set of data.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Warm Up.
Unit 4 Statistical Analysis Data Representations
The Practice of Statistics, Fourth Edition.
Laugh, and the world laughs with you. Weep and you weep alone
Sec. 1.1 HW Review Pg. 19 Titanic Data Exploration (Excel File)
recap Individuals Variables (two types) Distribution
Chapter 1 Data Analysis Section 1.2
Topic 5: Exploring Quantitative data
CHAPTER 1 Exploring Data
Analyzing Categorical Data & Displaying Quantitative Data
Sexual Activity and the Lifespan of Male Fruitflies
Do Now: A survey of 1,000 randomly chosen residents of a Minnesota town asked “where do you prefer to purchase your daily coffee?” The two-way table below.
CHAPTER 1 Exploring Data
1.1 Cont’d.
Good Morning AP Stat! Day #2
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Displaying Distributions with Graphs
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Analyzing Categorical Data & Displaying Quantitative Data Section 1.1 & 1.2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore

Starter Problem Antoinette plays a lot of golf. This summer she got a new driver and kept track of how far she hit her tee shots in several rounds. Look at these data (drive lengths in yards) and then write a few sentences that describe the lengths of her drives:

Today’s Objectives Analyze pie charts and bar graphs Two way tables: – Marginal Distribution –Conditional Distribution A Titanic Disaster Analyze Dot Plots Describe CUSS  your new best friend Stem and Leaf Plots: single, and back to back Histograms

Types of Variables Categorical variables record which group or category an individual belongs to. –What color is your hair? –What year are you in school? –What city do you live in? –Did the tee shot land in the fairway? –It does NOT make sense to average the results. Quantitative variables take on numeric values. –How tall is a person? –What score did a person get on the SAT? –How many desks are in a room? –How long was the tee shot? –It DOES make sense to average the results.

Visual Representation of Categorical Variables Categorical variables are typically represented by pie charts (for percents) or bar charts (percents or counts). Married?Count (M)Percent Single Married Widowed Divorced

Dilbert comics Use a pie chart only when you want to emphasize each category’s relation to the whole. Pie charts are awkward to make by hand, but technology will do the job for you.

What Makes a Good Bar Graph? Good –All bars have the same width –X & Y axis’ labeled –Units –Title of Graph Bad –Bars have different widths –Pictures replacing the bars (see example next slide) –No labels

Why Is This A Bad Bar Graph? This ad for DIRECTV has multiple problems. How many can you point out?

Two Way Tables Two – way tables are a visual representation of the possible relationships between two set of categorical data. The categories are labeled at the top and the left side of the table, with the frequency info appearing in the interior cells of the table. The “totals” of each row appear at the right, and the “totals” of each column appear at the bottom.

“If you could have a new vehicle, would you want a sport utility vehicle or a sports car? Entries in the body of the table are called joint frequencies. The cells that contain the sum are called marginal frequencies.

Probability When looking at a relative frequency table the percent or ratio is also the probability of that event happening over the ENTIRE TOTAL. If a random selection was made, What's the probability a male selects an SUV? 21/240 If a random selection was made, What's the probability a female selects an SUV? 135/240 If a random selection was made, What's the probability that a SUV is selected? 156/240

Probability

Conditional probability When we are calculating the probability of an event occurring given that another event has occurred, we are describing conditional probability. Certain conditions have been preselected, and now we much calculate the probability based on that condition already happening. When we have conditional probability our denominator value becomes the column total or the row total depending on which condition is given. Example: What is the probability of selecting a sports car given a male? V.S. What's the probability a male selects an SUV?

Conditional Probability

Comparing Two Different Questions What's the probability a male selects an sports car? What is the probability of selecting a sports car given a male?

Flashback! Titanic Disaster On April 15, 1912, the Titanic struck an iceberg and rapidly sank with only 710 of her 2,204 passengers and crew surviving. Data on survival of passengers are summarized in the table below Survival Status Class of TravelSurvivedDiedTotal First Class Second Class Third Class Total

Conditional Probability Survival Status Class of TravelSurvivedDiedTotal First Class Second Class Third Class Total

Break! - 5 Minutes

Section 1.2: Quantitative Data w/ Graphs Dotplots CUSS Histograms Stemplots

Types of Variables Categorical variables record which group or category an individual belongs to. –What color is your hair? –What year are you in school? –What city do you live in? –Did the tee shot land in the fairway? –It does NOT make sense to average the results. Quantitative variables take on numeric values. –How tall is a person? –What score did a person get on the SAT? –How many desks are in a room? –How long was the tee shot? –It DOES make sense to average the results.

Visual representation of Quantitative Variables: Dotplots The most basic method is a dotplot. –Every data point can be seen on the plot. Construction method: –Draw a horizontal axis with a scale that covers the full range of values for the variable. –Put a dot on (or above) the axis for each data point. –If data duplicate, stack them vertically. Construct a dotplot now of Antoinette’s drives:

Dotplot of Drive Data Based on the dotplot, estimate the center. –We see it around 230 or 240 yards. Estimate the spread. –Roughly from 190 to almost 280, so spread is about 90 yards. Describe the shape. –It appears “mound-shaped” with most of the data clustered at the center and with tails at each end.

C.U.S.S C: Center Median, where is it? –Mean can also describe the center, but is not resistant… U: Unusual data points Outliers! Are there any? We can calculate them…later in 1.3 S: Spread Describe the variability of the graph (largest value – smallest value) S: Shape How many peaks? Is the data clumped in a general location? Is data stretching to the right (skewed right). Is the data stretching to the left (skewed left). LASTLY…Always, ALWAYS C.U.S.S it out when describing graphs of data

Histograms Another important method is a histogram. –Individual data points cannot be seen on the plot. –Many data points are grouped together in vertical bars. Construction method: –Draw a horizontal axis with a scale that covers the full range of values for the variable. –Decide bar width (also called class width) so that 5 to 10 bars will cover the full range of data. –Set borders for bars, count frequencies, draw bars. –Use a vertical axis to show the bar height.

Histogram of Drive Data From a visual examination, estimate the center, unusual points, spread and the shape. (CUSS) –As before, you should see the center around 230 to 240, no unusual points, the spread looks like 90, and the shape still looks like a mound.

Stemplots AKA: Stem & Leaf Plots One way to organize numerical data is to make a stemplot. Lets turn to the board and walk through how to make a stemplot of the following data, found on pg

Stemplots check list Did we make a stemplot? Did we talk about splitting stems – …  upper and lower bounds Did we talk about back to back stemplots? Good…now we can move on

Percent of Population Over 65 by State Note: 4|9 = 4.9%

Today’s Objectives Analyze pie charts and bar graphs Two way tables: – Marginal Distribution –Conditional Distribution A Titanic Disaster Analyze Dot Plots Describe CUSS  your new best friend Stem and Leaf Plots: single, and back to back Histograms California Standard 14.0 Students organize and describe distributions of data by using a number of different methods, including frequency tables, histograms, standard line graphs and bar graphs, stem-and-leaf displays, scatterplots, and box-and-whisker plots.

Homework