Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Chapter 2 Exploring Data with Graphs and Numerical Summaries
Describing Distributions With Numbers
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers Part 2
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
CHAPTER 2: Describing Distributions with Numbers
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Statistics: Describing Quantitative Data Box and Whisker Plots.
M08-Numerical Summaries 2 1  Department of ISM, University of Alabama, Lesson Objectives  Learn what percentiles are and how to calculate quartiles.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Chapter 3 Looking at Data: Distributions Chapter Three
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
CHAPTER 1 Exploring Data
Describing Distributions Numerically
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Analyzing One-Variable Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Box and Whisker Plots Algebra 2.
DAY 3 Sections 1.2 and 1.3.
Describing Distributions of Data
Quartile Measures DCOVA
CHAPTER 1 Exploring Data
AP Statistics September 9, 2008 CASA
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Measures of Central Tendency
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Day 52 – Box-and-Whisker.
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Box and Whisker Plots and the 5 number summary
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Compare and contrast histograms to bar graphs
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread of a distribution. Two types of descriptive statistics for categorical variables: 1) Counts (Frequencies) 2) Rates or Proportions (Relative Frequencies) Many statistics available to summarize quantitative variables.

Homeruns in Baseball Question: Who is the best home run hitter ever in major league baseball? Players with high numbers of homeruns in seasons: Babe Ruth Roger Maris Mark McGwire Sammy Sosa Barry Bonds

Median and Quartiles The median (M) is the midpoint of a distribution when the observations are arranged in increasing order. Number such that half the observations are smaller and the other half are larger. (p. 219) List the data in order from smallest to largest If n is odd, the median is the middle value. If n is even, the median is the mean of the middle two values.

M for Sosa and Maris Calculate M for Sosa’s homeruns in a season (8 seasons, to 1999). Data: 15, 10, 33, 25, 36, 40, 36, 66 Calculate M for Maris’s homeruns in a season (11 seasons). Data: 14, 28, 16, 39, 61, 33, 23, 26, 13, 9, 5

Percentiles p×100% percentile – the value of a variable such that p×100% of the values are below it and (1-p)×100% of the values are above it where 0 < p < 1. For the 35 th percentile, p=0.35. Where have you seen percentiles before?

Quartiles First Quartile (Q1): The value such that 25% of the data values lie below Q1 and 75% of the data values lie above Q1. (25 th percentile) Third Quartile (Q3): The value such that 75% of the data values lie below Q3 and 25% of the data values lie above Q3. (75 th percentile) The median is the second quartile (Q2). (50 th percentile)

Calculating percentiles: Let n be the number of data values. Order the n values from largest to smallest. Calculate the product, n×p. –If the product is not an integer (0,1,2,3,…), then round it up to the next integer and take the corresponding ordered value. –If the product is an integer, say k, then average the kth and (k+1)-st ordered values.

5-Number Summary The 5-number summary of a data set consists of the following descriptive statistics (p. 221): Minimum, First Quartile (Q1), Median, Third Quartile (Q3), Maximum Give the 5-number summaries for Sosa and Maris’s homeruns.

Boxplot A boxplot is a graphical representation of the 5-number summary. (p. 221) A central box spans the quartiles (Q1 to Q3) Inter-quartile Range = IQR = Q3 - Q1 A line in the box marks the median Lines (whiskers) extend from box to the minimum and maximum observations.

Constructing Boxplots 1) Compute the 5-number summary. 2) Draw a vertical line at the Q1 and Q3. 3) Draw two horizontal lines to complete the box. 4) Draw a vertical line at the median. 5) Draw “whiskers” to the extremes (Min and Max). Draw boxplots for Sosa and Maris’s homeruns.