Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.

Slides:



Advertisements
Similar presentations
C. D. Toliver AP Statistics
Advertisements

Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Learn to find measures of variability. Box-and-whisker Plots.
SECTION 3.3 MEASURES OF POSITION Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
The Five-Number Summary and Boxplots
Averages and Variation
Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Section 2.5 Measures of Position.
Statistics: Use Graphs to Show Data Box Plots.
Box and Whisker Plots and Quartiles Sixth Grade. Five Statistical Summary When describing a set of data we have seen that we can use measures such as.
Department of Quantitative Methods & Information Systems
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Chapter 6 1. Chebychev’s Theorem The portion of any data set lying within k standard deviations (k > 1) of the mean is at least: 2 k = 2: In any data.
Table of Contents 1. Standard Deviation
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Section 3.3 Measures of Relative Position HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
3.3 – Percentiles & Box-and-Whisker Plots Mean and Standard Deviation deal with exact numbers to measure. Sometimes measuring position is important too.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
7.7 Statistics and Statistical Graphs. Learning Targets  Students should be able to… Use measures of central tendency and measures of dispersion to describe.
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 CHEBYSHEV'S THEOREM For any set of data and for any number k, greater than one, the.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Descriptive Statistics 2.
Percentiles For any whole number P (between 1 and 99), the Pth percentile of a distribution is a value such that P% of the data fall at or below it. The.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
Box and Whisker Plots Measures of Central Tendency.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Summary Statistics: Measures of Location and Dispersion.
Using Measures of Position (rather than value) to Describe Spread? 1.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Numerical descriptions of distributions
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
2-6 Box-and-Whisker Plots Indicator  D1 Read, create, and interpret box-and whisker plots Page
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
Exploratory Data Analysis
a graphical presentation of the five-number summary of data
Unit Three Central Tendency.
3 Averages and Variation
Descriptive Statistics
Box and Whisker Plots Algebra 2.
Percentiles and Box-and- Whisker Plots
Numerical Measures: Skewness and Location
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Percentiles and Box-and-Whisker Plots
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
The absolute value of each deviation.
Warm Up # 3: Answer each question to the best of your knowledge.
Measures of Position Section 3.3.
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. Averages and Variation 3

Copyright © Cengage Learning. All rights reserved. Section 3.3 Percentiles and Box-and-Whisker Plots

3 Focus Points Interpret the meaning of percentile scores. Compute the median, quartiles, and five-number summary from raw data. Make a box-and-whisker plot. Interpret the results. Describe how a box-and-whisker plot indicates spread of data about the median.

4 Percentiles and Box-and-Whisker Plots We’ve seen measures of central tendency and spread for a set of data. The arithmetic mean x and the standard deviation s will be very useful in later work. However, because they each utilize every data value, they can be heavily influenced by one or two extreme data values. In cases where our data distributions are heavily skewed or even bimodal, we often get a better summary of the distribution by utilizing relative position of data rather than exact values.

5 Percentiles and Box-and-Whisker Plots We know that the median is an average computed by using relative position of the data. If we are told that 81 is the median score on a biology test, we know that after the data have been ordered, 50% of the data fall at or below the median value of 81. The median is an example of a percentile; in fact, it is the 50th percentile. The general definition of the P th percentile follows.

6 Percentiles and Box-and-Whisker Plots In Figure 3-3, we see the 60th percentile marked on a histogram. We see that 60% of the data lie below the mark and 40% lie above it. A Histogram with the 60th Percentile Shown Figure 3-3

7 Percentiles and Box-and-Whisker Plots There are 99 percentiles, and in an ideal situation, the 99 percentiles divide the data set into 100 equal parts. (See Figure 3-4.) However, if the number of data elements is not exactly divisible by 100, the percentiles will not divide the data into equal parts. Percentiles Figure 3-4

8 Percentiles and Box-and-Whisker Plots There are several widely used conventions for finding percentiles. They lead to slightly different values for different situations, but these values are close together. For all conventions, the data are first ranked or ordered from smallest to largest. A natural way to find the Pth percentile is to then find a value such that P% of the data fall at or below it. This will not always be possible, so we take the nearest value satisfying the criterion. It is at this point that there is a variety of processes to determine the exact value of the percentile.

9 Percentiles and Box-and-Whisker Plots We will not be very concerned about exact procedures for evaluating percentiles in general. However, quartiles are special percentiles used so frequently that we want to adopt a specific procedure for their computation. Quartiles are those percentiles that divide the data into fourths.

10 Percentiles and Box-and-Whisker Plots The first quartile Q 1 is the 25th percentile, the second quartile Q 2 is the median, and the third quartile Q 3 is the 75th percentile. (See Figure 3-5.) Again, several conventions are used for computing quartiles, but the convention on next page utilizes the median and is widely adopted. Quartiles Figure 3-5

11 Percentiles and Box-and-Whisker Plots Procedure

12 Percentiles and Box-and-Whisker Plots In short, all we do to find the quartiles is find three medians. The median, or second quartile, is a popular measure of the center utilizing relative position. A useful measure of data spread utilizing relative position is the interquartile range (IQR). It is simply the difference between the third and first quartiles. Interquartile range = Q 3 – Q 1 The interquartile range tells us the spread of the middle half of the data. Now let’s look at an example to see how to compute all of these quantities.

13 Example 9 – Quartiles In a hurry? On the run? Hungry as well? How about an ice cream bar as a snack? Ice cream bars are popular among all age groups. Consumer Reports did a study of ice cream bars. Twenty-seven bars with taste ratings of at least “fair” were listed, and cost per bar was included in the report. Just how much does an ice cream bar cost? The data, expressed in dollars, appear in Table 3-4. Cost of Ice Cream Bars (in dollars) Table 3-4

14 Example 9 – Quartiles As you can see, the cost varies quite a bit, partly because the bars are not of uniform size. (a) Find the quartiles. Solution: We first order the data from smallest to largest. Table 3-5 shows the data in order. cont’d Ordered Cost of Ice Cream Bars (in dollars) Table 3-5

15 Example 9 – Solution Next, we find the median. Since the number of data values is 27, there are an odd number of data, and the median is simply the center or 14th value. The value is shown boxed in Table 3-5. Median = Q 2 = 0.50 There are 13 values below the median position, and Q 1 is the median of these values. cont’d

16 Example 9 – Solution It is the middle or seventh value and is shaded in Table 3-5. First quartile = Q 1 = 0.33 There are also 13 values above the median position. The median of these is the seventh value from the right end. This value is also shaded in Table 3-5. Third quartile = Q 3 = 1.00 cont’d

17 Example 9 – Quartiles (b) Find the interquartile range. Solution: IQR = Q 3 – Q 1 = 1.00 – 0.33 = 0.67 This means that the middle half of the data has a cost spread of 67¢. cont’d

18 Box-and-Whisker Plots

19 Box-and-Whisker Plots The quartiles together with the low and high data values give us a very useful five-number summary of the data and their spread. We will use these five numbers to create a graphic sketch of the data called a box-and-whisker plot. Box-and-whisker plots provide another useful technique from exploratory data analysis (EDA) for describing data.

20 Box-and-Whisker Plots Procedure The next example demonstrates the process of making a box-and-whisker plot. Box-and-Whisker Plot Figure 3-6

21 Example 10 – Box-and-whisker plot Make a box-and-whisker plot showing the calories in vanilla-flavored ice cream bars. Use the plot to make observations about the distribution of calories. (a) We ordered the data (see Table 3-7) and found the values of the median, Q 1, and Q 3. Ordered Data Table 3-7

22 Example 10 – Box-and-whisker plot From this previous work we have the following five-number summary: low value = 111; Q 1 = 182; median = 221.5; Q 3 = 319; high value = 439 cont’d

23 Example 10 – Box-and-whisker plot (b) We select an appropriate vertical scale and make the plot (Figure 3-7). Box-and-Whisker Plot for Calories in Vanilla-Flavored Ice Cream Bars Figure 3-7 cont’d

24 Example 10 – Box-and-whisker plot (c) Interpretation A quick glance at the box-and-whisker plot reveals the following: (i) The box tells us where the middle half of the data lies, so we see that half of the ice cream bars have between 182 and 319 calories, with an interquartile range of 137 calories. (ii) The median is slightly closer to the lower part of the box. This means that the lower calorie counts are more concentrated. The calorie counts above the median are more spread out, indicating that the distribution is slightly skewed toward the higher values. cont’d

25 Example 10 – Box-and-whisker plot (iii) The upper whisker is longer than the lower, which again emphasizes skewness toward the higher values. cont’d