§ 14.3 Numerical Summaries of Data

Slides:



Advertisements
Similar presentations
C. D. Toliver AP Statistics
Advertisements

Describing Distributions With Numbers
3.3 Measures of Position Measures of location in comparison to the mean. - standard scores - percentiles - deciles - quartiles.
T-6 Five Number Summary. Five Number Summary Includes: Minimum: the lowest value in the data set Lower Quartile (Q1): the 25 th percentile Median: the.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
CHAPTER 1 Exploring Data
Unit 6B Measures of Variation.
SECTION 3.3 MEASURES OF POSITION Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Describing Quantitative Data with Numbers Part 2
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Section 2.5 Measures of Position.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Objectives 1.2 Describing distributions with numbers
Measures of Central Tendencies Week # 2 1. Definitions Mean: or average The sum of a set of data divided by the number of data. (Do not round your answer.
Homework Discussion Read pages 476 – 489 Page 499: 1 – 3, 5, 7 – 11, 19, 21, (for 23, 25, 29, 30, 32, find the mean)
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Anthony J Greene1 Dispersion Outline What is Dispersion? I Ordinal Variables 1.Range 2.Interquartile Range 3.Semi-Interquartile Range II Ratio/Interval.
Chapter 3 Averages and Variations
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Section 3.1 Measures of Center. Mean (Average) Sample Mean.
1.3: Describing Quantitative Data with Numbers
Objectives Vocabulary
Do Now. Chapter 5 Section E and F Vocabulary Relative frequency- frequency expressed as a fraction of the total frequency Cumulative frequency- sum of.
Review Measures of central tendency
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Table of Contents 1. Standard Deviation
Math I: Unit 2 - Statistics
Section 3.3 Measures of Relative Position HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Describing distributions with numbers
Chapter 1: Exploring Data Lesson 4: Quartiles, Percentiles, and Box Plots Mrs. Parziale.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
 z – Score  Percentiles  Quartiles  A standardized value  A number of standard deviations a given value, x, is above or below the mean  z = (score.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
14.1 Data Sets: Data Sets: Data set: collection of data values.Data set: collection of data values. Frequency: The number of times a data entry occurs.Frequency:
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 CHEBYSHEV'S THEOREM For any set of data and for any number k, greater than one, the.
Percentiles For any whole number P (between 1 and 99), the Pth percentile of a distribution is a value such that P% of the data fall at or below it. The.
Chapter 2 Section 5 Notes Coach Bridges
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Created by: Tonya Jagoe. Measures of Central Tendency mean median mode.
Math 15 – Elementary Statistics Chapter 5 – Part 2 Summarizing Data Numerically.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Math 310 Section 8.1 & 8.2 Statistics. Centers and Spread A goal in statistics is to determine how data is centered and spread. There are many different.
Numerical descriptions of distributions
Lesson 25 Finding measures of central tendency and dispersion.
Section 3-4 Measures of Relative Standing and Boxplots.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
Measures of Central Tendency PS Algebra I. Objectives Given a set of data, be able to find the following: 1) mean* 2) median* 3) mode* 4) range 5) first.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chap 1: Exploring Data 1.3: Measures of Center 1.4: Quartiles, Percentiles, and Box Plots 1.7: Variance and Standard Deviation.
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
One-Variable Statistics
Objective: Given a data set, compute measures of center and spread.
Calculations with Lists
Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.
Measures of central tendency
Box and Whisker Plots Algebra 2.
Measures of central tendency
pencil, red pen, highlighter, GP notebook, graphing calculator
Measures of Position Section 3.3.
pencil, red pen, highlighter, GP notebook, graphing calculator
Warm up Honors Algebra 2 3/14/19
Presentation transcript:

§ 14.3 Numerical Summaries of Data

Numerical Summaries of a Data Set In the last section we looked at ways to graphically represent a data set--today we will look at numerical ways to summarize similar information. The are two major types of numerical summary: 1. Measures of location. 2. Measures of spread.

Numerical Summaries of a Data Set In the last section we looked at ways to graphically represent a data set--today we will look at numerical ways to summarize similar information. The are two major types of numerical summary: 1. Measures of location. 2. Measures of spread. average/mean range median interquartile range quartiles standard deviation

The Average / Mean The average or mean of a data set of size N is found by adding the numbers and dividing by N. Or more formally, if the data set is { x1 , x2 , x3 , . . . , xN } then the mean is given by: x1 + x2 + x3 + . . . + xN N

The Average / Mean What about when we are given a frequency table? Let’s look at the test scores from yesterday: 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score

The Average / Mean From a Frequency Table Entering Data and Finding the Mean on the TI-83: Hit [Stat] Select “1: Edit…” Enter data into L1. If you are working from a frequency table enter the corresponding frequencies into L2. Go to the “List” menu ([2nd], [Stat]) Select “3: mean( “ You should now be on the ‘main’ screen. Proceed as follows: (a) If you are working from just a list of data, type “L1” ([2nd], [1]) , close the parentheses and hit [Enter]. (b) If you are working from a freq. table type “L1” followed by “,” and “L2” ([2nd], [2]). . Then close the parentheses and hit enter. Step 1: Calculate the total of the data. Total = x1 * f1 + x2 * f2+ x3* f3 + . . . + xk * f1k Step 2: Calculate N. N = f1 + f2 + f3 + . . . + fk Step 3: Calculate the average. Average = Total / N fk f2 . . . f1 Freque ncy xk x2 . . . x1 Data

Example: Average Salary The average salary at a local computer manufacturer with 50 employees is $42,000. The owner draws a yearly salary of $800,000. What is the average salary of the other 49 employees?

Example: 105 Exam Scores Suppose you have averaged a 132 out of 150 on the first 3 exams in Math 105. What score would you need on the fourth exam to have an average of 135?

Percentiles The p th percentile of a data set is the value such that p percent of the numbers fall at or below the value. The rest of the data falls at or above the value. We will call the p th percent of N the locator, and write it as L .

Example: Height Sorting Data on the TI-83: 1. Enter data into L1 as before. 2. Hit [Stat] 3. Select “2: SortA( “ 4. You should now be on the ‘main’ screen. Hit L1. ([2nd], 1) 5. Close the parentheses and hit enter.

Finding the p th Percentile Step 1: Sort the original data set by size. (Suppose {d1 , d2 , d3 , . . . , dN } is the sorted set) Step 2: Compute the value of the locator. L = ( p /100 )( N ) Step 3: The p th percentile is: (a) The average of dL and dL+1 if L is a whole number. (b) dL+ if L is not a whole number. L+ is L rounded up.

Percentiles: The Median and Quartiles The 50th percentile, called the median, is the percentile that is most commonly used. The median will be written M. The other two commonly used percentiles are the quartiles: The first quartile, written as Q1, is the 25th percentile. The third quartile, denoted Q3, is the 75th percentile.

Example: Let’s examine the test scores again. . . 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score Find the quartiles and the median.

The Five-Number Summary One way to give a nice profile of a data set is the “five-number summary,” which consists of: 1. The lowest value, called the Min. 2. The first quartile, Q1. 3. The median, M. 4. The third quartile, Q3. 5. The highest value, called the Max.

Example: The Five-Number Summary for our test score example would look like this: 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score

The Five-Number Summary: Box Plots We can also represent the Five- Number Summary graphically in what is called a box plot or a box-and-whiskers plot. Min M Q3 Max Q1

Example: Here is the box plot for our test score example: 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score Min = 4 M = 44 Q3 = 48 Max = 96 Q1 = 36

§ 14.4 Measures of Spread

Example - Find the average and median of the following data sets:

The Range One way to measure the spread of data is to examine the range, given by R = Max - Min The problem with using the range is that outliers can severely affect it.

Example: Looking again at our ‘test score’ example. . . 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score We see that the range with the outliers (4 and 96) would be R = 96 - 4 = 92. However, without those pieces of data we would have R = 76 - 24 = 52.

The Interquartile Range In order to eliminate the problems caused by outliers, we could make use of the interquartile range--the difference between the third and first quartile: IQR = Q3 - Q1 This measure tells us where the middle 50% of the data is located.

The IQR for this set of data is: IQR = Q3 - Q1 = 48 - 36 = 12 Example: Your instructor didn’t feel like making a different example. . . 1 4 8 2 9 13 16 10 6 Frequency 96 76 72 64 60 56 48 44 40 36 32 28 24 Score The IQR for this set of data is: IQR = Q3 - Q1 = 48 - 36 = 12

The Standard Deviation The idea: Measure how spread out your data set is by examining how far each piece of information is from some fixed reference point. The reference point we will use is the mean (average).

The Standard Deviation We could try to average the Deviations from the Mean: (Data value - Mean)

Example: Once again, the test score data. . . 1 4 8 2 9 13 16 10 6 Freque ncy 96 76 72 64 60 56 48 44 40 36 32 28 24 Score ( x ) 49.3 9 29.3 9 25.3 9 17.3 9 13.3 9 9.3 9 1.3 9 - 2.6 1 - 6.6 1 - 10.6 1 - 14.6 1 - 18.6 1 - 22.6 1 - 42.6 1 (x - 46.61)

The Standard Deviation We could try to average the Deviations from the Mean: (Data value - Mean) However, negative deviations and positive deviations will cancel each other out--in fact (assuming we don’t round off any of our figures) the average of the deviations from the mean will always be 0!

The Standard Deviation What would happen if we squared the deviations from the mean? The squared deviations are always non-negative, so there would be no canceling. The average of these squared deviations is called the variance, V.

The Standard Deviation Unfortunately, there is a problem with using the variance as well--the units of measure. For instance if we were studying people’s height in inches (in), the variance would appear in units of in2.

The Standard Deviation Unfortunately, there is a problem with using the variance as well--the units of measure. For instance if we were studying people’s height in inches (in), the variance would come be in units of in2. The solution to our dilemma is simple--we will just take the square root of the variance to get the what is called the standard deviation, .

Finding The Standard Deviation Step 1: Find the average/mean of the data set. Call it A. Step 2: For each number x in the data set find the deviation from the mean, x - A. Step 3: Square each of the deviations found in Step 2. Step 4: Find the average of the squared deviations found in step 3. This is the variance, V. Step 5: Take the square root of the variance. This is the standard deviation, .

Finding The Standard Deviation Another way to find the Standard Deviation by hand is to use the following formula: N ∑ ( xi - A )2 i=1 N √  =

Finding The Standard Deviation Finding all of the information from 14.2-14.3 on the TI-83: 1. Enter data as shown previously. Quit to the main screen. 2. Hit [Stat] 3. Move right to the “CALC“ menu. 4. Select “1-Var Stats”. 5. Now on the main screen, type “L1”. (If you are using data from a frequency table also type “,” and “L2”) Hit [Enter]. 6. Interpret the information as follows: x is the mean/average, A; x is the Standard Deviation; n is the size of your data set; If you arrow down the Min, Max, Median and quartiles should all be listed.

Example: Find the standard deviation for the following data set. {1, 6, 14, 19}