Introduction Measures of center and variability can be used to describe a data set. The mean and median are two measures of center. The mean is the average.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Ch 11 – Probability & Statistics
Introduction Data sets can be compared and interpreted in the context of the problem. Data values that are much greater than or much less than the rest.
Unit 4 – Probability and Statistics
Grouped Data Calculation
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Box and Whisker Plot 5 Number Summary for Odd Numbered Data Sets.
4.1.1: Summarizing Numerical Data Sets
Describing distributions with numbers
Vocabulary box-and-whisker plot quartiles variation
0-12 Mean, Median, Mode, Range and Quartiles Objective: Calculate the measures of central tendency of a set of data.
Measures of Central Tendency & Spread
Objectives Vocabulary
Table of Contents 1. Standard Deviation
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
The table below shows the number of students who are varsity and junior varsity athletes. Find the probability that a student is a senior given that he.
What is the MEAN? How do we find it? The mean is the numerical average of the data set. The mean is found by adding all the values in the set, then.
Numerical Statistics Given a set of data (numbers and a context) we are interested in how to describe the entire set without listing all the elements.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Measures of Variation For Example: The 11 Workers at a company have the following ages: 27, 39, 40, 22, 19, 25, 41, 58, 53, 49, 51 Order data from least.
Measures of Dispersion. Introduction Measures of central tendency are incomplete and need to be paired with measures of dispersion Measures of dispersion.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Introduction Measures of center and variability can be used to describe a data set. The mean and median are two measures of center. The mean is the average.
What are the effects of outliers on statistical data?
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Warm Up Simplify each expression
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
Unit 4: Probability Day 4: Measures of Central Tendency and Box and Whisker Plots.
Concept: Comparing Data. Essential Question: How do we make comparisons between data sets? Vocabulary: Spread, variation Skewed left Skewed right Symmetric.
Unit 4 Describing Data Standards: S.ID.1 Represent data on the real number line (dot plots, histograms, and box plots) S.ID.2 Use statistics appropriate.
Measures of Central Tendency PS Algebra I. Objectives Given a set of data, be able to find the following: 1) mean* 2) median* 3) mode* 4) range 5) first.
CCGPS Coordinate Algebra Unit 4: Describing Data.
Concept: Representing Data Essential Question: How do we represent data? Vocabulary: Dot plot Box and whisker plot Histogram : Representing Data.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Measures of Central Tendency, Dispersion, IQR and Standard Deviation How do we describe data using statistical measures? M2 Unit 4: Day 1.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Introduction Data sets can be compared by examining the differences and similarities between measures of center and spread. The mean and median of a data.
Holt McDougal Algebra 1 Data Distributions Holt Algebra 1 Warm Up Warm Up Lesson Presentation Lesson Presentation Lesson Quiz Lesson Quiz Holt McDougal.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Introduction Data can be represented graphically using a number line. Graphs provide a visual representation of data; just by looking at a graph, you can.
Mean, Median, Mode and Standard Deviation (Section 11-1)
Introduction To compare data sets, use the same types of statistics that you use to represent or describe data sets. These statistics include measures.
Measures of Central Tendency & Center of Spread
Measures of Central Tendency & Center of Spread
Unit 4 Statistics Review
SEE SOMETHING, SAY SOMETHING
Measures of Variation.
Warm-up 8/25/14 Compare Data A to Data B using the five number summary, measure of center and measure of spread. A) 18, 33, 18, 87, 12, 23, 93, 34, 71,
Lesson 1: Measures of Center Mean and Median
Lesson 1: Summarizing and Interpreting Data
The absolute value of each deviation.
Summarizing Numerical Data Sets
11.2 box and whisker plots.
Algebra I Unit 1.
Describe the spread of the data:
Day 91 Learning Target: Students can use statistics appropriate to the shape of the data distribution to compare center (median, mean) and spread (interquartile.
1.3 Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
EOC Review Question of the Day.
Basic Practice of Statistics - 3rd Edition
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Describing Distributions with Numbers
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Core Focus on Linear Equations
“Day E” April 22, :51 - 8:51 Math 8:53 - 9:53 Science
Presentation transcript:

Introduction Measures of center and variability can be used to describe a data set. The mean and median are two measures of center. The mean is the average value of the data. The median is the middle-most value in a data set. These measures are used to generalize data sets and identify common or expected values. Interquartile range and mean absolute deviation describe variability of the data set. Interquartile range is the difference between the third and first quartiles. The first quartile is the median of the lower half of the data set : Summarizing Numerical Data Sets

Introduction, continued The third quartile is the median of the upper half of the data set. The mean absolute deviation is the average absolute value of the difference between each data point and the mean. Measures of spread describe the variance of data values (how spread out they are), and identify the diversity of values in a data set. Measures of spread are used to help explain whether data values are very similar or very different : Summarizing Numerical Data Sets

Key Concepts Data sets can be compared using measures of center and variability. Measures of center are used to generalize data sets and identify common or expected values. Two measures of center are the mean and median : Summarizing Numerical Data Sets Finding the Mean 1.Find the sum of the data values. 2.Divide the sum by the number of data points. This is the mean.

Key Concepts, continued The mean is useful when data sets do not contain values that vary greatly. Median is a second measure of center : Summarizing Numerical Data Sets Finding the Median 1.First arrange the data from least to greatest. 2.Count the number of data points. If there is an even number of data points, the median is the average of the two middle-most values. If there is an odd number of data points, the median is the middle- most value.

Key Concepts, continued The mean and median are both measures that describe the expected value of a data set. Measures of spread describe the range of data values in a data set. Mean absolute deviation and interquartile range describe variability : Summarizing Numerical Data Sets

Key Concepts, continued The mean absolute deviation takes the average distance of the data points from the mean. This summarizes the variability of the data using one number : Summarizing Numerical Data Sets Finding the Median Absolute Deviation 1.Find the mean. 2.Calculate the absolute value of the difference between each data value and the mean. 3.Determine the average of the differences found in step 2. This average is the mean absolute deviation.

Key Concepts, continued : Summarizing Numerical Data Sets Finding the Interquartile Range 1.Arrange the data from least to greatest. 2.Count the number of data points in the set. 3.Find the median of the data set. The median divides the data into two halves: the lower half and the upper half. 4.Find the middle-most value of the lower half of the data. The data to the left represents the first quartile, Q 1. 5.Find the middle-most value of the upper half of the data. The data to the right is the third quartile, Q 3. 6.Calculate the difference between the two quartiles, Q 3 – Q 1. The interquartile range is the difference between the third and first quartiles.

Key Concepts, continued The interquartile range finds the distance between the two data values that represent the middle 50% of the data. This summarizes the variability of the data using one number : Summarizing Numerical Data Sets

Common Errors/Misconceptions confusing the terms mean and median, and how to calculate each measure forgetting to order data from least to greatest before calculating the median, quartiles, and interquartile range incorrectly finding the absolute value of the difference between each data value and the mean : Summarizing Numerical Data Sets

Guided Practice Example 3 A website captures information about each customer’s order. The total dollar amounts of the last 8 orders are listed in the table to the right. What is the mean absolute deviation of the data? : Summarizing Numerical Data Sets OrderDollar amount

Guided Practice: Example 3, continued 1.To find the mean absolute deviation of the data, start by finding the mean of the data set : Summarizing Numerical Data Sets

Guided Practice: Example 3, continued 2.Find the sum of the data values, and divide the sum by the number of data values : Summarizing Numerical Data Sets

Guided Practice: Example 3, continued 3.Find the absolute value of the difference between each data value and the mean: |data value – mean|. |21 – 21| = 0 |15 – 21| = 6 |22 – 21| = 1 |26 – 21| = 5 |24 – 21| = 3 |21 – 21| = 0 |17 – 21| = 4 |22 – 21| = : Summarizing Numerical Data Sets

Guided Practice: Example 3, continued 4.Find the sum of the absolute values of the differences = : Summarizing Numerical Data Sets

Guided Practice: Example 3, continued 5.Divide the sum of the absolute values of the differences by the number of data values. The mean absolute deviation of the dollar amounts of each order set is 2.5. This says that the average cost difference between the orders and the mean order is $ : Summarizing Numerical Data Sets ✔

Guided Practice: Example 3, continued : Summarizing Numerical Data Sets

Guided Practice Example 4 A company keeps track of the age at which employees retire. It is considered an early retirement if the employee retires before turning 65. The age of the 11 employees who took early retirement this year are listed in the table below. Are there any striking deviations in the data? : Summarizing Numerical Data Sets

Guided P ractice: Example 4, continued : Summarizing Numerical Data Sets EmployeeAge at early retirement

Guided Practice: Example 4, continued 1.First find the interquartile range : Summarizing Numerical Data Sets

Guided Practice: Example 4, continued 2.Order the data set from least to greatest : Summarizing Numerical Data Sets

Guided Practice: Example 4, continued 3.Find the median of the data set. If there is an odd number of data values, find the middle-most value. If there is an even number of data values, find the average of the two middle-most values. There are 11 data values. The sixth data value is the middle-most value, and therefore is the median. The median of this data set is : Summarizing Numerical Data Sets median

Guided Practice: Example 4, continued 4.Find the first quartile. The first quartile is the median of the lower half of the data set, or the values less than the median value. The first five data values are the lower half of the data set: 42, 48, 51, 53, and 55. The median of the first five data values is the middle-most value of these five values. The first quartile is the third value, : Summarizing Numerical Data Sets medianQ1Q1

Guided Practice: Example 4, continued 5.Find the third quartile. The third quartile is the median of the upper half of the data set, or the values greater than the median value. The last five data values are the upper half of the data set: 56, 58, 59, 60, and 64. The median of the last five data values is the middle-most value of these five values. The third quartile is the third value, : Summarizing Numerical Data Sets medianQ1Q1 Q3Q3

Guided Practice: Example 4, continued 6.Find the difference between the third and first quartiles: third quartile – first quartile, or Q 3 – Q – 51 = 8 The interquartile range is : Summarizing Numerical Data Sets

Guided Practice: Example 4, continued 7.Look for striking deviations in the data. Think about the typical retirement age, which is 65. Also consider the interquartile range, which is 8. Retiring at the age of 42 is young and far away from the mean of 56. The age 42 would be considered a striking deviation because it is far away from the other data values : Summarizing Numerical Data Sets ✔

Guided Practice: Example 4, continued : Summarizing Numerical Data Sets