Unit 4 Describing Data Standards: S.ID.1 Represent data on the real number line (dot plots, histograms, and box plots) S.ID.2 Use statistics appropriate.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Statistics Unit 6.
Ch 11 – Probability & Statistics
Introduction Data sets can be compared and interpreted in the context of the problem. Data values that are much greater than or much less than the rest.
Unit 4 – Probability and Statistics
Statistics: Use Graphs to Show Data Box Plots.
Introduction Measures of center and variability can be used to describe a data set. The mean and median are two measures of center. The mean is the average.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
4.1.1: Summarizing Numerical Data Sets
Vocabulary box-and-whisker plot quartiles variation
Measures of Central Tendency & Spread
Objectives Vocabulary
Warm Up Problem of the Day Lesson Presentation Lesson Quizzes.
Analyze Data USE MEAN & MEDIAN TO COMPARE THE CENTER OF DATA SETS. IDENTIFY OUTLIERS AND THEIR EFFECT ON DATA SETS.
Statistics: Mean of Absolute Deviation
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Unit 4 Day 1 Vocabulary. Mean The average value of a data set, found by summing all values and dividing by the number of data points Example:
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
MIA U2D7 Warmup: Find the mean (rounded to the nearest tenth) and median for the following data: 73, 50, 72, 70, 70, 84, 85, 89, 89, 70, 73, 70, 72, 74.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Introduction Measures of center and variability can be used to describe a data set. The mean and median are two measures of center. The mean is the average.
CCGPS Advanced Algebra UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
Warm Up Simplify each expression
Math 310 Section 8.1 & 8.2 Statistics. Centers and Spread A goal in statistics is to determine how data is centered and spread. There are many different.
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
2-6 Box-and-Whisker Plots Indicator  D1 Read, create, and interpret box-and whisker plots Page
Unit 4: Probability Day 4: Measures of Central Tendency and Box and Whisker Plots.
Concept: Comparing Data. Essential Question: How do we make comparisons between data sets? Vocabulary: Spread, variation Skewed left Skewed right Symmetric.
Concept: Representing Data Essential Question: How do we represent data? Vocabulary: Dot plot Box and whisker plot Histogram : Representing Data.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Measures of Central Tendency, Dispersion, IQR and Standard Deviation How do we describe data using statistical measures? M2 Unit 4: Day 1.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Lesson Topic: The Mean Absolute Deviation (MAD) Lesson Objective: I can…  I can calculate the mean absolute deviation (MAD) for a given data set.  I.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Introduction Data sets can be compared by examining the differences and similarities between measures of center and spread. The mean and median of a data.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
Holt McDougal Algebra 1 Data Distributions Holt Algebra 1 Warm Up Warm Up Lesson Presentation Lesson Presentation Lesson Quiz Lesson Quiz Holt McDougal.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Statistics Vocab Notes Unit 4. Mean The average value of a data set, found by adding all values and dividing by the number of data points Example: 5 +
Statistics Unit 6.
Notes 13.2 Measures of Center & Spread
Introduction To compare data sets, use the same types of statistics that you use to represent or describe data sets. These statistics include measures.
Chapter 5 : Describing Distributions Numerically I
Statistics Unit Test Review
Measures of Central Tendency & Center of Spread
Measures of Central Tendency & Center of Spread
Unit 4 Statistics Review
Statistics Unit 6.
Warm-up 8/25/14 Compare Data A to Data B using the five number summary, measure of center and measure of spread. A) 18, 33, 18, 87, 12, 23, 93, 34, 71,
Lesson 1: Summarizing and Interpreting Data
The absolute value of each deviation.
Summarizing Numerical Data Sets
11.2 box and whisker plots.
Algebra I Unit 1.
Unit 6: Statistics Mean, Median, Mode, Range Measures of Variation
Describe the spread of the data:
Day 91 Learning Target: Students can use statistics appropriate to the shape of the data distribution to compare center (median, mean) and spread (interquartile.
Exploratory Data Analysis
EOC Review Question of the Day.
Chapter 1: Exploring Data
Statistics Vocabulary Continued
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Warm – Up: Students will take a quiz,
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Statistics Vocabulary Continued
Describing Data Coordinate Algebra.
“Day E” April 22, :51 - 8:51 Math 8:53 - 9:53 Science
Warm up Find the Mean Median Mode 0, 23, 12, 5, 8, 5, 13, 5, 15.
Presentation transcript:

Unit 4 Describing Data Standards: S.ID.1 Represent data on the real number line (dot plots, histograms, and box plots) S.ID.2 Use statistics appropriate to the shape of the data distribution to compare center (mean, median) and spread (IQR) of two or more different data sets. S.ID. 3 Interpret differences in shape center and spread in the context of the data sets, accounting for possible effects of extreme data points (outliers) : Summarizing Numerical Data Sets

Concept: Summarize Data Essential Question: How do we summarize data? Vocabulary: Mean, Median Mode, Range IQR, MAD : Summarizing Numerical Data Sets

Key Concepts Data sets can be compared using measures of center and variability. Measures of center are used to generalize data sets and identify common or expected values. Two measures of center are the mean and median : Summarizing Numerical Data Sets Finding the Mean 1.Find the sum of the data values. 2.Divide the sum by the number of data points. This is the mean.

Key Concepts, continued The mean is useful when data sets do not contain values that vary greatly. Median is a second measure of center : Summarizing Numerical Data Sets Finding the Median 1.First arrange the data from least to greatest. 2.Count the number of data points. If there is an even number of data points, the median is the average of the two middle-most values. If there is an odd number of data points, the median is the middle- most value.

Key Concepts, continued The mean and median are both measures that describe the expected value of a data set. Measures of spread describe the range of data values in a data set. Mean absolute deviation and interquartile range describe variability : Summarizing Numerical Data Sets

Key Concepts, continued The mean absolute deviation takes the average distance of the data points from the mean. This summarizes the variability of the data using one number : Summarizing Numerical Data Sets Finding the Median Absolute Deviation 1.Find the mean. 2.Calculate the absolute value of the difference between each data value and the mean. 3.Determine the average of the differences found in step 2. This average is the mean absolute deviation.

Key Concepts, continued : Summarizing Numerical Data Sets Finding the Interquartile Range 1.Arrange the data from least to greatest. 2.Count the number of data points in the set. 3.Find the median of the data set. The median divides the data into two halves: the lower half and the upper half. 4.Find the middle-most value of the lower half of the data. The data to the left represents the first quartile, Q 1. 5.Find the middle-most value of the upper half of the data. The data to the right is the third quartile, Q 3. 6.Calculate the difference between the two quartiles, Q 3 – Q 1. The interquartile range is the difference between the third and first quartiles.

Key Concepts, continued The interquartile range finds the distance between the two data values that represent the middle 50% of the data. This summarizes the variability of the data using one number : Summarizing Numerical Data Sets

Common Errors/Misconceptions confusing the terms mean and median, and how to calculate each measure forgetting to order data from least to greatest before calculating the median, quartiles, and interquartile range incorrectly finding the absolute value of the difference between each data value and the mean : Summarizing Numerical Data Sets

Guided Practice Example 1 A website captures information about each customer’s order. The total dollar amounts of the last 8 orders are listed in the table to the right. What is the mean absolute deviation of the data? : Summarizing Numerical Data Sets OrderDollar amount

Guided Practice: Example 1, continued 1.To find the mean absolute deviation of the data, start by finding the mean of the data set : Summarizing Numerical Data Sets

Guided Practice: Example 1, continued 2.Find the sum of the data values, and divide the sum by the number of data values : Summarizing Numerical Data Sets

Guided Practice: Example 1, continued 3.Find the absolute value of the difference between each data value and the mean: |data value – mean|. |21 – 21| = 0 |15 – 21| = 6 |22 – 21| = 1 |26 – 21| = 5 |24 – 21| = 3 |21 – 21| = 0 |17 – 21| = 4 |22 – 21| = : Summarizing Numerical Data Sets

Guided Practice: Example 1, continued 4.Find the sum of the absolute values of the differences = : Summarizing Numerical Data Sets

Guided Practice: Example 1, continued 5.Divide the sum of the absolute values of the differences by the number of data values. The mean absolute deviation of the dollar amounts of each order set is 2.5. This says that the average cost difference between the orders and the mean order is $ : Summarizing Numerical Data Sets ✔

Guided Practice: Example 1, continued : Summarizing Numerical Data Sets

Guided Practice Example 2 A company keeps track of the age at which employees retire. It is considered an early retirement if the employee retires before turning 65. The age of the 11 employees who took early retirement this year are listed in the table below. Are there any striking deviations in the data? : Summarizing Numerical Data Sets

Guided P ractice: Example 2, continued : Summarizing Numerical Data Sets EmployeeAge at early retirement

Guided Practice: Example 2, continued 1.First find the interquartile range : Summarizing Numerical Data Sets

Guided Practice: Example 2, continued 2.Order the data set from least to greatest : Summarizing Numerical Data Sets

Guided Practice: Example 2, continued 3.Find the median of the data set. If there is an odd number of data values, find the middle-most value. If there is an even number of data values, find the average of the two middle-most values. There are 11 data values. The sixth data value is the middle-most value, and therefore is the median. The median of this data set is : Summarizing Numerical Data Sets median

Guided Practice: Example 2, continued 4.Find the first quartile. The first quartile is the median of the lower half of the data set, or the values less than the median value. The first five data values are the lower half of the data set: 42, 48, 51, 53, and 55. The median of the first five data values is the middle-most value of these five values. The first quartile is the third value, : Summarizing Numerical Data Sets medianQ1Q1

Guided Practice: Example 2, continued 5.Find the third quartile. The third quartile is the median of the upper half of the data set, or the values greater than the median value. The last five data values are the upper half of the data set: 56, 58, 59, 60, and 64. The median of the last five data values is the middle-most value of these five values. The third quartile is the third value, : Summarizing Numerical Data Sets medianQ1Q1 Q3Q3

Guided Practice: Example 2, continued 6.Find the difference between the third and first quartiles: third quartile – first quartile, or Q 3 – Q – 51 = 8 The interquartile range is : Summarizing Numerical Data Sets

Guided Practice: Example 2, continued 7.Look for striking deviations in the data. Think about the typical retirement age, which is 65. Also consider the interquartile range, which is 8. Retiring at the age of 42 is young and far away from the mean of 56. The age 42 would be considered a striking deviation because it is far away from the other data values : Summarizing Numerical Data Sets ✔

Guided Practice: Example 2, continued : Summarizing Numerical Data Sets