Data Distributions.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Section 2.1 Visualizing Distributions: Shape, Center and Spread.
Chapter 1 & 3.
Chapter 1 Introduction Individual: objects described by a set of data (people, animals, or things) Variable: Characteristic of an individual. It can take.
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
CHAPTER 2: Describing Distributions with Numbers
Chapter 4 Displaying Quantitative Data. Graphs for Quantitative Data.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
INTRODUCTORY STATISTICS Chapter 2 DESCRIPTIVE STATISTICS PowerPoint Image Slideshow.
Warm-Up If the variance of a set of data is 12.4, what is the standard deviation? If the standard deviation of a set of data is 5.7, what is the variance?
AP Stats Chapter 1 Review. Q1: The midpoint of the data MeanMedianMode.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Density Curves and the Normal Distribution.
2.1 Density Curves and the Normal Distribution.  Differentiate between a density curve and a histogram  Understand where mean and median lie on curves.
Analyze Data USE MEAN & MEDIAN TO COMPARE THE CENTER OF DATA SETS. IDENTIFY OUTLIERS AND THEIR EFFECT ON DATA SETS.
Statistics: Mean of Absolute Deviation
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
Describing and Displaying Quantitative data. Summarizing continuous data Displaying continuous data Within-subject variability Presentation.
Sampling Distributions. Essential Question: How is the mean of a sampling distribution related to the population mean or proportion?
Summarizing Quantitative Data. We have discussed how to display data in a histogram. Today learn to describe how data is distributed.
Numerical Measures of Variability
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
Revision Analysing data. Measures of central tendency such as the mean and the median can be used to determine the location of the distribution of data.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Chapter 5 Describing Distributions Numerically.
SWBAT: Measure standard deviation, identify outliers, and construct a boxplot using the five-number summary Do Now: Explain how this graph could be misleading.
Warm Up Simplify each expression
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
ANALYZING THE SHAPE OF DATA (CC-37) PURPOSE: TO CHOOSE APPROPRIATE STATISTICS BASED ON THE SHAPE OF THE DATA DISTRIBUTION. ADRIAN, KARLA, ALLEN, DENISSE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Standard Deviation by Hand 9 – Step Process See 4.2 WS.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
Analyzing Data Week 1. Types of Graphs Histogram Must be Quantitative Data (measurements) Make “bins”, no overlaps, no gaps. Sort data into the bins.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Lesson 11.1 Normal Distributions (Day 1)
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
CHAPTER 1 Exploring Data
Chapter 5 : Describing Distributions Numerically I
Chapter 1 & 3.
Statistical Reasoning
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
2.1 Density Curve and the Normal Distributions
Warmup What five numbers need to be mentioned in the complete sentence you write when the data distribution is skewed?
Drill Construct a Histogram to represent the data of test score averages in 20 cities using 5 Bars. Test Averages {62, 68, 72, 58, 83, 91, 70, 82, 68,
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
Calculating IQR and Identifying Outliers
Chapter 8 Review Showdown.
Quartile Measures DCOVA
SWBAT: Measure center with the mean and median and spread with interquartile range. Do Now:
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Shape of Distributions
Organizing Data AP Stats Chapter 1.
1.3 Describing Quantitative Data with Numbers
AP Statistics Day 4 Objective: The students will be able to describe distributions with numbers and create and interpret boxplots.
10-5 The normal distribution
Click the mouse button or press the Space Bar to display the answers.
Histograms and Measures of Center vs. Spread
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
The Five-Number Summary
Shape, Center, Spread.
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Data Distributions

Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Data Distribution A set of numerical data that you can graph using a data display that involves a number line. Ex: line plot, histogram, or box plot. The graph will reveal the shape of the distribution.

Seeing the Shape of a Data Distribution Baby Birth Month Birth Weight (kg) Mother’s age 1 5 3.3 28 2 7 3.6 31 3 11 3.5 33 4 3.4 35 10 3.7 39 6 30 29 8 3.2 9 32 12 13 14 15 34 16 3.8 17 18 19 20 Make a line plot for the distribution of birth months. Make a line plot for the distribution of birth weights. Make a line plot for the distribution of mothers’ ages.

Reflect 1a. Describe the shape of the distribution of birth months.

Reflect 1b. Describe the shape of the distribution of birth weights.

Reflect 1c. Describe the shape of the distribution of mothers’ age.

Understanding Shape, Center, and Spread Data distributions can have various shapes. These shapes have names in Statistics.

Uniform Distribution The shape is basically level. It looks like a rectangle.

Normal Distribution Mound in the middle with symmetric tails at each end. Looks bell shaped.

Skewed distribution Mounded by not symmetric because one “tail” is much longer than the other.

Other (Mixed)

Distribution Center and Spread Mean

Distribution Center and Spread Median

Distribution Center and Spread Standard Deviation

Distribution Center and Spread Interquartile Range (IQR) – tells you how spread out the middle or (50%) of data are.

Reflect 2a. Describe the shape of each distribution that you made in the Example, using the vocabulary you just learned.

Reflect 2b. When the center and the spread of a distribution are reported, they are generally given either as the mean and standard deviation or as the median and IQR. Why do these pairings make sense?

Relating Center and Spread to Shape Baby Birth Month Birth Weight (kg) Mothers’ age 1 5 3.3 28 2 7 3.6 31 3 11 3.5 33 4 3.4 35 10 3.7 39 6 30 29 8 3.2 9 32 12 13 14 15 34 16 3.8 17 18 19 20 Calculate the following for both birth weight and mothers’ ages. Mean Median Standard Deviation IQR

Reflect 3a. What do you notice about the mean and median for the symmetric distribution (baby weights) as compared with the mean and median for the skewed distribution (mothers’ ages)? Explain why this happens.

Reflect 3b. One way to compare the spread of two distributions is to find the ratio (expressed as a percent) of the standard deviation to the mean for each distribution. Another way to find the ratio (expressed as a percent) of the IQR to the median. Calculate these ratios, rounding each to the nearest percent if necessary, for the symmetric and the skewed distribution. What do you observe when you compare the corresponding ratios? Why does this make sense?

Reflect 3c. Which measures of center and spread would you report for the symmetric distribution? For the skewed distribution? Explain your reasoning.

Making and Analyzing a Histogram We will use Excel to create a histogram using the data of baby weights.

Reflect 4a. By examining the histogram, determine the percent of the data that fall within 1 Standard Deviation (s=0.14) of the mean ( 𝑥 =3.5). That is, determine the percent of the data in the interval 3.5−0.14<𝑥<3.5+0.14, or 3.36<𝑥<3.64 Explain your reasoning.

Reflect 4b. Suppose one of the baby weights is chosen at random. By examining the histogram, determine the probability that the weight is more than 1 standard deviation (s=0.14) above the mean ( 𝑥 =3.5). That is, determine the probability that the weight is in the interval 𝑥>3.5+0.14, 𝑜𝑟 𝑥>3.64. Explain your reasoning.

Making and Analyzing a Box Plot

Reflect 5a How does the box plot show the distribution is skewed right?

Reflect 5b. Suppose one of the mothers’ ages is chosen at random. Based on the box plot and not the original set of data, what can you say is the approximate probability that the age falls between the median, 30.5 and the third quartile, 32.5? Explain your reasoning.

Making and Analyzing a Box Plot with outliers

Reflect 5c. A data value 𝑥 is considered to be an outlier if 𝑥< 𝑄 1 −1.5 𝐼𝑄𝑅 𝑜𝑟 𝑥> 𝑄 3 +1.5(𝐼𝑄𝑅) Explain why a mother’s age of 39 is an outlier for this data set. Redraw the box plot using the option for showing outliers. How does the box plot change?