Download presentation
Presentation is loading. Please wait.
Published byLinette Terry Modified over 8 years ago
1
Descriptive Statistics
2
Frequency Distributions and Their Graphs What you should learn: How to construct a frequency distribution including midpoints, relative frequencies, and cumulative frequencies. How to construct frequency histograms, frequency polygons, relative frequency histograms and ogives.
3
Frequency Distribution - Vocabulary A table that shows classes or intervals of data entries with a count of the number of entries in each class. Frequency - The number of data entries in the class.
4
Class width- Vocabulary The difference between two consecutive lower class limits in a frequency distribution table. Lower Class Limit - The least number that can belong to a specific class. Upper Class Limit - The greatest number that can belong to a specific class.
5
Guidelines for Constructing a Frequency Distribution Table Decide on the number of classes or intervals. This should be between 5 and 20. Calculate the class width. Round up to the next convenient number. Maximum data entry – Minimum data entry Number of classes Find the class limits. Use the minimum data entry as the lower limit of the first class. To find the remaining lower limits, add the class width to the lower limit of the preceding class.
6
Midpoint of a class - Vocabulary The average of the lower and upper limits of the class. Relative Frequency - The portion or percent of the data that falls in that class. Cumulative Frequency of a class - The sum of the frequency for that class and all previous classes. The cumulative frequency of the last class is equal to the sample size, n.
7
Frequency Histogram- Vocabulary A bar graph that represents the frequency distribution of a data set. Properties of a Histogram The horizontal scale is quantitative and measures the data values. The vertical scale measures the frequencies of the classes. Consecutive bars must touch.
8
Sample Histogram (using midpoints)
9
Sample Histogram (using class boundaries)
10
Histogram Checklist Does your histogram have a title? Does your horizontal axis have a title? Does your vertical axis have a title? Did you number your horizontal axis? Did you number your vertical axis? Did you leave a space between your vertical axis and the first tower? Is your histogram attactive to the eye?
11
Class Boundaries- Vocabulary Numbers that separate classes without forming gaps between them. Since consecutive bars of a histogram must touch, bars must begin and end at class boundaries instead of class limits.
12
Guidelines for Constructing a Frequency Histogram Find the class boundaries by subtracting.5 from the lower class limits and adding.5 to the upper class limits. Use either the class boundaries or the class midpoints as the horizontal scale. Construct rectangular towers to represent the frequency of each class of data.
13
Other data displays based on Frequency Distribution Tables Relative Frequency Histogram Ogive (Cumulative Frequency Graph)
14
What will a Relative Frequency Histogram Look Like? 0 5 10 15 2.5 5.5 8.5 11.5 14.5 17.5 Frequency Histogram 0.09.18.27 2.5 5.5 8.5 11.5 14.5 17.5 Relative Frequency Histogram Just a different vertical scale.
15
Frequency Polygon Like a Line Graph, but used for grouped data. They serve the same purpose as a histogram. More practical for comparing two sets of data.
16
Sample Frequency Polygon (using midpoints)
17
Guidelines for Constructing a Frequency Polygon Use the class midpoints to label the horizontal axis. Plot the frequency of each class against the vertical axis. Be sure to leave space between the first midpoint and the vertical axis. Plot your frequencies. Close your polygon. Be sure that you have even spacing at the beginning and the end.
18
Assignment: pg. 39: 3,4,15,18,22,27,30,32
19
More Graphs and Displays What you should learn: How to graph quantitative data sets using stem-and-leaf plots and dot plots How to graph qualitative data sets using pie charts and Pareto charts How to graph paired data sets using scatter plots and time series charts
20
Stem-and-leaf plot - Vocabulary An alternative method to the histogram of displaying quantitative data Stem - Left most digits of the data Leaf - The right most digit of the data.
22
Advantages of Stem-and-Leaf Plots The graphs contains the original data. Provides an easy way to order the data.
23
Unordered stem-and-leaf plot Ordered stem-and-leaf plot Double rows for each stem Leaves on each side of the stem (Back to Back Stem-and-Leaf) Types of Stem-and-Leaf Plots
24
Ordered Stem-and-Leaf Plot Double Stem Stem-and-Leaf Plot Back to Back Stem-and-Leaf Plot
25
Constructing Stem-and-Leaf Plots Write the stems in a vertical column with the smallest at the top. Draw a vertical line at the right of the above mentioned column Write each leaf in the row to the right of its stem Sort the leaves in increasing order as they move out from the stem.
26
Dot Plot- Vocabulary An alternative method to the histogram of displaying quantitative data whereby each data entry is plotted, using a point, above a horizontal axis. These allow you to see the distribution of the data
27
Sample Dot Plot
29
Constructing Dot Plots Choose an appropriate scale for the horizontal axis. Plot a point above the horizontal axis to represent each data entry.
30
Up until this point, our data displays were all Quantitative. Quantitative Data Displays Histograms Stem-and-leaf Plots Dot Plots
31
Qualitative Data Displays Pie Chart- A circle graph that shows relationships of parts to a whole. Pareto Chart- A vertical bar graph in which the height of each bar represents frequency or relative frequency. The bars are positioned in order of decreasing height, with the tallest bar positioned at the left.
32
Sample Pie Chart
33
Constructing Pie Charts Calculate the fractional part of the data that each category covers. Convert the fractional part to a percent. Round the percent to the nearest whole number. Calculate the number of degrees that the percent would represent in a circle. Using a protractor, measure the number of degrees represented by each category. Create a key and a title for your pie chart.
34
Be sure the 0 lines up on the radius of the circle. Measure the appropriate degree in a counterclockwise direction. Mark your location. Draw another radius from the center to the mark you just created. Realign your protractor so that the center of the protractor is at the center of the circle, but zero degrees is now lined up with the new radius just drawn. Using a Protractor Draw a line from the center of the circle to the edge of the circle (Radius) Line the center of the protractor up with the center of the circle.
35
Assignment: pg. 51: 3-10,15,16,21,22,23
36
Sample Pareto
37
Constructing Pareto Charts After collecting your data, determine the frequency of each data entry. Label your left vertical axis with frequency. Label your right vertical axis with percentages. Construct a bar chart tower for the most frequent piece of data. Construct a second bar chart tower for the next most frequent piece of data. Continue in this manner until all the data is represented. Then plot the cumulative percentage above each tower. Connect the plots. Be sure to title your Pareto Chart and label your towers.
38
Graphing Paired Data Sets One piece of data corresponds with another piece of data. Example: grams of fat vs. calorie count in a particular food product Represent the data using a scatter plot.
39
Scatter Plot- Vocabulary The ordered pairs are graphed as points in a coordinate plane.
40
The scatter plot below shows the women's winning marathon times and the high temperatures on the marathon days. Sample Scatter Plot
41
Time Series Chart Vocabulary A data set that is composed of entries taken at regular intervals over a period of time.
42
Toontown is a web based computer game. The chart below represents the plays over a 24 hour period. Sample Series Time Chart
43
Assignment: pg. 53: 24, 26, 30
44
Measures of Central Tendency What you should learn: How to find the mean, median, and mode of a population and a sample. How to find a weighted mean and the mean of a frequency distribution. How to describe the shape of a distribution as symmetric, uniform, or skewed.
45
Measure of Central Tendency - Vocabulary A value that represents a typical, or central, entry of a data set.
46
Mean - Measures of Central Tendency The sum of the data entries divided by the number of entries, more commonly known as the average. There are two different types of mean depending on the data… Population Mean Sample Mean
47
Formulas Population Mean: = xNxN The lower case Greek letter mu. The upper case Greek letter sigma. It indicates the sum of.
48
Formulas Sample Mean: Read as x bar. The upper case Greek letter sigma. It indicates the sum of. x = xnxn
49
Median - Measures of Central Tendency The middle data entry when the data set is sorted in ascending or descending order. If the data set has an even number of entries, the median is the mean of the two middle data entries.
50
Mode - Measures of Central Tendency The data entry that occurs with the greatest frequency. If no entry is repeated, the data set has no mode. If two entries occur with the same greatest frequency, each entry is a mode and the data set is called BIMODAL.
51
Outlier- Measures of Central Tendency A data entry that is far removed from the other entries in the data set.
52
Mean of a frequency distribution- Mean of Grouped Data x = (x∙f) n x represents the midpoints of the class f represents the frequency of the class.
53
Guidelines for Determining the Mean of a Frequency Distribution Find the midpoint of each class. Find the sum of the products of the midpoints and the frequencies. Find the sum of the frequencies. Find the mean of the frequency distribution.
54
Assignment: pg. 63:13,15,18,27,28,34-35
55
Weighted Mean- Weighted Mean The mean of the data whose entries have varying weights.
56
Weighted Mean x = (x∙w) w w represents the weight of each entry, x.
57
Symmetric Distribution- Shape of Distributions When a vertical line can be drawn through the middle of a graph of the distribution and the resulting halves are approximately mirror images. Example:
58
Uniform or Rectangular Distribution- Shape of Distributions When all entries, or classes, in the distribution have equal frequencies. Also Symmetric! Example:
59
Skewed Distribution- Shape of Distributions When the “tail” of the graph elongates more to one side than to the other. Example: Tail…
60
Also known as negatively skewed. Skewed-Left Distribution- Skewed Distributions When the “tail” of the graph elongates more to the left side. Example: Tail…
61
Also known as positively skewed. Skewed-Right Distribution- Skewed Distributions When the “tail” of the graph elongates more to the right side. Example: Tail…
62
FYI The mean will always fall in the direction the distribution is skewed. For example, when a distribution is skewed left, the mean is to the left of the median.
63
Assignment: pg. 63: 5-12,37,39,45
64
Measures of Variation What you should learn: How to find the range of a data set. How to find the variance and standard deviation of a population and a sample. How to use the Empirical Rule and Chebychev’s Theorem to interpret standard deviation. How to approximate the sample and standard deviation for grouped data.
65
Range - Vocabulary The difference between the maximum and minimum entries in the set. Range = maximum data entry – minimum data entry
66
Deviation - Vocabulary The difference between the entry and the mean, or x, of the data set. Deviation of x = x - µ or Deviation of x = x - x
67
FYI Notice that the sum of the deviations is zero. Therefore, it is pointless to find the average of the deviations. By squaring each deviation, we can make the data useful.
68
Population Variance - Vocabulary The mean of the squares of the deviations. The lower case Greek letter sigma.
69
Standard Deviation - Vocabulary A measure of the typical amount an entry deviates from the mean. The more the data entries are spread out, the greater the standard deviation.
70
Population Standard Deviation - Vocabulary The square root of the population variance.
71
Sample Variance - Vocabulary The mean of the squares of the deviations.
72
Sample Standard Deviation - Vocabulary The square root of the sample variance.
73
Assignment: pg. 78: 1,3-5,8,10-13,22
74
Bell Shaped Distribution
75
AKA 68-95-99.7 Rule Empirical Rule For data with a bell-shaped distribution, the standard deviation has the following characteristics. About 68% of the data lies within 1 standard deviation of the mean. About 95% of the data lies within 2 standard deviations of the mean. About 99.7% of the data lies within 3 standard deviations of the mean.
76
13.5% 34% 13.5% 34% 2.35% Empirical (68-95-99.7) Rule
77
13.5% 34% 13.5% 34% 2.35% Empirical Rule x x+ SD x+ 2SD x+ 3SD x- SD x- 2SD x- 3SD
78
Chebychev’s Theorem The portion of any data set lying within k standard deviations (k > 1) of the mean is at least 1 - 1 k 2
79
Example using Chebychev’s Theorem Chebychev’s Theorem If k = 2: 1 - 1 k 2 1 - 1 2 2 3 4 Interpretation: In any data set, at least ¾ or 75% of the data lies within 2 standard deviations of the mean.
80
Example using Chebychev’s Theorem Chebychev’s Theorem If k = 3: 1 - 1 k 2 1 - 1 3 2 8 9 Interpretation: In any data set, at least 8 / 9 or 88.9% of the data lies within 3 standard deviations of the mean.
81
Example using Chebychev’s Theorem Chebychev’s Theorem If k = 4: 1 - 1 k 2 1 - 1 4 2 15 16 Interpretation: In any data set, at least 15 / 16 or 93.8% of the data lies within 4 standard deviations of the mean.
82
Summary of Chebychev’s Theorem Chebychev’s Theorem At least 75% of the data fall in the interval from - 2 to + 2 At least 88.9% of the data fall in the interval from - 3 to + 3 At least 93.8% of the data fall in the interval from - 4 to + 4
83
34% Chebychev’s Theorem 75% 88.9% 93.8%
84
Chebychev’s Theorem vs. Empirical Rule Empirical Rule only works on BELL-SHAPED curves Chebychev’s Theorem works for ALL distribtuions.
85
Assignment: Worksheet on Empirical Rule & Chebychev’s Theorem
86
Standard Deviation for Grouped Data To use this formula, you need to create a frequency distribution.
87
Standard Deviation for Grouped Data This formula can also be used if the data is grouped into classes. You would then use the midpoint.
88
Assignment: pg. 81: 29-32
89
Measures of Position What you should learn: How to find the first, second, and third quartiles of a data set. How to find the interquartile range of a data set. How to represent a data set graphically using a box-and- whisker plot. How to interpret other fractiles such as percentiles.
90
Quartiles- Vocabulary Numbers that divide an ordered data set into four equal parts.
91
FYI There are 3 quartiles to a data set. About one quarter of the data falls on or below the first quartile (Q1). About one half of the data falls on or below the second quartile (Q2). About three quarters of the data falls on or below the third quartile (Q3).
92
Examples of Quartiles 3, 4, 4, 5, 6, 8, 8 2 nd quartile (Q2) AKA Median 1 st quartile (Q1) AKA Lower Quartile 3 rd quartile (Q3) AKA Upper Quartile
93
Guidelines for Determining the Quartiles of Data Arrange the data in order. Determine the median of the data. Split the data into two halves. The first quartile is the median of the lower half of data. The third quartile is the median of the upper half of data.
94
Interquartile Range (IQR) - Vocabulary The difference between the upper quartile and the lower quartile. IQR = Q3 – Q1
95
Box and Whisker Plots - Vocabulary An exploratory data analysis tool that highlights the important features of a data set. You need 5 key values: The minimum data entry The first quartile (Q1) The median (Q2) The third quartile (Q3) The maximum data entry
96
Components of Box and Whisker Plots
97
Example of Box and Whisker Plots
98
Percentiles- Vocabulary Numbers that divide an ordered data set into one hundred equal parts. Percentiles are used quite often in education and health-related fields. They are used to indicate how one individual compares with others in a group.
99
Percentiles It is important that you understand what a percentile means… If the weight of a 6-month old infant is at the 78 th percentile, it means that the infant’s weight is greater than 78% of all six-month-old infants. It does not mean that the infant weights 78% of some ideal weight.
101
Assignment: pg. 91: 13, 14, 21, 22
102
Review for Test: pg. 97: 1-3, 6-12,15,17- 18,22-26,28,30,32- 34,37-41,44
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.