Download presentation
Presentation is loading. Please wait.
1
Presenting Data in Tables & Charts Organizing Numerical Data
2
Data with 20 or more observations should be organized
3
The Ordered Array: arranges raw data in order from the smallest observation to the largest observation.
4
Raw Data Arranged in an Ordered Array
5
The Ordered Array makes it easy to identify:
6
extreme values typical values range where the majority of values are concentrated
7
Stem and Leaf Display: shows where raw data clusters over a range of observations.
8
EXAMPLE : the following data represent the weekly salary checks earned by a sample of eight secretaries: $555 $490 $648 $832 $710 $590 $576 $623
9
First, put the values in ascending order and then use the 100s column as the stems, use the 10s column as the leaves, and either ignore the units column or round the units column and then use the 10s column as the leaves.
10
$555 $490 $648 $832 $710 $590 $576 $623 4 | 9 5 | 579 6 | 24 7 | 1 8 | 3
11
To further illustrate, how we can organize data to present, analyze and interpret findings,
12
we will study data from a previous QBA questionnaire: 1) USD students’ auto costs 2) USD students’ maximum auto speeds
13
Raw Data from student questionnaire (partial)
15
Stem & Leaf Auto Costs
16
Stem & Leaf MPH
17
And just for fun, let’s look at GPA GPA Stem unit: = 1 24 5 5 6 8 8 9 30 0 0 0 1 1 2 2 3 3 3 3 3 4 4 4 5 5 6 7 8 8 9 40
18
How Else Can We Organize our Data?
19
Numerical Data Frequency Distribution Relative Frequency Distribution Percentage Frequency Distribution Cumulative Frequency Distribution
20
Frequency Distribution
21
Frequency Distribution for Numerical Data (5. Auto Cost ($)) 0FrequencyPercentage 10000724.14% 200001034.48% 30000827.59% 4000000.00% 5000026.90% 6000013.45% 7000013.45%
22
Selecting the Number of Classes There is no “correct” number of classes (K) to use in a frequency distribution. However, the frequency distribution should have at least 5 classes, but no more than 20
23
Caution! If you have too “FEW” classes (K), a large portion of your data, lies in one class. However, if there are a number of empty classes, or too many classes with a frequency of 1 or 2, this may indicate too “MANY” classes (K).
24
Approximate Number of Classes in Frequency Distribution # Observations# Classes Less than 505 – 7 50 – 2007 – 9 200 – 5009 - 10 500 – 1,00010– 11 1,000 – 5,00011- 13 5,000 – 50,00013 – 17 More than 50,00017 - 20
25
What do you gain by organizing your data in a Frequency Distribution?
26
Hint! From pages of raw data
27
Answer Reduce large numbers of data points to a workable number of classes and frequencies. Study the frequency distribution and learn a great deal about the shape of the data set.
28
Raw Data from student questionnaire (partial)
29
Frequency Distribution
30
Frequency Distribution for Numerical Data (5. Auto Cost ($)) 0FrequencyPercentage 10000724.14% 200001034.48% 30000827.59% 4000000.00% 5000026.90% 6000013.45% 7000013.45%
31
Constructing a Frequency Distribution Gather the sample data Arrange data in an Ordered Array Select the number of classes to be used Determine class width: range/ # of classes Determine the class limits for each class so that the distribution is easy to interpret Count the number of data values in each class (the raw frequencies) Determine the Relative Frequencies
32
Relative Frequency = Raw frequency count in each class -------------------------------------- Total number of observations (n)
33
Relative Frequency is essential for comparing the relationship between two data sets.
34
To Convert Relative Frequency to Percent Frequency: Multiply Relative Frequency X 100
35
Example 15. A doctor's office staff has studied the waiting times for patients who arrive at the office with a request for emergency service. The following data were collected over a one-month period (the waiting times are in minutes). 2 5 10 12 4 4 5 17 11 8 9 8 12 21 6 8 7 13 18 3 Use classes of 0 - 4, 5- 9, and so on. a. Show the frequency distribution. b. Show the relative frequency distribution. c. Show the cumulative frequency distribution. d. Show the relative cumulative frequency distribution.
36
How Else Can We Organize our Data?
37
Graphic Techniques to Describe Numerical Data 1) Histogram (continuous data) 2) Polygon 3) Ogive 4) Scattergram
38
Histogram Uni-modal Bi-modal Skewed: i) right or positively skewed ii) left or negatively skewed
39
Histogram Auto Costs
40
Histogram MPH
41
Negative or Left Skewed
42
Positive or Right Skewed
43
Quiz Would incomes of employees in large firms tend to be positively or negatively skewed? Why?
44
Quiz Do exam grades tend to be positively or negatively skewed? Why?
45
A Scatter Diagram Graphs bivariate data to examine whether a relationship exists between two numerical variables.
46
Is there a relationship between the price of their auto and the maximum MPH a USD student has driven?
49
Is there a relationship between the number of alcoholic beverages consumed per week and the number of hours studied per week?
54
Alch bev / wk (#)mph 070 080 490 2195 8100 4 15100 1 7 0 2 6105 12105 3108 50120 0 2 0130 10130 2135
57
Tables & Charts for Categorical Data 1)Summary Table: similar to Frequency Distribution. 2)Contingency Table for Crosstabulation of Bivariate Categorical Data. 3)Bar Chart: graphical representation of frequency of occurrence. 4)Pie Chart: graphical emphasis of proportion 5)Pareto Diagram 6)Side-by-Side Bar Charts: for bivariate categorical data.
58
Summary Table
59
Soft drinkFreq.Relative Freq Coke Classic19.38 or 38% Diet Coke 8.16 or 16% Dr. Pepper 5.10 or 10% Pepsi-Cola 13.26 or 26% Sprite 5.10 or 10% Total 501.00 or 100%
60
Contingency Table for Crosstabulation of Bivariate Categorical Data
61
Gender vs. Number of Alcoholic Drinks per Week
62
Contingency Table for Gender vs. Auto Costs
63
Contingency Table of Gender vs. MPH Cross Tabulation of Categorical DataCross Tabulation of Categorical Data 29. The following data are for 30 observations on two qualitative variables, X, and Y. The categories for x are A, B, and C; the categories for y are 1 and 2. Cross Tabulation of Categorical DataCross Tabulation of Categorical Data Cross Tabulation of Categorical DataCross Tabulation of Categorical Data
64
Contingency Table of Live on/off Campus by Gender
66
The following data are for 30 observations on two qualitative variables, X and Y. The categories for X are A, B, and C; the categories for Y are 1 and 2.
67
Categories for X are A,B, and C. The categories for Y are 1 and 2.
68
Side-by-side Bar Charts
69
Pareto Diagram Separates the “vital few” from the “trivial many”.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.