Presentation is loading. Please wait.

Presentation is loading. Please wait.

FREQUENCY DISTRIBUTIONS Twenty five medtech students were given a blood test to determine their blood type. The data set is: ABBABO OOB B BBOAO AOOO AOBA.

Similar presentations


Presentation on theme: "FREQUENCY DISTRIBUTIONS Twenty five medtech students were given a blood test to determine their blood type. The data set is: ABBABO OOB B BBOAO AOOO AOBA."— Presentation transcript:

1 FREQUENCY DISTRIBUTIONS Twenty five medtech students were given a blood test to determine their blood type. The data set is: ABBABO OOB B BBOAO AOOO AOBA As we can see, this is a little messy, so we decide to organize this into a frequency distribution:

2 ClassFrequency A FREQUENCY DISTRIBUTION is a table that shows the partition of data into classes or intervals and how many data values are in each class. Essentially, a frequency distribution must have at least two columns: one for the classes or data groupings, and another for the frequency or the no. of data values belonging to the respective class. Additional columns can be included when necessary or helpful.

3 Example: (Categorical frequency distribution) Twenty five medtech students were given a blood test to determine their blood type. The data set is: ABBABO OOB B BBOAO AOOO AOBA What is the variable in this study?Blood type of a student. What kind of variable is this?Qualitative. How do we construct a frequency dist. for this data set?

4 (1) Draw a frequency distribution table, including a column for tallies (which is helpful in counting the frequencies). ClassTalliesFrequency (2) Identify the classes. Note that, for qualitative variables, a data value is a category by itself. Therefore, the classes are the data values: A, B, AB, O A B AB O (3) Tally the data: one tick for a class if a particular data value belongs to it; and count the respective frequencies. ||||| ||||| || ||||| |||| |||| 5 7 9 4 This is called a categorical frequency distribution because the variable we are tabulating is categorical (qualitative).

5 (4) Another column for the relative frequencies of the classes is optional, but sometimes necessary. ClassTalliesFrequency A|||||5 B||||| ||7 AB||||| ||||9 O||||4 Rel. freq. The relative frequency of a class is the percentage of the population which the class occupies. 0.20 0.28 0.36 0.16

6 Example: (Grouped frequency distribution) A survey of the 50 wealthiest people by the Forbes Magazine yields the following data on the ages (in years, as of 2012) of these billionaires: 49573873817459766569 54566968786585496961 48816837437882436467 52568177798540855980 60715761696183908774 What is the variable in this study?Age What kind of variable is this?Quantitative How do we construct the frequency table for this data set?

7 (1) Draw a frequency distribution table, including a column for tallies (which is helpful in counting the frequencies). ClassTalliesFrequency The data in this example is quantitative (numerical), so the classes must be intervals. (ex.: 34 – 40, 41 – 47, etc.)

8 For the class labelled n 1 - n 2, n 1 is called the lower class limit, which is the lowest data value that can be tallied in the class n 1 - n 2 ; and n 2 is called the upper class limit, which is the highest data value that can be tallied in the class n 1 - n 2 Also, there must always be 5 to 20 interval classes for a better display of the distribution of data. As a rule, both class limits must have the same number of decimal places (no. of digits after the decimal point) as the data. For example, if the data are: 6.2, 12.8. 10.5, … (one decimal place) the interval classes must be like: 5.3 – 8.3, 8.4 – 11.4 … As a rule, both class limits must have the same number of decimal places (no. of digits after the decimal point) as the data. For example, if the data are: 6.2, 12.8. 10.5, … (one decimal place) the interval classes must be like: 5.3 – 8.3, 8.4 – 11.4 … Always take note of the no. of decimal places in the data!

9 (2) Compute the range of the data. In the example, range = 90 – 37= 53 (3) Choose the desired number of classes (5 – 20) and compute the class width The class width is the “gap” between any one lower class limit to the next lower class limit.

10 Let us choose to have 8 classes. Then the class width is: 53 / 8= 6.625 The class width must also have the same number of decimal places as the data. If the computed class width has a long decimal portion, keep the number of decimal places as that of the data (removing the other decimal places) and add 1 to the last digit. Since our data have no (0) decimal places, the class width must also have no decimal places. So, class width = 6.625  6 (keeping 0 dec. places, removing others)  7 (adding 1 to the last digit ‘6’) So, class width = 7

11 (4) For the first lower class limit, select any number close to the lowest data value, and add the class width the same number of times as the number of classes. These will be the lower limits of the classes. In the example, the smallest data value is 38. We choose the first lower class limit to be 35. Then we add the class width (7) to the first lower class limit (35), until we get 8 classes. ClassTalliesFrequency 35 - 42 - 49 - 56 - 63 - 70 - 77 - 84 -

12 (5) For the corresponding upper class limit, we simply subtract 1 from last digit of the next lower class limit. ClassTalliesFrequency 35 42 49 56 63 70 77 84 - 41 - 48 - 55 - 62 - 69 - 76 - 83 The class limits must be mutually exclusive (i.e., they don’t overlap), so that each data value will belong to exactly one class only. Also, note that the classes are equal in width. - 90

13 (6) Since the classes are intervals, we must insert a second column for the so-called class boundaries. ClassBoundariesTalliesFrequency 35 - 41 42 – 48 49 – 55 56 – 62 63 – 69 70 – 76 77 - 83 84 - 90 As we can see, the classes have a gap of 1 unit (35-41, 42-48). The class boundaries simply connect these classes when we construct the graph of the frequency distribution.

14 The class boundaries are also intervals (n 1 - n 2 ), like the class limits, having one more decimal place than that of the data, connecting the classes halfway between adjacent class limits. The ± 0.0..5 depends on the no. of decimal places in the data. Class boundaries must have one more decimal place, right? If the data has no decimal places, use ± 0.5 If the data has 1 decimal place, use ± 0.05 If the data has 2 decimal places, use ± 0.005

15 For the class 35 – 41, (no decimal places) lower class boundary = 35 – 0.5 = 34.5 upper class boundary = 41 + 0.5 = 41.5 So the corresponding class boundaries is 34.5 – 41.5 ClassBoundariesTalliesFrequency 35 - 4134.5 – 41.5 42 – 48 49 – 55 56 – 62 63 – 69 70 – 76 77 - 83 84 - 90 Computing the class boundaries for the other classes 41.5 – 48.5 48.5 – 55.5 55.5 – 62.5 62.5 – 69.5 69.5 – 76.5 76.5 – 83.5 83.5 – 90.5

16 (7) Tally the data: one tick for a class if a particular data value belongs to it; and count the respective frequencies. Class limits Class Boundaries TallyFrequency 35 – 4134.5 – 41.5|||3 42 – 4841.5 – 48.5|||3 49 – 5548.5 – 55.5||||4 56 – 6255.5 – 62.5||||| 10 63 – 6962.5 – 69.5||||| 10 70 – 7669.5 – 76.5|||||5 77 – 8376.5 – 83.5||||| 10 84 – 9083.5 – 90.5|||||5 This is called a grouped frequency distribution because the classes are not categories but groups or intervals.

17 Example: (Grouped frequency distribution) In a study of one-way commuting distance of FEU students, a random sample of 60 students gives the ff. data (in kms): 13.247.810.53.716.420.117.940.34.52.8 725.3821.419.615.13.217.814.26.3 12.245.81.48.24.116.711.218.523.212.4 62.515.213715.646.212.59.318.7 34.213.541.628.13617.22427.629.59.2 14.626.110.6243731.28.216.812.216 Make a frequency distribution with 6 classes and initial lower class limit = 1.0

18 First, take note that the data has 1 decimal place! (1) Compute the range. range = 47.8 – 1.4 = 46.4 (2) Compute the class width. class width = 46.4/6 = 7.73  7.7 (keeping 1 dec. place, removing others)  7.8 (adding 1 to the last digit ‘7’.) (3) Add the class width (7.8) to the initial lower class limit (1.0) until we get the no. of classes (6). 1.0 1.0 + 7.8 = 8.8 8.8 + 7.8 = 16.6 16.6 + 7.8 = 24.4 24.4 + 7.8 = 32.2 32.2 + 7.8 = 40.0 These numbers will be the lower class limits of the 6 classes.

19 ClassBoundariesTalliesFrequency 1.0 – 8.8 – 16.6 – 24.4 – 32.2 – 40.0 – (4) For the corresponding upper class limit, we simply subtract 1 from last digit of the next lower class limit. 8.7 16.5 24.3 32.1 39.9 47.7 (5) Compute the class boundaries for each class. For the class 1.0 – 8.7, (1 decimal place) lower class boundary = 1 – 0.05 = 0.95 upper class boundary = 8.7 + 0.05 = 8.75 So the corresponding class boundaries is 0.95 – 8.75 0.95 – 8.75 8.75 – 16.55 16.55 – 24.35 24.35 – 32.15 32.15 – 39.95 39.95 – 47.85

20 (6) Tally the data: one tick for a class if a particular data value belongs to it; and count the respective frequencies. ClassBoundariesTalliesFrequency 1.0 – 8.70.95 – 8.7514 8.8 – 16.5 8.75 – 16.5519 16.6 – 24.316.55 – 24.3513 24.4 – 32.124.35 – 32.156 32.2 – 39.932.15 – 39.953 40.0 – 47.839.95 – 47.855


Download ppt "FREQUENCY DISTRIBUTIONS Twenty five medtech students were given a blood test to determine their blood type. The data set is: ABBABO OOB B BBOAO AOOO AOBA."

Similar presentations


Ads by Google