Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 2 Graphs, Charts, and Tables – Describing Your Data Business Statistics Department of Quantitative Methods & Information Systems Dr. Mohammad Zainal.

Similar presentations


Presentation on theme: "Chapter 2 Graphs, Charts, and Tables – Describing Your Data Business Statistics Department of Quantitative Methods & Information Systems Dr. Mohammad Zainal."— Presentation transcript:

1 Chapter 2 Graphs, Charts, and Tables – Describing Your Data Business Statistics Department of Quantitative Methods & Information Systems Dr. Mohammad Zainal QMIS 120

2 Chapter Goals After completing this chapter, you should be able to: Construct a frequency distribution both manually and with a computer Construct and interpret a histogram Create and interpret bar charts, pie charts, and stem-and-leaf diagrams Present and interpret data in line charts and scatter diagrams QMIS 120, by Dr. M. Zainal Chap 2-2

3 Raw data Ages (in years) of 20 students selected from CBA are reported in the way they are collected. The data values are recorded in the following table. Ages of 20 Students 23242230192018241921 25241820192325222120 QMIS 120, by Dr. M. Zainal Chap 2-3

4 Raw data The same students were asked about their status. The responses of the sample are recorded in the following table Status of 20 Students SSJSFJFSFJ SSFJFJSJJF QMIS 120, by Dr. M. Zainal Chap 2-4

5 Frequency Distributions What is a Frequency Distribution? A frequency distribution is a list or a table … containing the values of a variable (or a set of ranges within which the data fall)... and the corresponding frequencies with which each value occurs (or frequencies with which data fall within each range) QMIS 120, by Dr. M. Zainal Chap 2-5

6 Frequency Distributions Weekly Earnings of 100 Employees of a company Number of employees f Weekly Earnings (dollars) 9401 to 600 22601 to 800 39801 to 1000 151001 to 1200 91201 to 1400 61401 to 1600 Variable First class Frequency Of first class Frequency Column Lower limit of the sixth class Upper limit of the sixth class QMIS 120, by Dr. M. Zainal Chap 2-6

7 Why Use Frequency Distributions? A frequency distribution is a way to summarize data The distribution condenses the raw data into a more useful form... and allows for a quick visual interpretation of the data QMIS 120, by Dr. M. Zainal Chap 2-7

8 Frequency Distribution: Discrete Data Discrete data: possible values are countable Example: An advertiser asks 200 customers how many days per week they read the daily newspaper. Row Data 5,6,1,2,4,5,7,2,3,5,1,3,2,5,0,2,2,0,7,7,1,2,4,3,5,6,7,1,1,1,1,2,5,0,0,0,1,20,7,5,3,6,2,1,6,2,1,4,2,4,5,3,1,0,2,3,6,5,7,4,1,2,3,5,6,1,0, 0,0,0,0,1,1,1,2,3,5,1,4…………………….. QMIS 120, by Dr. M. Zainal Chap 2-8

9 Frequency Distribution: Discrete Data Number of days read Frequency 044 124 218 316 420 522 626 730 Total200 It is called “Single-Value” approach QMIS 120, by Dr. M. Zainal Chap 2-9

10 Relative Frequency Relative Frequency: What proportion is in each category? Number of days read Frequency Relative Frequency 044.22 124.12 218.09 316.08 420.10 522.11 626.13 730.15 Total2001.00 22% of the people in the sample report that they read the newspaper 0 days per week QMIS 120, by Dr. M. Zainal Chap 2-10

11 Frequency Distribution: Discrete Data Example: Construct a frequency distribution table for the following data Home RunsTeamHome RunsTeam 139Milwaukee152Anaheim 167Minnesota165Arizona 162Montreal164Atlanta 160New York Mets165Baltimore 223New York Yankees177Boston 205Oakland200Chicago Cubs 165Philadelphia217Chicago White Sox 142Pittsburgh169Cincinnati 175St. Louis192Cleveland 136San Diego152Colorado 198San Francisco124Detroit 152Seattle146Florida 133Tampa Bay167Houston 230Texas140Kansas City 187Toronto155Los Angeles Home Runs Hit by Major League Baseball Teams During the 2002 Season QMIS 120, by Dr. M. Zainal Chap 2-11

12 Frequency Distribution: Discrete Data Number of classes: C = 1 + 3.3 log 30 = 5.87 ≈ 5 or 6 Class width: W = (230 – 124) / 5 = 21.2 ≈ 22 Starting point: 124 + 5 x 22 = 234 checks! Home Runs 124 – 145 146 – 167 168 – 189 190 – 211 212 – 233 Total f 6 13 4 4 3 30 QMIS 120, by Dr. M. Zainal Chap 2-12

13 Frequency Distribution: Continuous Data Continuous Data: may take on any value in some interval Example: A manufacturer of insulation randomly selects 20 winter days and records the daily high temperature 24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13, 12, 38, 41, 43, 44, 27, 53, 27 Temperature is a continuous variable because it could be measured to any degree of precision desired QMIS 120, by Dr. M. Zainal Chap 2-13

14 Grouping Data by Classes Sort raw data from low to high: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Find range: 58 - 12 = 46 Select number of classes: 5 (usually between 5 and 20) Compute class width: 10 (46/5 then round off) Determine class boundaries:10, 20, 30, 40, 50 (Sometimes class midpoints are reported: 15, 25, 35, 45, 55) Count the number of values in each class QMIS 120, by Dr. M. Zainal Chap 2-14

15 Frequency Distribution Example Data from low to high: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Class Frequency 10 but under 20 3.15 20 but under 30 6.30 30 but under 40 5.25 40 but under 50 4.20 50 but under 60 2.10 Total 20 1.00 Relative Frequency Frequency Distribution QMIS 120, by Dr. M. Zainal Chap 2-15

16 Frequency Histograms The classes or intervals are shown on the horizontal axis frequency is measured on the vertical axis Bars of the appropriate heights can be used to represent the number of observations within each class Such a graph is called a histogram QMIS 120, by Dr. M. Zainal Chap 2-16

17 Class Midpoints Histogram Example Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 No gaps between bars, since continuous data 0 10 20 30 40 50 60 Class Endpoints Chap 2-17

18 Questions for Grouping Data into Classes 1.How wide should each interval be? (How many classes should be used?) 2.How should the endpoints of the intervals be determined? Often answered by trial and error, subject to user judgment The goal is to create a distribution that is neither too "jagged" nor too "blocky” Goal is to appropriately show the pattern of variation in the data QMIS 120, by Dr. M. Zainal Chap 2-18

19 How Many Class Intervals? Many (Narrow class intervals) may yield a very jagged distribution with gaps from empty classes Can give a poor indication of how frequency varies across classes Few (Wide class intervals) may compress variation too much and yield a blocky distribution can obscure important patterns of variation. (X axis labels are upper class endpoints) QMIS 120, by Dr. M. Zainal Chap 2-19 (X axis labels are upper class endpoints)

20 General Guidelines Number of Data Points Number of Classes under 50 5 - 7 50 – 100 6 - 10 100 – 250 7 - 12 over 250 10 - 20 Class widths can typically be reduced as the number of observations increases Distributions with numerous observations are more likely to be smooth and have gaps filled since data are plentiful QMIS 120, by Dr. M. Zainal Chap 2-20

21 Class Width The class width is the distance between the lowest possible value and the highest possible value for a frequency class The class width is Largest Value - Smallest Value Number of Classes W = QMIS 120, by Dr. M. Zainal Chap 2-21

22 Histograms in Excel Select “Data” Tab 1 Choose Histogram 3 2 Data Analysis QMIS 120, by Dr. M. Zainal Chap 2-22

23 Histograms in Excel (continued) 4 Input data and bin ranges Select Chart Output QMIS 120, by Dr. M. Zainal Chap 2-23

24 Ogives An Ogive is a graph of the cumulative relative frequencies from a relative frequency distribution Ogives are sometime shown in the same graph as a relative frequency histogram QMIS 120, by Dr. M. Zainal Chap 2-24

25 Ogives 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Add a cumulative relative frequency column: Class Frequency 10 but under 20 3.15.15 20 but under 30 6.30.45 30 but under 40 5.25.70 40 but under 50 4.20.90 50 but under 60 2.10 1.00 Total 20 1.00 Relative Frequency Frequency Distribution (continued) Cumulative Relative Frequency QMIS 120, by Dr. M. Zainal Chap 2-25

26 Class Midpoints Ogive Example 100 80 60 40 20 0 Cumulative Frequency (%) / Ogive 0 10 20 30 40 50 60 Class Endpoints Chap 2-26

27 Excel will show the Ogive graphically if the “Cumulative Percentage” option is selected in the Histogram dialog box Ogives in Excel QMIS 120, by Dr. M. Zainal Chap 2-27

28 Other Graphical Presentation Tools Categorical Data Bar Chart Stem and Leaf Diagram Pie Charts Quantitative Data QMIS 120, by Dr. M. Zainal Chap 2-28

29 Bar and Pie Charts Bar charts and Pie charts are often used for qualitative (category) data Height of bar or size of pie slice shows the frequency or percentage for each category QMIS 120, by Dr. M. Zainal Chap 2-29

30 Bar Chart Example 1 (Note that bar charts can also be displayed with vertical bars) QMIS 120, by Dr. M. Zainal Chap 2-30

31 Bar Chart Example 2 Number of days read Frequency 044 124 218 316 420 522 626 730 Total200 QMIS 120, by Dr. M. Zainal Chap 2-31

32 Pie Chart Example Percentages are rounded to the nearest percent Current Investment Portfolio Savings 15% CD 14% Bonds 29% Stocks 42% Investment Amount Percentage Type (in thousands $) Stocks 46.5 42.27 Bonds 32.0 29.09 CD 15.5 14.09 Savings 16.0 14.55 Total 110 100 (Variables are Qualitative) QMIS 120, by Dr. M. Zainal Chap 2-32

33 Tabulating and Graphing Multivariate Categorical Data Investment in thousands of dollars Investment Investor A Investor B Investor C Total Category Stocks 46.5 55 27.5 129 Bonds 32.0 44 19.0 95 CD 15.5 20 13.5 49 Savings 16.0 28 7.0 51 Total 110.0 147 67.0 324 QMIS 120, by Dr. M. Zainal Chap 2-33

34 Tabulating and Graphing Multivariate Categorical Data Side by side charts (continued) QMIS 120, by Dr. M. Zainal Chap 2-34

35 Side-by-Side Chart Example Sales by quarter for three sales territories: QMIS 120, by Dr. M. Zainal Chap 2-35

36 Dot Plot A One of the simplest methods for graphing and understanding quantitative data is to create a dot plot. A horizontal axis shows the range of values for the observations. Each data point is represented by a dot placed above the axis. QMIS 120, by Dr. M. Zainal Chap 2-36

37 Dot Plot Dot plots can help us detect outliers (also called extreme values) in a data set. Outliers are the values that are extremely large or extremely small with respect to the rest of the data values. QMIS 120, by Dr. M. Zainal Chap 2-37

38 Dot Plot Example : The following table lists the number of runs batted in (RBIs) during the 2004 Major League Baseball playoffs by members of the Boston Red Sox team with at least one at-bat. Create a dot plot for these data. QMIS 120, by Dr. M. Zainal Chap 2-38

39 Dot Plot Step 1. First we draw a horizontal line that includes the minimum and the maximum values in this data set. Step 2. Place a dot above the value on the numbers line that represents each RBI listed in the table Outlier QMIS 120, by Dr. M. Zainal Chap 2-39

40 Stem and Leaf Diagram Another simple way to see distribution details from qualitative data METHOD 1. Separate the sorted data series into leading digits (the stem) and the trailing digits (the leaves) 2. List all stems in a column from low to high 3. For each stem, list all associated leaves QMIS 120, by Dr. M. Zainal Chap 2-40

41 Example: Here, use the 10’s digit for the stem unit: Data sorted from low to high: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 12 is shown as 35 is shown as Stem Leaf 1 2 3 5 QMIS 120, by Dr. M. Zainal Chap 2-41

42 Example: Completed Stem-and-leaf diagram: Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 28, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 StemLeaves 12 3 7 21 4 4 6 7 8 30 2 5 7 8 41 3 4 6 53 8 QMIS 120, by Dr. M. Zainal Chap 2-42

43 Using other stem units Using the 100’s digit as the stem: Round off the 10’s digit to form the leaves 613 would become 6 1 776 would become 7 8... 1224 becomes 12 2 Stem Leaf QMIS 120, by Dr. M. Zainal Chap 2-43

44 Line charts show values of one variable vs. time Time is traditionally shown on the horizontal axis Scatter Diagrams show points for bivariate data one variable is measured on the vertical axis and the other variable is measured on the horizontal axis A trend line is a line that provides an approximation of that relationship. Line Charts and Scatter Diagrams QMIS 120, by Dr. M. Zainal Chap 2-44

45 Line Chart Example Year Inflation Rate 19853.56 19861.86 19873.65 19884.14 19894.82 19905.40 19914.21 19923.01 19932.99 19942.56 19952.83 19962.95 19972.29 19981.56 19992.21 20003.36 20012.85 20021.59 20032.27 20042.68 20053.39 2006 3.24 QMIS 120, by Dr. M. Zainal Chap 2-45

46 Scatter Diagram Example Volume per day Cost per day 23125 26140 29146 33160 38167 42170 50188 55195 60200 Chap 2-46

47 Types of Relationships Linear Relationships QMIS 120, by Dr. M. Zainal Chap 2-47

48 Curvilinear Relationships Types of Relationships (continued) QMIS 120, by Dr. M. Zainal Chap 2-48

49 No Relationship Types of Relationships (continued) QMIS 120, by Dr. M. Zainal Chap 2-49

50 Cross-tabulation Example: Draw a scatter diagram for the following data which lists the total amount spent in KD by costumers in a restaurant. x (person)1122334455 y (KD)871418202221262933 QMIS 120, by Dr. M. Zainal Chap 2-50

51 Cross-tabulation A cross tab is a tabular summary of data of two variables. They are usually presented in a matrix format. Not like a frequency distribution (one variable). A contingency table describes the distribution of two or more variables simultaneously. Each cell shows the number of respondents that gave a specific combination of responses It can be used with any level of data (What are they?) QMIS 120, by Dr. M. Zainal Chap 2-51

52 Cross-tabulation example Example: In a survey of the quality rating and the meal price conducted by a consumer restaurant review agency, the following table was produced: RestaurantQuality ratingMeal Price 1Good18 2Very Good22 3Good28 4Excellent38 5Very Good33 6Good28... QMIS 120, by Dr. M. Zainal Chap 2-52

53 Cross-tabulation example Quality rating is a qualitative variable with the rating categories of good, very good and excellent Meal price Quality rating10 - 1920 - 2930 - 3940 - 49Total Good42402084 Very Good3464466150 Excellent214282266 Total781187628300 QMIS 120, by Dr. M. Zainal Chap 2-53

54 Cross-tabulation example Also, we can find the row percentage Meal price Quality rating10 - 1920 - 2930 - 3940 - 49Total Good50.047.62.40.0100 Very Good22.742.730.64.0100 Excellent3.021.242.433.4100 QMIS 120, by Dr. M. Zainal Chap 2-54

55 Cross-tabulation example Dividing the totals in the right margin of the cross tab by the grand total provides relative and percentage frequency distribution for the quality rating variable. Quality ratingRelative frequencyPercentage frequency Good0.2828 Very Good0.5050 Excellent0.2222 Total1.00100 QMIS 120, by Dr. M. Zainal Chap 2-55

56 Cross-tabulation example Try it for the meal price (column totals) Meal price Quality rating10 - 1920 - 2930 - 3940 - 49Total Good50.047.62.40.0100 Very Good22.742.730.64.0100 Excellent3.021.242.433.4100 QMIS 120, by Dr. M. Zainal Chap 2-56

57 Cross-tabulation example Example: The following data are for 30 observations involving two qualitative variables x (A, B and C) and y (1 and 2). Obs.xy xy xy 1A111A121C2 2B112B122B1 3B113C223C2 4C214C224A1 5B115C225B1 6C216B226C2 7B117C127C2 8C218B128A1 9A119C129B1 10B120B130B2 1- Construct a cross tabulation for the data 2- Calculate the row percentages QMIS 120, by Dr. M. Zainal Chap 2-57

58 Cross-tabulation example y x12Total A505 B11213 C21012 Total181230 y x12Total A1000 B84.6215.38100 C16.6783.33100 QMIS 120, by Dr. M. Zainal Chap 2-58

59 Cross-tabulation example XRelative frequency Percentage frequency A5/30=.16716.7 B13/30=.43343.3 C12/30=.40040.0 Total1.00100 YRelative frequency Percentage frequency 118/30=.6060 212/30=.4040 Total1.00100 QMIS 120, by Dr. M. Zainal Chap 2-59

60 Chapter Summary Data in raw form are usually not easy to use for decision making -- Some type of organization is needed:  Table  Graph Techniques reviewed in this chapter: Frequency Distributions, Histograms, and Ogives Bar Charts and Pie Charts Stem and Leaf Diagrams Line Charts and Scatter Diagrams QMIS 120, by Dr. M. Zainal Chap 2-60

61 Copyright The materials of this presentation were mostly taken from the PowerPoint files accompanied Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. QMIS 120, by Dr. M. Zainal Chap 2-61


Download ppt "Chapter 2 Graphs, Charts, and Tables – Describing Your Data Business Statistics Department of Quantitative Methods & Information Systems Dr. Mohammad Zainal."

Similar presentations


Ads by Google