Presentation is loading. Please wait.

Presentation is loading. Please wait.

McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.

Similar presentations


Presentation on theme: "McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics."— Presentation transcript:

1 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics in Business & Economics, 4 th edition David P. Doane and Lori E. Seward Prepared by Lloyd R. Jaisingh

2 3-2 Describing Data Visually Chapter Contents 3.1 Stem-and-Leaf Displays and Dot Plots 3.2 Frequency Distributions and Histograms 3.3 Excel Charts 3.4 Line Charts 3.5 Bar Charts 3.6 Pie Charts 3.7 Scatter Plots 3.8 Tables 3.9 Deceptive Graphs Chapter 3

3 3-3 Chapter Learning Objectives LO3-1: Make a stem-and-leaf or dot plot by hand or by computer. LO3-2: Create a frequency distribution for a data set. LO3-3: Make a histogram with appropriate bins. LO3-4: Identify skewness, modal classes, and outliers in a histogram. LO3-5: Make an effective line chart using Excel. Chapter 3 Describing Data Visually

4 3-4 Chapter Learning Objectives LO3-6: Know the rules for effective bar charts and pie charts. LO3-7: Make and interpret a scatter plot using Excel. LO3-8: Make simple tables and pivot tables. LO3-9: Recognize deceptive graphing techniques. Chapter 3 Describing Data Visually

5 3-5 Methods of organizing, exploring and summarizing data include:Methods of organizing, exploring and summarizing data include: Visual - Visual (charts and graphs) provides insight into characteristics of a data set without using mathematics. Numerical - Numerical (statistics or tables) provides insight into characteristics of a data set using mathematics. Chapter 3 3.1 Stem-and-Leaf Displays and Dot Plots

6 3-6 Begin with univariate data (a set of n observations on one variable) and consider the following:Begin with univariate data (a set of n observations on one variable) and consider the following: Chapter 3 3.1 Stem-and-Leaf Displays and Dot Plots

7 3-7 Measurement Measurement Look at the data and visualize how they were collected and measured. Sorting (Example: Price/Earnings Ratios) Sorting (Example: Price/Earnings Ratios) Sort the data and then summarize in a graphical display. Here are the sorted P/E ratios (values from Table 3.2). Chapter 3 3.1 Stem-and-Leaf Displays and Dot Plots

8 3-8 The type of graph you use to display your data is dependent on the type of data you have. Some charts are better suited for quantitative data, while others are better for displaying categorical data. One simple way to visualize small data sets is a stem-and-leaf plot. The stem-and-leaf plot is a tool of exploratory data analysis (EDA) that seeks to reveal essential data features in an intuitive way. A stem- and-leaf plot is basically a frequency tally, except that we use digits instead of tally marks. For two-digit or three-digit integer data, the stem is the tens digit of the data, and the leaf is the ones digit. Chapter 3 3.1 Stem-and-leaf Displays and Dot Plots Stem-and-Leaf Plot LO3-1: Make a stem-and-leaf or dot plot by hand or by computer. LO3-1

9 3-9 For the 44 P/E ratios, the stem-and-leaf plot is given below. Chapter 3 3.1 Stem-and-Leaf Displays and Dot Plots For example, the data values in the fourth stem are 31, 37, 37, 38. We always use equally spaced stems (even if some stems are empty). The stem-and-leaf can reveal central tendency (24 of the 44 P/E ratios were in the 10–19 stem) as well as dispersion (the range is from 7 to 59). In this illustration, the leaf digits have been sorted, although this is not necessary. The stem-and-leaf has the advantage that we can retrieve the raw data by concatenating a stem digit with each of its leaf digits. For example, the last stem has data values 50 and 59. LO3-1

10 3-10 Steps in Making a Dot PlotSteps in Making a Dot Plot A dot plot is the simplest graphical display of n individual values of numerical data. - Easy to understand. - It reveals dispersion, central tendency, and the shape of the distribution.A dot plot is the simplest graphical display of n individual values of numerical data. - Easy to understand. - It reveals dispersion, central tendency, and the shape of the distribution. 1. Make a scale that covers the data range. 2. Mark the axes and label them. 3Plot each data value as a dot above the scale at its approximate location. 3. Plot each data value as a dot above the scale at its approximate location. Note: If more than one data value lies at about the same axis location, the dots are stacked vertically. Chapter 3 Dot Plots Dot Plots LO3-1 3.1 Stem-and-Leaf Displays and Dot Plots

11 3-11 The range is from 7 to 59. All but a few data values lie between 10 and 25. A typical “middle” data value would be around 17 or 18. The data are not symmetric due to a few large P/E ratios. Chapter 3LO3-1 3.1 Stem-and-Leaf Displays and Dot Plots

12 3-12 Comparing Groups Comparing Groups stacked dot plotA stacked dot plot compares two or more groups using a common X-axis scale. Chapter 3 3.1 Stem-and-Leaf Displays and Dot Plots LO3-1

13 3-13 Bins and Bin Limits frequency distributionA frequency distribution is a table formed by classifying n data values into k classes (bins). Bin limitsBin limits define the values to be included in each bin. Widths must all be the same except when we have open-ended bins. FrequenciesFrequencies are the number of observations within each bin. relative frequencies percentagesExpress as relative frequencies (frequency divided by the total) or percentages (relative frequency times 100). Chapter 3 3.2 Frequency Distributions and Histograms LO3-2 LO3-2: Create a frequency distribution for a data set

14 3-14 - Herbert Sturges proposed the following rule: Constructing a Frequency Distribution Constructing a Frequency Distribution Chapter 3 3.2 Frequency Distributions and Histograms LO3-2

15 3-15 Chapter 3LO3-2 3.2 Frequency Distributions and Histograms

16 3-16 Histograms Histograms A histogram is a graphical representation of a frequency distributionA histogram is a graphical representation of a frequency distribution. A histogram is a bar chart.A histogram is a bar chart. Y-axis shows frequency within each bin Y-axis shows frequency within each bin. X-axis ticks shows end points of each bin. X-axis ticks shows end points of each bin. Chapter 3LO3-2 3.2 Frequency Distributions and Histograms

17 3-17 Consider 3 histograms for the P/E ratio data with different bin widths. What do they tell you? Chapter 3 3.2 Frequency Distributions and Histograms LO3-3 LO3-3: Make a histogram with appropriate bins.

18 3-18 Choosing the number of bins and bin limits in creating histograms requires judgment. One can use software programs to create histograms with different bins. These include software such as:One can use software programs to create histograms with different bins. These include software such as: ExcelExcel MegaStatMegaStat MinitabMinitab Chapter 3 LO3-3: Make a histogram with appropriate bins. 3.2 Frequency Distributions and Histograms LO3-3

19 3-19 Modal Class Modal Class A histogram bar that is higher than those on either side.A histogram bar that is higher than those on either side. Unimodal – a single modal class.Unimodal – a single modal class. Bimodal – two modal classes.Bimodal – two modal classes. Multimodal – more than two modal classes.Multimodal – more than two modal classes. Modal classes may be artifacts of the way bin limits are chosen.Modal classes may be artifacts of the way bin limits are chosen. Chapter 3 3.2 Frequency Distributions and Histograms LO3-3

20 3-20 Shape A histogram may suggest the shape of the population.A histogram may suggest the shape of the population. Skewness – indicated by the direction of the longer tail of the histogram.Skewness – indicated by the direction of the longer tail of the histogram. It is influenced by the number of bins and bin limits.It is influenced by the number of bins and bin limits. Left-skewed – (negatively skewed) a longer left tail. Right-skewed – (positively skewed) a longer right tail. Symmetric – both tail areas are the same. Chapter 3 LO3-4: Identify skewness, modes, and outliers in a histogram. LO3-4 3.2 Frequency Distributions and Histograms

21 3-21 Chapter 3 3.2 Frequency Distributions and Histograms LO3-4

22 3-22 Chapter 3 Frequency Polygons and Ogives A frequency polygon is a line graph that connects the midpoints of the histogram intervals, plus extra intervals at the beginning and end so that the line will touch the X-axis. A frequency polygon is a line graph that connects the midpoints of the histogram intervals, plus extra intervals at the beginning and end so that the line will touch the X-axis. It serves the same purpose as a histogram, but is attractive when you need to compare two data sets (since more than one frequency polygon can be plotted on the same scale). It serves the same purpose as a histogram, but is attractive when you need to compare two data sets (since more than one frequency polygon can be plotted on the same scale). An ogive (pronounced “oh-jive”) is a line graph of the cumulative frequencies. An ogive (pronounced “oh-jive”) is a line graph of the cumulative frequencies. It is useful for finding percentiles or in comparing the shape of the sample with a known benchmark such as the normal distribution (that you will be seeing in the next chapter). It is useful for finding percentiles or in comparing the shape of the sample with a known benchmark such as the normal distribution (that you will be seeing in the next chapter). 3.2 Frequency Distributions and Histograms

23 3-23 Chapter 3 3.2 Frequency Distributions and Histograms Frequency Polygons and Ogives

24 3-24 Chapter 3 3.3 Excel Charts This section describes how to use Excel to create charts. Please refer to the text.

25 3-25 Used to display a time series or spot trends, or to compare time periods.Used to display a time series or spot trends, or to compare time periods. Can display several variables at once.Can display several variables at once. Simple Line Charts Simple Line Charts Chapter 3 LO3-5: Make an effective line chart using Excel. 3.4 Line Charts LO3-5

26 3-26 Two-scale line chart – used to compare variables that differ in magnitude or are measured in different units.Two-scale line chart – used to compare variables that differ in magnitude or are measured in different units. Simple Line Charts Simple Line Charts Chapter 3 3.4 Line Charts LO3-5

27 3-27 Log Scales Log Scales Arithmetic scale – distances on the Y-axis are proportional to the magnitude of the variable being displayed.Arithmetic scale – distances on the Y-axis are proportional to the magnitude of the variable being displayed. Logarithmic scale – (ratio scale) equal distances represent equal ratios.Logarithmic scale – (ratio scale) equal distances represent equal ratios. Use a log scale for the vertical axis when data vary over a wide range, say, by more than an order of magnitude.Use a log scale for the vertical axis when data vary over a wide range, say, by more than an order of magnitude. This will reveal more detail for smaller data values.This will reveal more detail for smaller data values. Chapter 3LO3-5 3.4 Line Charts

28 3-28 Log Scales Log Scales A log scale is useful for time series data that might be expected to grow at a compound annual percentage rate (e.g., GDP, the national debt, or your future income). It reveals whether the quantity is growing at an increasing percent (concave upward), constant percent (straight line), or declining percent (concave downward) Chapter 3 3.4 Line Charts LO3-5

29 3-29 Most common way to display attribute data. - Bars represent categories or attributes. - Lengths of bars represent frequencie s. Simple Bar Charts Simple Bar Charts Chapter 3 3.5 Bar Charts LO3-6: Know the rules for effective bar charts and pie charts. LO3-6

30 3-30 Special type of bar chart used in quality management to display the frequency of defects or errors of different types.Special type of bar chart used in quality management to display the frequency of defects or errors of different types. Categories are displayed in descending order of frequency.Categories are displayed in descending order of frequency. Focus on significant few (i.e., few categories that account for most defects or errors).Focus on significant few (i.e., few categories that account for most defects or errors). Pareto Charts Pareto Charts Chapter 3 3.5 Bar Charts LO3-6

31 3-31 Bar height is the sum of several subtotals. Areas may be compared by color to show patterns in the subgroups and total.Bar height is the sum of several subtotals. Areas may be compared by color to show patterns in the subgroups and total. Stacked Bar Chart Stacked Bar Chart Chapter 3 3.5 Bar Charts LO3-6

32 3-32 A pie chart can only convey a general idea of the data.A pie chart can only convey a general idea of the data. Pie charts should be used to portray data which sum to a total (e.g., percent market shares).Pie charts should be used to portray data which sum to a total (e.g., percent market shares). A pie chart should only have a few (i.e., 2 or 3) slices.A pie chart should only have a few (i.e., 2 or 3) slices. Each slice can be labeled with data values or percents.Each slice can be labeled with data values or percents. An Oft-Abused Chart An Oft-Abused Chart Chapter 3 3.6 Pie Charts LO3-6: Know the rules for effective bar charts and pie charts. LO3-6

33 3-33 Consider the following charts used to illustrate an article from the Wall Street Journal. Which type appears to be better?Consider the following charts used to illustrate an article from the Wall Street Journal. Which type appears to be better? An Oft-Abused Chart An Oft-Abused Chart Chapter 3 3.6 Pie Charts LO3-6

34 3-34 Exploded3-D pie chartsExploded and 3-D pie charts add strong visual impact. Pie Chart Options Pie Chart Options Chapter 3 3.6 Pie Charts LO3-6

35 3-35 Scatter plots can convey patterns in data pairs that would not be apparent from a table. Scatter plots can convey patterns in data pairs that would not be apparent from a table. Chapter 3 3.7 Scatter Plots LO3-7: Make and interpret a scatter plot using Excel. LO3-7 Refer to the text for EXCEL outputs.

36 3-36 TablesTables are the simplest form of data display. Arrangement of data is in rows and columns to enhance meaning.Arrangement of data is in rows and columns to enhance meaning. compound tableA compound table is a table that contains time series data down the columns and variables across the rows. The data can be viewed by focusing on the time pattern (down the columns) or by comparing the variables (across the rows).The data can be viewed by focusing on the time pattern (down the columns) or by comparing the variables (across the rows). Example: School Expenditures Example: School Expenditures Chapter 3 3.8 Tables

37 3-37 Units of measure are stated in the footnote. Note merged headings to group columns. See text for “Tips for Effective Bar and Column Charts.” Tables”. Example: School Expenditures Example: School Expenditures Chapter 3 3.8 Tables

38 3-38 Chapter 3 3.8 Tables Here are some tips for creating effective tables: 1. Keep the table simple, consistent with its purpose. Put summary tables in the main body of the written report and detailed tables in an appendix. 2. Display the data to be compared in columns rather than rows. 3. For presentation purposes, round off to three or four significant digits. 4. Physical table layout should guide the eye toward the comparison you wish to emphasize. 5. Row and column headings should be simple yet descriptive. 6. Within a column, use a consistent number of decimal digits. LO3-8: Make simple tables and Pivot tables LO3-8

39 3-39 A nonzero origin will exaggerate the trend.A nonzero origin will exaggerate the trend. Deceptive Correct Error 1: Nonzero Origin Error 1: Nonzero Origin Chapter 3 LO3-9: Recognize deceptive graphing techniques. LO3-9 3.9 Deceptive Graphs

40 3-40 Keep the aspect ratio (width/height) below 2.00 so as not to exaggerate the graph. By default, Excel uses an aspect ratio of 1.68.Keep the aspect ratio (width/height) below 2.00 so as not to exaggerate the graph. By default, Excel uses an aspect ratio of 1.68. Error 2: Elastic Graph Proportions Error 2: Elastic Graph Proportions Chapter 3LO3-9 3.9 Deceptive Graphs

41 3-41 Can make trends appear to dwindle into the distance or loom towards you.Can make trends appear to dwindle into the distance or loom towards you. Error 4: 3-D and Novelty Graphs Error 4: 3-D and Novelty Graphs Chapter 3 3.9 Deceptive Graphs LO3-9

42 3-42 Can make trends appear to dwindle into the distance or loom towards you.Can make trends appear to dwindle into the distance or loom towards you. Error 5: 3-D and Rotated Graphs Error 5: 3-D and Rotated Graphs Chapter 3 3.9 Deceptive Graphs LO3-9

43 3-43 Avoid if possible. Keep your main objective in mind. Break graph into smaller parts.Avoid if possible. Keep your main objective in mind. Break graph into smaller parts. Error 8: Complex Graphs Error 8: Complex Graphs Chapter 3LO3-9 3.9 Deceptive Graphs

44 3-44 As figure height increases, so does width, distorting the graph.As figure height increases, so does width, distorting the graph. Error 11: Area Trick Error 11: Area Trick Chapter 3 3.9 Deceptive Graphs LO3-9

45 3-45 Other deceptive graphing techniques.Other deceptive graphing techniques. Error 3: Dramatic Title and Distracting Pictures Error 6: Unclear Definitions or Scales Error 7: Vague Sources Error 9: Gratuitous Effects Error 10: Estimated Data Chapter 3LO3-9 3.9 Deceptive Graphs


Download ppt "McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics."

Similar presentations


Ads by Google