Visual Presentation. For categorical data, pie charts can be effective in their simplicity for portraying relative frequencies. Pie chart to show percentages.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Advantages & Disadvantages of Dot Plots, Histograms, and Box Plots
Excel 2003: Chart types. Column chart Shows data changes over a period of time Shows data changes over a period of time Or illustrates comparisons among.
Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Chapter 2 Summarizing and Graphing Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Measures of Dispersion
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Descriptive statistics (Part I)
Very Basic Statistics.
Presenting information
Presentation of Data.
1 How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics
The Stats Unit.
Chapter 3: Graphic Presentation
OCR Functional Skills Charts Presenting data – Good data presentation skills are important. – Poor graphs and tables lead to the wrong conclusions being.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
9 - 1 Module 9: Frequency Distributions This module includes descriptions of frequency distributions, frequency tables, histograms and frequency polygons.
Charts and Graphs V
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Objectives (BPS chapter 1)
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
REPRESENTATION OF DATA.
Graphic Presentation The Pie Chart The Bar Graph The Statistical Map
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Welcome to Math 6 Statistics: Use Graphs to Show Data Histograms.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Copyright © Cengage Learning. All rights reserved. 2 Organizing Data.
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Chapter 2 Describing Data.
Describing and Displaying Quantitative data. Summarizing continuous data Displaying continuous data Within-subject variability Presentation.
Categorical vs. Quantitative…
Data: Presentation and Description Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Unit 4 Statistical Analysis Data Representations.
The hypothesis that most people already think is true. Ex. Eating a good breakfast before a test will help you focus Notation  NULL HYPOTHESIS HoHo.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Applied Quantitative Analysis and Practices
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
CONFIDENTIAL Data Visualization Katelina Boykova 15 October 2015.
Introduction to statistics I Sophia King Rm. P24 HWB
Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Statistics - is the science of collecting, organizing, and interpreting numerical facts we call data. Individuals – objects described by a set of data.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Graphs & Charts: The Art of Data Visualisation Alasdair Rutherford SSPC9C6University of StirlingSpring 2016.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Data Coaching Services Chart Interpretation 1. o Bar o Stacked Bar o Pie o Line o Scatter plot 2.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Visual Presentation Lies, damned lies, and statistics? (Disraeli) Contributors Shelly Bradford EMPHO Helen Cooke SWPHO Sean McPhail SWPHO.
Exploratory Data Analysis
Unit 4 Statistical Analysis Data Representations
How could data be used in an EPQ?
Description of Data (Summary and Variability measures)
DAY 3 Sections 1.2 and 1.3.
Probability and Statistics
Presentation transcript:

Visual Presentation

For categorical data, pie charts can be effective in their simplicity for portraying relative frequencies. Pie chart to show percentages of each smoking status in a sample of 211 women.

You can try various things to spice up the chart. The inclination and distortion that produce the 3D effects make it difficult to interpret the data. In the view below, the Hazelnuts (4%) and the Cashews (8%) look the same. You should avoid using such distortions.

When both variables are continuous it is helpful to produce a graph, called a scatterplot, of one against the other. Axes for any continuous variable on any graph should be chosen with a suitable scale and, importantly, should make it very clear if a “false origin” is used.

Scatter Plots

Is this a real trend ? How might we check ?

Table 1. Percentage of people in England and Wales rating their health as “not good”: data from Census 2001 MalesFemales Age-standardised proportion White British7.8%8.0% White Irish10.2%9.3% Other White7.4%7.8% Mixed9.8%10.4% Indian8.7%12.0% Pakistani13.5%17.1% Bangladeshi13.9%15.5% Other Asian8.7%10.5% Black Caribbean10.0%12.2% Black African6.8%8.4% Other Black10.1%11.9% Chinese5.6%6.2% Any other ethnic group8.2%8.0% All ethnic groups7.9%8.2% Tabular Presentation: Crosstabs

Tables clearly give actual numbers involved. They can be colour coded to represent statistical significance. Very hard to misinterpret a table. However with a large numbers of data can get overwhelming.

Bar charts are similar to histograms, but are used to represent data classified into categories. The widths of blocks are always equal and there are spaces between the blocks. Bar chart showing the number of people in each of four groups.

Clustered and stacked bar charts are used to describe two or more categorical variables simultaneously. Clustered bar chart showing the percentage in each group within each gender. Stacked bar chart showing the number of people of each gender, within each of four groups.

100% stacked vertical bar chart Difficult to make comparisons among the second, third, or subsequent segments in a stacked bar because judgment is not being made from a common base. Which region has largest proportion of males 35-64?

3d barchart What % of abortions in England at gestation< 10 weeks?

3d barchart What % of abortions at 13+ gestation in East Midlands?

What % of abortions in England at gestation< 10 weeks?

Vertical Bar Charts

Horizontal Bar Chart Spurious ranking somebody's got to be at the bottom Encourages comparison when perhaps not justified 95% CI’s arbitrary No consideration of multiple comparisons

Dot plot with Confidence intervals Not a standard graph format in excel Gives a better visual representation of the width of confidence intervals, useful when CI’s are wide.

Line charts display the same information as bar charts except that the line connects the midpoints of tops of corresponding bars. Line charts are preferable to bar charts when there are many categories or a continuous variable. Line chart showing trend of frequencies of days spent in hospital. This connects mid-points of the top of bars for the corresponding bar chart, with straight lines.

Life Expectancy at Birth Target = by 2010 increase LE to 78.6 years for men and to 82.5 years for women Line Chart LE Gap = 5 years Useful for looking at trends of time

Not using a consistent scale leads to misrepresentation of trends.

Histograms are used to depict numeric variables. The areas of rectangles represent numbers of cases. If the widths of the rectangles are all the same, then the height of the blocks is proportional to the observed frequencies. Histogram showing the distribution of the number of days stayed in hospital. Short lengths of stay (less than 4 days) were more common than longer ones.

Histogram Means of summarising the frequency distribution of a single variable eg. Age distribution of North Notts residents off Exeter system Use where you havent already got the frequency distribution provided Easiest to do if have stats package Like SPSS or can use Data Analysis Tools add-in in Excel Alternatively use Access to get frequency count and export into Excel and do bar chart

Histogram: Age Distribution of 1% random sample off N Notts Exeter

Figure 10 Histogram with normal-curve and statistics for cholesterol measurements. useful for illustrating the shape of the distribution of a batch of numbers

Figure 10 Histogram with normal-curve and statistics for cholesterol measurements. Mean=6.6 useful for illustrating the shape of the distribution of a batch of numbers

Figure 10 Histogram with normal-curve and statistics for cholesterol measurements. Mean=6.6 Mode useful for illustrating the shape of the distribution of a batch of numbers

Figure 10 Histogram with normal-curve and statistics for cholesterol measurements. Mean=6.6 Mode Median  6.4 useful for illustrating the shape of the distribution of a batch of numbers

Boxplots are graphical representations which clearly show the quartiles and extremes or outliers of a set of data in a compact manner. Boxplots showing the distribution of 2-hour blood glucose among males and females.

Boxplots are graphical representations which clearly show the quartiles and extremes or outliers of a set of data in a compact manner. Boxplots showing the distribution of 2-hour blood glucose among males and females. Median 1 st Quartile 3 rd Quartile Outliers “unusual observations” Minimum Maximum

Radar Chart or Spider Chart Allows the comparisons more than one group across a few different values

Age RangeTotalMales Female s and over Totals population pyramids for districts and regions comparing against the age-gender distribution for the UK as a whole: available on the ONS website under “Census 2001” Population Pyramids East Midlands Region Not standard excel format

Spine charts- many different types, used for displaying lots of information of different variables The range to the left and right is not necessarily equal The range of clusters can be greatly distorted by one outlying point

No statistical significance Variables with small variance from mean will be hard to spot

Maps

Statistical Process Control Charts Used to detect and monitor process improvement over time Easy to read guide which allows you to take appropriate action Provides an understanding of process over time and capability of achieving targets Distinguish between –Controlled (Chance) variation Stable and stays within process limits –Uncontrolled (assignable) variation Pattern changes over time and/or breaches control limits

Funnel Plots No ranking Data Displayed No preset threshold for out of control Visual relationship with volume Emphasises increased variability of smaller centres

Teenage pregnancy - Conceptions below age 18 (change in rate) CHI Indicator: Percentage change in the under-18 conception rate between 1998 and 2001.

Tweaking with colour

Traffic lights on a graph

Exercise List 3 advantages and 3 disadvantages to Funnel plots

X-Bar and R Chart Traditional chart – data points with mean and control limits The X-bar chart monitors the mean over time, based on the average of a series of observations, called a subgroup.

X-Bar and R Chart Does not allow us to monitor progress in terms of the targets set out by the DH

SPC chart with Targets Has targets instead of mean of data points The “Action limits” are similar to control limits – are 2 and 3 SD’s from target Does not account for different trust sizes and/or number of patients seen

Interpretation and use When are SPC methods not appropriate? – are we stretching their use? What if all points lie outside control limits? Is this a: –Bad measure? –Bad system? Where should control limits be set and how should we decide?

Observations Relatively simple to create – funnel plots, double square root charts etc, can all be created in Excel (spreadsheets available) Most people get it and like it when they do Quick, simple, robust, Funnel plots convey more information and use “real” units BUT It’s the pattern that matters Only a screening tool – tells you where to look first