Download presentation
Presentation is loading. Please wait.
Published byDaniella Stevenson Modified over 9 years ago
1
Effective Use of Graphs Annie Herbert Medical Statistician Research & Development Support Unit Salford Royal (Hope) Hospitals NHS Foundation Trust annie.herbert@manchester.ac.uk (0161 720) 2227
2
Timetable TimeTask 60 minsPresentation 20 minsCoffee Break 90 mins Practical Tasks in IT Room
3
Outline Graphs for categorical data Graphs for numerical data Comparing groups Additional graphs (covered in other courses) Final tips & Computer packages
4
Categorical Data (1) Examples: Sex – Male/Female Blood Group – A/B/AB/O Employment Status – Unemployed/Part-time/Full-time
5
Categorical Data (2) Record: Frequency (discrete number) per category Summary: Frequency OR percentage/fraction/proportion Visually: - Bar Chart - Pie Chart
6
Example – Discharge Destination (1) Where Patient Livesn = 731 Alone339 (46.3%) Family210 (28.7%) Home180 (24.6%) Other2 (0.3%)
7
Example – Discharge Destination (2)
8
Example – Psychiatric Illness/ Discharge Destination (1) Psychiatric Illness? Where Patient Lives No n=208 Yes n=523 Alone117 (56%)222 (42%) Family81 (39%)129 (25%) Home9 (4%)171 (33%) Other1 (0%)
9
Example – Psychiatric Illness/ Discharge Destination (2) Where Patient Lives Psychiatric Illness? Alone (n=339)Family (n=210)Home (n=180)Other (n=2) No117 (35%)81 (39%)9 (5%)1 (50%) Yes222 (65%)129 (61%)171 (95%)1 (50%)
10
Example – Psychiatric Illness/ Discharge Destination Bar Chart
11
Stacked Bar Chart
12
Re-ordering categories can emphasize a certain effect:
13
The axis should always start from 0:
14
Bar Charts – Adv & Disadv Advantages: - Visually strong. - Easy to compare between more than one dataset. Disadvantages: -Categories can be ‘re-ordered’ to emphasize certain effects. -Misleading if not used for counts. -Misleading if y-axis not from 0.
15
Bar Charts – Things to consider: What group differences are you interested in? Frequencies or percentages? If percentage, it’s down to you to specify the totals. Is ‘Other’ a large frequency/percentage? Consider the categories as un-ordered when using a stacked bar chart.
16
Pie Charts
17
Pie Charts – Advantages: Easy to compare categories, are equidistant from each other. Ordering of categories does not emphasize certain effects as badly as stacked bar charts do.
18
Pie Charts – Disadvantages: No choice between frequencies and percentages (down to you to specify totals). Cannot put more than one data set into a pie chart. Lose individual values of the data. Limited space: if using more than 5 or 6 categories, chart can look complicated.
19
Numerical Data (1) Examples: Weight Blood Pressure Cholesterol Levels
20
Numerical Data (2) Record: Number/Value (discrete or continuous) Summary: - Mean (SD) - Median (IQR) Visually: - Histogram- Box plot- Spread plot
21
Data – Ages of Patients in Selenium Study Age 48 36 56 66 65 19 36 59 48 52 67 39 28 58 48 49 39 57 62 74 59 66 45 69 55 63 42 68 54 24 19 70 73 29 34 50
22
Histogram – Ages of Patients in Selenium Study
23
Histograms for the same data can vary:
24
Compromise:
25
Beware! Histogram is not Bar Chart
26
Histograms – Advantages: Visual display of interval frequencies, easy to compare intervals. Can give an idea of the distribution of the data, e.g. shape, typical value, spread.
27
Histograms – Disadvantages: Choice of interval width can alter appearance. Individual values lost. One data set per histogram, difficult to compare data sets.
28
Box Plot Upper Quartile Lower Quartile Median Extreme Outlier Outlier
29
Box Plots – Advantages: Defines many summary statistics in one plot. Defines ‘outliers’ explicitly. Can have more than one data set in a plot, so easy to compare data sets:
30
Box Plots – Disadvantages: More complicated visually than some other types of data plots. Individual values lost.
31
Spread Plots (1)
32
Spread Plots (2) Advantages: -Can give an idea of the distribution of the data, e.g. shape, typical value, spread. -Shows individual values of the data. -Can show more than one dataset in a plot. Disadvantages: -Not very widely used in journal publications. -Doesn’t explicitly summarise statistics or outliers as box plot does.
33
Relationships in Numerical Data
34
Serial Measurements
35
What information does this give? Mean ± SE, n ≈ 30 per group
36
Better to look at individual data…
37
…or give a sensible summary.
38
Kaplan-Meier Curve (step graph) Time-to-Event data.
39
Bland-Altman Plots (scatter plots) How well do two methods of measurement agree?
40
Forest Plots (Hi-Lo-Close charts) Meta-Analysis.
41
Final Pointers: Before plotting think about the type of data and what you would like to compare. Show all data rather than summaries where possible. Label axes clearly. Graph should ‘stand alone’. Make sure when comparing groups that outcome on the same scale. Make sure any colours used are sufficiently different from each other, and not red/green.
42
Using a Computer Package: PackageAdvantagesDisadvantages SPSSProduces journal quality graphs Difficult to start with Expensive StatsDirectWhen copied and pasted, these graphs may be edited in Word Difficult to draw bar/pie charts ExcelEasy to use for bar/pie charts Not a statistics package
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.