Introduction to Graphical Presentation Andy Wang CIS 5930-03 Computer Systems Performance Analysis.

Slides:



Advertisements
Similar presentations
Lecture 06: Design II February 5, 2013 COMP Visualization.
Advertisements

Lecture 1: Beautiful graphics in R
1 CS533 Modeling and Performance Evaluation of Network and Computer Systems The Art of Data Presentation.
Making effective plots: 1.Don’t use default Excel plots! 2.Figure should highlight the key relationships in the data. 3.Should be clear - no extraneous.
Excel Tutorial 4: Analyzing and Charting Financial Data
® Microsoft Office 2010 Excel Tutorial 4: Enhancing a Workbook with Charts and Graphs.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 3.1 Chapter Three Art and Science of Graphical Presentations.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Graphing With Excel 2010 University of Michigan – Dearborn Science Learning Center Based on a presentation by James Golen Revised by Annette Sieg…
10-1 ©2006 Raj Jain The Art of Data Presentation.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Data Analysis and Presentation There are many “tricks of the trade” used in data.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Lecture 13 Page 1 CS 239, Spring 2007 Data Presentation CS 239 Experimental Methodologies for System Software Peter Reiher May 22, 2007.
Data Visualization.
Business Communication, 15e
CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel.
Sometimes, Tables can be confusing
OCR Functional Skills Charts Presenting data – Good data presentation skills are important. – Poor graphs and tables lead to the wrong conclusions being.
Jeffrey Nichols Displaying Quantitative Information May 2, 2003 Slide 0 Displaying Quantitative Information An exploration of Edward R. Tufte’s The Visual.
Graphics and visual information English 314 Technical communication Note: To hide or reveal these lecture notes, go to VIEW and click COMMENTS. This lecture.
Charts and Graphs V
1 PowerPoint Presentation Design Wednesday, September 02, 2015Ms. Wear Info Tech 9/10.
Guidelines for Visual Aids and Presentations Suggestions for Presenters Society of Quality Assurance 2004 Annual Meeting Guidance M. Rosenberg/L. KvasnickaJune.
Mark P. Baldwin Northwest Research Associates, USA Cargese UTLS Summer School, 6 Oct Data Graphics AndTypography.
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Graphical Display and Presentation of Quantitative Information 13 February 2006.
Integrating Graphics, Charts, Tables Into your technical writing documents.
The Scientific Method Honors Biology Laboratory Skills.
Chapter 10 The Art of Data Presentation. Overview 2 Types of Variables Guidelines for Preparing Good Charts Common Mistakes in Preparing Charts Pictorial.
A Picture Is Worth A Thousand Words. DAY 7: EXCEL CHAPTER 4 Tazin Afrin September 10,
1 Eric Rasmusen, March 10, 2014 Graphs and Tables.
Graphics for Macroeconomics. Principles Graphing is done best when it clearly communicates ideas about data Focus on the main point while preventing distractions.
© 1998, Geoff Kuenning The Art of Graphical Presentation Types of Variables Guidelines for Good Graphics Charts Common Mistakes in Graphics Pictorial Games.
Visualizing Data in Excel Geof Hileman, FSA Kennell & Associates, Inc June 4, 2012.
Graphing Data: Introduction to Basic Graphs Grade 8 M.Cacciotti.
© 1998, Geoff Kuenning Common Mistakes in Graphics Excess information Multiple scales Using symbols in place of text Poor scales Using lines incorrectly.
MATH 3400 Computer Applications of Statistics Lecture 6 Data Visualization and Presentation.
Graphs in Physics PowerPoint #4. A graph is… A convenient way to show data.
Effective PowerPoints. This: Do not attempt to put all the text, code, or explanation of what you are talking about directly onto the slide, especially.
COMMUNICATING DATA USING GRAPHICS MIS2502 Data Analytics.
When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as.
Unit 4 Statistical Analysis Data Representations.
Four types Data maps (17-19, Tufte, also History of the World in 100 Seconds)History of the World in 100 Seconds Time series Narrative graphics of space.
1 CSE 2337 Chapter 3 Data Visualization With Excel.
GRAPHICS GUIDELINES MUSE/CE 11B Anagnos/Williamson From Pfeiffer, W.S Technical Writing: A Practical Approach. 5th Edition. Prentice Hall. New Jersey.
Worth 1,000 Words How to use information graphics to make data meaningful National Association for Career and Technical Education Information May 17, 2012.
© 1998, Geoff Kuenning Vague idea “groping around” experiences Hypothesis Model Initial observations Experiment Data, analysis, interpretation Results.
Proposal: Preliminary Results and Discussion. Dos and Don’ts DoDon’t Include initial results if you have them You can also conduct and report on informal.
Microsoft® Excel Use the Chart Tools Design tab. 1 Use the Chart Tools Layout and Format tabs. 2 Create chart sheets and chart objects. 3 Edit.
Technical Communication A Practical Approach Chapter 13: Graphics William Sanborn Pfeiffer Kaye Adkins.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
UNIT ONE: Science Skills  Chapter 1Measurement  Chapter 2The Scientific Process  Chapter 3Mapping Earth.
TECHNICAL WRITING November 26, Today Effective visuals.
Integrating Graphics, Illustrations, Figures, Charts.
2. Graphing Sci. Info Skills.
Title of your poster here: if you have a really long title,
Tutorial 4: Enhancing a Workbook with Charts and Graphs
QM222 A1 Visualizing data using Excel graphs
STAT 4030 – Jennifer Priestley, Ph.D. Programming in R
Proposal: Preliminary Results and Discussion
Module 6: Presenting Data: Graphs and Charts
Make Your Data Tell a Story
Title of your poster here: if you have a really long title,
Technical Writing (AEEE299)
More on Data Presentation CS 239 Experimental Methodologies for System Software Peter Reiher May 24, 2007.
Topic 7: Visualization Lesson 1 – Creating Charts in Excel
The Art of Graphical Presentation
Presentation transcript:

Introduction to Graphical Presentation Andy Wang CIS Computer Systems Performance Analysis

2 The Art of Graphical Presentation Reference Works Types of Variables Guidelines for Good Graphics Charts Common Mistakes in Graphics Pictorial Games Special-Purpose Charts

3 Useful Reference Works Edward R. Tufte, The Visual Display of Quantitative Information, Graphics Press, Cheshire, Connecticut, Edward R. Tufte, Envisioning Information, Graphics Press, Cheshire, Connecticut, Edward R. Tufte, Visual Explanations, Graphics Press, Cheshire, Connecticut, Darrell Huff, How to Lie With Statistics, W.W. Norton & Co., New York, 1954

4 Types of Variables Qualitative –Ordered (e.g., modem, Ethernet, satellite) –Unordered (e.g., CS, math, literature) Quantitative –Discrete (e.g., number of terminals) –Continuous (e.g., time)

5 Charting Based on Variable Types Qualitative variables usually work best with bar charts or Kiviat graphs –If ordered, use bar charts to show order Quantitative variables work well in X-Y graphs –Use points if discrete, lines if continuous –Bar charts sometimes work well for discrete

6 Guidelines for Good Graphics Charts Principles of graphical excellence Principles of good graphics Specific hints for specific situations Aesthetics Friendliness

7 Principles of Graphical Excellence Graphical excellence is the well- designed presentation of interesting data: –Substance –Statistics –Design

8 Graphical Excellence (2) Complex ideas get communicated with: –Clarity –Precision –Efficiency

9 Graphical Excellence (3) Viewer gets: –Greatest number of ideas –In the shortest time –With the least ink –In the smallest space

10 Graphical Excellence (4) Is nearly always multivariate Requires telling truth about data

11 Principles of Good Graphics Above all else show the data Maximize the data-ink ratio Erase non-data ink Erase redundant data ink Revise and edit

12 Above All Else Show the Data

13 Above All Else Show the Data

14 Maximize the Data-Ink Ratio

15 Maximize the Data-Ink Ratio

16 Erase Non-Data Ink

17 Erase Non-Data Ink East West North

18 Erase Redundant Data Ink East West North

19 Erase Redundant Data Ink East West North

20 Revise and Edit

21 Revise and Edit

22 Revise and Edit

23 Revise and Edit

24 Revise and Edit

25 Revise and Edit

26 Revise and Edit

27 Specific Things to Do Give information the reader needs Limit complexity and confusion Have a point Show statistics graphically Don’t always use graphics Discuss it in the text

28 Give Information the Reader Needs Show informative axes –Use axes to indicate range Label things fully and intelligently Highlight important points on the graph

29 Giving Information the Reader Needs

30 Giving Information the Reader Needs

31 Limit Complexity and Confusion Not too many curves Single scale for all curves No “extra” curves No pointless decoration (“ducks”)

32 Limiting Complexity and Confusion

33 Limiting Complexity and Confusion

34 Have a Point Graphs should add information not otherwise available to reader Don’t plot data just because you collected it Know what you’re trying to show, and make sure the graph shows it

35 Having a Point Sales were up 15% this quarter:

36 Having a Point

37 Having a Point

38 Having a Point

39 Show Statistics Graphically Put bars in a reasonable order –Geographical –Best to worst –Even alphabetic Make bar widths reflect interval widths –Hard to do with most graphing software Show confidence intervals on the graph –Examples will be shown later

40 Don’t Always Use Graphics Tables are best for small sets of numbers –Tufte says 20 or fewer Also best for certain arrangements of data –E.g., 10 graphs of 3 points each Sometimes a simple sentence will do Always ask whether the chart is the best way to present the information –And whether it brings out your message

41 Text Would Have Been Better

42 Discuss It in the Text Figures should be self-explanatory –Many people scan papers, just look at graphs –Good graphs build interest, “hook” readers But text should highlight and aid figures –Tell readers when to look at figures –Point out what figure is telling them –Expand on what figure has to say

43 Aesthetics Not everyone is an artist –But figures should be visually pleasing Elegance is found in –Simplicity of design –Complexity of data

44 Principles of Aesthetics Use appropriate format and design Use words, numbers, drawings together Reflect balance, proportion, relevant scale Keep detail and complexity accessible Have story about the data (narrative quality) Do professional job of drawing Avoid decoration and chartjunk

45 Use Appropriate Format and Design Don’t automatically draw a graph –Mentioned before Choose graphical format carefully Sometimes “text graphic” works best –Use text placement to communicate numbers –Very close to being a table

46 GNP: +3.8IPG: +5.8CPI: +7.7Profits: CEA: +4.7 DR: +4.5 NABE: +4.5 WEF: +4.5 CBO: +4.4 CB: +4.2 IBM: +4.1 CE: +2.9 NABE: +6.2 IBM: +5.9 CB: +5.5 DR: +5.2 WEF: +4.8 IBM: +6.6 NABE: +6.5 CB: +6.2 WEF: +21 DR: IBM: CE: +6.5 WEF: 6.8 CB: 6.7 NABE: 6.7 IBM: 6.6 DR: 6.5 CBO: 6.3 CEA: 6.3 Unempl: 6.0 About a year ago, eight forecasters were asked for their predictions on some key economic indicators. Here’s how the forecasts stack up against the probable 1978 results (shown in the black panel). (New York Times, Jan. 2, 1979) Using Text as a Graphic

47 The Stem-and-Leaf Plot From Tukey, via Tufte, heights of volcanoes in feet: 0| | | | | | | | |

48 Choosing a Graphical Format Many options, more being invented all the time –Examples will be given later –See Jain for some commonly useful ones –Tufte shows ways to get creative Choose a format that reflects your data –Or that helps you analyze it yourself

49 Use Words, Numbers, Drawings Together Put graphics near or in text that discusses them –Even if you have to murder your word processor Integrate text into graphics Tufte: “Data graphics are paragraphs about data and should be treated as such”

50 Reflect Balance, Proportion, Relevant Scale Much of this boils down to “artistic sense” Make sure things are big enough to read –Tiny type is OK only for young people! Keep lines thin –But use heavier lines to indicate important information Keep horizontal larger than vertical –About 50% larger works well

51 Poor Balance and Proportion Sales in the North and West districts were steady through all quarters East sales varied widely, significantly outperforming the other districts in the third quarter

52 Better Proportion Sales in North and West districts were steady through all quarters East sales varied widely, significantly outperforming other districts in third quarter

53 Keep Detail and Complexity Accessible Make your graphics friendly: –Avoid abbreviations and encodings –Run words left-to-right –Explain data with little messages –Label graphic, don’t use elaborate shadings and a complex legend –Avoid red/green distinctions –Use clean, serif fonts in mixed case

54 An Unfriendly Graph

55 A Friendly Version

56 Even Friendlier

57 Have a Story About the Data (Narrative Quality) May be difficult in technical papers But think about why you are drawing graph Example: –Performance is controlled by network speed –But it tops out at high end –And that’s because we hit a CPU bottleneck

58 Showing a Story About the Data

59 Do a Professional Job of Drawing This is easy with modern tools –But take the time to do it right Align things carefully Check final version in format you will use –I.e., print Postscript one last time before submission –Or look at your slides on projection screen Preferably in presentation room Color balance varies by projector

60 Avoid Decoration and Chartjunk Powerpoint, etc. make chartjunk easy Avoid clip art, automatic backgrounds, etc. Remember: data is the story –Statistics aren’t boring –Uninterested readers aren’t drawn by cartoons –Interested readers are distracted Does removing it change message? –If not, leave it out

61 Examples of Chartjunk Gridlines! Vibration Pointless Fake 3-D Effects Filled “Floor”Clip Art In or out? Filled “Walls” Borders and Fills Galore Unintentional Heavy or Double Lines Filled Labels Serif Font with Thin & Thick Lines

62 Common Mistakes in Graphics Excess information Multiple scales Using symbols in place of text Poor scales Using lines incorrectly

63 Excess Information Sneaky trick to meet length limits Rules of thumb: –6 curves on line chart –10 bars on bar chart –8 slices on pie chart But note that Tufte hates pie charts Extract essence, don’t cram things in

64 Way Too Much Information

65 What’s Important About That Chart? Times for cp and rcp rise with number of replicas Most other benchmarks are near constant Exactly constant for rm

66 The Right Amount of Information

67 Multiple Scales Another way to meet length limits Basically, two graphs overlaid on each other Confuses reader (which line goes with which scale?) Misstates relationships –Implies equality of magnitude that doesn’t exist

68 Some Especially Bad Multiple Scales

69 Using Symbols in Place of Text Graphics should be self-explanatory –Remember that the graphs often draw the reader in So use explanatory text, not symbols This means no Greek letters! –Unless your conference is in Athens...

70 It’s All Greek To Me...

71 Explanation is Easy

72 Poor Scales Plotting programs love non-zero origins –But people are used to zero Fiddle with axis ranges (and logarithms) to get your message across –But don’t lie or cheat Sometimes trimming off high ends makes things clearer –Brings out low-end detail

73 Nonzero Origins (Chosen by Microsoft)

74 Proper Origins

75 A Poor Axis Range

76 A Logarithmic Range

77 A Truncated Range

78 Using Lines Incorrectly Don’t connect points unless interpolation is meaningful Don’t smooth lines that are based on samples –Exception: fitted non-linear curves

79 Incorrect Line Usage

80 Pictorial Games Non-zero origins and broken scales Double-whammy graphs Omitting confidence intervals Scaling by height, not area Poor histogram cell size

81 Non-Zero Origins and Broken Scales People expect (0,0) origins –Subconsciously So non-zero origins are great way to lie More common than not in popular press Also very common to cheat by omitting part of scale –“Really, Your Honor, I included (0,0)”

82 Non-Zero Origins

83 The Three-Quarters Rule Highest point should be 3/4 of scale or more

84 Double-Whammy Graphs Put two related measures on same graph –One is (almost) function of other Hits reader twice with same information –And thus overstates impact

85 Omitting Confidence Intervals Statistical data is inherently fuzzy But means appear precise Giving confidence intervals can make it clear there’s no real difference –So liars and fools leave them out

86 Graph Without Confidence Intervals

87 Graph With Confidence Intervals

88 Scaling by Height Instead of Area Clip art is popular with illustrators: Women in the Workforce

89 The Trouble with Height Scaling Previous graph had heights of 2:1 But people perceive areas, not heights –So areas should be what’s proportional to data Tufte defines lie factor: size of effect in graphic divided by size of effect in data –Not limited to area scaling –But especially insidious there (quadratic effect)

90 Scaling by Area Same graph with 2:1 area: Women in the Workforce

91 Poor Histogram Cell Size Picking bucket size is always problem Prefer 5 or more observations per bucket Choice of bucket size can affect results:

92 Principles of Graphics Integrity (Tufte) Proportional representation of numbers Clear, detailed, thorough labeling Show data variation, not design variation Use deflated money units Don’t have more dimensions than data has Don’t quote data out of context

93 Proportional Representation of Numbers Maintain lie factor of 1.0 Use areas, not heights, with clip art Avoiding “decorative” graphs will do wonders –Not too hard for most engineers!

94 Clear, Detailed, Thorough Labeling Goal is to defeat distortion and ambiguity Write explanations on graphic itself Label important events in the data

95 Show Data Variation, Not Design Variation Use one design for entire graphic In papers, try to use one design for all graphs Again, artistic license is big culprit

96 Use Deflated Money Units Often necessary to show money over time –Even in computer science –E.g., price/performance over time –Or expected future cost of a disk Nominal dollars are meaningless Derate by some standard inflation measure –That’s what the WWW is for!

97 Don’t Have More Dimensions Than Data Has This gets back to the Lie Factor 1-D data (e.g., money) should occupy one dimension on the graph: not Clip art is prohibited by this rule –But if you have to, use an area measure $1.00 $2.00

98 Don’t Quote Data Out of Context Tufte’s example:

99 The Same Data in Context

100 Special-Purpose Charts Tukey’s box plot Histograms Scatter plots Gantt charts Kiviat graphs

101 Tukey’s Box Plot Shows range, median, quartiles all in one: Tufte can’t resist improvements: or or even minim um maxim um quarti le medi an

102 Histograms Tufte improves everything about them:

103 Scatter Plots Useful in statistical analysis Also excellent for huge quantities of data –Can show patterns otherwise invisible

104 Better Scatter Plots Again, Tufte improves the standard –But it can be a pain with automated tools Can use modified Tukey box plot for axes

105 Gantt Charts Shows relative duration of Boolean conditions Arranged to make lines continuous –Each level after first follows FTTF pattern

106 Kiviat Graphs Also called “star charts” or “radar plots” Useful for looking at balance between HB and LB metrics

107 A Few Examples A bad graph Two good graphs

108 A Very Bad Graph

109 A Good Graph: Sunspots

110 A Superb Graph: DEC Traces

White Slide