Data Visualization using R Audit Technology Group Mar 22th, 2013
DV in the News – Gun Owners Put on Map
DV in the News – Gun Owners Retaliated
DV in the News – Home Ownership
What is R? From Wikipedia R is an open source programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians for developing statistical software and data analysis. R is an implementation of the S programming language created by John Chambers while at Bell Labs. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand.
More about R Some interesting facts R is now developed by the R Development Core Team, of which Chambers is a member R is named partly after the first names of the first two R authors (Robert Gentleman and Ross Ihaka), and partly as a play on the name of S. R is part of the GNU project. The source code for the R software environment is written primarily in C, Fortran, and R. R is freely available under the GNU General Public License, and pre-compiled binary versions are provided for various operating systems.
What should R Concern me as an Auditor? Good question! R uses a command line interface; however, several graphical user interfaces are available for use with R. R provides a wide variety of statistical and graphical techniques, including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering, and others. According to Rexer's Annual Data Miner Survey in 2010, R has become the data mining tool used by more data miners (43%) than any other.
More Importantly Last but not least Another strength of R is static graphics, which can produce publication-quality graphs, including mathematical symbols. Dynamic and interactive graphics are available through additional packages. In other words, if you are serious about data visualization, using only Excel graphing features is not enough. At the end of the day, it is FREE! All you need is filing a deviation form with your supervisor’s approval, and some weekends spent in front of your computer screen.
R Heat Map is Perfect for Risk Analysis Heat Map is a very useful tool for Risk Analysis But Excel does not have built-in Heat Map feature You can mimic a Heat Map in Excel using Bubble Chart but it is a labor intensive process with less than satisfactory result Or you can learn a little VBA and write your own code to create Heat Maps in Excel, once again, not a completely painless proposition
Heat Map: Correlation Between Genes
Heat Map: NBA Top Scorers
Calendar Heat Map: Stock Close Price
Heat Map: Pollution Data on Calendar
Showing Data on US Map by State
Graphing Rainfall in France
Topology Graph
Topography Graph with Height
Data Visualization in Action 1
Data Visualization in Action 2
Interactive Graph – U.S. Data by State
Interactive Graph – Hurricane Andrew
Interactive Graph – U.S. City Popularity
Q&A Question? Comments?