Data Visualization using R

Slides:



Advertisements
Similar presentations
1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Advertisements

Introduction to S-Plus by Francesco Ferretti Analysis of Biological Data Course Winter term 2007 Dalhousie University.
How to improve your Data Analysis Processes in your Web Application / ERP using RClass Juan Antonio Breña Moral
Introduction Copyright © Software Carpentry 2010 This work is licensed under the Creative Commons Attribution License See
A very brief introduction to R - Matthew Keller Some material cribbed from: UCLA Academic Technology Services Technical Report Series (by Patrick Burns)
MATLAB MATLAB is a high-level technical computing language and
Welcome to the Plant Breeding and Genomics Webinar Series Today’s Presenter: Dr. Heather Merk Presentation & Supplemental Files:
1 Painless File Management Under Windows and Unix Laszlo SZATHMARY Orpailleur team -- LORIA, Nancy, France Dec. 14, 2007, Nancy, France.
Introduction to BioConductor Friday 23th nov 2007 Ståle Nygård Statistical methods and bioinformatics for the analysis of microarray.
IT Project Management, Third Edition Appendix A1 Appendix A: Guide to Using Microsoft Project 2002.
R Mohammed Wahaj. What is R R is a programming language which is geared towards using a statistical approach and graphics Statisticians and data miners.
Financial Data Calculator© Produced by: Mathematical Investment Decisions, Inc. 95 West Gate Drive – 2 nd Floor Cherry Hill, NJ Web site:
MATLAB Presented By: Nathalie Tacconi Presented By: Nathalie Tacconi Originally Prepared By: Sheridan Saint-Michel Originally Prepared By: Sheridan Saint-Michel.
Experiences in Integration of the 'R' System into Kepler Dan Higgins – National Center for Ecological Analysis and Synthesis (NCEAS), UC Santa Barbara.
Computing in Statistical Education Pang Du Department of Statistics Virginia Tech.
MARKET SCAN TRADER A basic tool to reduce the markets complexity and make a short list of assets to invest. USERS designed for: Private Investors Professional.
Visualisation of Software Engineering Diagrams Part – 1 Rajat Anantharam Department of Gaming and Media Technology.
Microarray Analysis Software at NIH. BRB ArrayTools Visualization and Statistical analysis of gene expression data Features –Excel Add-in –Flexible Data.
IS&T Scientific Visualization Tutorial – Spring 2010 Robert Putnam Plotting packages overview.
1 Open Source Audit Software IIA District Conference Durham, NC 2/27/2009 Track 1 – Internal Audit Mike Blakley, EZ-R Stats, LLC.
Jennifer Paoletti. Office Live Workspace Basics provides a user with its own domain name, and the ability to create their own website. It also provides.
What is R Muhammad Omer. What is R  R is the programing language software for statistical computing and data analysis  The R language is extensively.
Data Structures and Programming.  John Edgar2.
1 Chapter 6 Understanding Computers, 11 th Edition Software Ownership Rights Software license: agreement, either included in a software package or displayed.
Enterprise 2.0 Portals Using portals as web browsers Ensuring continued interest by internal users Creative design techniques and navigating content Consistent.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Biostatistics, statistical software II. A brief survey of statistical program systems Krisztina Boda PhD Department of Medical Informatics, University.
Advanced Statistics for Interventional Cardiologists.
1 An Introduction – UCF, Methods in Ecology, Fall 2008 An Introduction By Danny K. Hunt & Eric D. Stolen Getting Started with R (with speaker notes)
Introduction to R By Robert Biddle. About Me Data Professional with over 10 years experience. Hilton Grand Vacations, Orlando Data Architect MCITP Database.
Appendix: The WEKA Data Mining Software
Tori’s CSE 3 Poster Computational Thinking: Throughout the beginning of our course, we learned as a class so much about computers, along with learning.
Analysis of RT distributions with R Emil Ratko-Dehnert WS 2010/ 2011.
Outline Class Intros – What are your goals? – What types of problems? datasets? Overview of Course Example Research Project.
SOUTHERN CALIFORNIA EDISON® Leading the Way in Electricity Data Visualization Using Microsoft Excel Audit Technology Group Mar 22 th, 2013.
Data Visualization Project B.Tech Major Project Project Guide Dr. Naresh Nagwani Project Team Members Pawan Singh Sumit Guha.
An Introduction to Linux Name: Haixin Wang ID :
Introduction to STATA for Clinical Researchers Jay Bhattacharya August 2007.
Using the ‘R’ Language for Bioinformatics
Outline Class Intros Overview of Course Example Research Project.
The new European Toolkit EC-CHM Miruna Bădescu EEA contractor: Eau de Web.
An Introduction to R Statistical Computing AMS 597 Stony Brook University Spring 2009 By Tianyi Zhang.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Mission Risk & Internal Control The New Normal
The.NET ModelKit Suite is released in the following editions: 1) 2) 3) 4)
Chapter 3-Multimedia Skills
By Shreya Mozumdar 6B.  An operating system (OS) is a program that, after being loaded onto the computer, manages all the other programs on the computer.
A centre of expertise in digital information managementwww.ukoln.ac.uk UKOLN is supported by: Effective Web Site Training Workshop: Benchmarking Web Sites.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Introduction to R Aedín Culhane
BlueJ X ICSE Syllabus. Board Pattern THEORY (100 marks) PRACTICAL (100 marks) PROJECT (50 marks) ASSIGNMENTS (50 marks)
Pinellas County Schools
1 INTRODUCTION TO COMPUTER GRAPHICS. Computer Graphics The computer is an information processing machine. It is a tool for storing, manipulating and correlating.
A quick guide to other statistical software
Software.
Key Features Advantages over PDF sharing Use Cases Clients
R programming language
Computer Software: Programming
Software for scientific calculations
Appendix A: Guide to Using Microsoft Project 2002
Adventures in teaching and learning data analysis with R
R Programming.
R Programming Language
R Integration in Microsoft Solutions
Christopher W.V. Hogue, Ph.D
Today’s Beginner Workshop
Machine Learning with Weka
R Statistical Language
Appendix A: Guide to Using Microsoft Project 2002
Using R for Data Analysis and Data Visualization
Presentation transcript:

Data Visualization using R Audit Technology Group Mar 22th, 2013

DV in the News – Gun Owners Put on Map

DV in the News – Gun Owners Retaliated

DV in the News – Home Ownership

What is R? From Wikipedia R is an open source programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians for developing statistical software and data analysis. R is an implementation of the S programming language created by John Chambers while at Bell Labs. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand.

More about R Some interesting facts R is now developed by the R Development Core Team, of which Chambers is a member R is named partly after the first names of the first two R authors (Robert Gentleman and Ross Ihaka), and partly as a play on the name of S. R is part of the GNU project. The source code for the R software environment is written primarily in C, Fortran, and R. R is freely available under the GNU General Public License, and pre-compiled binary versions are provided for various operating systems.

What should R Concern me as an Auditor? Good question! R uses a command line interface; however, several graphical user interfaces are available for use with R. R provides a wide variety of statistical and graphical techniques, including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering, and others. According to Rexer's Annual Data Miner Survey in 2010, R has become the data mining tool used by more data miners (43%) than any other.

More Importantly Last but not least Another strength of R is static graphics, which can produce publication-quality graphs, including mathematical symbols. Dynamic and interactive graphics are available through additional packages. In other words, if you are serious about data visualization, using only Excel graphing features is not enough. At the end of the day, it is FREE! All you need is filing a deviation form with your supervisor’s approval, and some weekends spent in front of your computer screen.

R Heat Map is Perfect for Risk Analysis Heat Map is a very useful tool for Risk Analysis But Excel does not have built-in Heat Map feature You can mimic a Heat Map in Excel using Bubble Chart but it is a labor intensive process with less than satisfactory result Or you can learn a little VBA and write your own code to create Heat Maps in Excel, once again, not a completely painless proposition

Heat Map: Correlation Between Genes

Heat Map: NBA Top Scorers

Calendar Heat Map: Stock Close Price

Heat Map: Pollution Data on Calendar

Showing Data on US Map by State

Graphing Rainfall in France

Topology Graph

Topography Graph with Height

Data Visualization in Action 1

Data Visualization in Action 2

Interactive Graph – U.S. Data by State

Interactive Graph – Hurricane Andrew

Interactive Graph – U.S. City Popularity

Q&A Question? Comments?