Today’s Beginner Workshop R Overview Learn Data Types Enter Data Write Basic Code Analyze Data Create Charts Install Packages
R Overview Statistics and Data Analysis User Interfaces Packages Comparison to SAS, SPSS User Interfaces RStudio Microsoft Packages Growth Who is Creating These Packages?
Top 20 Skills of a Data Scientist https://rjmetrics.com/resources/reports/the-state-of-data-science/
R was created in 1996 by Ross Ihaka and Robert Gentleman of the Statistics Department at The University of Auckland in New Zealand http://cm.bell-labs.com/cm/ms/departments/sia/S/index.html
As of 2010, still active in R Development https://www.stat.auckland.ac.nz/~ihaka/?Ross_Ihaka As of 2010, still active in R Development
Notable Pre-Loaded R Packages Descriptions base Core functions for data management, basic descriptive statistics stats Hundreds of statistical functions including correlation, regression, anova, cluster analysis, normal distribution graphics Contains many basic plots including barplot, boxplot, histograms, line, pie, and many more. graphics demos demo(graphics, package = "graphics")
R was initially developed as FREE, OPEN SOURCE, STATISTICAL SOFTWARE versus alternative Commercial Products Acquired by IBM in 2009 Business vs. Academia
Many R Packages have been created that are equivalent to the base functionality of SAS and SPSS Source: Robert Muenchen
R has Grown Far Beyond its Data, Analytics, and Graphics Beginnings http://www.r-bloggers.com/on-the-growth-of-cran-packages/
Package Categories COMMON FUNCTIONS SPECIALIZED TASK VIEWS Data Input / Output Data Management Mathematics and Statistics Graphics SPECIALIZED TASK VIEWS Web High Performance Computing Subject Matter Specific
Who is creating these packages? R is the work of dozens upon dozens of people; primarily University Professors and PhD Students; though also individuals on their spare time; and more recently companies. They have written many packages to make it easier for everyone to analyze data.
Most From Universities Many University Researchers receive FEDERAL and other grants to work on projects. For some projects, part of that money goes to create R libraries. $$$
$300K https://ropensci.org/ http://srug.ropensci.org/
User Interfaces Basic R User Interface RStudio Microsoft
The Original and Current Basic R “Console” command line interface…. Write Code/ Program Input Data Analyze Graphics R User Interface
2008- 1996- RStudio leverages the BaseR Software. This is why both installations are needed! RStudio has a better User-Interface . 2008- + Software Comparison 1996-
Console Scripting/Coding Current Datasets Write Code/ Program Input Data Analyze Graphics Console http://www.rstudio.com/ Enter Single Commands See Output
Microsoft and Revolution Analytics https://mran.revolutionanalytics.com/open/ https://mran.revolutionanalytics.com/rro/
Which IDE?