Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability.

Slides:



Advertisements
Similar presentations
A View of the Business with Drillable Graphics Southern Computer Measurement Group May, 2012 Martha Hays.
Advertisements

Lab # 03- SS Basic Graphic Commands. Lab Objectives: To understand M-files principle. To plot multiple plots on a single graph. To use different parameters.
Copyright © 2010 SAS Institute Inc. All rights reserved. A Quick Introduction to JMP Dara Hammond JMP Account Rep.
October 2007 NOVUS Consulting Group JMP Introduction Presented by Alex Filimon.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Data Analysis and Presentation There are many “tricks of the trade” used in data.
Introduction to a language and environment for statistical computing and graphics. A tutorial by Eric Gilliland and Matt Pocernich March 29th, 2006.
RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES.
Graphing. When to Graph Your Data When "a picture could tell billions of words" To impress people Dramatize a research finding Some people think visually.
Big Data Course Plans at Purdue Ananth Iyer. Big Data/Analytics Coursera course on Big Data by Bill Howe claims that Big Data involves issues of
Baburao Kamble (Ph.D) University of Nebraska-Lincoln Data Analysis Using R Week5: Charts/Plots in R.
Some Possibly Useful Graphics Functions Lunch presentation.
Data Mining and Knowledge Discovery in Databases.
What factors are most responsible for height?
Dr. Peter Bewerunge © 2009 HMS Analytical Software GmbH Calling R Functions from SAS… PhUSE 2011 – ….with PROC IML.
Chapter 5 Review: Plotting Introduction to MATLAB 7 Engineering 161.
Baburao Kamble (Ph.D) University of Nebraska-Lincoln
Graphing. The Important Elements of a Graph  Horizontal Axis (X-Axis)  The Independent Variable. A change in this variable affects the y variable. 
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Cancer: A Global View Gretchen A. Koch-Noble, Goucher College Ethel Stanley, BioQUEST Curriculum Consortium PEER UTK 2012.
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
The Original and Current Basic R “Console” command line interface….
Intro to R R is a free version of S-plus R is a free version of S-plus Can be used interactively but script or syntax files are commonly used to record.
Lesson 2-3 Example Graph the ordered pairs C(2, 5) and D(0, 5). Then connect the points. What do you notice? Step 1 Graph point C. Start at the origin,
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Currently, how many R Packages? At the command line enter:  dim(available.packages())  available.packages()
Graphing Parameters Titles X-Axis Title Y-Axis Title Legend Scales Color Gridlines library(help="graphics") Basic Chart Types The R Graphics Package LineHistogram.
Examples of Computing Uses for Statisticians Data management : data entry, data extraction, data cleaning, data storage, data manipulation, data distribution.
Cancer: A Global View Gretchen A. Koch Goucher College PEER UTK 2011.
I❤RI❤R Kin Wong (Sam) Game Plan Intro R Import SPSS file Descriptive Statistics Inferential Statistics GraphsQ&A.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Copyright © 2008, SAS Institute Inc. All rights reserved. Interactive Analysis and Data Visualization Using JMP −Dara Hammond, Federal Systems Engineer.
R packages/libraries Data input/output Rachel Carroll Department of Public Health Sciences, MUSC Computing for Research I, Spring 2014.
Week 8: QUANTITATIVE RESEARCH (2) An introduction to using SPSS to summarise & analyse survey data MA CORPORATE SOCIAL RESPONSIBILITY ACP011C – RESEARCH.
Currently, how many R Packages? At the command line enter:  dim(available.packages())  available.packages()
Ggplot2 A cool way for creating plots in R Maria Novosolov.
Introduction to Matlab  Matlab is a software package for technical computation.  Matlab allows you to solve many numerical problems including - arrays.
Aimee Gott – R Consultant The Rise and Rise of Graphic Expectations Aimee Gott Consultant.

Peer Teaching Assignment CTCH 603 Chris Braun John Wallin.
Computer Applications Chapter 16. Management Information Systems Management Information Systems (MIS)- an organized system of processing and reporting.
PH15720 Laboratory Techniques - An Introduction to MATHCAD.
Outline Research Question: What determines height? Data Input Look at One Variable Compare Two Variables Children’s Height and Parents Height Children’s.
Section 1.2 Graphs of Equations In Two Variables; Intercepts; Symmetry.
 I can identify the shape of a data distribution using statistics or charts.  I can make inferences about the population from the shape of a sample.
PH15010 Laboratory Techniques - An Introduction to MATHCAD.

Currently, how many R Packages? At the command line enter:  dim(available.packages())  available.packages()
Introduction to CADStat. CADStat and R R is a powerful and free statistical package [
Advanced Charts Lesson 9. Objectives 1. Create charts by using data from other applications. 2. Modify chart types. 3. Add and modify chart options. 4.
“Moh’d Sami” AshhabSummer 2008University of Jordan MATLAB By (Mohammed Sami) Ashhab University of Jordan Summer 2008.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
R PROGRAMMING FOR SQL DEVELOPERS Kiran Math Developer : Proterra in Greenville SC
With the support of the LPP programme of the European Union 1 This project has been funded with support from the European Commission. This publication.
A quick guide to other statistical software
Overview of R and ggplot2 for graphics
ggplot2 Merrill Rudd TAs: Brooke Davis and Megsie Siple
Software for scientific calculations
Introduction to R Programming with AzureML
Currently, how many R Packages?
به نام خدا كاربرد كامپيوتر در مهندسي صنايع نرم افزار spss
(Mohammed Sami) Ashhab
Today’s Beginner Workshop
CPSC 531: System Modeling and Simulation
Data Presentation Carey Williamson Department of Computer Science
R Programming For Sql Developers ETL USING R
Communication and Coding Theory Lab(CS491)
Overview of R and ggplot2 for graphics
Carey Williamson Department of Computer Science University of Calgary
Presentation transcript:

Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability

Currently, how many R Packages? At the command line enter:  dim(available.packages())  available.packages()

Is there an R App Store?

Two heavyweights in the statistical software market are SAS and SPSS/IBM

R Packages have been created that are equivalent to the functionality of SAS and SPSS

XLConnect XML rhbase sas7bdat Rcpp Packages for reading, writing for various file formats RJSONIO Hmisc RODBC / ROracle foreign RMySQL RWeka Comma Separated Variables

Oracle R Enterprise (ORE) R Being Integrated Into Other Data-Related Products “Both R and SAS are here to stay, and finding ways to make them work better with each other is in the best interests of our customers.”`

R “Machine Learning” Libraries Analytic TechniqueR Package/LibraryAuthorOrganization Support Vector Mach.libsvm (ksvm) Chih-Chung Chang Chih-Jen Lin National Taiwan Univ. + EBay Research Labs Neural NetworksneuralnetFrauke Gunther Stefan Fritsch Epidemiology and Prevention Research nnetBrian RipleyUniversity of Oxford monmlpAlex J. CannonAtmospheric Science Randomized ForestsrandomForestFortran original by Leo Breiman & Adele Cutler, R port by Andy Liaw and Matthew Wiener.Merck Decision TreesrpartTerry M Therneau and Beth Atkinson. R port by Brian Ripley. Mayo Clinic University of Oxford Boosting ModelAdaMark CulpWest Virginia University Maximum EntropymaxentYoshimasha Tsuruoka Timothy Jurka University of Tokyo UC-Davis Bagging, bootstrapadabagEsteban Alfaro-CortesLa Universidad de Castilla-La Mancha Latent DiralectsldaJonathan ChangFacebook Naïve Bayese1071David Meyer Evgenia Dimitriadout Vienna University Bayesian NetworkbnlearnMarco Scutari.UCL Genetics Institute Hidden MarkovhiddenmarkovDavid HarteStatistics Research

IndustryPct. Research24% Higher Education7% Information Technology9% Computer Software7% Financial Services6% Banking2% Pharmaceuticals4% Biotechnology4% Market Research3% Management Consulting3% Total69% Hadley Wickham Asst. Professor of Statistics at Rice University ggplot2 plyr reshape rggobi profr Industries / Organizations Creating and Using R

Top 100 R packages for 2013 (Jan-May)

Specialized “Domain” BeginnerSome Coverage stats graphics (both built-in) Data Management plyr reshape Graphics ggplot2 Bayesian DifferentialEquations Econometrics Environmetrics ExperimentalDesign Finance Genetics HighPerformanceComputing MachineLearning MedicalImaging NaturalLanguageProcessing Pharmacokinetics Phylogenetics Psychometrics SocialSciences Spatial TimeSeries

Easy to Use Interactive Standard Visualizations Steep Learning Curve Visualization and Reporting

The R Graphics Package Graphing Parameters Titles X-Axis Title Y-Axis Title Legend Scales Color Gridlines library(help="graphics") Basic Chart Types

In ggplot2 a plot is made up of layers. ggplot2 Plot Grammar of Graphics Layer -Data - Mapping -Geom -Stat -Postiion Scale Coord Facet

Correlations Matrix library(car) scatterplotMatrix(h)

The Correlation Package was built on top of the Pairs Package

The next data visual was produced with about 150 lines of R code

good documentation and sample code helpful for trouble-shooting code helpful for hearing about new things Additional Resources