R for Research Data Analysis using R Day1: Basic R Baburao Kamble University of Nebraska-Lincoln.

Slides:



Advertisements
Similar presentations
R for Macroecology Aarhus University, Spring 2011.
Advertisements

PRE-SCHOOL QUANT WORKSHOP II R THROUGH EXCEL. NEW YORK TIMES INFOGRAPHICS GALARY The Jobless Rate for People Like You Home Prices in Selected Cities For.
Introduction to Matlab Workshop Matthew Johnson, Economics October 17, /13/20151.
Baburao Kamble (Ph.D) University of Nebraska-Lincoln Data Analysis Using R Week3: Data Input/Output (Import/Export) in R.
Introduction to GTECH 201 Session 13. What is R? Statistics package A GNU project based on the S language Statistical environment Graphics package Programming.
Maths for Computer Graphics
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Introduction to Spreadsheets Presented by Frank H. Osborne, Ph. D. © 2005 Bio 2900 Computer Applications in Biology.
Guide To UNIX Using Linux Third Edition
Module 6 Matrices & Applications Chapter 26 Matrices and Applications I.
How to Use the R Programming Language for Statistical Analyses Part I: An Introduction to R Jennifer Urbano Blackford, Ph.D. Department of Psychiatry Kennedy.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
What is R Muhammad Omer. What is R  R is the programing language software for statistical computing and data analysis  The R language is extensively.
Introduction to R: The Basics Rosales de Veliz L., David S.L., McElhiney D., Price E., & Brooks G. Contributions from Ragan. M., Terzi. F., & Smith. E.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Baburao Kamble (Ph.D) University of Nebraska-Lincoln Data Analysis Using R Week2: Data Structure, Types and Manipulation in R.
Baburao Kamble (Ph.D) University of Nebraska-Lincoln Data Analysis Using R Week5: Charts/Plots in R.
Applied Bioinformatics Introduction to Linux and R Bing Zhang Department of Biomedical Informatics Vanderbilt University
B.A. (Mahayana Studies) Introduction to Computer Science November March Office Tools A look at the main tools most computer users.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
File Types, MS Word, and MS Excel
Basic R Programming for Life Science Undergraduate Students Introductory Workshop (Session 1) 1.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
1 Chapter 5: Creating Summarized Output 5.1 Generating Summary Statistics 5.2 Creating a Summary Report with the Summary Tables Task 5.3 Creating and Applying.
Introduction to MATLAB adapted from Dr. Rolf Lakaemper.
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
Introduction to to R Emily Kalah Gade University of Washington Credit to Kristin Siebel for development of much of this PowerPoint.
TERA: PAMS Reporting By Michael McGuire
P366: Lecture #1 Use of Excel for analysis Lei Chen, MD Jan 6, 2002.
Introduction to SAS. What is SAS? SAS originally stood for “Statistical Analysis System”. SAS is a computer software system that provides all the tools.
Computational Methods of Scientific Programming Lecturers Thomas A Herring, Room A, Chris Hill, Room ,
Piotr Wolski Introduction to R. Topics What is R? Sample session How to install R? Minimum you have to know to work in R Data objects in R and how to.
Khoros Yongqun He Dept. of Computer Science, Virginia Tech.
Introduction to MATLAB 7 Engineering 161 Engineering Practices II Joe Mixsell Spring 2010.
Copyright © 2008, SAS Institute Inc. All rights reserved. Interactive Analysis and Data Visualization Using JMP −Dara Hammond, Federal Systems Engineer.
R packages/libraries Data input/output Rachel Carroll Department of Public Health Sciences, MUSC Computing for Research I, Spring 2014.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
EXCEL Intro to Microsoft Excel. Objectives for the Week Content ObjectivesLanguage Objectives I can create and manipulate charts, graphs, and reports.
Matlab 14.html Cost: $100 Available in labs on Windows and Unix machines.
Introduction to MATLAB adapted from Dr. Rolf Lakaemper.
Introduction to R Carol Bult The Jackson Laboratory Functional Genomics (BMB550) Spring 2011.
Chapter 1: Overview of SAS System Basic Concepts of SAS System.
CIS 601 Fall 2003 Introduction to MATLAB Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
1 Chapter 3: Getting Started with Tasks 3.1 Introduction to Task Dialogs 3.2 Creating a Listing Report 3.3 Creating a Frequency Report 3.4 Creating a Two-Way.
© 2015 by Wade Rogers Introduction to R Cytomics Workshop December, 2015.
Postgraduate Computing Lectures PAW 1 PAW: Physicist Analysis Workstation What is PAW? –A tool to display and manipulate data. Learning PAW –See ref. in.
1 FREE SAS SOFTWARE. 2 FREE SOFTWARE Free SAS ® software. SAS STUDIO; An interactive, online community. Superior training and documentation. And the analytical.
Data & Graphing vectors data frames importing data contingency tables barplots 18 September 2014 Sherubtse Training.
Math 252: Math Modeling Eli Goldwyn Introduction to MATLAB.
CIS 595 MATLAB First Impressions. MATLAB This introduction will give Some basic ideas Main advantages and drawbacks compared to other languages.
Lecture 11 Introduction to R and Accessing USGS Data from Web Services Jeffery S. Horsburgh Hydroinformatics Fall 2013 This work was funded by National.
Pinellas County Schools
Introduction to R Chris Free. Introduction to R Free! Superior (if not comparable) to commercial alternatives Available on all platforms Not just for.
16BIT IITR Data Collection Module If you have not already done so, download and install R from download.
Introduction to R Dr. Satish Nargundkar. What is R? R is a free software environment for statistical computing and graphics. It compiles and runs on a.
Introduction to Data Manipulation, Analysis, and Visualization with R Patrick Grof-Tisza.
A Workshop on LibreOffice Er. Arvind Kumar Assistant Professor, Department of Computer Science & Engineering
Introduction to R user-friendly and absolutely free
R Brown-Bag Seminar 2.1 Topic: Introduction to R Presenter: Faith Musili ICRAF-Geoscience Lab.
Introduction to R Studio
Introduction to MATLAB
Lab 1 Introductions to R Sean Potter.
Introduction to R.
Use of Mathematics using Technology (Maltlab)
Communication and Coding Theory Lab(CS491)
Simulation And Modeling
Programming For Big Data
Introduction to Matlab
Python for Data Analysis
Matrix A matrix is a rectangular arrangement of numbers in rows and columns Each number in a matrix is called an Element. The dimensions of a matrix are.
Presentation transcript:

R for Research Data Analysis using R Day1: Basic R Baburao Kamble University of Nebraska-Lincoln

Agenda R Introduction Installation Data Types Data Import/Export Data Preprocessing –Data Exploration –Handling missing data Data Visualization –Basic Plots and annotation –Advanced Plotting (ggplot and lattice) Workshop Material:

What is R R is a free programming environment for statistical computing and graphics. R is –programming language –environment for data manipulation, –Calculation & graphical display. –includes conditionals, loops, user-defined recursive functions and input and output facilities. –R is similar to the award-winning S system, which was developed at Bell Laboratories.

Why R? Open source and open development. Design and deployment of portable, extensible, and scalable software. Interoperability with other languages: C, XML. Variety of statistical and numerical methods. High quality visualization and graphics tools. Effective, extensible user interface. Innovative tools for producing documentation and training materials: vignettes. Supports the creation, testing, and distribution of software and data modules: packages. A grad student’s 2 cents Why biologists should use R

Installing, Running, and Interacting with R How to get R: – –Google: “R” –Windows, Linux, Mac OS X, source

The R Project Statistical Computing (R-CRAN)

The R Studio

Data Analysis Data BasicImport Data Prepare, explore, and clean data Statistical Analysis and Modeling Export Data (Graph/Chart/Tables) Day 1 Day 2

Data Structure and Data Types Numeric Integer Complex Logical Character Vectors Matrices Arrays Data Frames Lists Factors DataTypes.R

Data Types Vector is a sequence of data elements of the same basic type Matrix is a rectangular array of data arranged in rows and columns. Arrays are similar to matrices but can have more than two dimensions. Data frame is more general than a matrix, in that different columns can have different modes of data (numeric, character, factor, etc.). Lists are ordered collection of objects (components). Factor is a vector whose elements can take on one of a specific set of values. DataTypes.R

Data Import –Read (write) variety of data Text: *.csv, *.tsv, *.txt Excel: *.xls, *.xlsx Images: *.tiff Shapefile: *.shp DataIO.R

Data Preprocessing Subset data from the vector and data frame Merge two vectors or data frames Aggregation of data (mean, mode,…..,etc) Reshape the data frame Sorting data in ascending and descending DataPreprocessing.R DataProcessing_Example.R

Data Exploration Check the dimensionality of the object –dim(x) Variable names or column names –Names(DataFrameA) Structure and Attributes –str(DataFrameA) Get the first and last 5 rows (head and tail) –head(DataFrameA); tail(DataFrameA) Summary –summary(DataFrameA) Frequency –Table(DataFrameA$State) Pair plot –pairs(DataFrameA) DataExploration.R

Data Visualization Basic Plot Creating a Graph Customization of graphs (fonts, colors, axes, titles) through graphic options. Saving Graphs DataVisulization1.R

Time series plot and Two Y axis Publication Quality Graphics There is more called ggplot and lattice Data Visualization Advanced Plotting DataVisulization2.R

Statistical Analysis Advanced visualization (ggplot, lattice, spatial) Descriptive Statistics Regression Analysis Time Series Data Analysis –Seasonal –ARIMA Modeling Forecasting/Prediction

Back Page