R tutorial http://people.musc.edu/~elg26/teaching/methods2.2009/R-intro.pdf.

Slides:



Advertisements
Similar presentations
Rrrrrr…… now we know what it is but how the heck do you use it? Andrew Trant PPS Arctic - Labrador Highlands Research Group.
Advertisements

Introduction to MATLAB for Biomedical Engineering BME 1008 Introduction to Biomedical Engineering FIU, Spring 2015 Lesson 2: Element-wise vs. matrix operations.
Jack Davis Andrew Henrey FROM N00B TO PRO. PURPOSE Create a simulator from scratch that: Generates data from a variety of distributions Makes a response.
MATLAB – What is it? Computing environment / programming language Tool for manipulating matrices Many applications, you just need to get some numbers in.
R tutorial g/methods2.2010/R-intro.pdf.
Basics of Using R Xiao He 1. AGENDA 1.What is R? 2.Basic operations 3.Different types of data objects 4.Importing data 5.Basic data manipulation 2.
Introduction to GTECH 201 Session 13. What is R? Statistics package A GNU project based on the S language Statistical environment Graphics package Programming.
Introduction to R A. Di Bucchianico. Introduction to R2 Types of statistical software command-line software –requires knowledge of syntax of commands.
MATLAB TUTORIAL Dmitry Drutskoy Some material borrowed from the departmental MATLAB info session by Philippe Rigollet Kevin Wayne.
LISA Short Course Series R Basics
LISA Short Course Series Basics of R Lin Zhang Feb. 16, 2015 LISA: Basics of RFeb. 16, 2015.
Introduction à R Olivier Mestre Ecole Nationale de la Météorologie.
Basic R Programming for Life Science Undergraduate Students Introductory Workshop (Session 1) 1.
MATLAB Lecture One Monday 4 July Matlab Melvyn Sim Department of Decision Sciences NUS Business School
LISA Short Course Series R Basics Ana Maria Ortega Villa Fall 2013 LISA: R BasicsFall 2013.
R-Graphics Day 2 Stephen Opiyo. Basic Graphs One of the main reasons data analysts turn to R is for its strong graphic capabilities. R generates publication-ready.
732A44 Programming in R.  Self-studies of the course book  2 Lectures (1 in the beginning, 1 in the end)  Labs (computer). Compulsory submission of.
Hands-on Introduction to R. Outline R : A powerful Platform for Statistical Analysis Why bother learning R ? Data, data, data, I cannot make bricks without.
Introduction to SAS BIO 226 – Spring Outline Windows and common rules Getting the data –The PRINT and CONTENT Procedures Manipulating the data.
REVIEW 2 Exam History of Computers 1. CPU stands for _______________________. a. Counter productive units b. Central processing unit c. Copper.
Introduction to R Part 2. Working Directory The working directory is where you are currently saving data in R. What is the current working directory?
Arko Barman with modification by C.F. Eick COSC 4335 Data Mining Spring 2015.
Introduction to R Lecture 3: Data Manipulation Andrew Jaffe 9/27/10.
Data Objects in R Vector1 dimensionAll elements have the same data types Data types: numeric, character logic, factor Matrix2 dimensions Array2 or more.
Piotr Wolski Introduction to R. Topics What is R? Sample session How to install R? Minimum you have to know to work in R Data objects in R and how to.
Matlab Basics Tutorial. Vectors Let's start off by creating something simple, like a vector. Enter each element of the vector (separated by a space) between.
An Introduction to R graphics Cody Chiuzan Division of Biostatistics and Epidemiology Computing for Research I, 2012.
I❤RI❤R Kin Wong (Sam) Game Plan Intro R Import SPSS file Descriptive Statistics Inferential Statistics GraphsQ&A.
R: Packages & Data. Presented here are a number of ways to accomplish a task, some are redundant or may not represent the best way to accomplish a task.
Installing R CRAN: –(R homepage: –Windows 95 and later  Base –rw2001.exe.
R Programming Yang, Yufei. Normal distribution.
Introduction to Programming in R Department of Statistical Sciences and Operations Research Computation Seminar Series Speaker: Edward Boone
Distributions, Iteration, Simulation Why R will rock your world (if it hasn’t already)
STAT 251 Lab 1. Outline Lab Accounts Introduction to R.
Introduction to R Carol Bult The Jackson Laboratory Functional Genomics (BMB550) Spring 2011.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
STAT 534: Statistical Computing Hari Narayanan
R tutorial Stat 140 Linjuan Qian
Basics in R part 2. Variable types in R Common variable types: Numeric - numeric value: 3, 5.9, Logical - logical value: TRUE or FALSE (1 or 0)
Computing for Research I Spring 2013 Primary Instructor: Elizabeth Garrett-Mayer Introduction to R March 5.
Descriptive Statistics using R. Summary Commands An essential starting point with any set of data is to get an overview of what you are dealing with You.
Introduction to R and Data Science Tools in the Microsoft Stack Jamey Johnston.
MIS2502: Data Analytics Introduction to Advanced Analytics and R.
Review > x[-c(1,4,6)] > Y[1:3,2:8] > island.data fishData$weight[1] > fishData[fishData$weight < 20 & fishData$condition.
16BIT IITR Data Collection Module If you have not already done so, download and install R from download.
Statistical Programming Using the R Language Lecture 2 Basic Concepts II Darren J. Fitzpatrick, Ph.D April 2016.
Working with data in R 2 Fish 552: Lecture 3. Recommended Reading An Introduction to R (R Development Core Team) –
Introduction to R and Data Science Tools in the Microsoft Stack Jamey Johnston.
Linear Algebra Review.
Programming in R Intro, data and programming structures
Sihua Peng, PhD Shanghai Ocean University
Arko Barman COSC 6335 Data Mining Fall 2014
Statistical Programming Using the R Language
Research methodology R Statistics – Introduction
MATLAB DENC 2533 ECADD LAB 9.
Matlab Workshop 9/22/2018.
Numerical Descriptives in R
Introduction to Exploratory Descriptive Data Analysis in S-Plus II
MATH 493 Introduction to MATLAB
Use of Mathematics using Technology (Maltlab)
Recoding II: Numerical & Graphical Descriptives
Sihua Peng, PhD Shanghai Ocean University
Code is on the Website Outline Comparison of Excel and R
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Introduction to Exploratory Descriptive Data Analysis in S-Plus
MIS2502: Data Analytics Introduction to R and RStudio
Factors (or Grouping Variables) Checking Model Assumptions
Data analysis with R and the tidyverse
Matlab Basics Tutorial
Research methodology R Statistics – Introduction
Presentation transcript:

R tutorial http://people.musc.edu/~elg26/teaching/methods2.2009/R-intro.pdf

R interface Unix, linux, windows batching file: File -> open script run commands: Ctrl-R Save session: sink([filename])….sink() Quit session: q()

General Syntax result <- function(object(s), options…) Object-oriented programming Note that ‘result’ is an object

First things first: help([function]) help.search(“linear model”) help.start()

Choosing your default setwd(“[pathname for directory]”) need “\\” instead of “\” when giving paths .Rdata .Rhistory

Start with data read.table read.csv scan dget

Extracting variables from data Use $: data$AGE note it is case-sensitive! attach([data]) and detach([data])

Descriptive statistics summary mean, median var quantile range, max, min

Missing values sometimes cause ‘error’ message na.rm=T na.option=na.omit

Objects data.frame, as.data.frame, is.data.frame names([data]) row.names([data]) matrix, as.matrix, is.matrix dimnames([data]) factor, as.factor, is.factor levels([factor]) arrays lists functions vectors scalars

Creating and manipulating combine: c cbind: combine as columns rbind: combine as rows list: make a list rep(x,n): repeat x n times seq(a,b,i): create a sequence between a and b in increments of i seq(a,b, length=k): create a sequence between a and b with length k with equally spaced increments

ifelse ifelse(condition, true, false) cut(x, breaks) agelt50 <- ifelse(data$AGE<50,1,0) note for equality must use “==“ cut(x, breaks) agegrp <- cut(data$AGE, breaks=c(0,50,60,130)) agegrp <- cut(data$AGE, breaks=c(0,50,60,130), labels=c(0,1,2)) labels=F)

Looking at objects dim length sort

Subsetting Use [ ] Vectors Matrices & Dataframes data$AGE[data$REGION==1] data$AGE[data$LOS<10] Matrices & Dataframes data[data$AGE<50, ] data[ , 2:5] data[data$AGE<50, 2:5]

Some math abs(x) sqrt(x) x^k log(x) (natural log, by default) choose(n,k)

Matrix Manipulation Matrix multiplication: A%*%B transpose: t(X) diag(X)

Table table(x,y) tabulate(x)

Statistical Tests and CI’s t.test fisher.test and binom.exact wilcox.test

Plots hist boxplot plot pch xlab, ylab xlim, ylim xaxt, yaxt axis

Plot Layout par(mfrow=c(2,1)) par(mfrow=c(1,1)) par(mfcol=c(2,2)) help(par)

Probability Distributions Normal: rnorm(N,m,s): generate random normal data dnorm(x,m,s): density at x for normal with mean m, std dev s qnorm(p,m,s): quantile associated with cumulative probability of p for normal with mean m, std dev s pnorm(q,m,s): cumulative probability at quantile q for normal with mean m, std dev s Binomial rbinom etc.

Libraries Additional packages that can be loaded Example: epitools library library(help=[libname])

Keeping things tidy ls() and objects() rm() rm(list=ls())

Future Topics linear regression sourcing R code creating functions organizing R files