Advanced Spatial Methods in R Michael Mann George Washington University Department of Geography mmann1123@gwu.edu http://michaelmann.i234.me/
Overview Review Basics Setting up Space-time data Space-time plots library(RasterVis) library(plotKML) Vector & Other Data Visualization library(ggplot2) Mapping ggplot2 visualizations library(ggmap)
Before we begin Best places for help http://stackoverflow.com/ Question & Answer form. Quick & high quality responses ?function_name Look up help files for a function from any library http://gis.stackexchange.com/ Similar to stackoverflow, but targeted to spatial community The fridge – grab a beer and spend some time
Interpreting My Slides! Inputs into command line Outputs Notes Data is in the MMSpatialData folder s = c("aa", "bb", "cc", "dd", "ee") s [1] "aa” "bb” "cc” "dd” "ee" An important note
Your location Quick survey Please raise you hand if (before today) you have never used R or a similar language.
Let’s even the playing field Beginners Please look around you. Move if there is a beginner cluster! Experts Put on your teaching hats! Remember how difficult this material is. Make sure to help your teammates!
Overview Indexing a vector Indexing a data.frame Helpful Functions s = c("aa", "bb", "cc", "dd", "ee") s = data.frame(col1=c(1,2,3), col2=c(5,6,7)) s[1] s[c(2,5)] col1 col2 1 1 5 2 2 6 3 3 7 [1] ‘aa’ [1] ‘bb’ ‘ee’ s[2,1] s[,’col2’] s[c(2:5)] [1] 2 [1] 5 6 7 [1] ‘bb’ ‘cc’ ‘dd’ ‘ee’ vector_name[ position_# ] vector_name[ row#, col# ]
Objective 1 – Raster space-time plots library(raster) Raster Stacks & Bricks Multidimensional raster objects Multi-layer (red,green,blue) Multi-dim (time series, multi variable) Indexing Your Data stack[row# ,col#, Z#] Y Z X
Objective 1 – Raster space-time plots Multi-layer Raster ‘Brick’ b <- brick(system.file("external/rlogo.grd", package="raster) plot(b) Brick[X,Y,Z]
Objective 1 – Raster space-time plots Multi-layer Raster Brick plotRGB(b, r=1, g=2, b=3)
Objective 1 – Raster space-time plots Multi-dimentional Raster Stack Good for time-series of rasters, or multivariate analysis Time Series Multivariate stack[x,y, c(cwd2000-2010) ] stack[x,y, c(crime,pop) ]
Objective 1 – Raster space-time plots Multi-dimentional Raster Stack Good for time-series of rasters, or multivariate analysis Time Series Multivariate stack[x,y, c(cwd2000-2010) ] stack[x,y, c(crime,pop) ] Use: Data Visualization Use: Regression Analysis, modeling
Task 1.1 – Setup Raster Stack Data Helpful Functions dir() grep() dir(‘C://SESYNC//data’) grep(‘a’, c( ‘a’, ‘b’, ‘c’, ‘a’ ) ) [1] 1 4 grep(‘c’, c( ‘tab’, ‘car’, ‘bat’ ) ) [1] 2 [1] ‘data.zip’ ‘ggmap vinette.pdf’….
Task 1.1 – Setup Raster Stack Data Helpful Functions – Using grep to index a vector Find the location of a element with ‘c’ in it Query vector s with the grep s = c("aa", "bb", "cc", "dd", "ee") grep(‘c’, s ) [1] 3 s[ grep(‘c’, s ) ] [1] ‘cc’
Task 1.1 – Setup Raster Stack Data Helpful Functions – Using grep to index a vector Find the location of a element with ‘c’ in it Query vector s with the grep s = c("aa", "bb", "cc", "dd", "ee") grep(‘c’, s ) [1] 3 TRY THIS! s[ grep(‘c’, s ) ] [1] ‘cc’
Task 1.1 – Let’s get started!
Task 1.2 – Create Raster Stacks and Assign Name Labels Helpful Functions paste() names() paste(‘Hi',c('Bill','Bob', 'Sam'), sep=' ') names(rstack) = c(‘test2001’,’test2002’) [1] “Hi Bill" “Hi Bob" “Hi Sam" paste(’aet',c(’2001',’2002’), sep=’_') [1] "aet_2001" "aet_2002" test2002 test2001
Task 1.2 – Create Raster Stacks and Assign Name Labels Helpful Functions paste() names() paste(‘Hi',c('Bill','Bob', 'Sam'), sep=' ') names(rstack) = c(‘test2001’,’test2002’) [1] “Hi Bill" “Hi Bob" “Hi Sam" TRY PASTE()! paste(’aet',c(’2001',’2002’), sep=’_') [1] "aet_2001" "aet_2002" test2002 test2001
Task 1.2 – Create Raster Stacks and Assign Time Labels Helpful Functions seq() & as.Date() setz() seq(1,15, by=3) raster_stack = setZ(raster_stack, years) [1] 1 4 7 10 13 Years = seq(as.Date(’2001-01-01'), as.Date('2010-01-01'), by=’1 year') [1] "2000-01-01" "2001-01-01" … [10] "2009-01-01" "2010-01-01"
Task 1.2 – Create Raster Stacks and Assign Time Labels Helpful Functions seq() & as.Date() setz() seq(1,15, by=3) raster_stack = setZ(raster_stack, years) [1] 1 4 7 10 13 Important: setZ must be passed a series of ‘Dates’ (created with as.Date function) Years = seq(as.Date(’2001-01-01'), as.Date('2010-01-01'), by=’1 year') [1] "2000-01-01" "2001-01-01" … [10] "2009-01-01" "2010-01-01"
Task 1.2 – Let’s get started!
Task 1.3 – Visualize Stack Data Indexing Your Data Method 1 plot(raster_stack[[1]]) # plot first raster plot(raster_stack[[2]]) # plot second raster NOTE: [[ ]] is used b/c stack is a list object Method 2 plot( raster(raster_stack,'HDen_1989') ) NOTE: raster() is used b/c… well that is just how it works
Task 1.3 – Let’s get started!
Task 1.4 – Challenge Questions
Task 1.5 Visualize Space-Time Data rasterVis & plotKML packages Excellent tutorial available at http://rastervis.r-forge.r-project.org/ Data Format Raster stack with z-dim set to dates using: as.Date() Spatial points or polygons data.frame with z-dim set to dates Hovmoller Plot Horizon Plot
Task 1.5 Visualize Space-Time Data rasterVis & plotKML packages Data Format 2 Raster stack OR Spatial points or polygons data.frame with z-dim set to slope, or direction Vectorplot Stream Plot
Task 1.5 Visualize Space-Time Data rasterVis & plotKML packages Excellent tutorial: http://plotkml.r-forge.r-project.org/ Data Format Exports many formats including raster stacks Note: This code outputs both a Housing.kml file and a series of other image files (.png files). In order for this to work, the Housing.kml file needs to be in the same directory as all the image files.
Task 1.5 Visualize Space-Time Data rasterVis & plotKML packages Excellent tutorial: http://plotkml.r-forge.r-project.org/ Data Format Exports many formats including raster stacks All data must be unprojected Lat Lon "+proj=longlat +datum=WGS84” Note: This code outputs both a Housing.kml file and a series of other image files (.png files). In order for this to work, the Housing.kml file needs to be in the same directory as all the image files.
Task 1.5 – Let’s get started!
Task 1.6 – Let’s get started!
Objective 2: Intro to ggplot2 One of the best data visualization tools in R Documentation available here: http://ggplot2.org/
Objective 2: Intro to ggplot2 A plot is made up of multiple layers. A layer consists of data, a set of mappings between variables and aesthetics, a geometric object and a statistical transformation Scales control the details of the mapping. All components are independent and reusable.
Objective 2: Intro to ggplot2 aesthetics geometric object scales control ‘+’ add mappings Typical Command a = ggplot(movies, aes(y = budget, x = year, group = round_any(year, 10) )) + geom_boxplot() + scale_y_log10() plot(a)
Task 2.1 Setting up your data ggplot2 uses data.frames!
Task 2.1 Setting up your data Helpful Functions aggregate() Summarizes data of interest by factors (categorical data) aggregate( rating ~ year ,data= movies, FUN='mean')
Task 2.1 Setting up your data Helpful Functions fortify() Converts spatial polygons, lines, points to data.frame usable in ggplot2 jepson.points = fortify(jepson, region="id")
Task 2.1 Let’s get going!
Objective 2: Intro to ggplot2 plot reminder aesthetics geometric object scales control ‘+’ add mappings Typical Command a = ggplot(jepson.df) + aes(long,lat,group=group,fill=ECOREGION) + geom_polygon() plot(a)
Task 2.2 Let’s get going!
Task 2.3 Let’s get going!
Task 2.4 Challenge questions
Objective 3: Intro to ggmap Ggmap enables visualization of layered graphics using implementation similar to ggplot2 Combines the functionality of ggplot2 and spatial information of static maps from Google Maps, OpenStreetMap, Stamen Maps or CloudMade
Objective 3: Intro to ggmap Ggmap enables visualization of layered graphics using implementation similar to ggplot2 qmap() Downloads static maps from google, or OSM Defined by a central Lat and Lon and a ‘Zoom’ level Takes additional ‘+’ commands to overlay ggplot2 graphics Zoom = 5 Zoom = 14
Objective 3: Intro to ggmap plot aesthetics geometric object scales control coordinate system Done in background through qmap ‘+’ add mappings Typical Command HoustonMap <- qmap("houston", zoom = 13, color = "bw") HoustonMap + geom_point(data=violent_crimes,aes(x = lon, y = lat, colour = offense ) ) plot( HoustonMap )
Task 3.1 Setting up your data Helpful Functions projectRaster() & project() Converts spatial -polygons, -lines, -points to data.frame usable in ggplot2 & ggmap. Ggmap data must be in unprojected lat lon (defined below) # Project a raster to unprojected Lat Lon using nearest neighbor algorithm stack_proj = projectRaster(raster_stack, crs="+proj=longlat +datum=WGS84” , method='ngb') # Project a polygon to unprojected Lat Lon using nearest neighbor algorithm jepson_proj = project(jepson, crs="+proj=longlat +datum=WGS84”)
Task 3.1 Setting up your data Helpful Functions fortify() Converts spatial polygons, lines, points to data.frame usable in ggplot2 jepson.points = fortify(jepson, region="id")
Task 3.1 Setting up your data Helpful Functions geocode() revgeocode() Converts text addresses or location names to Lat Lon coordindates Converts Lat Lon to text addresses geocode("the white house") revgeocode(c(-77.03650, 38.89768), output = c("address"))
Task 3.1 Setting up your data Helpful Functions subset() data = data.frame(name=c(‘mike’,’john’,’jim’), age=c(4,3,6)) name age mike 4 john 3 jim 6 name age 2 john 3 3 jim 6 subset(data, name != ‘mike’ ) name age 1 mike 4 3 jim 6 subset(data, age > 3 )
Task 3.1: Learn Basic ggmap Functions
Task 3.2: Crime Mapping In Houston TX
Task 3.3: Challenge Questions
What should we review?