Spatial Data Analysis: An Elective Course for Advanced Undergraduates

Slides:



Advertisements
Similar presentations
IB Portfolio Tasks 20% of final grade
Advertisements

Sampling and monitoring the environment Marian Scott Sept 2006.
Spatial modelling an introduction
Spatial point patterns and Geostatistics an introduction
Spatial point patterns and Geostatistics an introduction
A. John Bailer Statistics and Statistical Modeling in The First Two Years of College Math.
Expert elicitation of the variogram Phuong N. Truong Gerard B.M. Heuvelink John Paul Gosling Wageningen University (NL) Food and Environmental Research.
Deterministic Solutions Geostatistical Solutions
Copyright © 2005 Department of Computer Science CPSC 641 Winter Data Analysis and Presentation There are many “tricks of the trade” used in data.
Applied Geostatistics
1 Practicals, Methodology & Statistics II Laura McAvinue School of Psychology Trinity College Dublin.
Deterministic Solutions Geostatistical Solutions
Spatial Statistics Wei Ding, Spring Some Thoughts It is hard to find a general spatial statistics paper. In the following slides, I recommend a.
STAT 592A(UW) 526 (UBC-V) 890-4(SFU) Spatial Statistical Methods NRCSE.
Research Topics for Spatial temporal processes Time Series:Time Series: nonlinear state space M. West (1996) In Bayesian Statistics 5, Oxford University.
Statistical Tools for Environmental Problems NRCSE.
Spatial statistics 2 Stat 518 Sp 08. Ordinary kriging where and kriging variance.
1 PERF EVAL (CONT’D) H There are many other “tools of the trade” used in performance evaluation H Only a few will be mentioned here: –queueing theory –verification.
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Applied Geostatistics geog. buffalo. edu/~lbian/GEO497_597
Esri International User Conference | San Diego, CA Technical Workshops | Spatial Statistics: Best Practices Lauren Rosenshein, MS Lauren M. Scott, PhD.
Spatial Interpolation of monthly precipitation by Kriging method
Modeling Spatial Correlation (The Semivariogram) ©2007 Dr. B. C. Paul.
Gridding Daily Climate Variables for use in ENSEMBLES Malcolm Haylock, Climatic Research Unit Nynke Hofstra, Mark New, Phil Jones.
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Geographic Information Science
Beramiesha Thompson Laura Salman Dianna Baritot MST Inquiry Unit Plan – Summer 2010 Professor O’Connor-Petruso.
Where are you joining us from
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
Robust GW summary statistics & robust GW regression are used to investigate a freshwater acidification data set. Results show that data relationships can.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Introduction to the Teacher Work Sample Portfolio Presented by Frank H. Osborne, Ph. D. © 2015 EMSE 3123 Math and Science in Education 1.
Uncertainty assessment in European air quality mapping and exposure studies Bruce Rolstad Denby, Jan Horálek 2, Frank de Leeuw 3, Peter de Smet 3 1 Norwegian.
Statistics 349.3(02) Analysis of Time Series. Course Information 1.Instructor: W. H. Laverty 235 McLean Hall Tel:
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Exposure Assessment for Health Effect Studies: Insights from Air Pollution Epidemiology Lianne Sheppard University of Washington Special thanks to Sun-Young.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
Writing Assignments in Mechanical Engineering Anne Parker University of Manitoba A. Parker, CASDW, UVic,
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Amy Wagaman Amherst College Mathematics and Statistics.
L5 Computing Team Project Final Presentation Guidelines.
Why Model? Make predictions or forecasts where we don’t have data.
Writing and Submitting Student Learning Objectives
Performance Eval Slides originally from Williamson at Calgary
General Education Assessment
Joan Donohue University of South Carolina
Computational Reasoning in High School Science and Math
Linear Mixed Models in JMP Pro
Mathematics for Modeling
Data Structures Algorithms: (Slides to be Adopted from Goodrich and aligned with Weiss' book) Instructor: Ganesh Ramakrishnan
Special Topics In Scientific Computing
Statistics and Research Desgin
CHAPTER 29: Multiple Regression*
Session 2:50 PM - 4:20 PM; Saturday, Nov 18, 2017
Predictive distributions
CPSC 531: System Modeling and Simulation
Data Presentation Carey Williamson Department of Computer Science
Stochastic Hydrology Random Field Simulation
The Annotated Bib.
Five-Year Undergraduate Enrollments
SCIENCE AND ENGINEERING PRACTICES
MA171 Introduction to Probability and Statistics
MA171 Introduction to Probability and Statistics
Carey Williamson Department of Computer Science University of Calgary
Curriculum Coordinator: Marela Fiacco Date : February 29, 2015
Your SLO.
STAT 515 Statistical Methods I Lecture 1 August 22, 2019 Originally prepared by Brian Habing Department of Statistics University of South Carolina.
Presentation transcript:

Spatial Data Analysis: An Elective Course for Advanced Undergraduates Laura Boehm Vock Gustavus Adolphus College August 1, 2016 lboehmvo@gustavus.edu

Background Liberal arts college with ~2300 students 15-20 math majors per year. 2016 is first year students can graduate with a statistics major. Course satisfies a “capstone” criteria for the major Focus on presentation skills and team projects 5 team projects; Individual final project

Alignment with ASA guidelines Real applications (about half “textbook” and half “real” data) Diverse models and approaches Focus on problem solving Skills: Data manipulation, Simulation, reproducibility, use of discipline-specific knowledge, visualizations, teamwork, communication

Using class time Recapping readings Filling in background stats knowledge Putting knowledge to work in R Meetings with instructor & project teams

Project 3: Ordinary Kriging Team 1: Lili, John, Russ --- Particulate Matter --- 24-hour mean concentration of fine particulate matter (PM2.5) on March 15, 2015. (Yes, 2015). Info about PM2.5 here: http://www3.epa.gov/pm/ Team 2: Lauren, Liam, Kaitlin --- Snow --- Snow totals from snowstorm Feb 2-3, 2016. Cumulative total (inches) as reported by 12:34pm Feb 3. Most were reported between 7-9 am on Feb 3. 1. Plot the observed locations on a map using the map function (from maps package) in R. 2. Plot the observed locations on a map using a Google maps backdrop using the RGooglemaps package. Hint: For 1 & 2 check out AdvancedMapping.Rmd 3. Use ordinary kriging to predict PM/Snow at a grid of spatial locations. Be sure to consider: Which variogram model to use? How to fit parameters in variogram model? (WLS, REML, choosing various starting values) 4. Create images that show the predicted values and the kriging variance. 5. Generate 4 random surfaces from your model and compare them (can use the krige or predict.gstat function, don’t have to do this by hand), and compare to your images from #4 and to the original data.

Teams will produce: 1. A 10-15 minute presentation of the results of both problem 1 and 2. 2. A written document including relevant visualizations, tables, etc, with proper captions and explanations (for both problem 1 and 2) 3. An R script or Rmarkdown document which includes all the code to produce their visualizations. Teams will be graded on: 1. Content of the written document, including quality of visualizations and explanations. (50%) 2. Quality of their presentation including oral communication and visual quality of slides. (30%) 3. Quality and clarity of R code (if I enter appropriate filepaths, can I run it? Are there sufficient comments or notes that I can understand what is going on?) (10%) 4. Contribution to the team (assessed by teammates, and my own observations, e.g. do each speak relatively equally in the presentation?) (10% - will vary per individual)

Course topics Intro/definitions of geostatistical, point process, and areal data. (Gaussian) spatial processes Multivariate normal distribution, defining mean, variance, and covariance through expectations. Stationarity, isotropy Variograms Ordinary Kriging Universal Kriging Extension projects: Indicator kriging, bootstrap interval in OK & UK, anisotropy What are point process data? How do we describe patterns (aggregating, regular, CSR) Checking for CSR: K, L, F, G Monte Carlo testing Intensity estimation Modeling intensity as a Poisson process with covariates Visualizations for marked processes

Primary Skills/Ideas What it means for observations to be correlated Visualization Multiple estimators exist for the same parameter (e.g. WLS and REML for variogram parameters) Variance estimation can be tricky Likelihood estimation Monte Carlo simulation/testing The same statistical idea can work in many areas of application Development of statistical methods is ongoing

Arsenic (As) in MN Groundwater ordinary kriging, universal kriging with geologic soil type predictor

Bee Populations in Finland Tests for spatial heterogeneity

Arsons in Minneapolis point process model with neighborhood income and 2nd order polynomial trend

Challenges Computing, and different levels of skills in computing. Finding an appropriate text Connection/balance of theory and application Balancing needs of our majors with interest from students with little math background Balance of team/individual work during semester

What worked well Team based work, team member evaluations Exciting spatial data sources for projects Lots of creativity in final projects Final poster presentation Sense of accomplishment for students

Books Diggle (2013) Statistical Analysis of Spatial and Spatio-Temporal Point Patterns [3rd ed] Banerjee, Carlin, Gelfand (2014) Hierarchical Modeling and Analysis for Spatial Data [2nd ed] Brunson, Comber (2015) An Introduction to R for Spatial Analysis and Mapping Cressie (1993) Statistics for Spatial Data Data sources gisdata.mn.gov : State datasets from all the agencies. Environmental, agricultural, climate, demographic… Some stuff is point-located, some is county level reports (especially health stuff). www.opentwincities.org/data/ - also includes links to a spreadsheet of all kinds of sources in MN opendata.Minneapolismn.gov/ , nycopendata.socrata.com/, …. EPA also has lots if you search around