ENVIRONMENTAL LAYERS IPLANT MEETING WEBEX 2012-03-20 Roundup 3 Benoit Parmentier.

Slides:



Advertisements
Similar presentations
Global Land Cover 2000 China Window. Data preparation climatic stratification of China VGT data’s preparation remove the cloud contamination synthesizing.
Advertisements

Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
Climate Predictability Tool (CPT)
Nidal Salim, Walter Wildi Institute F.-A. Forel, University of Geneva, Switzerland Impact of global climate change on water resources in the Israeli, Jordanian.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
Geographic data: sources and considerations. Geographical Concepts: Geographic coordinate system: defines locations on the earth using an angular unit.
Deterministic Solutions Geostatistical Solutions
Classification and Prediction: Regression Analysis
Integrating Global Species Distributions, Remote Sensing and Climate Station Data to Assess Biodiversity Response to Climate Change Adam Wilson & Walter.
Integrating global species distributions, remote sensing and climate data to model change in species distributions.
Briggs Henan University 2010
Spatial Regression Model For Build-Up Growth Geo-informatics, Mahasarakham University.
Comparison of spatial interpolation techniques for agroclimatic zoning of Sardinia (Italy) Cossu A., Fiori M., Canu S. Agrometeorological Service of Sardinia.
Introduction Land surface temperature (LST) measurement is important for understanding climate change, modeling the hydrological and biogeochemical cycles,
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Day 7 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.
Chapter 12 Multiple Regression and Model Building.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
Morphological Interpretation of Seamounts in Tutuila, American Samoa: Inferring Probable Genesis Through Shape and Distribution Analysis Morphological.
Spatial Interpolation of monthly precipitation by Kriging method
Climate Predictability Tool (CPT) Ousmane Ndiaye and Simon J. Mason International Research Institute for Climate and Society The Earth.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
Spatial Interpolation (Kriging). Objectives Understand the general procedures for spatial interpolation Explore the use of Kriging for spatial interpolation.
Gridding Daily Climate Variables for use in ENSEMBLES Malcolm Haylock, Climatic Research Unit Nynke Hofstra, Mark New, Phil Jones.
Guidance on Intensity Guidance Kieran Bhatia, David Nolan, Mark DeMaria, Andrea Schumacher IHC Presentation This project is supported by the.
Geographic Information Science
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
Performance of Growth Models for Salmonella and Other Pathogens Thomas P. Oscar, Agricultural Research Service, USDA, Room 2111, Center for Food Science.
Quality control of daily data on example of Central European series of air temperature, relative humidity and precipitation P. Štěpánek (1), P. Zahradníček.
Potential impact of climate change on growth and wood quality in white spruce Christophe ANDALO 1,2, Jean BEAULIEU 1 & Jean BOUSQUET 2 1 Natural Resources.
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
Lecture 5 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.
Adjustment of Global Gridded Precipitation for Orographic Effects Jennifer Adam.
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
Spatial distribution of snow water equivalent across the central and southern Sierra Nevada Roger Bales, Robert Rice, Xiande Meng Sierra Nevada Research.
Climate Predictability Tool (CPT) Ousmane Ndiaye and Simon J. Mason International Research Institute for Climate and Society The Earth.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Evaluation and simulation of global terrestrial latent heat flux by merging CMIP5 climate models and surface eddy covariance observations Yunjun Yao 1,
Spatial interpolation of Daily temperatures using an advection scheme Kwang Soo Kim.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
The observational dataset most RT’s are waiting for: the WP5.1 daily high-resolution gridded datasets HadGHCND – daily Tmax Caesar et al., 2001 GPCC -
Montserrat Fuentes Statistics Department NCSU Research directions in climate change SAMSI workshop, September 14, 2009.
U.S. Department of the Interior U.S. Geological Survey Evaluating the drought monitoring capabilities of rainfall estimates for Africa Chris Funk Pete.
The ENSEMBLES high- resolution gridded daily observed dataset Malcolm Haylock, Phil Jones, Climatic Research Unit, UK WP5.1 team: KNMI, MeteoSwiss, Oxford.
Linear regression models. Purposes: To describe the linear relationship between two continuous variables, the response variable (y- axis) and a single.
1. Session Goals 2 __________________________________________ FAMINE EARLY WARNING SYSTEMS NETWORK Become familiar with the available data sources for.
Corn Yield Comparison Between EPIC-View Simulated Yield And Observed Yield Monitor Data by Chad M. Boshart Oklahoma State University.
Titre. Geographic Information System GIS offer powerful tools for adding spatial perspectives to: –Planning –Research –Technology transfer –Impact assessment.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Mapping the Thermal Climate of the HJ Andrews Experimental Forest, Oregon Jonathan Smith Spatial Climate Analysis Service Oregon State University Corvallis,
Using FAO NewClim to Build Climatologies Primary attempt – May a. verdin 08/20/2010.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Approach in developing PnET-BGC model inputs for Smoky Mountains
Overview Modern chip designs have multiple IP components with different process, voltage, temperature sensitivities Optimizing mix to different customer.
Basic Estimation Techniques
VegDRI History, Current Status, and Related Activities
PCB 3043L - General Ecology Data Analysis.
Advanced Analytics Using Enterprise Miner
Stefano Grassi WindEurope Summit
Concepts and Applications of Kriging
Analytics: Its More than Just Modeling
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross-validation of a two-stage generalized.
Longitudinal Trends in the Performance of Scientific Peer Reviewers
Concepts and Applications of Kriging
Facultad de Ingeniería, Centro de Cálculo
Geostatistical Simulations – Preparing for Worst-Case Scenarios
Presentation transcript:

ENVIRONMENTAL LAYERS IPLANT MEETING WEBEX Roundup 3 Benoit Parmentier

What I have been doing working on: 1)Using Geographically Weighted regression Reading on GWR Writing a code in R using the spgwr package Prediction: first assessment using RMSE fit and different hold out proportion. 2) Screening data and prediction Screening data Some GAM prediction 3) Producing LST mean Preparing the LST data variable (extraction, projection, clipping) Calculating mean LST per day and adding variable in the dataset Writing up a script in python (with IDRISI API but with GDAL in mind) 4) Examining interactions in GAM Plotting graph to find interaction terms Some GAM prediction

GAM SCREENING GAM_ANUSPLIN1: tmax~ s(lat) + s (lon) + s (ELEV_SRTM)) GAM_PRISM1: tmax~ s(lat) + s (lon) + s (ELEV_SRTM) + s (Northness)+ s (Eastness) + s(DISTOC)) GAM_PRISM2: tmax~ s(lat) + s (lon) + s (ELEV_SRTM) + s (Northness_w)+ s (Eastness_w) + s(DISTOC))

SCREENING THE DATA FOR UNUSUAL DATA VALUES range(ghcn_all$tmax) [1] What is the valid range of temperature in OR ?? range(ghcn_all$ELEV_SRTM) [1] range(ghcn_all$DISTOC) [1]

screened not screened datesns loss ns SCREENING THE DATA FOR UNUSUAL DATA VALUES Range of values: 0<tmax<400) ELEV_SRTM>0 ghcn_all : observations Ghcn_test: observations (tmax screened) Ghcn_test2: observations 365X172=62,780 stations maximum for the year There were observations with elevation greater than 0m i.e. 631 below zero meters.

RMSE FOR ALL THREE MODELS FOR THE 10 dates. RMSE without screening of data values.

RMSE FOR ALL THREE MODELS FOR THE 10 dates after screening

AVERAGE AND MEDIAN RMSE FOR ALL THREE MODELS FOR THE 10 dates. For the 10 dates, we note that the number of loss of stations is very small but the impact on the RMSE is important.

GEOGRAPHICALLY WEIGTHED REGRESSION

GWR predictions were produced using the sgwr package in R. The following specifications were used to run the models: Dependent variable: tmax Independent variables: lon, lat, ELEV_SRTM, Eastness, Northness, DISTOC Bandwidth: determined from the data by CV (one leave out approach). Weight function model: Gaussian proportion of hold out: 0 %, 30%, 50%, 70% validation: RMSE fit GEOGRAPHICALLY WEIGTHED REGRESSION

No Hold-out: Proportion: 0 INTERPOLATION WITH GEOGRAPHICALLY WEIGHTED REGRESSION For the last date: Code: gwr_Oregon_ c.R

No Hold-out: Proportion: 30% INTERPOLATION WITH GEOGRAPHICALLY WEIGHTED REGRESSION For the last date:

RMSE FIT FOR GWR FOR DIFFERENT % HOLD-OUT AND DATES Note that the data was screened…

It is somewhat surprising that the lowest RMSE is obtained for the largest hold out (of 70%). It may be necessary to redo the prediction with the same proportion but by changing the sample!

RMSE COMPARISON: GWR AND GAM MODELS FOR THE TEN DATES Note that the RMSE is a fit for GWR and validation for GAM!! When data are not screened the GWR model performs poorly (purple spike).

RMSE COMPARISON: GWR AND GAM MODELS FOR THE TEN DATES GWR models The median and average RMSE is greater for GWR!

1)Approach 1 First GWR is performed on the training dataset to produce coefficients at every training stations. Second a surface of parameters (slope coefficient) is obtained by interpolation (Kriging). Third, tmax values at testing samples are then obtained by applying the parameters at the testing locations. Fourth an RMSE is calculated for the testing dataset. 2) Approach 2 First, GWR is performed on the training dataset and the bandwidth is obtained. Second, the training bandwidth is then used when running GWR on the testing dataset. Third, coefficients produced at testing sites are used to predict tmax values for testing samples. Fourth an RMSE is calculated for the testing dataset. VALIDATION APPROACHES

Harris P., A.S. Fotheringham, R. Crespo, M. Charlton. (2010). The Use of Geographically Weighted Regression for Spatial Prediction: An Evaluation of Models Using Simulated Data Sets. Math Geosci:: 657–680 Llyod C.D. (2010). Nonstationary models for exploring and mapping monthly precipitation in the United Kingdom. INTERNATIONAL JOURNAL OF CLIMATOLOGY Int. J. Climatol. 30: 390–405. Wimberly1 M.C., M. J. Yabsley, A. D. Baer1, V. G. Dugan, and W. R. Davidson (2008). Spatial heterogeneity of climate and land-cover constraints on distributions of tick-borne pathogens land-cover constraints on distributions of tick-borne pathogens Global Ecology and Biogeography, (Global Ecol. Biogeogr.) 17, 189–202. VALIDATION REFERENCES

LAND SURFACE TEMPERATURE PROCESSING

1.Check input and missing files… 2.Extract from hdf (idrisi/gdal) 3.Mosaic (idrisi/gdal) 4.Project (idrisi/gdal) 5.GROUP files per - year -day -per month 6.Calculate average per day (IDRISI-GRASS/R-RASTER or GDAL) 7.Calculate average per month (IDRISI-GRASS/R-RASTER or GDAL) PYTHON SCRIPT  Missing dates ordered on NASA REVERB…

Average for day 244 over : the LST values need to be rescaled (multiplication factor is 0.02). An example of the average for day 244 (Sept 1)

Oregon_2008_366_MOD11A1_Reprojected_QC_Day.rst TAKING INTO ACCOUNT THE QUALITY FLAGS

Oregon_2008_366_MOD11A1_Reprojected_LST_Day_1km.rst TAKING INTO ACCOUNT THE QUALITY FLAGS