ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON 2012-05-01 Roundup Benoit Parmentier.

Slides:



Advertisements
Similar presentations
Aims of a Variance Components Analysis Estimate the amount of variation between groups (level 2 variance) relative to within groups (level 1 variance)
Advertisements

Residuals.
Scaling Biomass Measurements for Examining MODIS Derived Vegetation Products Matthew C. Reeves and Maosheng Zhao Numerical Terradynamic Simulation Group.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
Multiple Linear Regression
An Assessment of CMAQ with TEOM Measurements over the Eastern US Michael Ku, Chris Hogrefe, Kevin Civerolo, and Gopal Sistla PM Model Performance Workshop,
Princeton University Global Evaluation of a MODIS based Evapotranspiration Product Eric Wood Hongbo Su Matthew McCabe.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Applied Geostatistics
Class 6: Tuesday, Sep. 28 Section 2.4. Checking the assumptions of the simple linear regression model: –Residual plots –Normal quantile plots Outliers.
Measure of Center A measure of center is a value at the center or middle of the data set Surprising, huh?
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Regression Diagnostics - I
Deterministic Solutions Geostatistical Solutions
Why Geography is important.
Model Checking in the Proportional Hazard model
Integrating Global Species Distributions, Remote Sensing and Climate Station Data to Assess Biodiversity Response to Climate Change Adam Wilson & Walter.
Comparison of spatial interpolation techniques for agroclimatic zoning of Sardinia (Italy) Cossu A., Fiori M., Canu S. Agrometeorological Service of Sardinia.
Using My NASA Data to Explore Earth Systems Lynne H. Hehr John G. Hehr University of Arkansas Department of Geosciences And Center for Math and Science.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
10/03/2005NOV-3300-SL M. Weiss, F. Baret D. Allard, S. Garrigues From local measurements to high spatial resolution VALERI maps.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Basic statistics 11/09/13.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
ENVIRONMENTAL LAYERS IPLANT MEETING WEBEX Roundup 3 Benoit Parmentier.
Spatial Statistics in Ecology: Continuous Data Lecture Three.
Statistics Class 7 2/11/2013. It’s all relative. Create a box and whisker diagram for the following data(hint: you need to find the 5 number summary):
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
The climate and climate variability of the wind power resource in the Great Lakes region of the United States Sharon Zhong 1 *, Xiuping Li 1, Xindi Bian.
1 Upscaling of Site-Specific Nitrate-Nitrogen Measurements to the Watershed Scale in the Santa Fe River Watershed S. Lamsal, S. Grunwald, C. M. Bliss,
6.8Compare Statistics from Samples Example 1 Compare statistics from different samples Production The data set below give the number of computers assembled.
Section 6.8 Compare Statistics from Samples. Vocabulary Quartile: The median of an ordered data set Upper Quartile: The median of the upper half of an.
Numerical Measures of Variability
1 Quadratic Model In order to account for curvature in the relationship between an explanatory and a response variable, one often adds the square of the.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Lesson Correlation and Regression Wisdom. Knowledge Objectives Recall the three limitations on the use of correlation and regression. Explain what.
1. The Study of Excess Nitrogen in the Neuse River Basin “A Landscape Level Analysis of Potential Excess Nitrogen in East-Central North Carolina, USA”
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
SWBAT: Assess the normality of a distribution. Do Now: A random sample of golf scores gives the following summary statistics: n = 20, x = 84.5, S[x] =
Chapter 4 – Measurements of Location and Position Math 22 Introductory Statistics.
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Midterm Review IN CLASS. Chapter 1: The Art and Science of Data 1.Recognize individuals and variables in a statistical study. 2.Distinguish between categorical.
Chapter 12 Understanding Research Results: Description and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Cautions about Correlation and Regression
Chapter 12: Regression Diagnostics
Happiness comes not from material wealth but less desire.
Unit 7: Statistics Key Terms
1. The Study of Excess Nitrogen in the Neuse River Basin
Raster-based spatial analyses
The greatest blessing in life is
Additional notes on random variables
Challenges of ascertaining seasonality in tropical countries, when to vaccinate? Good morning. K Vandemaele GIP/WHO.
Additional notes on random variables
Indicator Variables Response: Highway MPG
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Exceedence Diagnostics
Model Adequacy Checking
Presentation transcript:

ENVIRONMENTAL LAYERS MEETING IPLANT TUCSON Roundup Benoit Parmentier

What I have been working on: 1) GAM prediction for 365 dates and first round up of results -Assessing results across the year. 2) GAM prediction: model diagnostics and residuals -Contribution of variables -Outliers: searching for patterns. -Improving screening of unreliable observations. -Land cover and LST 3) Examining the effect of sampling on the results -Examining the RMSE for different training and testing samples -Examining the RMSE for the different hold out proportions. 4) Incorporating spatial information: Kriging and spatial filtering -GAM + Kriging -Spatial eigenvectors

1) ASSESSING RESULTS ACROSS THE YEAR: Running GAM over 365 dates

GAM MODELS USED FOR THE ANALYSIS mod1<- tmax~ s(lat) + s (lon) + s (ELEV_SRTM) mod2<- tmax~ s(lat,lon) +s(ELEV_SRTM) mod3<- tmax~ s(lat) + s (lon) + s (ELEV_SRTM) + s (Northness)+ s (Eastness) + s(DISTOC) mod4<- tmax~ s(lat) + s (lon) + s(ELEV_SRTM) + s(Northness) + s (Eastness) + s(DISTOC) + s(LST) mod5<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST) mod6<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST,LC1) mod7<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST,LC3) mod8<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST) + s(LC1) Using monthly LST mean…

FIRST SUMMARY ROUND UP mod3<- tmax~ s(lat) + s (lon) + s (ELEV_SRTM) + s (Northness)+ s (Eastness) + s(DISTOC) Mean and median RMSE based on the 10 selected dates.

RMSE DISTRIBUTION FOR YEAR 2010 mod2<- tmax~ s(lat,lon) +s(ELEV_SRTM)

Working on 365 dates… RMSE DISTRIBUTION FOR YEAR 2010 mod6<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST,LC1)

ASSESSING RESULTS ACROSS THE YEAR: Running GAM over 365 dates  Mean RMSE is between 2.4C and 2.5C with model 2 performing the best but…: - The data suggest that models with LST might perform better when some winter dates are removed. - thus we must assess the RMSE per month/seasons and different hold out.

2) GAM prediction: model diagnostics and residuals -Contribution of variables -Outliers: searching for patterns. -Improving screening of unreliable observations. -Land cover and LST

HIGHEST RMSE FOR DATE RESIDUALS FOR MODEL 3 mod3<- tmax~ s(lat) + s (lon) + s (ELEV_SRTM) + s (Northness)+ s (Eastness) + s(DISTOC)

GHCN_S_

GHCN_V_

3) ASSESSING THE STABILITY OF THE RESULTS: INFLUENCE OF SAMPLING

The first results indicate that models with the inclusion of LST have lowest median RMSE. mod5<- tmax~ s(lat,lon) +s(ELEV_SRTM) + s(Northness,Eastness) + s(DISTOC) + s(LST) SUMMARY STATISTICS FOR DIFFERENT SAMPLING Median and Averages were calculated for 260 runs (26x10dates).

Continue working on: 1) GAM prediction for 365 dates -Assessing results across the year:  per month and seasons 2) GAM prediction: model diagnostics and residuals -Contribution of variables -Outliers: searching for patterns. -Improving screening of unreliable observations. -Land cover and LST 3) Examining the effect of sampling on the results -Examining the RMSE for different training and testing samples -Examining the RMSE for the different hold out proportions. -Examining for 4) Incorporating spatial information: Kriging and spatial filtering -GAM + Kriging -Spatial eigenvectors