Uncertainty How “certain” of the data are we? How much “error” does it contain? How well does the model match reality? Goal: –Understand and document uncertainties.

Slides:



Advertisements
Similar presentations
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Advertisements

Model generalization Test error Bias, variance and complexity
Best Model Dylan Loudon. Linear Regression Results Erin Alvey.
Robert Plant != Richard Plant. Sample Data Response, covariates Predictors Remotely sensed Build Model Uncertainty Maps Covariates Direct or Remotely.
Spatial distribution models From the truth to the whole truth? Senait D. Senay & Sue P. Worner.
Maxent interface.
Spatial Analysis Longley et al., Ch 14,15. Transformations Buffering (Point, Line, Area) Point-in-polygon Polygon Overlay Spatial Interpolation –Theissen.
Geographic data: sources and considerations. Geographical Concepts: Geographic coordinate system: defines locations on the earth using an angular unit.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Laura Palamara, John Manderson, Josh Kohut, Matthew Oliver, John Goff, Steven Gray Developing Ecological Indicators for Fisheries Management using IOOS.
1 NOAA’s National Climatic Data Center April 2005 Climate Observation Program Blended SST Analysis Changes and Implications for the Buoy Network 1.Plans.
Integrating Global Species Distributions, Remote Sensing and Climate Station Data to Assess Biodiversity Response to Climate Change Adam Wilson & Walter.
Managing Uncertainty Geo580, Jim Graham. Topic: Uncertainty Why it’s important: –How to keep from being “wrong” Definitions: –Gross errors, accuracy (bias),
Chapter 13: Inference in Regression
Weed mapping tools and practical approaches – a review Prague February 2014 Weed mapping tools and practical approaches – a review Prague February 2014.
Ensemble-variational sea ice data assimilation Anna Shlyaeva, Mark Buehner, Alain Caya, Data Assimilation and Satellite Meteorology Research Jean-Francois.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.
Spatial Interpolation of monthly precipitation by Kriging method
Satellite Cross comparisonMorisette 1 Satellite LAI Cross Comparison Jeff Morisette, Jeff Privette – MODLAND Validation Eric Vermote – MODIS Surface Reflectance.
NR 422 Quality Control Jim Graham Spring Staircase of Knowledge Increasing Subjectivity Human value added Observation And Measurement Data Information.
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross- validation of a two- stage generalized.
9th International Symposium on Wild Boar and others Suids, Hannover 2012 Factors influencing wild boar presence in agricultural landscape: a habitat suitability.
Spatial Fisheries Values in the Gulf of Alaska Matthew Berman Institute of Social and Economic Research University of Alaska Anchorage Ed Gregr Ryan Coatta.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.
NR 422- Habitat Suitability Models Jim Graham Spring 2009.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Geographic Information Science
Montane Frogs in Rainforest 2013, Marcio et al., Understanding the mechanisms underlying the distribution of microendemic montane frogs (Brachycephalus.
Candidate KBA Identification: Modeling Techniques for Field Survey Prioritization Species Distribution Modeling: approximation of species ecological niche.
Why are there more kinds of species here compared to there? Theoretical FocusConservation Focus – Latitudinal Gradients – Energy Theory – Climate Attributes.
GEOSTATISICAL ANALYSIS Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257.
Why Model? Make predictions or forecasts where we don’t have data.
Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.
Role of Spatial Database in Biodiversity Conservation Planning Sham Davande, GIS Expert Arid Communities Technologies, Bhuj 11 September, 2015.
Uncertainty How “certain” of the data are we? How much “error” does it contain? Also known as: –Quality Assurance / Quality Control –QAQC.
Application of in situ Observations to Current Satellite-Derived Sea Surface Temperature Products Gary A. Wick NOAA Earth System Research Laboratory With.
2nd GODAE Observing System Evaluation Workshop - June Ocean state estimates from the observations Contributions and complementarities of Argo,
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
Definitions (Jim’s) Transformations: General term for anything that takes an input and provides an output (e.g. “transforms” data) Processing: Converting.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
1. Analysis and Reanalysis Products Adrian M Tompkins, ICTP picture from Nasa.
Remote-sensing and biodiversity in a changing climate Catherine Graham SUNY-Stony Brook Robert Hijmans, UC-Berkeley Lianrong Zhai, SUNY-Stony Brook Sassan.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
U.S. Department of the Interior U.S. Geological Survey Automatic Generation of Parameter Inputs and Visualization of Model Outputs for AGNPS using GIS.
Uncertainty “God does not play dice” –Einstein “the end of certainty” –Prigogine, 1977 Nobel Prize What remains is: –Quantifiable probability with uncertainty.
Infrared and Microwave Remote Sensing of Sea Surface Temperature Gary A. Wick NOAA Environmental Technology Laboratory January 14, 2004.
November 28, 2006 Derivation and Evaluation of Multi- Sensor SST Error Characteristics Gary Wick 1 and Sandra Castro 2 1 NOAA Earth System Research Laboratory.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
A comparison of AMSR-E and AATSR SST time-series A preliminary investigation into the effects of using cloud-cleared SST data as opposed to all-sky SST.
Use of Maxent for predictive habitat mapping of CWC in the Bari canyon Bargain Annaëlle Foglini Federica, Bonaldo Davide, Pairaud Ivane & Fabri Marie-Claire.
Sandra Castro and Gary Wick.  Does direct regression of satellite infrared brightness temperatures to observed in situ skin temperatures result in.
Who will you trust? Field technicians? Software programmers?
Why Model? Make predictions or forecasts where we don’t have data.
1. Analysis and Reanalysis Products
Robert Plant != Richard Plant
Uncertainty How “certain” of the data are we?
Predicting species distributions for New England invasives
How Good is a Model? How much information does AIC give us?
MODELING THE CURRENT AND FUTURE DISTRIBUTIONS OF
Validation of Satellite-derived Lake Surface Temperatures
Bias and Variance of the Estimator
Managing Uncertainty Geo580, Jim Graham.
The SST CCI: Scientific Approaches
Uncertainty “God does not play dice”
More on Maxent Env. Variable importance:
Presentation transcript:

Uncertainty How “certain” of the data are we? How much “error” does it contain? How well does the model match reality? Goal: –Understand and document uncertainties from data collection to publication

LifeMapper: Tamarix chinensis LifeMapper.org

LifeMapper: Loggerhead Turtles LifeMapper.org

Sample Data Predictor Layers Modeling Software Spatial Precision Spatial Accuracy Sample Bias Identification Errors Date problems Gross Errors Gridding Over fitting? Assumptions? Response Curves Model Performance Measures Number of Parameters AIC, AICc, BIC, AUC Match expectations? Over-fit? What is the best model? Habitat Map Realistic? Uncertainty maps? How to determine? Settings Road Map of Uncertainty Accurate measures? Noise Correlation Interpolation Error Spatial Errors Measurement Errors Temporal Uncertainty

Occurrences of Polar Bears From The Global Biodiversity Information Facility ( 2011)

January 1 st Dates If you put just a “year”, like 2011, into a relational database, the database will return: –Midnight, January 1 st, of that year In other words: –2011 becomes: – :00:00.00

Predictor Layers Remotely sensed: –DEMs, Visual, IR, NIR, SST, NPP, Sea Height Integrated from multiple data sets: –Bathymetry Interpolated: –Temp, precipitation, wind –DO, Sub-surface temp, salinity, bottom type Processed from other layers: –Slope, aspect, distance to shore From other models: –Climate

BioClim/WorldClim Data

Red Snapper An important recreational and commercial species $ million/year outdooralabama.com lickyourownbowl.wordpress.com

SEAMAP Trawls (>47,000 records) Red Snapper Occurrences (>6,000 records)

Density of Platforms and Pipelines Created from Bureau of Ocean Energy Management (BOEM) Point Data Set Uncertainty? Bathymetry Resampled from 90m NOAA Coastal Inundation Dataset And others Uncertainty < 9km Net Primary Production (NPP) Milligrams of Carbon per Meter Squared per Day OSU Ocean Productivity Uncertainty? Sea Surface Temperature (SST) NOAA AVHRR Pathfinder Satellite Spatial: 9km Measured: <0.4 K

Sample Data Spatial Precision –Standard Deviation of about 2 km Spatial Accuracy: –< 1km Sampling Bias: –Some areas more heavily sampled Identification Errors: –Unknown but red snapper are pretty easy to identify Date problems: –Not a temporal model Gridding: –Not at 9 kilometers

Jiggling The Samples Randomly shifting the position of the points based on a given standard deviation based on sample uncertainty Running the model repeatedly to see the potential effect of the uncertainty

Predictor Resolution Should model at the lowest resolution of the predictor layers or lower Lower resolution: –Can reduce problems from sample data uncertainty –In same cases can combine sample data into measures such as CPUE, density, abundance –Reduces detail of final model For these tests: 9km

Histograms of Predictor Layers

Predictor Layer Uncertainty Noise: Minimal Correlation: Eliminate NPP Spatial Errors: Unknown, < 9km? Measurement errors: Minimal to Unknown Spearman’s Coefficient NPP to SST Correlation: 0.99

Modeling Software: Maxent Popular, relatively easy to use Only requires presence points Known for over-fitting Performs “piece-wise” regression Assumes: –Predictors are error free –Independence of errors in response –Lack of correlation in predictors –Constant variance –Random data collection over sample area

Experiments Different regularization multipliers –Select regularization for other tests Try different numbers of samples –Determine the number of samples to use “Jiggle” the sample data spatially –Introduce different amounts of spatial error

Regularization=0.1

Regularization=1.2

Regularization=10

Regularization=0.1Regularization=1.2Regularization=10

PercentNumber of SamplesNumber of ParametersAICAUC 10% % , % , % , % , % ,

10% Sample Points 100% Sample Points Poor Habitat Optimal Habitat

No Jiggling Std Dev=4.4km Std Dev=55km Jiggling

Spearman’s Correlation 0 vs. 4.4km, Correlation =.9970 vs. 55km, Correlation =.919

Uncertainty Maps Standard Deviation of Jiggling Points by 4.4km

Conclusions We can improve HSMs over the default settings in Maxent We can determine uncertainty in some cases Next Steps –Support uncertainty for each data point –Improve uncertainty analysis and visualization for predictor layers –Begin adding uncertainty analysis and maps to products

Communicating Uncertainy Colleen Sullivan, 2012

References Anderson, Model Based Inference in the Life Sciences: A Primer on Evidence AVHRR Technical Specifications: df df Chil’s JP, Delfiner P, Geostatistics: Modeling Spatial Uncertainty Franklin J, Mapping Species Distributions: Spatial Inference and Prediction. Cambridge University Press, Cambridge Hunsaker CT et. al., Spatial Uncertainty in Ecology: Implications for Remote Sensing and GIS Applications Maxent: Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecol Model 190 (3-4): Warren DL, Seifert SN (2011) Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria. Ecological Applications 21 (2): Wenzhong S, Principles of Modeling Uncertainties in Spatial Data and Spatial Analyses

Other Issues… Selecting background (absence points) Clamping: –Prevents prediction outside the predictors used to create the model Thresholds

Additional Slides

Communicating Uncertainy Colleen Sullivan, 2012

Staircase of Knowledge Increasing Subjectivity Human value added Observation And Measurement Data Information Knowledge Understanding Wisdom Organization Interpretation Verification Selection Testing Comprehension Integration Judgment Environmental Monitoring and Characterization, Aritola, Pepper, and Brusseau