Uncertainty “God does not play dice”

Slides:



Advertisements
Similar presentations
Student Quiz Grades Test Grades 1.Describe the association between Quiz Grades and Test Grades. 2.Write the.
Advertisements

Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Sampling Distributions (§ )
Best Model Dylan Loudon. Linear Regression Results Erin Alvey.
Robert Plant != Richard Plant. Sample Data Response, covariates Predictors Remotely sensed Build Model Uncertainty Maps Covariates Direct or Remotely.
Approaches to Data Acquisition The LCA depends upon data acquisition Qualitative vs. Quantitative –While some quantitative analysis is appropriate, inappropriate.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
3.3 Toward Statistical Inference. What is statistical inference? Statistical inference is using a fact about a sample to estimate the truth about the.
More Raster and Surface Analysis in Spatial Analyst
Spatial Analysis Longley et al., Ch 14,15. Transformations Buffering (Point, Line, Area) Point-in-polygon Polygon Overlay Spatial Interpolation –Theissen.
SAMPLING DISTRIBUTIONS. SAMPLING VARIABILITY
T T07-01 Sample Size Effect – Normal Distribution Purpose Allows the analyst to analyze the effect that sample size has on a sampling distribution.
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Standard error of estimate & Confidence interval.
Fundamentals of GIS Lecture Materials by Austin Troy except where noted © 2008 Lecture 14: More Raster and Surface Analysis in Spatial Analyst Using.
Review of normal distribution. Exercise Solution.
Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.
© Harry Campbell & Richard Brown School of Economics The University of Queensland BENEFIT-COST ANALYSIS Financial and Economic Appraisal using Spreadsheets.
Populations, Samples, Standard errors, confidence intervals Dr. Omar Al Jadaan.
Probabilistic Mechanism Analysis. Outline Uncertainty in mechanisms Why consider uncertainty Basics of uncertainty Probabilistic mechanism analysis Examples.
Orthorectification using
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Interpolation Tools. Lesson 5 overview  Concepts  Sampling methods  Creating continuous surfaces  Interpolation  Density surfaces in GIS  Interpolators.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc.. Chap 7-1 Developing a Sampling Distribution Assume there is a population … Population size N=4.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Uncertainty How “certain” of the data are we? How much “error” does it contain? How well does the model match reality? Goal: –Understand and document uncertainties.
Experiences in assessing deposition model uncertainty and the consequences for policy application Rognvald I Smith Centre for Ecology and Hydrology, Edinburgh.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
Uncertainty of runoff flow path on a small agricultural watershed Unit of Soil and Water System Departement of Environment Science and Technology Gembloux.
Simulated data sets Extracted from:. The data sets shared a common time period of 30 years and age range from 0 to 16 years. The data were provided to.
Machine Design Under Uncertainty. Outline Uncertainty in mechanical components Why consider uncertainty Basics of uncertainty Uncertainty analysis for.
BOT / GEOG / GEOL 4111 / Field data collection Visiting and characterizing representative sites Used for classification (training data), information.
Statistical Surfaces Any geographic entity that can be thought of as containing a Z value for each X,Y location –topographic elevation being the most obvious.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Statistics for Engineer. Statistics  Deals with  Collection  Presentation  Analysis and use of data to make decision  Solve problems and design.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
Lecture 5 Introduction to Sampling Distributions.
Optimization formulation Optimization methods help us find solutions to problems where we seek to find the best of something. This lecture is about how.
Uncertainty “God does not play dice” –Einstein “the end of certainty” –Prigogine, 1977 Nobel Prize What remains is: –Quantifiable probability with uncertainty.
Nonlinear regression Review of Linear Regression.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
ISRIC Spring School – Hand’s on Global Soil Information Facilities, 9-13 May 2016 Uncertainty quantification and propagation Gerard Heuvelink.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Chapter 9 Day 2. Warm-up  If students picked numbers completely at random from the numbers 1 to 20, the proportion of times that the number 7 would be.
INTERPOLATION Procedure to predict values of attributes at unsampled points within the region sampled Why?Examples: -Can not measure all locations: - temperature.
Markov Chain Monte Carlo in R
Who will you trust? Field technicians? Software programmers?
Why Model? Make predictions or forecasts where we don’t have data.
Statistical Inference
Robert Plant != Richard Plant
Uncertainty How “certain” of the data are we?
How Good is a Model? How much information does AIC give us?
Sampling Distributions
Statistical surfaces: DEM’s
Monte Carlo Simulation Managing uncertainty in complex environments.
Professor S K Dubey,VSM Amity School of Business
Spatial Analysis Longley et al..
Direct or Remotely sensed
Predictive distributions
Sampling Distribution
Sampling Distribution
Lecture 2 – Monte Carlo method in finance
QQ Plot Quantile to Quantile Plot Quantile: QQ Plot:
Sampling Distribution of a Sample Proportion
Empirical Bayesian Kriging and EBK Regression Prediction – Robust Kriging as Geoprocessing Tools Eric Krause.
Sampling Distribution of a Sample Proportion
Sampling Distributions (§ )
Quantitative Methods Varsha Varde.
Presentation transcript:

Uncertainty “God does not play dice” Einstein “The more we know about our universe, the more difficult it becomes to believe in determinism.” Prigogine, 1977 Nobel Prize What remains is: Quantifiable probability with uncertainty

Situation All data includes some uncertainty The uncertainty is usually not documented Most modeling methods do not provide uncertainty outputs

Solution? Estimate the uncertainty in the measured values and the predictor variables Use “Monte Carlo” methods: Inject “noise” into the input data Create the model Repeat 1 and 2 over and over Find the distribution of the model outputs i.e. the parameters and statistical measures E.g. coefficients, R2, p values, etc.

Douglas-Fir sample data Create the Model Noise Model “Parameters” Precip Extract Prediction To Points Text File Attributes To Raster Noise

Estimating Uncertainty Field data Distribution of x,y values Measurements Predictor layers Interpolated Remotely Sensed

Down Sampling

Original raster at 10 meter resolution (DEM)

Down sampled to 90 meters (each pixel is the mean of the pixels it overlaps with)

Mean Raster Just for fun!

Standard deviation of the pixels that each pixel was derived from.

Additional Error What was the distribution of the contents of each pixel when it was sampled? What’s in a Pixel? Cracknell, 2010

Pixel Sampling Each pixel represents an area that is: Elliptical Larger than the pixels dimensions

Point-Spread Function AVHRR

Resampling

Approach? Estimate the standard deviation of the original scene that the pixels represent Use this estimate to create predictor rasters that we down sample for the modeling

Original raster at 10 meter resolution (DEM)

Original raster with noise injection Original raster with error Original raster with noise injection

Monte Carlo Error Injection Create the model with the “mean raster” Inject normally distributed random “error” into the predictors Recreate the model Repeat 2 & 3 saving results Create distribution of the parameters and performance measures (R2, AIC, AUC, etc.)

Interpolated Predictors Many predictors are interpolated from point-source data Kriging provides a standard deviation raster as one of it’s output (these are rarely available) By injecting error into the point data and recreating the interpolated surface, we can characterize the error in it. We can also use this to characterize the error’s impact on the model as above

"GSENM" by User:Axcordion - http://en. wikipedia. org/wiki/File:GSENM "GSENM" by User:Axcordion - http://en.wikipedia.org/wiki/File:GSENM.JPG. Licensed under CC BY-SA 3.0 via Wikimedia Commons - http://commons.wikimedia.org/wiki/File:GSENM.jpg#mediaviewer/File:GSENM.jpg

http://www.planetware.com/tourist-attractions/utah-usut.htm

Zion http://www.planetware.com/tourist-attractions/utah-usut.htm

Where was the data collected? On flat spots Near roads Often at airports! The data is not representative of our entire landscape

We’re Missing Data! Interpolated Raster Canyon Where Are the weather stations? Canyon

Approach If you created the interpolated surface: Regardless: Use Monte Carlo methods to repeatedly recreate the interpolated surface to see the effect of missing data Regardless: Estimate the variability that was missed Maybe from a DEM? Use this as an uncertainty raster?

Lab This Week Characterizing the uncertainty in Remotely Sensed data