Using the Model to Evaluate Observation Locations and Parameter Information in the Context of Predictions.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Design of Experiments Lecture I
Mean, Proportion, CLT Bootstrap
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
+ Gladstone Bioinformatics Core Kirsten E. Eilertson + Statistics in Science: Best Practices.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Sensitivity Analysis In deterministic analysis, single fixed values (typically, mean values) of representative samples or strength parameters or slope.
Sampling Distributions
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Sampling Distributions (§ )
Computational statistics 2009 Sensitivity analysis (SA) - outline 1.Objectives and examples 2.Local and global SA 3.Importance measures involving derivatives.
This presentation can be downloaded at Water Cycle Projections over Decades to Centuries at River Basin to Regional Scales:
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 4: Modeling Decision Processes Decision Support Systems in the.
Uncertainty analysis and Model Validation.
The Calibration Process
Lecture 2 Data Processing, Errors, Propagation of Uncertainty.
Section 7.4 Table summarizes hypothesis tests about one variance (standard deviation). (These tests assume that the sample is taken from a normal.
Introduction: Psychological Assessment. Terms & Definitions Psychological Assessment The gathering and integrating of psychological data for psychological.
Chapter 9: Introduction to the t statistic
Correlation & Regression
Quantify prediction uncertainty (Book, p ) Prediction standard deviations (Book, p. 180): A measure of prediction uncertainty Calculated by translating.
Uses of Modeling A model is designed to represent reality in such a way that the modeler can do one of several things: –Quickly estimate certain aspects.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Bayesian and Geostatistical Approaches to Inverse Problems Peter K. Kitanidis Civil and Environmental Engineering Stanford University.
IV. Sensitivity Analysis for Initial Model 1. Sensitivities and how are they calculated 2. Fit-independent sensitivity-analysis statistics 3. Scaled sensitivities.
BsysE595 Lecture Basic modeling approaches for engineering systems – Summary and Review Shulin Chen January 10, 2013.
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
Probabilistic Mechanism Analysis. Outline Uncertainty in mechanisms Why consider uncertainty Basics of uncertainty Probabilistic mechanism analysis Examples.
Identify Parameters Important to Predictions using PPR & Identify Existing Observation Locations Important to Predictions using OPR.
III. Ground-Water Management Problem Used for the Exercises.
Using Ground-Water Model Predictions and the ppr and opr Statistics to Guide Data Collection.
(Zheng and Bennett) Steps in Transport Modeling Calibration step (calibrate flow model & transport model) Adjust parameter values Traditional approach.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
V. Nonlinear Regression Objective-Function Surfaces Thus far, we have: Parameterized the forward model Obtained head and flow observations and their weights.
Geocenter, Copenhagen September 2007 Instructors: Mary C. Hill (US Geological Survey, USA) Matthew Tonkin (SS Papadopulos and Associates, USA) Calibration,
VIII: Methods for Evaluating Model Predictions 1. Define predictive quantity and calculate sensitivities and standard deviations (Ex8.1a) 2. Assess data.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Statistics and the Verification Validation & Testing of Adaptive Systems Roman D. Fresnedo M&CT, Phantom Works The Boeing Company.
1 A Bayesian statistical method for particle identification in shower counters IX International Workshop on Advanced Computing and Analysis Techniques.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
StatisticSpelled outCalculated for each.. LeverageObs DSSDimensionless SSObs and parameter CSSComposite SSParameter PCCParam. Cor. Coef.Parameter pair.
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
IX. Transient Model Nonlinear Regression and Statistical Analysis.
1 Commissioned by PAMSA and German Technical Co-Operation National Certificate in Paper & Pulp Manufacturing NQF Level 4 Apply knowledge of statistics.
Steady-State and Transient Models of Groundwater Flow and Advective Transport, Eastern Snake River Plain Aquifer, INL and Vicinity, Idaho Jason C. Fisher,
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
1 Groundwater Modeling. 2 Introduction 3 Lecture Outline What Is A Model? Modeling Axioms Guiding Thoughts and Protocol Governing Equations Practical.
IX. Transient Forward Modeling. Ground-Water Management Issues Recall the ground-water management issues for the simple flow system considered in the.
Goal of Stochastic Hydrology Develop analytical tools to systematically deal with uncertainty and spatial variability in hydrologic systems Examples of.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Computacion Inteligente Least-Square Methods for System Identification.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Regression Analysis Module 3.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
B&A ; and REGRESSION - ANCOVA B&A ; and
Calibration.
Transport Modeling in Groundwater
Computer Modeling Vadose Zone Models Groundwater Flow Model
Sampling Distributions (§ )
Transport Modeling in Groundwater
Presentation transcript:

Using the Model to Evaluate Observation Locations and Parameter Information in the Context of Predictions

Data Worth and Predictions Predictions: Most models are developed with the intention of making predictions about future system behavior Predictions can also be posed as some unknown occurrence at some location in the model domain that can be simulated by the model – such as flow in an unmonitored stream reach Approach : The best possible model is developed and calibrated under the assumption that a model that reproduces past system response and system properties makes reliable predictions But does the best fitting model always produces the best predictions? Evaluations (questions we should ask!): How much uncertainty accompanies simulated predictions? Which observations and parameters most influence the value of this predictive uncertainty?

Predictions – Death Valley Model: Hydrogeology described by many parameters Large number of head and flow observations to calibrate the model Prediction: Groundwater flow directions and velocities in the Yucca Flats area

Predictions – Death Valley Prediction Advection is often the dominant aspect of groundwater transport Advection can be simulated using particle tracking or path lines This is available within MODFLOW using the Advection (ADV) package MODFLOW can also provide sensitivities pertaining to the pathway

Predictions – Advection Prediction Specifically – MODFLOW calculates the travel path in three directions: X - East-West Y - North-South Z - Up-Down Using calculations described later, the variance of these predictions can be easily determined

Parameters – Predictions (PPR) 1. Parameter Information Which parameters are most important to predictions What information might be cost- effectively collected to further reduce predictive uncertainty?

Observations – Predictions (OPR) 2. Observation Locations Existing observations Existing observations: Which existing observation locations are most influential? New observations: New observations: Where could new measurements be made to reduce predictive uncertainty?

Data Worth and Predictions Evaluations: How much uncertainty accompanies simulated predictions? Evaluate deterministically – trial and error sensitivity. Evaluate using multiple models and Monte-Carlo methods Evaluate using statistics – variance, standard deviation Which parameters and observations are most influential in the calculation of this predictive uncertainty? Statistical methods enable rapid evaluation of the contribution of (a) parameters and / or (b) observations to the uncertainty (standard deviation) of one or more predictions First order second moment (FOSM) methods First order – linear sensitivities Second moment – variances and standard deviations

Predictions – Uncertainty Standard Deviation Measure of spread of values for a variable Has a stochastic basis Involves assumptions Regardless – a means for comparing relative predictive uncertainty Normal distribution

Data Worth and Predictions Approach: Define the prediction: How can we simulate the predicted quantity? How can we summarize the ‘value’ of the prediction numerically? Gather information obtained through calibration: Observation sensitivities and weights Prior information on parameters Calculate prediction sensitivities: Using methods similar to when calculating observation sensitivities Include all parameters if possible calibrated parameter values measured parameter values

Data Worth and Predictions Approach: Calculate the current prediction variance Calculate a hypothetical prediction variance assuming changes to (a) information about parameters or (b) available observations The Parameter-PRediction (PPR) Statistic: Evaluate worth of potential knowledge about parameters, posed in the form of prior information - add this to calculations The Observation-PRediction (OPR) Statistic: Evaluate existing observation locations by omitting them from the calculations Evaluate potential new observation locations by adding them to the calculations Note: PPR was called Value of Improved Information (VOII) in one journal article [1- (s znew / s z )] x 100

Data Worth and Predictions

Approach: Recently encapsulated in a program: OPR-PPR Program In review Class Exercise will use the OPR and PPR methods with a synthetic model that is described both in the OPR- PPR documentation, and in Hill and Tiedeman (2007)

Relevant References Tiedeman, C.R., Hill, M.C., D’Agnese, F.A., and Faunt, C.C., 2003, Methods for using groundwater model predictions to guide hydrogeologic data collection, with application to the Death Valley regional ground-water flow system: Water Resources Research, 39(1): 5-1 to 5-17, /2001WR (The PPR statistic is the same as what is called the VOII statistic in this paper) Tiedeman, C.R., D.M. Ely, M.C. Hill, and G.M. O'Brien, 2004, A method for evaluating the importance of system state observations to model predictions, with application to the Death Valley regional groundwater flow system, Water Resources Res., 40, W12411, doi: /2004WR Tonkin, M.J., Tiedeman C.R., Ely, D.M., and Hill M.C., (in press), Documentation of OPR-PPR, a Computer Program for Assessing Data Importance to Model Predictions Using Linear Statistics, Constructed using the JUPITER API, JUPITER: Joint Universal Parameter IdenTification and Evaluation of Reliability API: Application Programming Interface, U.S. Geological Survey Techniques and Methods Report TM6-E2.