TOWARD TRANSPARENT, REFUTABLE HYDROLOGIC MODELS IN KANSAS OR OZ In class: Mary C. Hill U.S. Geological Survey, Boulder Olda Rakovec Wageningen University,

Slides:



Advertisements
Similar presentations
Martyn Clark Short course on “Model building, inference and hypothesis testing in hydrology” May, 2012.
Advertisements

Experimental Design, Response Surface Analysis, and Optimization
Analysis of variance (ANOVA)-the General Linear Model (GLM)
1 Annual Report by Work Group 2 on Uncertainty Analysis and Parameter Estimation Tom Nicholson, U.S. Nuclear Regulatory Commission Mary Hill, U.S. Geological.
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
EARS1160 – Numerical Methods notes by G. Houseman
Statistics for the Social Sciences
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
Interdisciplinary Modeling of Aquatic Ecosystems Curriculum Development Workshop July 18, 2005 Groundwater Flow and Transport Modeling Greg Pohll Division.
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Development of hyper-resolution large-ensemble continental-scale hydrologic model simulations AGU, San Francisco, CA 14 December 2014 Martyn Clark, Naoki.
The Calibration Process
Abstract In the case of the application of the Soil Moisture and Ocean Salinity (SMOS) mission to the field of hydrology, the question asked is the following:
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Horizontal Infiltration using Richards Equation. The Bruce and Klute approach for horizontal infiltration.
Decision analysis and Risk Management course in Kuopio
Introduction to Multilevel Modeling Using SPSS
Quantify prediction uncertainty (Book, p ) Prediction standard deviations (Book, p. 180): A measure of prediction uncertainty Calculated by translating.
VIRTUAL BUSINESS RETAILING
Regression Analysis. Regression analysis Definition: Regression analysis is a statistical method for fitting an equation to a data set. It is used to.
Inference for regression - Simple linear regression
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
National Weather Service River Forecast System Model Calibration Fritz Fiedler Hydromet 00-3 Tuesday, 23 May East Prospect Road, Suite 1 Fort.
Economic Cooperation Organization Training Course on “Drought and Desertification” Alanya Facilities, Antalya, TURKEY presented by Ertan TURGU from Turkish.
Figure 1: Schematic representation of the VIC model. 2. Model description Hydrologic model The VIC macroscale hydrologic model [Liang et al., 1994] solves.
IV. Sensitivity Analysis for Initial Model 1. Sensitivities and how are they calculated 2. Fit-independent sensitivity-analysis statistics 3. Scaled sensitivities.
Testing modeling frameworks or… how can we improve predictive skill from our data? Mary C. Hill, University of Kansas; Emeritus, U.S. Geological Survey.
Advancements in Simulating Land Hydrologic Processes for Land Surface Modeling (LSM) Hua Su Presentation for Physical Climatology.
Hydrologic model benchmarks: Synthetic test cases, CZO data, and continental-scale diagnostics CUAHSI Community Modeling Working Group, San Francisco,
Basic linear regression and multiple regression Psych Fraley.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Streamflow Predictability Tom Hopson. Conduct Idealized Predictability Experiments Document relative importance of uncertainties in basin initial conditions.
HEC-HMS Runoff Computation.
Preliminary Applications of the HL-RDHM within the Colorado Basin River Forecast Center Ed Clark, Hydrologist Presented July 26 th, 2007 as part of the.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Evaluating hydrological model structure using tracer data within a multi-model framework Hilary McMillan 1, Doerthe Tetzlaff 2, Martyn Clark 3, Chris Soulsby.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Sensitivity Analysis of Mesoscale Forecasts from Large Ensembles of Randomly and Non-Randomly Perturbed Model Runs William Martin November 10, 2005.
Estimating  0 Estimating the proportion of true null hypotheses with the method of moments By Jose M Muino.
Understanding hydrologic changes: application of the VIC model Vimal Mishra Assistant Professor Indian Institute of Technology (IIT), Gandhinagar
Chapter 13 Multiple Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. 3 Forecasting.
Towards development of a Regional Arctic Climate System Model --- Coupling WRF with the Variable Infiltration Capacity land model via a flux coupler Chunmei.
Physics 114: Lecture 18 Least Squares Fit to Arbitrary Functions Dale E. Gary NJIT Physics Department.
Experimental Algorithmics Reading Group, UBC, CS Presented paper: Fine-tuning of Algorithms Using Fractional Experimental Designs and Local Search by Belarmino.
[1] Simple Linear Regression. The general equation of a line is Y = c + mX or Y =  +  X.  > 0  > 0  > 0  = 0  = 0  < 0  > 0  < 0.
Biases in land surface models Yingping Wang CSIRO Marine and Atmospheric Research.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 7 - Chapter 25.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Assessing the Influence of Decadal Climate Variability and Climate Change on Snowpacks in the Pacific Northwest JISAO/SMA Climate Impacts Group and the.
Review of statistical modeling and probability theory Alan Moses ML4bio.
Cell#1 Program Instructions. Don’t run. Used to load a Statistical Package Cell #3 Defines standard normal pdf and cdf functions Ignore spelling warning.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 6 - Chapters 22 and 23.
1 Spring 2003 Prof. Tim Warburton MA557/MA578/CS557 Lecture 32.
Simulation. Types of simulation Discrete-event simulation – Used for modeling of a system as it evolves over time by a representation in which the state.
Data Assimilation in Hydrology
Data Mining: Neural Network Applications by Louise Francis CAS Convention, Nov 13, 2001 Francis Analytics and Actuarial Data Mining, Inc.
Modeling with WEAP University of Utah Hydroinformatics - Fall 2015.
LSM Hind Cast for the Terrestrial Arctic Drainage System Theodore J. Bohn 1, Dennis P. Lettenmaier 1, Mark C. Serreze 2, and Andrew G. Slater 2 1 Department.
BUILDING AND RUNNING THE HYDROLOGICAL MODEL
Chapter 4: Basic Estimation Techniques
Midterm Review.
Part 7 - Chapter 25.
Two-Stage Upscaling of Two-Phase Flow: From Core to Simulation Scale
The Calibration Process
Effects of Temperature and Precipitation Variability on Snowpack Trends in the Western U.S. JISAO/SMA Climate Impacts Group and the Department of Civil.
Part 7 - Chapter 25.
M248: Analyzing data Block D UNIT D2 Regression.
HEC-HMS Runoff Computation Modeling Direct Runoff with HEC-HMS Empirical models Empirical models - traditional UH models - traditional UH models - a.
Presentation transcript:

TOWARD TRANSPARENT, REFUTABLE HYDROLOGIC MODELS IN KANSAS OR OZ In class: Mary C. Hill U.S. Geological Survey, Boulder Olda Rakovec Wageningen University, Netherlands Other co-authors: Martyn Clark UCAR, Boulder A. H. Weerts Deltares, Wageningin University A. J. Teuling Wageningin University R. Uijlenhoet Wageningin University

Problem Transparency and refutability are championed in articles by Oreskes and others as necessary for meaningful environmental models Transparency and refutability are supposed to be served by model analysis methods. – Analyze model fit, sensitivity analysis, uncertainty Many model analysis methods – Nearly every field and even report use a different method. Many analysis methods require heroic efforts – 10,000s to millions of model runs. – Modelers forced to choose between processes and analysis. How well are transparency and refutability served? Can we do better?

Include computationally frugal methods and organize by purpose

Exposes what is important to a given situation, test, etc. Needed???? 4 “Would you go to a orthopedist who didn’t take x-rays and show them to you?” SimLab web site: Sensitivity analysis

Outline Some basics about sensitivity analysis Sensitivity analysis – Global and local methods – Total-order statistics Simple test case (analytical example relevant to FUSE) – run local sensitivity analysis in class Sensitivity analysis results for the test case Some results from FUSE

Some basics about sensitivity analysis Decide on model Decide on parameters – Can include parameters on unusual aspects of the model, like additive or multiplicative parameters for precipitation – Decide on ranges for each parameter – The defined parameter space has the dimensions of the number of parameters, k. Easy to plot results for 1- and 2-D parameter space. Could easily be 10-D parameter space! Decide on performance metric, ψ Calculate performance metric variability over parameter space. Measure it with the variance V(ψ)

Some basics about sensitivity analysis Total-order sensitivity for parameter j S T j – 1 minus Fraction of V(ψ) explained by variation in all other parameters. – Includes of interdependence of parameter j with the other parameters. Values of S T j range from 0.0 to 1.0. – Small value  unimportant parameter – Large value  important parameter

Sensitivity analysis – Global method Define a ranges for each parameter Sample the parameter space R times using Sobol’. For each, calculate performance metric, ψ V(ψ) = Σ r=1,R ψ r 2 - [Σ r=1,R ψ r ] 2 S T j = 1 – [U - Σ r=1,R ψ r 2 ] / V(ψ) U = (1/R)Σ r=1,R [ψ r ψ r -j ] Number of model runs = R(k+1) R typically 10,000

Sensitivity analysis – Local methods Sample the parameter space N times. For each, calculate ψ and ∂ψ/∂θ j (θ j is parameter j) V L (ψ) = (∂ψ/∂θ) T V(θ) (∂ψ/∂θ) S T(L) j = 1 - [(∂ψ/∂θ) T V(θ) (j) (∂ψ/∂θ)] / V L (ψ) (New) V(θ) – Here, V(θ) diagonals equal parameter variances and V(θ) (j) has a zero for parameter j. – Make variances consistent with parameter ranges used for global method. Here, st dev = (max-min)/2. Number of model runs = L(2k + 1) N typically 1 for local results. We suggest the idea of multi-location local methods, and N>1.

Simple Test Case (analytical example relevant to FUSE) Nonlinear reservoir with 2 parameters ODE: dS/dt = p(t) – q(t) Here, p(t)=0, so: q(t)=K(S(t)/S sc ) c – where K[mm/day] and c[-] are analyzed parameters – S sc [mm] is a scaling parameter that is held constant Goal: investigate sensitivity of K and c to ψ=mean(S) Used R script that solves the problem using fixed- time step, implicit Euler. Solution is in R script FUN_Nonlinear_Implicit_v02.r

Simple Test Case Run R script “Sensitivities_Global_Local_v02.r” Time series plots of storage (Files: 1_Plot_S__K900_vs_c.pdf, 1_Plot_S__K100_vs_c.pdf) ψ = mean storage over time, so average over each curve

Simple Test Case Performance metric ψ (mean storage) File: 2_Image_OF_2D_K_vs_c.pdf

Typical Sobol’ Global Sensitivity Analysis File: 7_Barchart_SobolSens_NONLIN888.pdf Global Sobol’ suggests the two parameters have the same sensitivity. 10,000 model runs First-order sensitivity Total order sensitivity

Can we say more? Consider evaluations at many locations within the parameter space. Like these 100 locations. How does parameter importance change as the solution conditions change? File: 2_Image_OF_2D_K_vs_c.pdf Global and local statistics both need V(ψ). What values of V(ψ) do we get at each point using Sobol’ and local methods?

Sobol’ Global variance ½, V(ψ) 1/2 Bootstrapping suggests 3,000,000 model runs are needed to obtain stable results These are the 100 values -- V(ψ) 1/2 calculated for each of the 100 points plotted two ways. Both in file: 3_Variances_Sobol.pdf

Sobol’ variance (3,000,000 model runs) file: 3_Variances_Sobol.pdf Local variance (300 model runs) file: 4_Variances_psi_local.pdf

Sobol’ variance (3,000,000 model runs) file: 3_Variances_Sobol.pdf Local variance (300 model runs) file: 4_Variances_psi_local.pdf The way we calculate parameter variance from the range needs to be different. Total-order statistics divide variances, so this constant multiplicative difference divides out for the results here. We seem to be getting the relative values for different parameters right.

Total sensitivity indices (S T ) at many locations Results obtained from local (300 model runs) and global (3,000,000 model runs) Parameter is… important not important Importance of File: 5_Image_sens_tot_2D_K_vs_cx.pdf

Exercise The Sobol’ results are provided – the runs take too long for class (about 12 hours). You can produce the local sensitivity results in class using an R script written by Olda.

Exercise instructions How to run – Start R using R /bin/i386/Rgui.exe or R.bat – File/Open script/ Sensitivities_Global_Local_v02.r Change path on line 17 to where the R script is. Change any \ to /. Spaces ok. – Run program: With R script window active, cntl-a, cntl-r. PDF files are produced – All the files in the ppt presentation. You did all the calculations for the local method right here. Explore – For the global method defined parameter ranges (max,min). – For local method used st dev=(max-min)/2. – R script Sensitivities_Global_Local_v02.r Defines st dev for K as WW1=499.5 and for c WW2=1.5. Use caps to search for variable names. Try changing these values by a constant such as multiplying by 10. Do the figures change? Try changing just one of the values by a factor of 10. Do the figures change?

FUSE (Clark et al, 2008)

FUSE accommodates different decisions regarding process selection and representation influenceduninfluencedsaturated Consider an example conceptual model of hydrological processes

FUSE accommodates different options for model architecture Consider the “mosaic” approach to representing heterogeneity in vegetation, as in the VIC model … and contrast against other model architectures: should different vegetation types compete for the same soil moisture? should the architecture allow for sub- grid variability in soil moisture, to simulate the co-variability between vegetation type and water availability?

FUSE Separates the hypothesized model equations from their solutions Many hydrologic models use ad-hoc approaches to add/subtract model fluxes from model stores in a pre-determined sequence For example, the Sacramento model implementation Critically: Lack of attention applied to numerical error control

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL FUSE supports quantifying uncertainty using multiple models…

Define model building decisions—1 SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL UPPER LAYER ARCHITECTURE

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL LOWER LAYER ARCHITECTURE / BASEFLOW Define model building decisions—2

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL PERCOLATION Define model building decisions—3

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL SURFACE RUNOFF Define model building decisions—4

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL and mix architecture + parameterizations… TO CREATE HUNDREDS OF MODELS, ALL WITH DIFFERENT STRUCTURE

SOILR GFLWR GWATR IZONE LAYR2 LAYR1 PCTIM ADIMP ADIMC UZTWCUZFWC LZTWC LZFSCLZFPC IMPZR RZONE IFLWR GFLWR LZONE PRMSSACRAMENTO ARNO/VICTOPMODEL FUSE-016

Total sensitivity results from FUSE-016 Global Local

“Numerical Daemons” (Clark, Kavetski, 2010; Kavetski and Clark, 2010, WRR) produced by thresholds can produce large local sensitivities Kavetski, Kuczera 2007 WRR

Organizing computationally frugal and demanding methods by purpose

TOWARD TRANSPARENT, REFUTABLE HYDROLOGIC MODELS IN KANSAS OR OZ In class: Mary C. Hill U.S. Geological Survey, Boulder Olda Rakovec Wageningen University, Netherlands Other co-authors: Martyn Clark UCAR, Boulder A. H. Weerts Deltares, Wageningin University A. J. Teuling Wageningin University R. Uijlenhoet Wageningin University