This presentation can be downloaded at – This work is carried out within the SWITCH-ON.

Slides:



Advertisements
Similar presentations
Introduction to parameter optimization
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
1 Uncertainty in rainfall-runoff simulations An introduction and review of different techniques M. Shafii, Dept. Of Hydrology, Feb
Econometric Modeling Through EViews and EXCEL
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Importance of Land use management on the Flood Management in the Chi River Basin, Thailand Kittiwet Kuntiyawichai Bart Schultz Stefan Uhlenbrook F.X. Suryadi.
Lwando Kondlo Supervisor: Prof. Chris Koen University of the Western Cape 12/3/2008 SKA SA Postgraduate Bursary Conference Estimation of the parameters.
Model calibration using. Pag. 5/3/20152 PEST program.
AGC DSP AGC DSP Professor A G Constantinides©1 Modern Spectral Estimation Modern Spectral Estimation is based on a priori assumptions on the manner, the.
Testing hydrological models as hypotheses: a limits of acceptability approach and the issue of disinformation Keith Beven, Paul Smith and Andy Wood Lancaster.
Experimental Uncertainties: A Practical Guide What you should already know well What you need to know, and use, in this lab More details available in handout.
The Simple Linear Regression Model: Specification and Estimation
Model calibration and validation
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
Additional Topics in Regression Analysis
Simulation.
1 Validation and Verification of Simulation Models.
7-2 Estimating a Population Proportion
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Relationships Among Variables
TOPLHCWG. Introduction The ATLAS+CMS combination of single-top production cross-section measurements in the t channel was performed using the BLUE (Best.
Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
1 MADE WHAT IF SOME OLS ASSUMPTIONS ARE NOT FULFILED?
Sampling Theory Determining the distribution of Sample statistics.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
ETM 607 – Input Modeling General Idea of Input Modeling Data Collection Identifying Distributions Parameter estimation Goodness of Fit tests Selecting.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Robust Estimators.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Discrepancy between Data and Fit. Introduction What is Deviance? Deviance for Binary Responses and Proportions Deviance as measure of the goodness of.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Machine Learning 5. Parametric Methods.
- 1 - Calibration with discrepancy Major references –Calibration lecture is not in the book. –Kennedy, Marc C., and Anthony O'Hagan. "Bayesian calibration.
Sampling Theory Determining the distribution of Sample statistics.
CHAPTER 4 ESTIMATES OF MEAN AND ERRORS. 4.1 METHOD OF LEAST SQUARES I n Chapter 2 we defined the mean  of the parent distribution and noted that the.
Computacion Inteligente Least-Square Methods for System Identification.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Building Valid, Credible & Appropriately Detailed Simulation Models
Sandeep Bisht Assistant Director Basin planning
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Yun, Hyuk Jin. Theory A.Nonuniformity Model where at location x, v is the measured signal, u is the true signal emitted by the tissue, is an unknown.
Performance assessment of a Bayesian Forecasting System (BFS) for realtime flood forecasting Biondi D. , De Luca D.L. Laboratory of Cartography and Hydrogeological.
Probability Theory and Parameter Estimation I
Linear Regression.
Stats Club Marnie Brennan
Introduction to Instrumentation Engineering
Modern Spectral Estimation
Prepared by Lee Revere and John Large
Discrete Event Simulation - 4
Techniques for Data Analysis Event Study
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Simple Linear Regression
Presentation transcript:

This presentation can be downloaded at – This work is carried out within the SWITCH-ON Research Project financed by the European Union within the 7 th Framework Programme, and the Panta Rhei research initiative of the International Association of Hydrological Sciences

This presentation can be downloaded at – The selection of the objective function for calibrating hydrological models is still a delicate task, impacting hydrological practice. Thanks to recent valuable contributions, there is an increasing pressure to optimise hydrological models by maximising their likelihood. This recommendation is frequently suggested and adopted without realising its practical implications. The purpose of this study is to investigate, from an engineering perspective, what are the implications of using a likelihood for calibrating hydrological models. From Wikipedia: In statistics, a likelihood function (often simply the likelihood) is a function of the parameters of a statistical model. The likelihood of a set of parameter values, θ, for given outcomes, is equal to the probability of those observed outcomes given those parameter values, that is: L(θ|x) = P(x |θ).

This presentation can be downloaded at – Assumptions What if assumptions are not satisfied? Probabilities and likelihood are not well estimated. See next slides.

This presentation can be downloaded at – For a given model, parameter set and observations the likelihood is fixed. The likelihood is a probability and not a probability distribution. The likelihood is a perfect candidate for the objective function. The likelihoods depends on predictive uncertainty.

This presentation can be downloaded at – The likelihood is conditioned by assumptions that are rarely met in hydrology. The most common assumption is independence and heteroscedasticity of model residuals. Recent contributions proposed new formulations for the likelihood (Schoups and Vrugt, 2009; Pianosi and Raso, 2012). However, the fit provided by the likelihood is still not satisfactory in many cases. If the assumptions are violated, then the probability of the observed data is not well estimated, at least for some data points. My conclusion is that the likelihood is no more a likelihood if its assumptions are violated. It becomes just another objective function. Remember: we often speak of “true” model parameters. Actually, if the model is uncertain a true parameter set does not exist. We have several optimal parameter sets (along with their probability distributions), one for each objective function we may consider.

This presentation can be downloaded at – If the assumptions conditioning the likelihood are violated (99% of the cases), then: - Parameters estimation is biased: namely, the model does not reproduce all the data with the same reliability, but some data (e.g. floods) are better simulated than others (e.g. droughts ). -Uncertainty for the model output is not correctly estimated: for some data (e.g. floods) uncertainty maybe correctly estimated, while for others ((e,g. droughts) not. -The objective function should be identified by considering the purpose of the analysis and the behaviour of each candidate from a practical point of view. - It is necessary to study the behaviours of the objective functions (for some of them the behaviours are well known already) and the behaviours of the considered likelihood formulation, with the awareness that it is not a likelihood.

This presentation can be downloaded at – Sieve River Basin (Tuscany region – Italy) – Contributing area 822 km 2 Hourly rainfall, evapotranspiration, river flow for the period 1992 – 1996 Hymod rainfall-runoff model: lumped, 5 parameters Calibration and validation performed for the 1992 – 1994 and 1995 – 1996 periods, respectively. Optimal parameter set obtained by minimising: - Sum of squared residuals. - Sum of absolute errors. - Parametric approximate maximum likelihood (autoregressive model of order 1 estimated for the residuals) with autoregressive parameter estimated off-line with respect to hydrological model parameters. Optimization of objective functions has been carried out by using DREAM (Vrugt et al., 2008) Sieve River at Pontassieve

This presentation can be downloaded at – Sum of squares: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.47 Sum of absolute errors: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.65 Approximate maximum likelihood: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.50 Sum of squares: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.47 Sum of absolute errors: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.65 Approximate maximum likelihood: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.50 For each of the three objective functions the distribution of the parameters was checked to assess if the optimum is well identified -- Observed data -- Least squares -- Absolute error -- MLE

This presentation can be downloaded at – Sum of squares: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.47 Sum of absolute errors: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.65 Approximate maximum likelihood: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.50 Sum of squares: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.47 Sum of absolute errors: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.65 Approximate maximum likelihood: - Calibration: Nash-Sutcliffe efficiency = Validation: Nash-Sutcliffe efficiency = 0.50 O Least squares O Absolute errors O MLE

This presentation can be downloaded at – The shock produced by the occurrence of a flood that is incorrectly predicted by the model lasts very long if the error is correlated. The same shock is repeated for several time steps and the related error is squared. Parameterisation is much focused on floods Observed flow Least squares simulation Least squares error MLE error Let us focus on a generic flood event for the Sieve River and the related simulation error If the error is decorrelated the shock lasts for a few steps only. Less focus on floods.

This presentation can be downloaded at – The likelihood is not likely! There is no theoretical reason to prefer the use of a likelihood if assumptions are not satisfied. On the other hand, approximate likelihoods should be considered as candidate objective functions, but their behaviours should be inspected. The objective function should be selected by considering the purpose of the analysis and the behaviours of the candidate functions. More research is needed on alternative objective functions and their behaviours, in terms of shape of the response surface and capability to meet practical needs. Such additional research should clearly highlight the asset of each proposed function. Such research should not be focused on likelihoods only. We dedicated a lot of attention to optimization but little to objective functions. Practical experience suggests that a proper objective function, taking into account that a hydrological model is never perfect, may improve model performances significantly.

This presentation can be downloaded at – Call for research themes and working groups coming soon

This presentation can be downloaded at – Final deadline: March 15, 2014