Improving wave model validation based on RMSE L. Mentaschi 1, G. Besio 1, F. Cassola 2, A. Mazzino 1 1 Dipartimento di Ingegneria Civile, Chimica ed Ambientale.

Slides:



Advertisements
Similar presentations
L. Mentaschi, G. Besio, F. Cassola, A. Mazzino Dipartimento di Ingegneria Civile, Chimica ed Ambientale (DICCA), Università degli studi di Genova.
Advertisements

Chapter 3 Properties of Random Variables
Regression and correlation methods
Multiple Regression W&W, Chapter 13, 15(3-4). Introduction Multiple regression is an extension of bivariate regression to take into account more than.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Simple Linear Regression and Correlation
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
The Simple Linear Regression Model: Specification and Estimation
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Simple Linear Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Chapter 11 Multiple Regression.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Autocorrelation Lecture 18 Lecture 18.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation & Regression
Lecture 16 Correlation and Coefficient of Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
(2) Ratio statistics of gene expression levels and applications to microarray data analysis Bioinformatics, Vol. 18, no. 9, 2002 Yidong Chen, Vishnu Kamat,
Hypothesis Testing in Linear Regression Analysis
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Regression Analysis (2)
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Correlation and Regression SCATTER DIAGRAM The simplest method to assess relationship between two quantitative variables is to draw a scatter diagram.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Wsws wpwp dw wpwp wsws NARR ONLY Variations in x (size parameter)
PS 225 Lecture 20 Linear Regression Equation and Prediction.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
5.4 Line of Best Fit Given the following scatter plots, draw in your line of best fit and classify the type of relationship: Strong Positive Linear Strong.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Correlation. Correlation is a measure of the strength of the relation between two or more variables. Any correlation coefficient has two parts – Valence:
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Regression and Correlation of Data Correlation: Correlation is a measure of the association between random variables, say X and Y. No assumption that one.
Chapter 13 Simple Linear Regression
AP Statistics Chapter 14 Section 1.
Correlation and Simple Linear Regression
Basic Estimation Techniques
Linear Regression and Correlation Analysis
Chapter 11: Simple Linear Regression
...Relax... 9/21/2018 ST3131, Lecture 3 ST5213 Semester II, 2000/2001
Correlation and Simple Linear Regression
Basic Estimation Techniques
Correlation and Simple Linear Regression
Correlation and Regression
Simple Linear Regression and Correlation
Statistics II: An Overview of Statistics
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Improving wave model validation based on RMSE L. Mentaschi 1, G. Besio 1, F. Cassola 2, A. Mazzino 1 1 Dipartimento di Ingegneria Civile, Chimica ed Ambientale (DICCA), Università degli studi di Genova 2 Dipartimento di Fisica (DIFI), Università degli studi di Genova

Motivation This work has been developed during a tuning of WWIII in the Mediterranean sea. RMSE, NRMSE and SI provided unsatisfactory indication of wave model performances during a validation of WWIII in the Mediterranean sea in storm conditions. A best fit of the model based on RMSE led to parametrizations affected by strong negative bias.

Conclusions RMSE, NRMSE and SI tend to be systematically better for simulations affected by negative bias. This is mostly evident when: We are tuning parameters involving an amplification of the results. Correlation coefficient appreciably smaller than 1,. Standard deviation of the observations of the same order of the average,. The indicator HH introduced by Hanna and Heinold (1985), defined by: provides more reliable information, being minimum for simulations unaffected by bias.

Validation of wave model in the Mediterranean sea Model Wavewatch III –Ardhuin et al. (2010) source terms (41 parameterizations). –Tolman and Chalikov (1996) source terms, not tuned to the Mediterranean sea conditions. Normalized Bias (NBI) Statistical indicators used for validation Normalized Root Mean Square Error (NRMSE) Correlation coeff. (ρ) Root Mean Square Error

Statistics on 17 storms and 23 buoys ACC350BJAT & C NBI2.1%-4.6%-11.2% ρ NRMSE (-2.3%) February 1990 storm

Drawback of using RMSE A numerical example ρ=0.614 for both of the simulations NBI = -12% for the red simulation One would say the best simulation is the blue one. But … ρ=0.614 for both of the simulations NBI = -12% for the red simulation One would say the best simulation is the blue one. But … NRMSE(blue) = NRMSE(red) = (~ -7.2%) NRMSE(blue) = NRMSE(red) = (~ -7.2%)

Geometrical decomposition of RMSE Bias component Scatter component

Geometrical decomposition of NRMSE Bias component Scatter component (also called Scatter Index) Are SI and BC independent?

Example of set of simulations with constant ratio : In general and are not independent. where is the unbiased simulation. Let’s assume this relation: Holds for amplifications Amplification factor:

Relationship SI – NBI SI grows linearly in NBI around NBI=0 We can express both and as functions of NBI Also SI can be expressed as a function of NBI

Fits well real world data Significant wave height Mean period

The simulation with the minimum value of RMSE/NRMSE underestimates the average value

Conditions when this effect is most evident Standard deviation of the same order of the mean Correlation appreciably smaller than 1 ( ), since SI is minimum for

How to overcome this problem? Hanna and Heinold (1985) indicator: Property of HH: ρ constant HH minimum for null bias

HH has a minimum for null bias

Wavewatch III validation on the Mediterranean sea. ACC350BJAT & C NBI2.1%-4.6%-11.2% ρ NRMSE (-2.3%) HH (+4.8%) HH has a minimum for null bias Mentaschi et al. 2013

RMSE, NRMSE and SI tend to be systematically better for simulations affected by negative bias. This is mostly evident when: We are tuning parameters involving an amplification of the results. Correlation coefficient appreciably smaller than 1,. Standard deviation of the observations of the same order of the average, HH indicator overcomes this problem introducing a different normalization of the root mean square error. Conclusions

Thank you!