Assimilation of Observations and Bayesianity Mohamed Jardak and Olivier Talagrand Laboratoire de Météorologie Dynamique, École Normale Supérieure, Paris,

Slides:



Advertisements
Similar presentations
Data-Assimilation Research Centre
Advertisements

Introduction to Data Assimilation NCEO Data-assimilation training days 5-7 July 2010 Peter Jan van Leeuwen Data Assimilation Research Center (DARC) University.
What is a good ensemble forecast? Chris Ferro University of Exeter, UK With thanks to Tom Fricker, Keith Mitchell, Stefan Siegert, David Stephenson, Robin.
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
Creating probability forecasts of binary events from ensemble predictions and prior information - A comparison of methods Cristina Primo Institute Pierre.
Observers and Kalman Filters
Data assimilation schemes in numerical weather forecasting and their link with ensemble forecasting Gérald Desroziers Météo-France, Toulouse, France.
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
Validation Olivier Talagrand WMO Workshop on 4D-Var and Ensemble Kalman Filter Intercomparisons Buenos Aires, Argentina 13 November 2008.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Nonlinear and Non-Gaussian Estimation with A Focus on Particle Filters Prasanth Jeevan Mary Knox May 12, 2006.
Introducing Some Basic Concepts Linear Theories of Waves (Vanishingly) small perturbations Particle orbits are not affected by waves. Dispersion.
1 Validation of Ensemble Methods O. Talagrand G. Candille and L. Descamps Laboratoire de Météorologie Dynamique, École Normale Supérieure Paris, France.
CHAPTER 6 Statistical Analysis of Experimental Data
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
QMS 6351 Statistics and Research Methods Probability and Probability distributions Chapter 4, page 161 Chapter 5 (5.1) Chapter 6 (6.2) Prof. Vera Adamchik.
1 Optimal solution error covariances in nonlinear problems of variational data assimilation Victor Shutyaev Institute of Numerical Mathematics, Russian.
Introduction to Numerical Weather Prediction and Ensemble Weather Forecasting Tom Hamill NOAA-CIRES Climate Diagnostics Center Boulder, Colorado USA.
Lecture II-2: Probability Review
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Ensemble Data Assimilation and Uncertainty Quantification Jeffrey Anderson, Alicia Karspeck, Tim Hoar, Nancy Collins, Kevin Raeder, Steve Yeager National.
Workshop Uncertainty Analysis in Geophysical Inverse Problems Lorentz Center, Leiden, The Netherlands A Few Concluding Comments … Olivier Talagrand Laboratoire.
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
The Importance of Atmospheric Variability for Data Requirements, Data Assimilation, Forecast Errors, OSSEs and Verification Rod Frehlich and Robert Sharman.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Statistical Characteristics of High- Resolution COSMO.
CSDA Conference, Limassol, 2005 University of Medicine and Pharmacy “Gr. T. Popa” Iasi Department of Mathematics and Informatics Gabriel Dimitriu University.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss High-resolution data assimilation in COSMO: Status and.
Probabilistic Robotics Bayes Filter Implementations.
Celeste Saulo and Juan Ruiz CIMA (CONICET/UBA) – DCAO (FCEN –UBA)
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
2004 SIAM Annual Meeting Minisymposium on Data Assimilation and Predictability for Atmospheric and Oceanographic Modeling July 15, 2004, Portland, Oregon.
A unifying framework for hybrid data-assimilation schemes Peter Jan van Leeuwen Data Assimilation Research Center (DARC) National Centre for Earth Observation.
Applications of optimal control and EnKF to Flow Simulation and Modeling Florida State University, February, 2005, Tallahassee, Florida The Maximum.
MODEL ERROR ESTIMATION EMPLOYING DATA ASSIMILATION METHODOLOGIES Dusanka Zupanski Cooperative Institute for Research in the Atmosphere Colorado State University.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Mobile Robot Localization (ch. 7)
Uncertainty Management in Rule-based Expert Systems
1 Value of information – SITEX Data analysis Shubha Kadambe (310) Information Sciences Laboratory HRL Labs 3011 Malibu Canyon.
Errors, Uncertainties in Data Assimilation François-Xavier LE DIMET Université Joseph Fourier+INRIA Projet IDOPT, Grenoble, France.
Quality of model and Error Analysis in Variational Data Assimilation François-Xavier LE DIMET Victor SHUTYAEV Université Joseph Fourier+INRIA Projet IDOPT,
Local Predictability of the Performance of an Ensemble Forecast System Liz Satterfield and Istvan Szunyogh Texas A&M University, College Station, TX Third.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Machine Learning 5. Parametric Methods.
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Pg 1 SAMSI UQ Workshop; Sep Ensemble Data Assimilation and Uncertainty Quantification Jeff Anderson National Center for Atmospheric Research.
École Doctorale des Sciences de l'Environnement d’ Î le-de-France Année Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
École Doctorale des Sciences de l'Environnement d’ Î le-de-France Année Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Figures from “The ECMWF Ensemble Prediction System”
On triangular norms, metric spaces and a general formulation of the discrete inverse problem or starting to think logically about uncertainty On triangular.
ECMWF/EUMETSAT NWP-SAF Satellite data assimilation Training Course Mar 2016.
Statistics & Evidence-Based Practice
Data Assimilation Theory CTCD Data Assimilation Workshop Nov 2005
Department of Civil and Environmental Engineering
Verifying and interpreting ensemble products
PSG College of Technology
Course: Autonomous Machine Learning
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Sampling Distributions
2. University of Northern British Columbia, Prince George, Canada
Parametric Methods Berlin Chen, 2005 References:
Multivariate Methods Berlin Chen
Analyzing Reliability and Validity in Outcomes Assessment
the performance of weather forecasts
What is a good ensemble forecast?
International Conference on Ensemble Methods in Geophysical Sciences
Presentation transcript:

Assimilation of Observations and Bayesianity Mohamed Jardak and Olivier Talagrand Laboratoire de Météorologie Dynamique, École Normale Supérieure, Paris, France Fifth Meeting of THORPEX Workshop Group on Predictability and Dynamical Processes Reading University, UK 20 June 2012

Project PREVASSEMBLE Funded by French Agence Nationale pour la Recherche Duration : initially 3 years, extended to 4 (started 01/2009) ; Total funding : 780,000 euros Purpose: Study all aspects of ensemble methods (theory, practical implementation, validation, possible limitations, ….) for data assimilation and prediction in atmospheric sciences. Three partners 1. Institut Pierre-Simon Laplace pour les Sciences de l'Environnement Global, Paris (O. Talagrand, general coordinator of the project) 2. Institut de Recherche en Informatique et Systèmes Aléatoires, Rennes (F. Le Gland) 3. Météo-France, Toulouse (G. Desroziers) Positions funded : 4 post-doctoral positions (Paris, Toulouse), 1 scientist position (Paris), 1 graduate student position (Rennes) Contact:

Purpose of assimilation : reconstruct as accurately as possible the state of the atmospheric or oceanic flow, using all available appropriate information. The latter essentially consists of  The observations proper, which vary in nature, resolution and accuracy, and are distributed more or less regularly in space and time.  The physical laws governing the evolution of the flow, available in practice in the form of a discretized, and necessarily approximate, numerical model.  ‘Asymptotic’ properties of the flow, such as, e. g., geostrophic balance of middle latitudes. Although they basically are necessary consequences of the physical laws which govern the flow, these properties can usefully be explicitly introduced in the assimilation process.

Both observations and ‘model’ are affected with some uncertainty  uncertainty on the estimate. For some reason, uncertainty is conveniently described by probability distributions ( Jaynes, E. T., 2007, Probability Theory: The Logic of Science, Cambridge University Press ). Assimilation is a problem in bayesian estimation. Determine the conditional probability distribution for the state of the system, knowing everything we know (unambiguously defined if a prior probability distribution is defined; see Tarantola, 2005).

Present approaches to data assimilation  Obtain some ‘central’ estimate of the conditional probability distribution (expectation, mode, …), plus some estimate of the corresponding spread (standard deviations and a number of correlations).  Produce an ensemble of estimates which are meant to sample the conditional probability distribution (dimension N ≈ O( ) ).

A large part of presently existing algorithms for assimilation are still linear and gaussian, in the sense that they are empirical and heuristic extensions to weakly nonlinear and non-gaussian situations of algorithms which achieve bayesianity in linear and gaussian cases.

Two main classes of algorithms for ensemble assimilation exist at present - Ensemble Kalman filter, which is bayesian in its ‘forecast’ phase, but linear and gaussian in its updating phase. Abundantly used. - Particle filters, which are totally bayesian in their principle (P. J. van Leeuwen), but are still at research stage.

Data z =  x +  Then conditional posterior probability distribution P(x  z) = N [x a, P a ] with x a = (  T S -1  ) -1  T S -1 [z  ] P a = (  T S -1  ) -1 Ready recipe for producing sample of independent realizations of posterior probability distribution (resampling method) Perturb data vector additively according to error probability distribution N [0, S], and compute analysis x a for each perturbed data vector

Available data consist of - Background estimate at time 0 x 0 b = x 0 +  0 b  0 b  N [ , P 0 b ] - Observations at times k = 0, …, K y k = H k x k +  k  k  N [ , R k ] - Model (supposed to be exact) x k+1 = M k x k k = 0, …, K-1 Errors assumed to be unbiased and uncorrelated in time, H k and M k linear Then optimal state (mean of bayesian gaussian pdf) at initial time 0 minimizes objective function  0  S  J (  0 ) = (1/2) (x 0 b -  0 ) T [P 0 b ] -1 (x 0 b -  0 ) + (1/2)  k [y k - H k  k ] T R k -1 [y k - H k  k ] subject to  k+1 = M k  k,k = 0, …, K-1

Work done here. Apply ‘ensemble’ recipe described above to variational assimilation in nonlinear and non-gaussian cases, and look at what happens. Everything synthetic, Two one-dimensional toy models : Lorenz ’96 model and Kuramoto-Sivashinsky equation. Perfect model assumption

(Nonlinear) Lorenz’96 model

There is no (and there cannot be) a general objective test of bayesianity. We use here as a substitute the much weaker property of reliability. Reliability. Statistical consistency between predicted probability of occurrence and observed frequency of occurrence (it rains 40% of the time in circumstances when I predict 40%-probability for rain). Observed frequency of occurrence p‘(p) of event, given that it has been predicted to occur with probability p, must be equal to p. For any p, p‘(p) = p More generally, frequency distribution F‘(F) of reality, given that probability distribution F has been predicted for the state of the system, must be equal to F. Reliability can be objectively assessed, provided a large enough sample of realizations of the estimation process is available.

Reliability diagramme, NCEP, event T 850 > T c - 4C, 2-day range, Northern Atlantic Ocean, December February 1999

Rank histograms, T 850, Northern Atlantic, winter Top panels: ECMWF, bottom panels: NMC (from Candille, Doctoral Dissertation, 2003)

In both linear and nonlinear cases, size of ensembles N e =30 Number of realizations of the process M =3000 and 7000 for Lorenz and Kuramoto-Sivashinsky respectively.

Non-gaussianity of the error, if model is kept linear, has no significant impact on scores.

Brier Score (Brier, 1950), relative to binary event E B  E[(p - p o ) 2 ] where p is predicted probability of occurrence, p o = 1 or 0 depending on whether E has been observed to occur or not, and E denotes average over all realizations of the prediction system. Decomposes into B = E[(p-p’) 2 ] - E[(p’-p c ) 2 ] + p c (1-p c ) where p c  E(p o ) = E(p’) is observed frequency of occurrence of E. First term E[(p-p’) 2 ] measures reliability. Second term E[(p’-p c ) 2 ] measures dispersion of a posteriori calibrated probabilities p’. The larger that dispersion, the more discriminating, or resolving, and the more useful, the prediction system. That term measures the resolution of the system. Third term, called uncertainty term, depends only on event E, not on performance of prediction system.

Brier Skill Score B SS  1 - B/ p c (1-p c ) (positively oriented) and components B rel  E[(p-p’) 2 ] / p c (1-p c ) B res  1 - E[(p’-p c ) 2 ] / p c (1-p c ) (negatively oriented)

Very similar results obtained with the Kuramoto-Sivashinsky equation. u t + uu x + u xx + u xxxx = 0

Preliminary conclusions In the linear case, ensemble variational assimilation produces, in agreement with theory, reliable estimates of the state of the system. Nonlinearity significantly degrades reliability (and therefore bayesianity) of variational assimilation ensembles. Resolution, i. e., capability of ensembles to reliably estimate a broad range of different probabilities of occurrence, is also degraded.

Perspectives Perform further studies on physically more realistic dynamical models (shallow-water equations) Further comparison with performance Particle Filters, and with performance of Ensemble Kalman Filter.

Announcement International Conference on Ensemble Methods in Geophysical Sciences Supported by World Meteorological Organization, Météo-France, ANR, CNRS, INRIA Dates and location : November 2012, Toulouse, France Forum for study of all aspects of ensemble methods for estimation of state of geophysical systems (atmosphere, ocean, solid earth, …), in particular in assimilation of observations and prediction : theory, algorithmic implementation, practical applications, evaluation, … Deadline for submission of abstracts : 20 June 2012