Estimating emissions from observed atmospheric concentrations: A primer on top-down inverse methods Daniel J. Jacob.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Introduction to Data Assimilation NCEO Data-assimilation training days 5-7 July 2010 Peter Jan van Leeuwen Data Assimilation Research Center (DARC) University.
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
Quantifying North American methane emissions using satellite observations of methane columns Daniel J. Jacob with Alex Turner, Bram Maasakkers, Melissa.
Pattern Classification. Chapter 2 (Part 1): Bayesian Decision Theory (Sections ) Introduction Bayesian Decision Theory–Continuous Features.
Aspects of Conditional Simulation and estimation of hydraulic conductivity in coastal aquifers" Luit Jan Slooten.
CHAPTER 16 MARKOV CHAIN MONTE CARLO
Bayesian statistics – MCMC techniques
Visual Recognition Tutorial
ECMWF CO 2 Data Assimilation at ECMWF Richard Engelen European Centre for Medium-Range Weather Forecasts Reading, United Kingdom Many thanks to Phil Watts,
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Nonlinear and Non-Gaussian Estimation with A Focus on Particle Filters Prasanth Jeevan Mary Knox May 12, 2006.
Retrieval Theory Mar 23, 2008 Vijay Natraj. The Inverse Modeling Problem Optimize values of an ensemble of variables (state vector x ) using observations:
REMOTE SENSING & THE INVERSE PROBLEM “Remote sensing is the science and art of obtaining information about an object, area, or phenomenon through the analysis.
CHAPTER 3: SIMPLE MODELS
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Lecture II-2: Probability Review
Classification: Internal Status: Draft Using the EnKF for combined state and parameter estimation Geir Evensen.
Use of satellite and suborbital observations to constrain North American methane emissions in the Carbon Monitoring System Daniel Jacob (PI), Steven.
Image Analysis and Markov Random Fields (MRFs) Quanren Xiong.
: Appendix A: Mathematical Foundations 1 Montri Karnjanadecha ac.th/~montri Principles of.
1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Principles of Pattern Recognition
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
COST 723 Training School - Cargese October 2005 OBS 2 Radiative transfer for thermal radiation. Observations Bruno Carli.
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Intercomparison methods for satellite sensors: application to tropospheric ozone and CO measurements from Aura Daniel J. Jacob, Lin Zhang, Monika Kopacz.
Machine Learning Lecture 23: Statistical Estimation with Sampling Iain Murray’s MLSS lecture on videolectures.net:
GLOBAL MODELS OF ATMOSPHERIC COMPOSITION Daniel J. Jacob Harvard University.
1 ESTIMATING THE STATE OF LARGE SPATIOTEMPORALLY CHAOTIC SYSTEMS: WEATHER FORECASTING, ETC. Edward Ott University of Maryland Main Reference: E. OTT, B.
CSDA Conference, Limassol, 2005 University of Medicine and Pharmacy “Gr. T. Popa” Iasi Department of Mathematics and Informatics Gabriel Dimitriu University.
Jamal Saboune - CRV10 Tutorial Day 1 Bayesian state estimation and application to tracking Jamal Saboune VIVA Lab - SITE - University.
Finding Scientific topics August , Topic Modeling 1.A document as a probabilistic mixture of topics. 2.A topic as a probability distribution.
Comparing droplet activation parameterisations against adiabatic parcel models using a novel inverse modelling framework Warsaw: April 20 th 2015: Eulerian.
Scattering by Earth surface Instruments: Backscattered intensity I B absorption   Methane column  Application of inverse methods to constrain methane.
Overview of Techniques for Deriving Emission Inventories from Satellite Observations Frascati, November 2009 Bas Mijling Ronald van der A.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
2004 SIAM Annual Meeting Minisymposium on Data Assimilation and Predictability for Atmospheric and Oceanographic Modeling July 15, 2004, Portland, Oregon.
Research Vignette: The TransCom3 Time-Dependent Global CO 2 Flux Inversion … and More David F. Baker NCAR 12 July 2007 David F. Baker NCAR 12 July 2007.
INVERSE MODELING OF ATMOSPHERIC COMPOSITION DATA Daniel J. Jacob See my web site under “educational materials” for lectures on inverse modeling atmospheric.
Clustering and Testing in High- Dimensional Data M. Radavičius, G. Jakimauskas, J. Sušinskas (Institute of Mathematics and Informatics, Vilnius, Lithuania)
Satellite-based inversion of NOx emissions using the adjoint of CMAQ Amir Hakami, John H. Seinfeld (Caltech) Qinbin Li (JPL) Daewon W. Byun, Violeta Coarfa,
Data assimilation and forecasting the weather (!) Eugenia Kalnay and many friends University of Maryland.
FastOpt Quantitative Design of Observational Networks M. Scholze, R. Giering, T. Kaminski, E. Koffi P. Rayner, and M. Voßbeck Future GHG observation WS,
Inverse modeling in a linear algebra framework State vector x (dimension n) Observation vector y (dimension m) correlations between vector elements Use.
INVERSE MODELING TECHNIQUES Daniel J. Jacob. GENERAL APPROACH FOR COMPLEX SYSTEM ANALYSIS Construct mathematical “forward” model describing system As.
July 11, 2006Bayesian Inference and Maximum Entropy Probing the covariance matrix Kenneth M. Hanson T-16, Nuclear Physics; Theoretical Division Los.
Bayes Theorem The most likely value of x derived from this posterior pdf therefore represents our inverse solution. Our knowledge contained in is explicitly.
Bayesian Travel Time Reliability
Unscented Kalman Filter (UKF) CSCE 774 – Guest Lecture Dr. Gabriel Terejanu Fall 2015.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Inverse Modeling of Surface Carbon Fluxes Please read Peters et al (2007) and Explore the CarbonTracker website.
RESULTS: CO constraints from an adjoint inversion REFERENCES Streets et al. [2003] JGR doi: /2003GB Heald et al. [2003a] JGR doi: /2002JD
Top-down and bottom-up estimates of North American methane emissions Daniel J. Jacob with Bram Maasakkers, Jianxiong Sheng, Melissa Sulprizio, Alex Turner.
Comparison of adjoint and analytical approaches for solving atmospheric chemistry inverse problems Monika Kopacz 1, Daniel J. Jacob 1, Daven Henze 2, Colette.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Canadian Bioinformatics Workshops
Potential of Observations from the Tropospheric Emission Spectrometer to Constrain Continental Sources of Carbon Monoxide D. B. A. Jones, P. I. Palmer,
Carbon Cycle Data Assimilation with a Variational Approach (“4-D Var”) David Baker CGD/TSS with Scott Doney, Dave Schimel, Britt Stephens, and Roger Dargaville.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Case study: using GOSAT to estimate US emissions
CO2 sources and sinks in China as seen from the global atmosphere
(Towards) anthropogenic CO2 emissions through inverse modelling
Adjoint modeling and applications
Constraining methane emissions in North America
Filtering and State Estimation: Basic Concepts
Models of atmospheric chemistry
Estimation Error and Portfolio Optimization
Yalchin Efendiev Texas A&M University
Presentation transcript:

Estimating emissions from observed atmospheric concentrations: A primer on top-down inverse methods Daniel J. Jacob

The inverse modeling problem Quantify selected variables driving a physical system (state vector x, dim n) by using: –The observable manifestations of the system (observation vector y, dim m ) –a physical model relating x to y (forward model F) –consideration of errors (error covariance matrix S). prior estimate x A ± S A observations y ± S O forward model y = F(x) ± S M optimal estimate statistical fit Forward model gives y = F(x), but solving x = F -1 (y) is not possible due to errors Need instead a statistical fit: “inversion” “data assimilation”

Inverting for emissions: a conceptual view 2-D gridded emission field x Observed atmospheric concentration y i Sensitivity Fit x to y Emission footprint Incude prior information x A ± S A to: target the fit to most relevant components prevent unphysical solutions weigh observations vs. prior knowledge quantify information content of observations from chemical transport model (forward model) transport Adjoint of forward model (reverse flow)

Chemical transport model (CTM): forward model for inverting emissions from atmospheric observations Solve 3-D continuity equation for chemical concentrations in the atmosphere: change in concentration with time grid-resolved transport (advection) subgrid transport (turbulence) chemical production and loss emission, deposition solve for fixed gridboxes over  t utut toto to+tto+t Eulerian framework solve for point masses moving with the flow Lagrangian framework Resolution of current models: ~100 km (global) ~ 10 km (continental) ~ 1 km (regional)

Backward Lagrangian particle dispersion model (LPDM) Run Lagrangian model backward from receptor location, with points released at receptor location only Single simulation defines footprint of a given observation Adjoint of forward Lagrangian model Efficient with limited number of local observations flow backward in time

CTM relating methane emissions to concentrations is linear simulated concentrations are proportional to emissions This is not an obvious result because methane and OH are sinks for each other: Emission ↑ [CH 4 ] ↑ [OH] ↓ [CH 4 ] ↑ ↑ However, we know [CH 4 ] from global background observations to within a few % so we should not let model changes in [CH 4 ] affect [OH] The linear CTM is solely defined by its Jacobian matrix K: The CTM adjoint is the transpose K T (reverse flow)

Bayes’ theorem: foundation for inverse modeling P(x) = probability density function (PDF) of x P(x,y) = PDF of (x,y) P(y | x) = PDF of y given x prior PDF observation PDF normalizing factor (unimportant) posterior PDF Optimal estimate solution for x given y is max[P(x | y)] solve for  P(x,y)dxdy  Bayes’ theorem

General Bayesian solution to the inverse problem State vector Observation vector Forward model Prior x A with prior error covariance matrix S A Observations y with observation error covariance matrix S O prior PDF P(x)Observation PDF P(y | x) solve

Error covariance matrices and PDFs for vectors Consider vector x with expected value E[x] and error Error covariance S : Gaussian error PDF:

Optimal estimate solution assuming Gaussian errors prior PDF observation PDF Minimize cost function: solve

Linear forward model allows analytical solution y = F(x) = Kx where Is the Jacobian matrix Solution:with gain matrix posterior error covariance matrix Relate solution to true value: with A = GK averaging kernel matrix smoothing error observational error truth

A little more on the averaging kernel matrix A describes the sensitivity of the optimized estimate to the true state and hence the smoothing of the solution: Analytical inversion gives A as part of the solution: The trace of A gives the # of independent pieces of information in the inversion – also called the degrees of freedom for signal (DOFS) smoothing error observational error truth

Pros and cons of analytical solution to inverse problem PRO: Complete error characterization as part of the solution CON: Calculating Jacobian matrix can be expensive Assumption of Gaussian errors may not be best, allows negative solutions Prior error PDFs Lognormal error PDF may be more physical and enforces positivity - but does not allow analytical solution

Adjoint-based numerical solution ° ° ° ° AA 22 11 33 x2x2 x1x1 x3x3 xAxA Minimum of cost function J Solve the inverse problem by applying adjoint K T to 1. Starting from prior x A, calculate 2. Using a steepest-descent algorithm get next guess x 1 3. Calculate, get next guess x 2 4. Iterate until convergence

Pros and cons of adjoint-based solution to inverse problem PRO: No computational limitation on size of state vector Non-Gaussian error PDFs can be accommodated CON: No error characterization as part of the solution Requires availability of CTM adjoint

Monte Carlo Markov Chain (MCMC) solution to inverse problem Brute-force calculation of posterior PDF: Start from x A, calculate P(x A ) and P(y|x A ) and from there P(x A |y) Now apply a random displacement x 1 = x A + ∆x, calculate P(x 1 |y) Apply next displacement ∆x to either x A or x 1, calculate P(x 2 |y) Apply next displacement ∆x to either x 1 or x 2, calculate P(x 3 |y) … eventually the full PDF P(x|y) is constructed xAxA x1x1 x2x2 x3x3 sample population PDF isolines PRO: Use any form of prior PDF Full posterior PDF is calculated Positivity of solution can be enforced CON: Computationally expensive

Summary flow diagram for inverse methods

Selection of state vector If state vector is too large, cost function is dominated by prior: smoothing error Correct this by aggregating state vector elements, but this incurs aggregation error There is an optimal state vector dimension for fitting observations: # state vector elements aggregation smoothing As dim(x) increases, the importance of the prior terms increases Prior Observations native grid aggregated grid

Optimal clustering of 1/2 o x2/3 o gridsquares Correction factor to bottom-up emissions Native CTM resolution (n = 7096) 1000 clusters SCIAMACHY data cannot constrain emissions at 1/2 o x2/3 o resolution; reduce to 1000 clusters Wecht et al. [2014a] Selection of emission state vector in methane source inversions

Quantifying stratospheric contribution to methane columns solar occultation satellite data Stratosphere contributes 5% of methane column in tropics, 20% at high latitudes Stratospheric meridional transport in models tends to be too fast, causing overestimate of methane columns at high latitudes Not correcting for this bias can cause ~5% bias in inversion results at northern mid- latitudes, 40% in Arctic Patra et al. [2011], Ostler et al. [2015] transport observations model

Observing System Simulation Experiments (OSSEs) to evaluate future satellite missions