Tim Hoar: National Center for Atmospheric Research with a whole lot of help from: Bill Sacks, Mariana Vertenstein, Tony Craig, Jim Edwards: NCAR Andrew.

Slides:



Advertisements
Similar presentations
Variational data assimilation and forecast error statistics
Advertisements

OSE meeting GODAE, Toulouse 4-5 June 2009 Interest of assimilating future Sea Surface Salinity measurements.
Initialization Issues of Coupled Ocean-atmosphere Prediction System Climate and Environment System Research Center Seoul National University, Korea In-Sik.
Empirical Localization of Observation Impact in Ensemble Filters Jeff Anderson IMAGe/DAReS Thanks to Lili Lei, Tim Hoar, Kevin Raeder, Nancy Collins, Glen.
03/06/2015 Modelling of regional CO2 balance Tiina Markkanen with Tuula Aalto, Tea Thum, Jouni Susiluoto and Niina Puttonen.
Toward State-Dependent Moisture Availability Perturbations in a Multi-Analysis Ensemble System with Physics Diversity Eric Grimit.
Globally distributed evapotranspiration using remote sensing and CEOP data Eric Wood, Matthew McCabe and Hongbo Su Princeton University.
B1 -Biogeochemical ANL - Townhall V. Rao Kotamarthi.
Carbon Flux Bias Estimation at Regional Scale using Coupled MLEF-PCTM model Ravindra Lokupitiya Department of Atmospheric Science Colorado State University.
© 2012 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE DATA ASSIMILATION RESEARCH TESTBED (DART) FOR ECOLOGICAL FORECASTING Andy.
MDSS Challenges, Research, and Managing User Expectations - Weather Issues - Bill Mahoney & Kevin Petty National Center for Atmospheric Research (NCAR)
1 Terminating Statistical Analysis By Dr. Jason Merrick.
Ensemble-variational sea ice data assimilation Anna Shlyaeva, Mark Buehner, Alain Caya, Data Assimilation and Satellite Meteorology Research Jean-Francois.
Kalman filtering techniques for parameter estimation Jared Barber Department of Mathematics, University of Pittsburgh Work with Ivan Yotov and Mark Tronzo.
Ensemble Data Assimilation and Uncertainty Quantification Jeffrey Anderson, Alicia Karspeck, Tim Hoar, Nancy Collins, Kevin Raeder, Steve Yeager National.
ESA DA Projects Progress Meeting 2University of Reading Advanced Data Assimilation Methods WP2.1 Perform (ensemble) experiments to quantify model errors.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Observing Strategy and Observation Targeting for Tropical Cyclones Using Ensemble-Based Sensitivity Analysis and Data Assimilation Chen, Deng-Shun 3 Dec,
Open Source Ensemble Kalman Filtering: the Data Assimilation Research Testbed - DART Tim Hoar, Jeffrey Anderson, Nancy Collins, Kevin Raeder, Hui Liu,
Earth Science Division National Aeronautics and Space Administration 18 January 2007 Paper 5A.4: Slide 1 American Meteorological Society 21 st Conference.
Parameter estimation: To what extent can data assimilation techniques correctly uncover stochasticity? Jim Hansen MIT, EAPS (with lots.
Page 1© Crown copyright WP4 Development of a System for Carbon Cycle Data Assimilation Richard Betts.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss High-resolution data assimilation in COSMO: Status and.
The Merton Report an AIMES/IGBP-ESA partnership As Earth System science advances and matures, it must be supported by robust and integrated observation.
1 Geophysical Fluid Dynamics Laboratory Review June 30 - July 2, 2009.
Institute for Mathematics Applied to Geoscience Geophysical Statistics Project - GSP Data Assimilation Research Section - DAReS Turbulence Numerics Team.
Tim Hoar : National Center for Atmospheric Research with a whole lot of help from: Jeff Anderson, Nancy Collins, Kevin Raeder, Bill Sacks : NCAR Yongfei.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Dusanka Zupanski And Scott Denning Colorado State University Fort Collins, CO CMDL Workshop on Modeling and Data Analysis of Atmospheric CO.
Ensemble Kalman Filters for WRF-ARW Chris Snyder MMM and IMAGe National Center for Atmospheric Research Presented by So-Young Ha (MMM/NCAR)
Applications of optimal control and EnKF to Flow Simulation and Modeling Florida State University, February, 2005, Tallahassee, Florida The Maximum.
Assimilating chemical compound with a regional chemical model Chu-Chun Chang 1, Shu-Chih Yang 1, Mao-Chang Liang 2, ShuWei Hsu 1, Yu-Heng Tseng 3 and Ji-Sung.
Soil moisture generation at ECMWF Gisela Seuffert and Pedro Viterbo European Centre for Medium Range Weather Forecasts ELDAS Interim Data Co-ordination.
MODEL ERROR ESTIMATION EMPLOYING DATA ASSIMILATION METHODOLOGIES Dusanka Zupanski Cooperative Institute for Research in the Atmosphere Colorado State University.
Development of an EnKF to estimate CO 2 fluxes from realistic distributions of X CO2 Liang Feng, Paul Palmer
Pg 1 A Loosely Coupled Ocean-Atmosphere Ensemble Assimilation System. Tim Hoar, Nancy Collins, Kevin Raeder, Jeffrey Anderson, NCAR Institute for Math.
# # # # An Application of Maximum Likelihood Ensemble Filter (MLEF) to Carbon Problems Ravindra Lokupitiya 1, Scott Denning 1, Dusanka Zupanski 2, Kevin.
© Crown copyright Met Office The EN4 dataset of quality controlled ocean temperature and salinity profiles and monthly objective analyses Simon Good.
AIRS Radiance and Geophysical Products: Methodology and Validation Mitch Goldberg, Larry McMillin NOAA/NESDIS Walter Wolf, Lihang Zhou, Yanni Qu and M.
WP3 WP6 USE CASE DATA MODEL FUSION USING PHENOLOGICAL DATA TO INFORM PRODUCTIVITY MODEL Andy Fox, David Moore, Jesus Marco de Lucas, Jeff Taylor, and many.
Local Predictability of the Performance of an Ensemble Forecast System Liz Satterfield and Istvan Szunyogh Texas A&M University, College Station, TX Third.
Goal: to understand carbon dynamics in montane forest regions by developing new methods for estimating carbon exchange at local to regional scales. Activities:
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
Classification Ensemble Methods 1
Geogg124: Data assimilation P. Lewis. What is Data Assimilation? Optimal merging of models and data Models Expression of current understanding about process.
Pg 1 SAMSI UQ Workshop; Sep Ensemble Data Assimilation and Uncertainty Quantification Jeff Anderson National Center for Atmospheric Research.
1 Information Content Tristan L’Ecuyer. 2 Degrees of Freedom Using the expression for the state vector that minimizes the cost function it is relatively.
Ensemble forecasting/data assimilation and model error estimation algorithm Prepared by Dusanka Zupanski and Milija Zupanski CIRA/CSU Denning group meeting.
Future Directions in Ensemble DA for Hurricane Prediction Applications Jeff Anderson: NCAR Ryan Torn: SUNY Albany Thanks to Chris Snyder, Pavel Sakov The.
École Doctorale des Sciences de l'Environnement d’ Î le-de-France Année Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
CESM-CAM5 Large Ensemble Update May 13, list for all future updates -
Demonstration and Comparison of Sequential Approaches for Altimeter Data Assimilation in HYCOM A. Srinivasan, E. P. Chassignet, O. M. Smedstad, C. Thacker,
Introduction to emulators Tony O’Hagan University of Sheffield.
June 20, 2005Workshop on Chemical data assimilation and data needs Data Assimilation Methods Experience from operational meteorological assimilation John.
Pg 1 ESPC DA Workshop; Sept The Data Assimilation Research Testbed: A Community Ensemble DA Facility Jeffrey Anderson, Nancy Collins, Tim Hoar, Hui.
Jeffrey Anderson, NCAR Data Assimilation Research Section
Alexander Loew1, Mike Schwank2
Jeffrey Anderson, NCAR Data Assimilation Research Section
Information content in ensemble data assimilation
Jeff Anderson, NCAR Data Assimilation Research Section
Parallel Implementations of Ensemble Kalman Filters for Huge Geophysical Models Jeffrey Anderson, Helen Kershaw, Jonathan Hendricks, Nancy Collins, Ye.
Jeff Anderson, NCAR Data Assimilation Research Section
Kostas M. Andreadis1, Dennis P. Lettenmaier1
Andy Wood and Dennis P. Lettenmaier
Error statistics in data assimilation
Assimilation of Global Positioning System Radio Occultation Observations Using an Ensemble Filter in Atmospheric Prediction Models Hui Liu, Jefferey Anderson,
Data Assimilation Initiative, NCAR
Sarah Dance DARC/University of Reading
Presentation transcript:

Tim Hoar: National Center for Atmospheric Research with a whole lot of help from: Bill Sacks, Mariana Vertenstein, Tony Craig, Jim Edwards: NCAR Andrew Fox: National Ecological Observatory Network (NEON) Nancy Collins, Kevin Raeder, Jeff Anderson: NCAR Yongfei Zhang: University of Texas Austin Data Assimilation with CLM & DART

Is DA different for NWP and ecosystem models? Data Assimilation in NWPData Assimilation in CLM Main objectiveImproved initial conditions Forecast improvement Process understanding Regional quantification Forecasting DynamicsPhysics – essentially well known from first principles Physical, biological, chemical – Only partially known, empirical relationships ObservationsHigh spatial and temporal densityVery different spatial and temporal characteristics Mathematical problem Optimization of initial conditionsInitial value problem (e.g. pools) Boundary conditions (e.g. fluxes) Parameter optimization

CLM instances The increments are regressed onto as many CLM state variables as you like. If there is a correlation, the CLM state gets adjusted in the restart file. 1. A way to make model forecasts. 2. A way to estimate what the observation would be – given the model state. This is the observation operator – h. A generic ensemble filter system like DART needs:

A multi-instance version of CESM has been developed that more easily facilitates ensemble-based DA For example, multiple land models can be driven by multiple data-atmospheres in a single executable. This capability should be available in the next CESM release. Multi-instance CESM code Right now, this is only available on the development branch!

Creating the initial ensemble of CLM. model time “a long time” “spun up” Replicate what we have N times. Use a unique (and different!) realistic DATM for each. Run them forward for “a long time”. Getting a proper initial ensemble is an area of active research. We don’t know how much spread we NEED to capture the uncertainty in the system.

The ensemble advantage. In a free run, the ensemble spread frequently grows. With a good assimilation: ensemble spread ultimately remains stable and small enough to be informative You can represent uncertainty. observation times

Atmospheric Reanalysis Assimilation uses 80 members of 2 o FV CAM forced by a single ocean (Hadley+ NCEP-OI2) and produces a very competitive reanalysis. O(1 million) atmospheric obs are assimilated every day. Generates spread in the land model. Each CLM ensemble member is forced with a different atmospheric reanalysis member. 500 hPa GPH Feb x daily is available x daily is available.

Our goal has been to “Do no harm” to CLM DART’s namelist allows you to choose what CLM variables get updated by the assimilation New routines communicate between CLM and DART At predetermined assimilation intervals: 1.CESM/CLM stops and writes restart and history files 2.DART state vector extracted from CLM restart & history files 3.Increments calculated and applied to DART state vector 4.CLM restart files updated with adjusted DART state vector 5.CESM postrun script executes CLM-DART coupling Proof-of-concept using Perfect Model Experiment of leafc follows. 18 synthetic observation locations of leaf carbon 40 CLM instances spun up for several months

Information from a site is extrapolated across space through the covariance matrix represented by the ensemble of CLM instances. Generally, largest updates closest to observation sites. Innovation map of leafc on 4 May 2000 leaf carbon

40 member ensemble of CLM forced with meteorology from 40 different data atmospheres in 2° grid global runs Leaf carbon is a key variable in CLM strongly influencing productivity, evapotranspiration and radiation dynamics Run 1 ensemble member forward from 1 May 2000, harvesting daily observations of leafc at 16 FLUXNET locations Run 40 ensemble members forward from 1 May 2000 for 30 days, assimilating synthetic observations An example of data assimilation in the CLM Global leafc, 1 May 2000

Proof-of-concept with leaf carbon Prior and posterior probability distributions of leaf carbon in a single grid cell at 60°W, 4°S for nine days of assimilation

40 member ensemble of leafc in a single grid cell corresponding to 60.21°W, 2.61°S (Manaus, Brazil). Ensemble members (blue lines) show impact of assimilation. Time series of “truth”, obs and 40 ens members Andy has a CSL proposal for 420,000 core-hours on Yellowstone to continue.

CESM/CLM DART logic 1.CESM advances CLM to some time when we have observations – and STOPS. 1.This is done through the normal CESM framework: env_run.xml 2.A call to a DART shell script “assimilate.csh” is made. 1.DART makes a ‘clean’ directory in the CESM $RUN directory 2.The valid time of the model is determined and the appropriate file containing the observations is linked to a static file name – IMPORTANT. 3.DART makes a directory for each CLM instance and converts the CLM restart file to a DART initial conditions file – serially, but simultaneously. 4.‘filter’ runs and performs the assimilation on the DART initial conditions files and writes out DART restart files. 5.A subset of the information in the DART restart files is used to update the CLM restart files. Ask me why. Again – serially, but simultaneously. 3.The CESM ccsm_postrun.csh script executes as normal. 1.It may advance the model 2.It may call the short-term archiver 3.… whatever you told it in env_run.xml

Details DART allows you to choose what CLM variables get updated by the assimilation. &clm_vars_nml clm_state_variables = 'frac_sno', 'KIND_SNOWCOVER_FRAC', 'DZSNO', 'KIND_SNOW_THICKNESS', 'H2OSNO', 'KIND_SNOW_WATER’, 'T_SOISNO', 'KIND_SOIL_TEMPERATURE', ‘leafc’, ‘KIND_LEAF_CARBON’ / These are read from a CLM restart file and reinserted after the assimilation. Potential problem … balance/consistency?

Assimilation of MODIS snow cover fraction 80 member ensemble for onset of NH winter Assimilate once per day Level 3 MODIS product – regridded to a daily 1 degree grid Observation error variance is 0.1 (for lack of a better value) Observations can impact state variables within 200km CLM variable to be updated is the snow water equivalent “ H2OSNO ” Standard deviation of the CLM snow cover fraction initial conditions for Oct. 2002

An early result: assimilation of MODIS snowcover fraction on total snow water equivalent in CLM. Prior for Nov 30, 2002 Increments (Prior – Posterior) Focus on the non-zero increments The model state is changing in reasonable places, by reasonable amounts. At this point, that’s all we’re looking for. kg/m2 Thanks Yongfei!

Slushy Snow? New Snow? Snow Albedo? Snow Density? Dirty Snow? Dry Snow? Wet Snow? Old Snow? Early Season Snow? Packed Snow? Crusty Snow? Corn Snow? Sugar Snow? “Champagne Powder”? The ensemble must have some uncertainty, it cannot use the same value for all. The model expert must provide guidance. It’s even worse for the hundreds of carbon-based quantities!

Use the CESM multi-instance capability to run simultaneous instances of CLM. Force each instance with different realistic atmospheric conditions (say, from an offline CAM/DART assimilation). Assimilate observations every time CESM stops. Modify the CLM restart file contents to be more consistent with observations – and not just at the observation location! Use CLM history files to provide model states to compare with observations, i.e. the observation operator IS the history file (GRACE observations, NEE, … ). What can CLM-DART do right now:

Use the CESM multi-instance capability to run up to 80 simultaneous instances of CLM Force each instance with different realistic atmospheric conditions (from an offline CAM/DART assimilation) Use the multi-instance capability to assimilate every midnight Modify the CLM restart file contents to be more consistent with observations – and not just at the observation location Can use CLM history files to provide model states to compare with observations, i.e. the observation operator IS the history file (GRACE observations, NEE, … ) Defeat any (and all?) balance checks Erik can throw at us … Blow your file quota on any machine, any time, without breaking a sweat … What can CLM-DART do right now:

Observation support / Forward operators obs_def_1d_state_mod.f90obs_def_GWD_mod.f90obs_def_simple_advection_mod.f90 obs_def_AIRS_mod.f90obs_def_metar_mod.f90obs_def_TES_nadir_mod.f90 obs_def_altimeter_mod.f90obs_def_ocean_mod.f90obs_def_tower_mod.f90 obs_def_AOD_mod.f90obs_def_pe2lyr_mod.f90obs_def_upper_atm_mod.f90 obs_def_cloud_mod.f90obs_def_QuikSCAT_mod.f90obs_def_vortex_mod.f90 obs_def_dew_point_mod.f90obs_def_radar_mod.f90obs_def_wind_speed_mod.f90 obs_def_gps_mod.f90obs_def_reanalysis_bufr_mod.f90 obs_def_gts_mod.f90obs_def_rel_humidity_mod.f90 There is a clean separation between DART, the model, and the observations. This allows for modular support of multiple models and multiple forward operators, and a tremendous amount of code reuse. The same code is used to assimilate with the Lorenz ‘63 model as is used with every other model. The same observation datasets can be used with regional or global models, data denial experiments, etc. This is a list of the modules used to support different observation types:

Problems to be solved: Proper initial ensemble Creating snow with the right characteristics Bounded quantities When all ensembles have identical values the observations cannot have any effect with the current algorithms Forward operators – many flux observations are over timescales that are inconvenient – need soil moisture from last month and now… CLM has a lot of carbon species, hard to support all the forward operators required CLM’s abstraction of grid cells, land units, etc., make the treatment of observations very peculiar. All land units in a grid cell share a location. Easy to have ‘contradictory’ observations.

For more information:

CLM instances The increments are regressed onto as many CLM state variables as you like. If there is a correlation, the CLM state gets adjusted in the restart file. 1. A way to make model forecasts; 2. A way to compute the observation operators, h. A generic ensemble filter system like DART needs:

History Files History file games: We can query a history file for the CLM state at cat ! user_nl_clm &clm_inparm hist_empty_htapes =.false. hist_fincl1 = 'NEP' hist_fincl2 = 'NEP' hist_nhtfrq = -24,1, hist_mfilt = 1,48 hist_avgflag_pertape = 'A','A' / EOF

Niwot Ridge

slide held in reserve

Ensemble Filter for Large Geophysical Models 1. Use model to advance ensemble (3 members here) to time at which next observation becomes available. Ensemble state estimate after using previous observation (analysis) Ensemble state at time of next observation (prior)

2. Get prior ensemble sample of observation, y = h(x), by applying forward operator h to each ensemble member. Theory: observations from instruments with uncorrelated errors can be done sequentially. Ensemble Filter for Large Geophysical Models

3. Get observed value and observational error distribution from observing system. Ensemble Filter for Large Geophysical Models

4. Find the increments for the prior observation ensemble (this is a scalar problem for uncorrelated observation errors). Note: Difference between various ensemble filters is primarily in observation increment calculation. Ensemble Filter for Large Geophysical Models

5. Use ensemble samples of y and each state variable to linearly regress observation increments onto state variable increments. Theory: impact of observation increments on each state variable can be handled independently! Ensemble Filter for Large Geophysical Models

6. When all ensemble members for each state variable are updated, there is a new analysis. Integrate to time of next observation … Ensemble Filter for Large Geophysical Models