Uncertainty in eddy covariance data and its relevance to gap filling David HollingerAndrew Richardson USDA Forest ServiceUniversity of New Hampshire Durham,

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

The Simple Linear Regression Model Specification and Estimation Hill et al Chs 3 and 4.
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Chapter 3 Properties of Random Variables
Mobile Robot Localization and Mapping using the Kalman Filter
Why gap filling isn’t always easy Andrew Richardson University of New Hampshire Jena Gap Filling Workshop September 2006.
Pattern Recognition and Machine Learning
Improving Understanding of Global and Regional Carbon Dioxide Flux Variability through Assimilation of in Situ and Remote Sensing Data in a Geostatistical.
Data-model assimilation for manipulative experiments Dr. Yiqi Luo Botany and microbiology department University of Oklahoma, USA.
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
Effects of model error on ensemble forecast using the EnKF Hiroshi Koyama 1 and Masahiro Watanabe 2 1 : Center for Climate System Research, University.
Integration of sensory modalities
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
Errors and uncertainties in measuring and modelling surface-atmosphere exchanges Andrew D. Richardson University of New Hampshire NSF/NCAR Summer Course.
Uncertainty in eddy covariance datasets Dario Papale, Markus Reichstein, Antje Moffat, Ankur Desai, many others..
Gap filling using a Bayesian-regularized neural network B.H. Braswell University of New Hampshire.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Today Today: Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Estimation of parameters. Maximum likelihood What has happened was most likely.
Lecture 8 The Principle of Maximum Likelihood. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture.
Parameter Estimation and Data Assimilation Techniques for Land Surface Modeling Qingyun Duan Lawrence Livermore National Laboratory Livermore, California.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
7. Least squares 7.1 Method of least squares K. Desch – Statistical methods of data analysis SS10 Another important method to estimate parameters Connection.
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Statistical Treatment of Data Significant Figures : number of digits know with certainty + the first in doubt. Rounding off: use the same number of significant.
Lecture II-2: Probability Review
TOPLHCWG. Introduction The ATLAS+CMS combination of single-top production cross-section measurements in the t channel was performed using the BLUE (Best.
1 Oceanic sources and sinks for atmospheric CO 2 The Ocean Inversion Contribution Nicolas Gruber 1, Sara Mikaloff Fletcher 2, and Kay Steinkamp 1 1 Environmental.
Hydrologic Statistics
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
The role of the Chequamegon Ecosystem-Atmosphere Study in the U.S. Carbon Cycle Science Plan Ken Davis The Pennsylvania State University The 13 th ChEAS.
The “EPRI” Bayesian M max Approach for Stable Continental Regions (SCR) (Johnston et al. 1994) Robert Youngs AMEC Geomatrix USGS Workshop on Maximum Magnitude.
Jamal Saboune - CRV10 Tutorial Day 1 Bayesian state estimation and application to tracking Jamal Saboune VIVA Lab - SITE - University.
Applications of optimal control and EnKF to Flow Simulation and Modeling Florida State University, February, 2005, Tallahassee, Florida The Maximum.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Mixed Effects Models Rebecca Atkins and Rachel Smith March 30, 2015.
CAMELS CCDAS A Bayesian approach and Metropolis Monte Carlo method to estimate parameters and uncertainties in ecosystem models from eddy-covariance data.
Geology 5670/6670 Inverse Theory 21 Jan 2015 © A.R. Lowry 2015 Read for Fri 23 Jan: Menke Ch 3 (39-68) Last time: Ordinary Least Squares Inversion Ordinary.
FastOpt Quantitative Design of Observational Networks M. Scholze, R. Giering, T. Kaminski, E. Koffi P. Rayner, and M. Voßbeck Future GHG observation WS,
INTRODUCTION TO Machine Learning 3rd Edition
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Recursive Calibration of Ecosystem Models Using Sequential Data Assimilation Mingshi Chen¹, Shuguang Liu¹, Larry L. Tieszen², and David Y. Hollinger 3.
Model-Data Synthesis of CO 2 Fluxes at Niwot Ridge, Colorado Bill Sacks, Dave Schimel NCAR Climate & Global Dynamics Division Russ Monson CU Boulder Rob.
NON-LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Prepared by Dusanka Zupanski and …… Maximum Likelihood Ensemble Filter: application to carbon problems.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Inverse Modeling of Surface Carbon Fluxes Please read Peters et al (2007) and Explore the CarbonTracker website.
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Towards a Robust and Model- Independent GNSS RO Climate Data Record Chi O. Ao and Anthony J. Mannucci 12/2/15CLARREO SDT Meeting, Hampton, VA1 © 2015 California.
June 4-15, 2007 Boulder, CO National Center for Atmospheric Research Advanced Study Program Carbon and water flux measurements and applications to regional.
The Unscented Kalman Filter for Nonlinear Estimation Young Ki Baik.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
CHAPTER 4 ESTIMATES OF MEAN AND ERRORS. 4.1 METHOD OF LEAST SQUARES I n Chapter 2 we defined the mean  of the parent distribution and noted that the.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Data Modeling Patrice Koehl Department of Biological Sciences
Biointelligence Laboratory, Seoul National University
Calibration of land surface models
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
Special Topics In Scientific Computing
Slope of the regression line:
Modelling data and curve fitting
Estimates and 95% CIs of between- and within-pair variations for SS and OS twin pairs and achievement test scores in mathematics and reading assessed in.
Linear regression Fitting a straight line to observations.
Determining the Influence of Winter on Ecosystem
6.7 Practical Problems with Curve Fitting simple conceptual problems
Microeconometric Modeling
Probabilistic Surrogate Models
Presentation transcript:

Uncertainty in eddy covariance data and its relevance to gap filling David HollingerAndrew Richardson USDA Forest ServiceUniversity of New Hampshire Durham, NH USA The Howland flux research was supported by the USDA Forest Service Northern Global Change Program, the Office of Science (BER), U.S. Department of Energy, through the Northeast Regional Center of the National Institute for Global Environmental Change under Cooperative Agreement No. DE-FC03-90ER61010, and by the Office of Science (BER), U.S. Department of Energy, Interagency Agreement No. DE-AI02- 00ER63028

All measurements are corrupted by random error: We want the flux, F, but we measure F +  +  where  is the random error and  is a bias The random error is characterized by its PDF; the first moment (  ) referred to as “uncertainty” What is uncertainty and why do we care? Uncertainty information is required to properly fit models to data; e.g. for estimating the parameters of models used to fill gaps.  Maximum Likelihood  Data assimilation (Kalman filter)  Bayesian methods Uncertainty useful for assessing model fits, annual uncertainties, risk analysis, etc.

"To put the point provocatively, providing data and allowing another researcher to provide the uncertainty is indistinguishable from allowing the second researcher to make up the data in the first place." –Raupach et al. (2005). Model data synthesis in terrestrial carbon observation: methods, data requirements and data uncertainty specifications. Global Change Biology 11:

Compare independent measurements of the same thing (x and y) - The surface must be homogenous If x and y are independent and have the same random measurement uncertainty: Determining measurement uncertainty: Two-tower method: Hollinger & Richardson (2005), Tree Physiology 25: A “one-tower” method trades time for space (Compare with next day values if environmental conditions similar) Howland

a double exponential PDF better represents the random error distribution of eddy fluxes Flux Uncertainties are non- Gaussian with non- constant variance Simultaneous measurements at 2 towers (Howland) 1 Single tower next day comparisons (Howland, Harvard, Duke, Lethbridge, WLEF, Nebraska) 2 Data-model residuals 3 1.Hollinger & Richardson (2005) Tree Physiology 25: Richardson et al. (2006) Ag. Forest Met. 136: Hagen et al. (2006), Journal of Geophysical Research 111, D08S03.

Richardson et al. (2006) Ag. Forest Met. 136: Flux uncertainty increases linearly with flux magnitude ~proportional to flux (for least squares uncertainty is constant) Forests F C > 0  = *F C F C < 0  = *F C (reduce one-tower estimates by ~25%)

Maximum likelihood – “given the data, what are the most likely model coefficients?” Determined by minimizing the difference between data and model: For Gaussian data For double exponential data

Relevance of uncertainty to gap filling: 1. Information needed to correctly determine model parameters (likelihood function). 2. The data we are trying to “fill” are contaminated by random noise so there is a minimal MAE or MSE error we can achieve, even with a perfect model!

Summary Uncertainty is a characterization of measurement error (  and PDF) Flux measurement error is not well described by a Gaussian model (least squares inappropriate) –distribution is peaked with long tails –heteroscedastic (uncertainty ↑ with magnitude of flux) Double exponential model better (median) Uncertainty can be estimated by the difference between measurements made at 1 tower under similar conditions (reduce by ~20-25%)