A Bayesian hierarchical modeling approach to reconstructing past climates David Hirst Norwegian Computing Center.

Slides:



Advertisements
Similar presentations
Uncertainty Quantification (UQ) and Climate Change Talking Points Mark Berliner, Ohio State Issues of continuing interest: Models, Data, Impacts & Decision.
Advertisements

Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Design of Experiments Lecture I
Multivariate Regression
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008.
16 March 2011 | Peter Janssen & Arthur Petersen Model structure uncertainty A matter of (Bayesian) belief?
Climate Research Branch / CCCma Discussion of use of statistical methods in palaeo-reconstructions Photo: F. Zwiers Francis Zwiers Climate Research Division,
Petter Mostad Linear regression Petter Mostad
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Prediction and model selection
Monitoring and Pollutant Load Estimation. Load = the mass or weight of pollutant that passes a cross-section of the river in a specific amount of time.
Eric J. Steig & David P. Schneider University of Washington C. A. Shuman NASA/Goddard WAIS Workshop September, 2003 Reconstruction of Antarctic climate.
The Autoregressive Model of Change David A. Kenny.
Gaussian process modelling
Spatial Interpolation of monthly precipitation by Kriging method
Inferential Statistics 2 Maarten Buis January 11, 2006.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Regional climate prediction comparisons via statistical upscaling and downscaling Peter Guttorp University of Washington Norwegian Computing Center
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Relationship between global mean sea-level, global mean temperature and heat-flux in a climate simulation of the past millennium Hans von Storch, Eduardo.
Regression Regression relationship = trend + scatter
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Optimizing Shipping Times Using Fractional Factorial Designs Steven Walfish June 6, 2002.
FAME Time Series Econometrics Daniel V. Gordon Department of Economics University of Calgary.
Statistical approach Statistical post-processing of LPJ output Analyse trends in global annual mean NPP based on outputs from 19 runs of the LPJ model.
1 Data Mining, Data Perturbation and Degrees of Freedom of Projection Regression T.C. Lin * appear in JRSS series C.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
Line of Best fit, slope and y- intercepts MAP4C. Best fit lines 0 A line of best fit is a line drawn through data points that represents a linear relationship.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Review #2.
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Linear Statistical.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
R. Ty Jones Director of Institutional Research Columbia Basin College PNAIRP Annual Conference Portland, Oregon November 7, 2012 R. Ty Jones Director of.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
Forecasting is the art and science of predicting future events.
Comparison of Large-Scale Proxy-Based Temperature Reconstructions over the Past Few Centuries M.E. Mann, Department of Environmental Sciences University.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
ISEN 315 Spring 2011 Dr. Gary Gaukler. Forecasting for Stationary Series A stationary time series has the form: D t =  +  t where  is a constant.
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
Leobardo Diosdado David Galaz. “Longitudinal data analysis represents a marriage of regression and time series analysis.” Source: Edward Frees (
Downscaling and Why Local Predictions are Difficult to Make Preparing Your Coast.
Inference for Regression
Multiple Imputation using SOLAS for Missing Data Analysis
Regression Chapter 6 I Introduction to Regression
What is Correlation Analysis?
Econometric methods of analysis and forecasting of financial markets
Materials for Lecture 18 Chapters 3 and 6
Multivariate Regression
Assessing uncertainties on production forecasting based on production Profile reconstruction from a few Dynamic simulations Gaétan Bardy – PhD Student.
Autocorrelation.
FORECASTING 16-Jan-19 Dr.B.Sasidhar.
Crime survey Neuchatel, 7-8 July 2011
Task 6 Statistical Approaches
Maximizing NHL Player Usage Using a Linear Optimization Model
The general linear model and Statistical Parametric Mapping
Forecasting.
Adequacy of Linear Regression Models
Seasonal Forecasting Using the Climate Predictability Tool
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Autocorrelation.
Lecturer Dr. Veronika Alhanaqtah
Statistics for genomics
FORECASTING 11-Dec-19 Dr.B.Sasidhar.
Presentation transcript:

A Bayesian hierarchical modeling approach to reconstructing past climates David Hirst Norwegian Computing Center

Temperature data Many locations Direct measure of temperature Annual or better resolution small (known?) error Not too many missing values Short series

Proxy data Long series Few (”strange”) locations Relationship with temperature unclear, may change over time Often coarse resolution Large (unknown) error Lots of missing values Pre-processing critical

Current reconstruction methods: 1)Choose proxies 2)Create matrix X of pre-processed proxy by time 3)Create matrix Y of instrumental temperatures. 4)Relate X to Y (by PCA of one or both, then regression of X on Y or Y on X) 5)Use X to predict Y back in time

Difficulties with existing methods: Missing data Spatial association between proxies and instruments lost PCA of proxy data dangerous Uncertainty in temperature data ignored Difficult to include proxies at different resolutions

Consequences: Underestimation of past climate variability Wrong uncertainty

An alternative approach Regard both instruments and proxies as observations of an underlying temperature process. Model all observations including appropriate error terms

In general: Model temperature as an underlying space-time field Model data (proxies and thermometers) as observations of this field Use appropriate functional relationship between proxies and temperature Use appropriate error terms

Specifically: True temperature T(t) an AR(1) process: Observations O = linear function of T plus AR(1) error E + measurement error For low resolution proxy replace T by mean over appropriate period

A simulation study 50 years of thermometer data 250 years of proxies True temperature AR1, coefficient=0.95, sd =1 10 thermometers, small AR1 error (coef=0.7, sd=0.1) 5 proxies, (coef=0.7, sd=1)

For comparison, regression estimator Find first pc of proxies Regress thermometer mean on pc predict ”temperature” (actually thermometer mean) using regression

Add uncertainty to proxies Only 2 proxies error sd = 2

The effect of missing data 5 proxies, error sd = 1 50% proxy data missing at random

Add a trend Only 150 years for proxies cosine trend, cycle 50 years, amplitude 4 (first 50 years) 8 (next 50) and 12 (last 50) AR1 model for temperature no longer correct

Add lots of ”bad” proxies 2 proxies linearly related to temperture 20 proxies unrelated to temperature

Some data from China Two proxies used in Moberg et at closest instrumental data sets

InstrumentalBeijingChina

Modelling conclusions A flexible model which can take account of many sources of uncertainty Theoretically easy to include spatial correlations Can include proxies at different resolutions Missing data not a problem Avoids underestimation of variability if model correct Functional form of temperature and error series very important

Other conclusions Impossible to work with proxies without help from appropriate scientists (preferably those who collected the data) Pre-processing crucial Selection of proxies important Some assumptions impossible to verify