Bob Livezey Climate Prediction Center Seminar February 20, 2013.

Slides:



Advertisements
Similar presentations
Kin 304 Regression Linear Regression Least Sum of Squares
Advertisements

Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Data mining and statistical learning - lecture 6
Exponential Smoothing Methods
Validation and Monitoring Measures of Accuracy Combining Forecasts Managing the Forecasting Process Monitoring & Control.
The Simple Linear Regression Model: Specification and Estimation
Kernel methods - overview
Linear Methods for Regression Dept. Computer Science & Engineering, Shanghai Jiao Tong University.
Statistics for Managers Using Microsoft® Excel 5th Edition
Analyzing and Forecasting Time Series Data
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )
Part II – TIME SERIES ANALYSIS C2 Simple Time Series Methods & Moving Averages © Angel A. Juan & Carles Serrat - UPC 2007/2008.
Time Series Report - Basic 1.Introduction 2.Long term trend 3.Seasonal Component 4.Residual Component 5.Predictions 6.Conclusion (Refer to “Stats enquiry.
Statistical Forecasting Models
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Business Forecasting Chapter 5 Forecasting with Smoothing Techniques.
Fall, 2012 EMBA 512 Demand Forecasting Boise State University 1 Demand Forecasting.
Relationships Among Variables
Ordinary Least Squares
Constant process Separate signal & noise Smooth the data: Backward smoother: At any give T, replace the observation yt by a combination of observations.
Lecture 4 Time-Series Forecasting
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
Regression and Correlation Methods Judy Zhong Ph.D.
STA291 Statistical Methods Lecture 27. Inference for Regression.
COAPS Workshop March 9-11, 2004 WFO Site Specific Seasonal Outlook Product Victor Murphy Southern Region Climate Service Program Manager Nicole Kempf Nicole.
CORRELATION & REGRESSION
Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
TRENDS IN U.S. EXTREME SNOWFALL SEASONS SINCE 1900 Kenneth E. Kunkel NOAA Cooperative Institute for Climate and Satellites - NC David R. Easterling National.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Time Series Forecasting Chapter 16.
DAVIS AQUILANO CHASE PowerPoint Presentation by Charlie Cook F O U R T H E D I T I O N Forecasting © The McGraw-Hill Companies, Inc., 2003 chapter 9.
Operations Management For Competitive Advantage 1Forecasting Operations Management For Competitive Advantage Chapter 11.
Breaks in Daily Climate Records Ralf Lindau University of Bonn Germany.
MBA.782.ForecastingCAJ Demand Management Qualitative Methods of Forecasting Quantitative Methods of Forecasting Causal Relationship Forecasting Focus.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Time Series Forecasting and Index Numbers Statistics.
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
1 Climate Test Bed Seminar Series 24 June 2009 Bias Correction & Forecast Skill of NCEP GFS Ensemble Week 1 & Week 2 Precipitation & Soil Moisture Forecasts.
1 Motivation Motivation SST analysis products at NCDC SST analysis products at NCDC  Extended Reconstruction SST (ERSST) v.3b  Daily Optimum Interpolation.
Federal Departement of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Revisiting Swiss temperature trends * Paulo.
On the reliability of using the maximum explained variance as criterion for optimum segmentations Ralf Lindau & Victor Venema University of Bonn Germany.
Statistical Post Processing - Using Reforecast to Improve GEFS Forecast Yuejian Zhu Hong Guan and Bo Cui ECM/NCEP/NWS Dec. 3 rd 2013 Acknowledgements:
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
Bob Livezey NWS Climate Services Seminar February 13, 2013.
Dynamic Models, Autocorrelation and Forecasting ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Machine Learning 5. Parametric Methods.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
Time-Series Forecast Models  A time series is a sequence of evenly time-spaced data points, such as daily shipments, weekly sales, or quarterly earnings.
The joint influence of break and noise variance on break detection Ralf Lindau & Victor Venema University of Bonn Germany.
Towards a Robust and Model- Independent GNSS RO Climate Data Record Chi O. Ao and Anthony J. Mannucci 12/2/15CLARREO SDT Meeting, Hampton, VA1 © 2015 California.
Homogenization of daily data series for extreme climate index calculation Lakatos, M., Szentimey T. Bihari, Z., Szalai, S. Meeting of COST-ES0601 (HOME)
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Anthony Arguez NOAA National Climatic Data Center Phone: (828) OPTIMAL NORMALS On Improving NOAA’S Climate Normals: An.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Time Series Forecasting Trends and Seasons and Time Series Models PBS Chapters 13.1 and 13.2 © 2009 W.H. Freeman and Company.
Yandell – Econ 216 Chap 16-1 Chapter 16 Time-Series Forecasting.
Lecture 9 Forecasting. Introduction to Forecasting * * * * * * * * o o o o o o o o Model 1Model 2 Which model performs better? There are many forecasting.
Demand Forecasting Fall, 2016 EMBA 512 Demand Forecasting
The simple linear regression model and parameter estimation
Bob Livezey and Marina Timofeyeva
Statistical Data Analysis - Lecture /04/03
Non-Parametric Tests 12/1.
Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 4: Seasonal Series: Forecasting and Decomposition
MGS 3100 Business Analysis Regression Feb 18, 2016
F test for Lack of Fit The lack of fit test..
Presentation transcript:

Bob Livezey Climate Prediction Center Seminar February 20, 2013

Outline Introduction and motivation Climates are dominantly warming so official normals are dominantly cold biased Two challenges: Estimating normals as “expected values” rather than as retrospective references Tracking the normal history; signal separation/detrending For tracking, how important is data homogenization? Methods and their expected merits Moving averages/running means Simple prescribed models assuming linear change A note about other smoothers Independent tests Impact of data sets Validation of hinge choices Relative performance on homogenized station records Conclusions and Recommendations

Introduction and Motivation The climate is warming in most locations in every season, so official normals are cold biased

Introduction and Motivation OK, so what? If a normal is only used as a reference, the cold bias doesn’t matter and the consistency of official normals might be preferred If the normal is used as the “expected value,” it does matter! Every deg F difference in normals represents a difference of over 200 expected heating degree days per unit

Introduction and Motivation Aside from possible usefulness in estimating current normals, why would we want to track the climate (i.e. detrend/separate climate change signal from climate noise)? To get the best, most relevant estimates of: Rates of warming Variability Current probabilities and conditional probabilities Assume that at least to 1 st order, so far climate noise (variability) is independent of climate change: Track the normal smoothly and simply Recenter residuals to the current climate

Is use of homogenized data necessary and important? Emphatically yes if your goals are best estimates of current climate, warming trends, probabilities and conditional probabilities!

Is use of homogenized data necessary and important? NCDC provides easy public access to homogenized station records for the 1218 UCHCN along with corresponding raw and time-of-obs (TOB) corrected series. NWS (CSD)/NCDC provides field office access to homogenized records at least at 4000 additional stations. NCDC is addressing requirements for homogenized records for both monthly mean divisional data and daily station data. Are CPC in-house records as free of inhomogeneities? In this context CPC and NCDC goals are compatible, so shouldn’t leveraged data sets be consistent?

Methods and their expected merits (demerits) Time averages: 30-years Less than 30-years Optimum Climate Normals (OCN) minimize sum of bias error (increases with averaging period) and sampling error (decreases with averaging period) Fixed 10- or 15 years (CPC 10 & CPC 15 ) Tailored to case (location/season): Best performer over dependent period (OCN) Optimize based on trend estimates (OCN 1P & OCN 2P ) Intercept of weighted regression fit to series of estimates on more and more recent training periods (OCN M )

Methods and their expected merits (demerits) Trend-based methods Full-period trend Post-1975 trend 1975 hinge (Livezey et al., 2007; L7) Estimated change-point and 2-phase hinges (3 variants) Fit change point case by case (C Est) Fit slope case by case (Two-Phase C=1975) Fit both of the above (Two-Phase C Est) Various time series smoothers (autoregressive or spline methods)

Methods and their expected merits (demerits) Desirable attributes of methods: Small squared error in estimating next year Small bias error in estimating next year Current normal stable when updated each year Can be used to track the climate smoothly and realistically through the entire record Krakuaer (Advances in Meteorology, 2012) OCNs are the least stable and can’t be used to track the full record smoothly and without compromises, but are expected to have small bias and squared errors when warming is moderate Post-1975 trend still unstable, but less so, with similar errors, but cannot track the full record Full-period trend is very stable and can track the full record smoothly but not realistically, and has larger biases and squared errors 1975 hinges (1- and 2-phase) have all desirable attributes; parsimonious, well-supported model of climate change Time series smoothers are the most arbitrary and require more compromises; generally just produce smoothed out hinges

Independent Tests of OCNs, Full-Period Trend and Hinges Wilks’ (W13; JCAM, 2013) tested CPC’s OCNs and L7 and other hinges on periods ( and respectively) after the methods were proposed; the tests were on CPC mega-divisional data W13 found for 1-year in advance temperature prediction: CPC 15 overwhelmingly best in terms of reduction of variance (RV) with respect to 30-year averages : Had 8/9 region/season cases out of 12 where an alternative beat 30-year averages, the 2-phase 1975 hinge had the other : Had 4/9 region/season cases out of 12 where an alternative beat 30-year averages, the 2-phase 1975 hinge had 3 others and CPC 10 1 Estimated hinges uniformly degraded badly the 1975 hinge results, while the phase hinge performed very comparably to the 2-phase, thereby validating the choices made by L7

Independent Tests of OCNs, Full-Period Trend and Hinges Wilks and Livezey (WL13; JCAM, 2013) repeated the tests with data through 2012 on: Megadivisional data To repeat W13’s results Station data with TOB corrections from the 1218 station USHCN TOB-corrected station data is noisier but contains steeper trend cases than megadivisional Expectation is that hinge-based methods will improve, but not empirically-determined OCNs Fully-homogenized station data from the 1218 station USHCN Expected to improve the performance of all methods

Independent Tests of OCNs, Full-Period Trend and Hinges WL13

Independent Tests of OCNs, Full-Period Trend and Hinges ( ) 15-year OCN remains best overall Impact of stations vs megadivisions as expected Homogenization improves performance for 9/11 methods! phase hinge gets even stronger validation!

Homogenized Data Results ( ) Winter: No method outperformed 30-yr average in West; 15-year average best in Central and East Spring: 1975 hinges best 2 in Central & East; 15- year average in West Overall advantage of 15- year average over 1975 hinges largely accounted for by winter and spring West

Homogenized Data Results ( ) Summer: 1975 hinges best 2 everywhere Fall: 15-year average best in Central & East; only trend beats 30- year average in West

Homogenized Data Results ( ) Alternatives to 30-year averages performed better in 11/12 regions/seasons: the winter West was the only exception 15-year fixed OCNs were best 5/12 times, fall and winter East and Central and spring West 1975 hinges were best 5/12 times, spring and summer East and Central and summer West The advantage of the fixed 15-year average over the 1975 hinges is dominantly a consequence of unusually cold halves of the year (especially in the West) during the almost 7-year test period 1975 hinges had the best two overall biases in 6/12 cases and 2 nd and 3 rd in another, no other method had more than 2

Conclusions and Discussion Warming is so ubiquitous that relevant current normals are dominantly best estimated with alternatives to 30-year averages except under extreme departures from this warming: We don’t know in advance when the exceptions will occur 15-year averages have been the most resilient for all data sets, the 1975 hinges otherwise The 1975 hinges are the best choice if bias reduction is more important than reduction of variance with respect to 30-year averages For detrending or signal separation (when relevant estimates of warming trends, or current interannual variability, probabilities and conditional probabilities are needed): The changing climate needs to be tracked smoothly and reasonably and the preferred methodology is the 1975 hinge When possible, tracking and distribution estimation should be based on homogenized records If uniformity is not a requirement, the best methodology depends on your objectives

Conclusions and Discussion WL13 Hybrid 15-year average used unless 1975 hinge slope exceeds significance threshold Horizontal axis shows increasing use of hinge from right to left Using the 1975 hinge in 14% of all cases reduces the average bias by 1/3 but increases the RMSE by less than 1%

Recommendations All retrospective work on climate variability and change should leverage the best homogenization science available; CPC should work with NCDC on this Noisy methods with artificial boundary conditions or methods that don’t reflect ubiquitous features of climate change should not be used for detrending or signal separation; why not use the hinge, it now has an even solider basis The 10-(11-?)year OCN for forecasting should immediately be replaced with a 15-year version, the hinge, or a hybrid approach