1 Malaquias Peña and Huug van den Dool Consolidation of Multi Method Forecasts Application to monthly predictions of Pacific SST NCEP Climate Meeting,

Slides:



Advertisements
Similar presentations
R OBERTO B ATTITI, M AURO B RUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Feb 2014.
Advertisements

Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Data mining and statistical learning, lecture 5 Outline  Summary of regressions on correlated inputs  Ridge regression  PCR (principal components regression)
Statistics: Data Presentation & Analysis Fr Clinic I.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
© Crown copyright Met Office Andrew Colman presentation to EuroBrisa Workshop July Met Office combined statistical and dynamical forecasts for.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
A Regression Model for Ensemble Forecasts David Unger Climate Prediction Center.
Lecture 5 Correlation and Regression
Creating Empirical Models Constructing a Simple Correlation and Regression-based Forecast Model Christopher Oludhe, Department of Meteorology, University.
Multi-Model Ensembling for Seasonal-to-Interannual Prediction: From Simple to Complex Lisa Goddard and Simon Mason International Research Institute for.
Portfolio Management-Learning Objective
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
DEMETER Taiwan, October 2003 Development of a European Multi-Model Ensemble System for Seasonal to Interannual Prediction   DEMETER Noel Keenlyside,
MULTIPLE TRIANGLE MODELLING ( or MPTF ) APPLICATIONS MULTIPLE LINES OF BUSINESS- DIVERSIFICATION? MULTIPLE SEGMENTS –MEDICAL VERSUS INDEMNITY –SAME LINE,
Statistics and Linear Algebra (the real thing). Vector A vector is a rectangular arrangement of number in several rows and one column. A vector is denoted.
1 How Does NCEP/CPC Make Operational Monthly and Seasonal Forecasts? Huug van den Dool (CPC) CPC, June 23, 2011/ Oct 2011/ Feb 15, 2012 / UoMDMay,2,2012/
EUROBRISA Workshop – Beyond seasonal forecastingBarcelona, 14 December 2010 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Beyond seasonal forecasting F. J. Doblas-Reyes,
Heidke Skill Score (for deterministic categorical forecasts) Heidke score = Example: Suppose for OND 1997, rainfall forecasts are made for 15 stations.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Model validation Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
Economic Optimization Chapter 2. Chapter 2 OVERVIEW   Economic Optimization Process   Revenue Relations   Cost Relations   Profit Relations 
Verification of IRI Forecasts Tony Barnston and Shuhua Li.
1 Climate Test Bed Seminar Series 24 June 2009 Bias Correction & Forecast Skill of NCEP GFS Ensemble Week 1 & Week 2 Precipitation & Soil Moisture Forecasts.
Forecasting in CPT Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
CTB Science Plan For Multi Model Ensembles (MME) Suru Saha Environmental Modeling Centre NCEP/NWS/NOAA.
Development of Precipitation Outlooks for the Global Tropics Keyed to the MJO Cycle Jon Gottschalck 1, Qin Zhang 1, Michelle L’Heureux 1, Peitao Peng 1,
Application of a Hybrid Dynamical-Statistical Model for Week 3 and 4 Forecast of Atlantic/Pacific Tropical Storm and Hurricane Activity Jae-Kyung E. Schemm.
Recent and planed NCEP climate modeling activities Hua-Lu Pan EMC/NCEP.
“Comparison of model data based ENSO composites and the actual prediction by these models for winter 2015/16.” Model composites (method etc) 6 slides Comparison.
Modes of variability and teleconnections: Part II Hai Lin Meteorological Research Division, Environment Canada Advanced School and Workshop on S2S ICTP,
Huug van den Dool / Dave Unger Consolidation of Multi-Method Seasonal Forecasts at CPC. Part I.
1 How Does NCEP/CPC Make Operational Monthly and Seasonal Forecasts? Huug van den Dool (CPC) ESSIC, February, 23, 2011.
Meteorology 485 Long Range Forecasting Friday, February 13, 2004.
Two Consolidation Projects: Towards an International MME: CFS+EUROSIP(UKMO,ECMWF,METF) 11 slides Towards a National MME: CFS and GFDL 18 slides.
Multi Model Ensembles CTB Transition Project Team Report Suranjana Saha, EMC (chair) Huug van den Dool, CPC Arun Kumar, CPC February 2007.
Huug van den Dool and Suranjana Saha Prediction Skill and Predictability in CFS.
Details for Today: DATE:13 th January 2005 BY:Mark Cresswell FOLLOWED BY:Practical Dynamical Forecasting 69EG3137 – Impacts & Models of Climate Change.
Huug van den Dool and Steve Lord International Multi Model Ensemble.
1 Summary of CFS ENSO Forecast September 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
1 Malaquias Peña and Huug van den Dool Consolidation methods for SST monthly forecasts for MME Acknowledgments: Suru Saha retrieved and organized the data,
1 Summary of CFS ENSO Forecast August 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
Chapter 3: Describing Relationships
Challenges of Seasonal Forecasting: El Niño, La Niña, and La Nada
Roberto Battiti, Mauro Brunato
Progress in Seasonal Forecasting at NCEP
Chapter 3: Describing Relationships
Predictability of Indian monsoon rainfall variability
Chapter 3: Describing Relationships
IRI forecast April 2010 SASCOF-1
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Tropical storm intra-seasonal prediction
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Seasonal Forecasting Using the Climate Predictability Tool
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Forecast system development activities
Chapter 3: Describing Relationships
Short Range Ensemble Prediction System Verification over Greece
Chapter 3: Describing Relationships
Presentation transcript:

1 Malaquias Peña and Huug van den Dool Consolidation of Multi Method Forecasts Application to monthly predictions of Pacific SST NCEP Climate Meeting, April 4, 2007 Acknowledgments: Suru Saha retrieved and organized the data, Dave Unger and Peitao Peng provided discussion to the subject

2 DATA Forecasting tools: 8 CGCMs, 1 Statistical model –NCEP CFS: , 15 membs, 9 leads –DEMETER : , 9 membs, 6 leads ECMWF MPI MF UKMO INGV LODYC CERFAX –CPC’ Constructed Analog (CA) : , 12 membs,12 leads This is what all have in common: Monthly Forecasts, leads 0 to 5 Initial months: Feb, May, Aug, Nov Length of retrospective forecasts: 21 years, FOCUS: TROPICAL PACIFIC SST: 12.5 S TO 12.5 N

3 Consolidation: Making the best single forecast out of a number of forecast inputs. Objective consolidation necessary as large supply of forecasts are available. If K is the number of participant forecast systems, ζ, predicting a particular target month with a given lead time, the consolidation is the following linear combination: DEFINITIONS For convenience, systematic errors and observed climatology are removed in ζ. The regression coefficients (weights), α, are based on past performance of the forecast system. o is the verifying field (e.g. observation; climatology removed). Suppose there are N cases of retrospective forecasts, then one can train a consolidation method by comparing:

4 OPTIMIZING WEIGHTS Find weights, α i,for each forecasting tool, ζ i, that minimizes the (sum of square of) errors ε j in Where Z is a matrix whose columns are the forecasting tools and rows are the data points in the training period, o is the column vector containing the verifying field, and ε is a vector of errors. Least square method (unconstrained regression):

5 eigenvaluesNino 3.4PNANAO Corresponding weights for UR for lead 1, im 1 ILL-POSED MATRIX PROBLEM too large

6 RIDGE REGRESSION Constrained to: Minimize: leads to Ridge Regression (DelSole, 2007) (ad hoc) where and Van den Dool estimates such that the weights are small and stable Many more ways to find it Depends on characteristics of covariance matrix Z T Z

7 RIDGE REGRESSION Model weights ( α i, i=1..9 ) as a function of λ for three ridge consolidation methods. Figure illustrates asymptotic values. Our methods stop at λ=0.5. Unconstrained regression ( λ=0 ) results in a wide range (including negative values) of weights. RIDRIM RIW λλλ

8 CONSOLIDATION METHODS ASSESED

9 CROSS-VALIDATION Anomaly Pattern correlation over the tropical Pacific. Average for all leads and initial months. Empty bar: Full (dependent), filled bar: 3-yr out cross-validated.

10 GRIDPOINT BY GRIDPOINT PERFORMANCE

11 EQUATORIAL PACIFIC

12 WESTERN TROPICAL PACIFIC Trust in good models when performed well in a gridpoint. It goes to the opposite direction of the bad models

13 WESTERN TROPICAL PACIFIC MIXES CLOSEST NEIGHBORING GRIDPOINT Trust in good models when performed well in a 3x3 box of gridpoints. It goes to the opposite direction of the bad models

14 WESTERN TROPICAL PACIFIC Trust less good models, damps towards climatology as negative weights are set to zero DOUBLE PASS AND MIXES CLOSEST NEIGHBORING GRIDPOINT

15 INCREASING EFFECTIVE SAMPLE 1 GRIDPOINT BY GRIPOINT 23X3 BOXES 35X5 BOXES 4ALL GRIDPOINTS IN THE DOMAIN 5GRIDPOINTS IN AND OUT DOMAIN Multi- methods average AC Skill of most consolidation methods improve when effective sampling size increases Tropical Pacific SST. AC average for all leads and initial months

16 INCREASING EFFECTIVE SAMPLE Consistency: Percentage cases (leads and initial months) outperforming MM

17 RELATIVE OPERATIONING CURVES Assess the ability to anticipate correctly the occurrence or non occurrence that SST anomalies will fall in the upper, middle and lower terciles. Class limits defined by the observed SST during the training period Probability information from the ensemble: counting the fraction of ensemble members that falls into the “above-normal”, “near-normal”, and “below-normal” categories, and interpreting this fraction as the probability that forecasts will fall in such categories. Approach for the optimized weights: each ensemble member forecast is multiplied by normalized weights. Lead 3 Upper tercile Lower tercile

18 UPPER TERCILE

19 LOWER TERCILE

20 SUMMARY All the points below are for the particular case of SST anomalies in the tropical Pacific. Forecasts arising from a combination of multiple models of similar skill generally outperform those from individual models but not UR after CV-3. Even the simple average of multi-methods shows consistent improvement over individual participant models. Over all and after cross-validation, sophisticated consolidation methods marginally improve over the simple average. Increasing the effective sampling size increases the skill and consistency of consolidation methods. Consolidation methods improve significantly over the multi- methods average in the western Pacific. Probabilistic assessment, as measured by ROC shows some improvement of consolidation methods over MM. Construction of the probability density function of the consolidation requires optimization.