Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Seasonal Forecasting From DEMETER to ENSEMBLES Francisco J. Doblas-Reyes ECMWF.

Slides:



Advertisements
Similar presentations
Page 1© Crown copyright 2004 Seasonal forecasting activities at the Met Office Long-range Forecasting Group, Hadley Centre Presenter: Richard Graham ECMWF.
Advertisements

Sub-seasonal to seasonal prediction David Anderson.
LRF Training, Belgrade 13 th - 16 th November 2013 © ECMWF Sources of predictability and error in ECMWF long range forecasts Tim Stockdale European Centre.
ECMWF long range forecast systems
WCRP OSC 2011: Strategies for improving seasonal prediction © ECMWF Strategies for improving seasonal prediction Tim Stockdale, Franco Molteni, Magdalena.
Caio A. S. Coelho Centro de Previsão de Tempo e Estudos Climáticos (CPTEC) Instituto Nacional de Pesquisas Espaciais (INPE) 1 st EUROBRISA.
© Crown copyright Met Office Decadal Climate Prediction Doug Smith, Nick Dunstone, Rosie Eade, Leon Hermanson, Adam Scaife.
Seasonal Predictability in East Asian Region Targeted Training Activity: Seasonal Predictability in Tropical Regions: Research and Applications 『 East.
1 Seasonal Forecasts and Predictability Masato Sugi Climate Prediction Division/JMA.
Assessment of CFSv2 hindcast (seasonal mean) CPC/NCEP/NOAA Jan 2011.
© Crown copyright Met Office Andrew Colman presentation to EuroBrisa Workshop July Met Office combined statistical and dynamical forecasts for.
Exeter 1-3 December 2010 Monthly Forecasting with Ensembles Frédéric Vitart European Centre for Medium-Range Weather Forecasts.
1 Assessment of the CFSv2 real-time seasonal forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
Caio A. S. Coelho Supervisors: D. B. Stephenson, F. J. Doblas-Reyes (*) Thanks to CAG, S. Pezzulli and M. Balmaseda.
Performance of the MOGREPS Regional Ensemble
WWRP/WCRP/THORPEX Workshop – Representing model uncertainty on seasonal time scales2 December 2010 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Representing model.
Caio A. S. Coelho Department of Meteorology University of Reading Met Office, Exeter (U.K.), 20 February 2006 PLAN OF TALK.
Climate Forecasting Unit Second Ph’d training talk Prediction of climate extreme events at seasonal and decadal time scale Aida Pintó Biescas.
Climate Forecasting Unit Prediction of climate extreme events at seasonal and decadal time scale Aida Pintó Biescas.
© Crown copyright Met Office An Introduction to Long-range Forecasting Emily Wallace Nov 2012.
Barcelona, 2015 Ocean prediction activites at BSC-IC3 Virginie Guemas and the Climate Forecasting Unit 9 February 2015.
First EUROBRISA Workshop17 March 2008The ENSEMBLES project Model uncertainty in climate forecasting from seasons to decades: the ENSEMBLES project Francisco.
DEMETER Taiwan, October 2003 Development of a European Multi-Model Ensemble System for Seasonal to Interannual Prediction   DEMETER Noel Keenlyside,
The La Niña Influence on Central Alabama Rainfall Patterns.
EUROBRISA Workshop – Beyond seasonal forecastingBarcelona, 14 December 2010 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Beyond seasonal forecasting F. J. Doblas-Reyes,
NLOA2012: Global Climate PredictionMadrid, 5 July 2012 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Global Climate Prediction: Between Weather Forecasting and.
EUROBRISA WORKSHOP, Paraty March 2008, ECMWF System 3 1 The ECMWF Seasonal Forecast System-3 Magdalena A. Balmaseda Franco Molteni,Tim Stockdale.
Improved ensemble-mean forecast skills of ENSO events by a zero-mean stochastic model-error model of an intermediate coupled model Jiang Zhu and Fei Zheng.
Caio A. S. Coelho, D. B. Stephenson, F. J. Doblas-Reyes (*) and M. Balmaseda (*) Department of Meteorology, University of Reading and ECMWF (*)
Page 1© Crown copyright 2006 Matt Huddleston With thanks to: Frederic Vitart (ECMWF), Ruth McDonald & Met Office Seasonal forecasting team 14 th March.
1 Climate Test Bed Seminar Series 24 June 2009 Bias Correction & Forecast Skill of NCEP GFS Ensemble Week 1 & Week 2 Precipitation & Soil Moisture Forecasts.
ENSEMBLES RT4/RT5 Joint Meeting Paris, February 2005 Overview of the WP5.3 Activities Partners: ECMWF, METO/HC, MeteoSchweiz, KNMI, IfM, CNRM, UREAD/CGAM,
18 September 2009: On the value of reforecasts for the TIGGE database 1/27 On the value of reforecasts for the TIGGE database Renate Hagedorn European.
© Crown copyright Met Office Standard Verification System for Long-range Forecasts (SVSLRF) Richard Graham, Met Office Hadley Centre. With acknowledgements.
The Centre for Australian Weather and Climate Research A partnership between CSIRO and the Bureau of Meteorology Verification and Metrics (CAWCR)
Page 1© Crown copyright 2004 WP5.3 Assessment of Forecast Quality ENSEMBLES RT4/RT5 Kick Off Meeting, Paris, Feb 2005 Richard Graham.
Antje Weisheimer Meteorological Training Course 27 April 2006 Antje Weisheimer Multi-model ensemble predictions on seasonal to decadal timescales.
© Crown copyright Met Office Strategy for Seasonal Prediction Development: UKMO and WGSIP activities Adam Scaife Head Monthly to Decadal Prediction Met.
One-year re-forecast ensembles with CCSM3.0 using initial states for 1 January and 1 July in Model: CCSM3 is a coupled climate model with state-of-the-art.
SASCOF 2010 Météo-France GCM forecasts JP. Céron – Météo-France
ECMWF Training course 26/4/2006 DRD meeting, 2 July 2004 Frederic Vitart 1 Predictability on the Monthly Timescale Frederic Vitart ECMWF, Reading, UK.
1 Malaquias Peña and Huug van den Dool Consolidation of Multi Method Forecasts Application to monthly predictions of Pacific SST NCEP Climate Meeting,
© Crown copyright Met Office Seasonal forecasting: Not just seasonal averages! Emily Wallace November 2012.
WGSIP12 – Miami - January 2009 – Impacts verification - Andy Morse Preliminary report on verification scores for malaria and the climate drivers DEMETER.
Details for Today: DATE:13 th January 2005 BY:Mark Cresswell FOLLOWED BY:Practical Dynamical Forecasting 69EG3137 – Impacts & Models of Climate Change.
Verification of operational seasonal forecasts at RA-VI Regional Climate Center South East European Virtual Climate Change Centre Goran Pejanović Marija.
1 An Assessment of the CFS real-time forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
1 A review of CFS forecast skill for Wanqiu Wang, Arun Kumar and Yan Xue CPC/NCEP/NOAA.
Fifth Session of the South Asian Climate Outlook Forum (SASCOF-5) JMA Seasonal Prediction of South Asian Climate for Summer 2014 Hitoshi Sato Climate Prediction.
Seasonal Outlook for 2010 Southwest Monsoon Rainfall D. S. Pai Director, Long Range Forecasting South Asian Climate Outlook Forum (SASCOF -1) April.
1 Summary of CFS ENSO Forecast December 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
1 Summary of CFS ENSO Forecast August 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
ENSEMBLES: Seamless seasonal-to-decadal forecasting
Synthesis of Seasonal Prediction Skill
JMA Seasonal Prediction of South Asian Climate for OND 2017
JMA Seasonal Prediction of South Asian Climate for OND 2017
Teleconnections in MINERVA experiments
Question 1 Given that the globe is warming, why does the DJF outlook favor below-average temperatures in the southeastern U. S.? Climate variability on.
GPC CPTEC: Seasonal forecast activities update
El Nino and La Nina An important atmospheric variation that has an average period of three to seven years. Goes between El Nino, Neutral, and La Nina (ENSO.
Seasonal prediction of South Asian summer monsoon 2010: Met Office
Sub-seasonal prediction at ECMWF
DEMETER Development of a European Multi-model Ensemble System for
Seasonal Prediction Activities at the South African Weather Service
Predictability assessment of climate predictions within the context
GloSea4: the Met Office Seasonal Forecasting System
Operational Seasonal Forecast Systems:
Decadal Climate Prediction at BSC
Sub-seasonal and Seasonal prediction at ECMWF
Presentation transcript:

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Seasonal Forecasting From DEMETER to ENSEMBLES Francisco J. Doblas-Reyes ECMWF

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ●Coupled ocean-atmosphere systems without assimilation of sub-surface ocean observations. ●Multi-model (ECMWF, GloSea, Météo-France, IfM-Kiel, CERFACS, INGV, LODYC) ensemble re-forecasts. ●Re-forecast period , seasonal (6 months, February, May, August and November start date), 9- member ensembles, ERA40 initialization in most cases. DEMETER

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ●Model uncertainty is a major source of forecast error. Three approaches to deal with model uncertainty are being investigated in ENSEMBLES: multi-model (ECMWF, GloSea, DePreSys, Météo-France, IfM-Kiel, CERFACS, INGV), stochastic physics (ECMWF) and perturbed parameters (DePreSys). ●Hindcasts in two streams: o Stream 1: hindcast period , seasonal (7 months, May and November start date), annual (14 months, November start date), 9-member ensembles, ERA40 initialization in most cases; DePreSys (IC and PP ensembles) 10-year runs in every instance. o Stream 2: As in Stream 1 but over , with 4 start dates for seasonal hindcasts, at least 1 for annual and at least one 3- member decadal hindcast every 5 years; DePreSys 10-year runs once a year and 30-year runs every 5 years. ENSEMBLES

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Feb 87 May 87 Aug 87 Nov 87 Feb K models x M ensemble members M*K-member ensemble Assume a multi-model ensemble system with coupled initialized GCMs Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Ensemble climate forecast systems Lead time = 7

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Feb 87 May 87 Aug 87 Nov 87 Feb Ensemble climate forecast systems Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Assume a multi-model ensemble system with coupled initialized GCMs Lead time = 4

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Some definitions: o Forecast system: the forecast model (e.g., a coupled ocean/atmosphere model), the initialization (and ensemble generation) method and the statistical model that creates the forecast probabilities (an ensemble is not a probability forecast). o Forecast quality: Statistical description of how well the forecasts match the observations; it has multiple attributes. Important in probability forecast systems: o Systematic error: difference in the statistical properties of the forecast and the reference distribution. o Accuracy: some sort of distance between forecasts and observations. It is used as an indication of the association, the discrimination or resolution of the forecasts. o Reliability: A measure of trustworthiness (not accuracy). Error and accuracy

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Main systematic errors in dynamical climate forecasts: o Differences between the model climatological pdf (computed for a lead time from all start dates and ensemble members) and the reference climatological pdf (for the corresponding times of the reference dataset): systematic errors in mean and variability. o Conditional biases in the forecast pdf: errors in conditional probabilities implying that probability forecasts are not trustworthy. This type of systematic error is best assessed using the reliability diagram. Temperature Differences in climatological pdfs Reference pdfModel pdf Systematic error in ensemble forecasts Threshold Forecast PDF t=1 t=2 t=3 Mean bias Different variabilities Actual occurrences

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ENSEMBLES Stream 2 T2m mean bias wrt ERA40/OPS, First month May Months 2-4 JJA Months 5-7 SON Systematic error in seasonal forecasts ECMWF Météo-France

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ENSEMBLES Stream 2 T2m ratio of standard deviation wrt ERA40/OPS ECMWF Météo-France Systematic error in seasonal forecasts First month May Months 2-4 JJA Months 5-7 SON

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ENSEMBLES Stream 2 precip. mean bias wrt GPCP, ECMWF Météo-France Systematic error in seasonal forecasts First month May Months 2-4 JJA Months 5-7 SON

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ENSEMBLES Stream 2 precipitation ratio of standard deviation wrt GPCP ECMWF Météo-France Systematic error in seasonal hindcasts First month May Months 2-4 JJA Months 5-7 SON

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Lower quintile of DJF T2m for ERA-40/OPS (top row) and ECMWF ENSEMBLES Stream 2 (bottom row) in (temperatures in °C) ERA/OPS Systematic error in seasonal forecasts ECMWF Stream 2

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Climatological PDF of DJF T2m for ERA-40/OPS and ECMWF ENSEMBLES Stream 2 computed over the period Systematic error: climatological pdf Equatorial Pacific (0, 180°) For deterministic forecasts, compute anomalies with respect to the corresponding mean For probabilistic forecasts, compute probabilities with respect to hindcast and reference thresholds (terciles) Central Europe (50°N, 10°E)

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stratospheric winds in seasonal forecasts Daily evolution of the zonal mean of the 50-hPa zonal wind for the ENSEMBLES Stream 1 hindcasts with November start date (the horizontal axis runs from the 1 st of November to the 31 st of May). Results are averaged over Black lines are for the ERA40 climatology (with shaded region covering ±σ). Every forecast system underestimates the mean wind. The same happens with the intraseasonal variability that prevents stratospheric sudden warming events from happening with the correct frequency and amplitude. From Amanda Maycock (Univ. of Reading)

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Blocking frequency in seasonal forecasts Northern Hemisphere DJF blocking frequency for 1-month lead 9-member ensemble seasonal hindcasts for DEMETER (left) and ENSEMBLES Stream 1 (right) over the period Results are for the Tibaldi and Molteni index (reversal of the meridional gradient of Z500). The dots on top of each panel show the longitudes where the mean frequency of an experiment is significantly equal with 95% confidence to the mean frequency in ERA40. DEMETER ERA40 ECMWF CNRM Met Office LODYC CERFACS INGV MPI ENSEMBLES MM Stream 1 ERA40 ECMWF Glosea CNRM IfM DePreSys

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Anomalies: ENSO forecasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ECMWF EUROSIP Analysis T2m forecasts for MAM 2009

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Reliability of verifiable predictions Users might have problems when dealing with probability forecasts, e.g., what is the meaning of “ 70% chance of rain ” ? Assuming that we agree on what “ rain ” means and that a set of past predictions exist, let us interpret it as “ It will rain 70% of the times the forecast system predicts the event with 70% probability ”. In this case, the user can trust the forecast. What happens if it actually rains only 50% of the times the probability forecast is 70%? The user can not trust the predictions, which are not reliable by being overconfident. This sort of information is essential for the user, especially if the user employs this forecast information in another model. When reliability information is not available, cognitive illusions (Nicholls, 1999) are favoured.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Sea surface temperature RMSE (solid) and spread (dashed) averaged over the Niño3.4 region for the ENSEMBLES Stream 1hindcasts of the 1 st November start dates over the period Persistence RMSE in dashed black. Multi-model (5 models, 45 members) ECMWF Stochastic Physics (9 members) Perturbed Parameters (9 members) Ensemble-mean error vs spread All forecast systems beat persistence. The multi-model is the most skilful system, with highest deterministic reliability (RMSE~spread), in the first 6 months, while perturbed parameters is as good for longer lead times.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Attributes diagrams for near-surface temperature in Niño3.4 of ENSEMBLES Stream 1 multi-model over Multi-model matches RMSE and spread, but gives unreliable probabilities: matching RMSE and spread does not imply reliable probability forecasts. RMSE-spread and probabilistic reliability 1-month lead DJF 3-month lead FMA

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 ●Reliability is a property of a forecast system that measures the ability to issue trustworthy probabilities oA probabilistic model is required to go from the ensemble members to probability forecasts (using e.g., probabilities as the proportion of ensemble members beyond a threshold, dressing). oForecast reliability measures the conditional biases of the forecast probabilities and is part of the systematic error of the forecast system. oReliability for dichotomous events is best illustrated in an attributes diagram (made of a reliability diagram and a forecast probability histogram). Reliability in probability forecasts Threshold Forecast PDF Actual occurrences

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Overconfident Perfect reliability Reliability diagram Forecast reliability measures conditional biases of forecast probabilities; example for event “values above the median” Climatological frequency

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Forecast reliability measures conditional biases of forecast probabilities Reliability (the smaller, the better) Reliability diagram Overconfident Perfect reliability

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Reliability score (the smaller, the better) Resolution score (the bigger, the better) Reliability diagram Overconfident Perfect reliability

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Attributes diagrams for 1-month lead seasonal (JJA) precipitation above the upper tercile over the tropical band for the ENSEMBLES Stream 1 multi-model (left, 45 members), stochastic physics (centre, 9 members) and perturbed parameters (right, 9 members) hindcasts started in May over the period verified against GPCP. The Brier and ROC skill scores, along with 95% confidence intervals (in brackets) computed using a bootstrap method, are shown on top of each panel. Multi-model (0.082,0.178) (0.378,0.502) Stochastic physics (0.005,0.105) (0.322,0.453) Perturbed parameters (0.002,0.105) (0.329,0.443) Attributes diagram

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Reliability diagrams for 1-month lead seasonal (JJA) precipitation above the upper tercile over the tropical band for the ENSEMBLES Stream 1 multi-model (left, 45 members), stochastic physics (centre, 9 members) and perturbed parameters (right, 9 members) hindcasts over the period verified against GPCP. Direct model output (no bias correction) and threshold (upper tercile) computed from the reference climatology Multi-model Stochastic physicsPerturbed parameters Direct model output

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Scores for southern South America precipitation from Stochastic Physics, Perturbed Parameters (both with 9- member ensembles) and Multi-model (5 models, 45 members). Sample values are shown with black dots along with 95% confidence intervals obtained using a bootstrap method (verified against GPCP over ). Anomaly correlation coefficient Ratio between spread and RMSE ROCSS for anomalies above the upper tercile Stream 1 seasonal hindcasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Scores for Mediterranean precipitation from Stochastic Physics, Perturbed Parameters (both with 9-member ensembles) and Multi-model (5 models, 45 members). Sample values are shown with black dots along with 95% confidence intervals obtained using a bootstrap method (verified against GPCP over ). Anomaly correlation coefficient Ratio between spread and RMSE ROCSS for anomalies above the upper tercile Stream 1 seasonal hindcasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 seasonal hindcasts Anomaly correlation coefficient of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 seasonal hindcasts Standardized spread of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 seasonal hindcasts Ratio of spread/RMSE of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Brier skill score of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), events (anomalies above/below the upper/lower tercile and above the median), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence. Stream 1 seasonal hindcasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Brier skill score for infinite ensemble size of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), events (anomalies above/below the upper/lower tercile and above the median), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence. Stream 1 seasonal hindcasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 seasonal hindcasts Reliability skill score of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), events (anomalies above/below the upper/lower tercile and above the median), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 seasonal hindcasts ROC skill score of the three different forecast systems for several regions, lead times (1-1, 1-3, 2-4, 3-5, 4-6 months), events (anomalies above/below the upper/lower tercile and above the median), start dates (May and November) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Sea surface temperature RMSE (solid) and spread (dashed) averaged over the Niño3.4 region for the 1 st November (bottom row) start dates over the period RMSE of simple persistence in dashed black. Multi-model (5 models, 45 members) ECMWF Stochastic Physics (9 members) Perturbed Parameters (9 members) Stream 1 annual hindcasts All forecast systems beat persistence. Multi-model is the most skilful system, with highest reliability (RMSE~spread), in the first 6 months, while the perturbed parameter system is as good for longer lead times.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Reliability diagrams for nine-month lead seasonal (ASO) precipitation above the upper tercile over the tropical band for the ENSEMBLES Stream 1 multi-model (left, 45 members), stochastic physics (centre, 9 members) and perturbed parameters (right, 9 members) hindcasts started in May over the period verified against ERA40. The Brier and ROC skill scores, along with 95% confidence intervals (in brackets) computed using a bootstrap method, are shown on top of each panel. Multi-model (-0.123,0.115) (0.052,0.402) Stochastic physics (-0.290,-0.030) (0.013,0.334) Perturbed parameters (-0.121,0.115) (0.125,0.430) Stream 1 annual hindcasts

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Anomaly correlation coefficient of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Standardized spread of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Ratio spread/RMSE of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Brier skill score of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Brier skill score for infinite ensembles of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts Reliability Skill Score of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 1 annual hindcasts ROC skill score of the three different forecast systems for several regions, lead times (6-8, 8-10, months), events (anomalies above/below the upper/lower tercile and above the median) and variables (T2m, precipitation, Z500, T850 and MSLP) computed over the period for the November start dates. The inset numbers indicate the number of cases where a system is superior with 95% confidence.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Scores for Northern extratropics T2m (left) and tropical band precipitation (right) from ECMWF, Météo-France, INGV, IfM, Met Office, Perturbed Parameters, reduced multi-model (nine-member ensembles) and multi- model (5 models, 45 members). Temperature verified with ERA40/ERA-Int over and precipitation verified against GPCP over Stream 2 seasonal hindcasts Ensemble-mean anomaly correlation Ratio between spread and RMSE RelSS for anomalies above the upper tercile

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Anomaly correlation coefficient of the two different forecast systems for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 64

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Standardized spread of the two different forecast systems for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 72

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Ratio spread/RMSE of the two different forecast systems for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 56

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Brier skill score for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), events (anomalies above/below the upper/lower tercile), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precipitation, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 176

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Debiased Brier skill score for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), events (anomalies above/below the upper/lower tercile), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precip, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 176

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Brier skill score (infinite ensemble size) for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), events (anomalies above/below the upper/lower tercile), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precip, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 184

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts Reliability skill score for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), events (anomalies above/below the upper/lower tercile), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precip, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 200

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Stream 2 seasonal hindcasts ROC skill score for several regions (Northern Hemisphere, Tropics, Southern Hemisphere), events (anomalies above/below the upper/lower tercile), lead times (2-4, 5-7 months), start dates (Feb, May, Aug and Nov) and variables (near-surface temperature, precip, Z500, T850 and MSLP) computed over the period The inset numbers indicate the number of cases where a system is superior. 116

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 DEMETER vs ENSEMBLES Brier skill score for Niño3 SST re-forecasts in DEMETER, ENSEMBLES and DEMETER+ENSEMBLES using all start dates over the period Forecast period 2-4 months Forecast period 4-6 months x’>upper tercilex’<lower tercile

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 DEMETER vs ENSEMBLES Brier skill score for re-forecasts of near-surface temperature and precipitation for different land regions over the period DEMETER ENSEMBLES DEMETER+ENSEMBLES

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 EUROBRISA at IC3 ●The Catalan Institute for Climate Sciences (IC3, Barcelona, Spain) will start this winter a new group on seasonal and interannual climate forecasting. ●The main goals are to develop a capability to perform research on climate forecasting and to work on methods that provide useful climate information; the target regions are the Mediterranean area, South America and Africa. ●A solid link to EUROBRISA is expected at IC3. Two members of the group will work on seasonal forecasting calibration and combination with a focus on the Mediterranean region and the calibration of forecasts using non-stationary series (in the presence of trends and low- frequency variability).

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009 Summary ●Substantial systematic error, including lack of reliability, is still a fundamental problem in dynamical seasonal and interannual forecasting and forces a posteriori corrections to obtain useful predictions. ●Comprehensive assessments of the forecast quality measures (including estimates of their standard error) are indispensable in forecast system comparisons. ●Perturbed-parameter ensembles are competitive with multi-model ensembles. ●The ENSEMBLES multi-model is marginally better than the DEMETER multi-model; much is still to be gained from robust calibration and combination.

Seasonal forecasting from DEMETER to ENSEMBLES21 July 2009