Detection and attribution of long-term change (“trend” detection)

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Detection and attribution of temperature change in the lower stratosphere Nathan Gillett, Ben Santer.
Component Analysis (Review)
Climate change detection and attribution methods Exploratory Workshop DADA, Buenos Aires, Oct 2012 Francis Zwiers, Pacific Climate Impacts Consortium,
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Dimension reduction (1)
Detection and Attribution of Changes in Arctic Temperature Mohammad Reza Najafi, Francis Zwiers, Pacific Climate Impacts Consortium (PCIC), Nathan Gillett,
© Crown copyright Met Office Regional/local climate projections: present ability and future plans Research funded by Richard Jones: WCRP workshop on regional.
Analysis of Extremes in Climate Science Francis Zwiers Climate Research Division, Environment Canada. Photo: F. Zwiers.
IPCC Climate Change Report Moving Towards Consensus Based on real world data.
Outline Further Reading: Detailed Notes Posted on Class Web Sites Natural Environments: The Atmosphere GE 101 – Spring 2007 Boston University Myneni L29:
Detection of Human Influence on Extreme Precipitation 11 th IMSC, Edinburgh, July 2010 Seung-Ki Min 1, Xuebin Zhang 1, Francis Zwiers 1 & Gabi Hegerl.
Application of Statistical Techniques to Neural Data Analysis Aniket Kaloti 03/07/2006.
The limited contribution of model uncertainty to the uncertainty in observationally-constrained estimates of anthropogenic warming Nathan Gillett, Canadian.
Detection of anthropogenic climate change Gabi Hegerl, Nicholas School for the Environment and Earth Sciences, Duke University.
Explaining Changes in Extreme U.S. Climate Events Gerald A. Meehl Julie Arblaster, Claudia Tebaldi.
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Hypothesis test in climate analyses Xuebin Zhang Climate Research Division.
1 The Venzke et al. * Optimal Detection Analysis Jeff Knight * Venzke, S., M. R. Allen, R. T. Sutton and D. P. Rowell, The Atmospheric Response over the.
Detection of an anthropogenic climate change in Northern Europe Jonas Bhend 1 and Hans von Storch 2,3 1 Institute for Atmospheric and Climate Science,
Photo: F. Zwiers Assessing Human Influence on Changes in Extremes Francis Zwiers, Climate Research Division, Environment Canada Acknowledgements – Slava.
Model dependence and an idea for post- processing multi-model ensembles Craig H. Bishop Naval Research Laboratory, Monterey, CA, USA Gab Abramowitz Climate.
Volcanic Climate Impacts and ENSO Interaction Georgiy Stenchikov Department of Environmental Sciences, Rutgers University, New Brunswick, NJ Thomas Delworth.
Evaluation of climate models, Attribution of climate change IPCC Chpts 7,8 and 12. John F B Mitchell Hadley Centre How well do models simulate present.
TEMPLATE DESIGN © DATA: Analysis is performed on annual mean, near-surface (2m) temperature from 14 CMIP5 climate models.
Human fingerprints on our changing climate Neil Leary Changing Planet Study Group June 28 – July 1, 2011 Cooling the Liberal Arts Curriculum A NASA-GCCE.
Environment Canada Environnement Canada Effects of elevated CO 2 on modelled ENSO variability Bill Merryfield Canadian Centre for Climate Modelling and.
Components of the Global Climate Change Process IPCC AR4.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
The evolution of climate modeling Kevin Hennessy on behalf of CSIRO & the Bureau of Meteorology Tuesday 30 th September 2003 Canberra Short course & Climate.
Based on data to 2000, 20 years of additional data could halve uncertainty in future warming © Crown copyright Met Office Stott and Kettleborough, 2002.
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Climate Change Information Seminar Intergovernmental Panel on Climate Change Fourth Assessment Report (AR4) – the relevance to FAO’s activities Claudia.
Dimension reduction (2) EDR space Sliced inverse regression Multi-dimensional LDA Partial Least Squares Network Component analysis.
I. Statistical Methods for Genome-Enabled Prediction of Complex Traits OUTLINE THE CHALLENGES OF PREDICTING COMPLEX TRAITS ORDINARY LEAST SQUARES (OLS)
RAL, 2012, May 11 Research behaviour Martin Juckes, 11 May, 2012.
Computational Intelligence: Methods and Applications Lecture 14 Bias-variance tradeoff – model selection. Włodzisław Duch Dept. of Informatics, UMK Google:
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Sea Ice Component Description Number of historical ensemble members
Chapter 2 Projection of Future Climate Scenarios
Seasonal Arctic heat budget in CMIP5 models
A.Liudchik, V.Pakatashkin, S.Umreika, S.Barodka
Climate Change Climate change scenarios of the
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
Ruth Doherty, Edinburgh University Adam Butler & Glenn Marion, BioSS
LECTURE 10: DISCRIMINANT ANALYSIS
Visualizing Intermodel Comparison
Instrumental Surface Temperature Record
Presented by: Javed Akhter1 Contributors: Lalu Das 2and Monami Dutta2
Ronald J Stouffer Karl Taylor, Jerry Meehl and many others
The Arctic Ocean in CMIP5 Simulations
Adam Butler & Glenn Marion, Biomathematics & Statistics Scotland •
Instrumental Surface Temperature Record
Modeling the Atmos.-Ocean System
Detection of anthropogenic climate change
Contrasts & Statistical Inference
10701 / Machine Learning Today: - Cross validation,
Emerging signals at various spatial scales
An Approach to Enhance Credibility of Decadal-Century Scale Arctic
Lamont-Doherty Earth Observatory
Connecting Data with Domain Knowledge in Neural Networks -- Use Deep learning in Conventional problems Lizhong Zheng.
OVERVIEW OF LINEAR MODELS
20th Century Sahel Rainfall Variability in IPCC Model Simulations and Future Projection Mingfang Ting With Yochanan Kushnir, Richard Seager, Cuihua Li,
LECTURE 09: DISCRIMINANT ANALYSIS
Instrumental Surface Temperature Record
Inez Fung University of California, Berkeley April 2007
Kalman Filter: Bayes Interpretation
Volcanic Climate Impacts and ENSO Interaction
Contrasts & Statistical Inference
Presentation transcript:

Detection and attribution of long-term change (“trend” detection) Francis Zwiers PCIC, University of Victoria CSI, 19 July 2017 Photo: F. Zwiers (Strait of Juan de Fuca)

Introduction Today’s talk Tomorrow’s talk Key reference Detection and attribution of change in the mean state Science using this technique has become the foundation for the series of attribution assessments that have been made by the IPCC SAR – 1995 – “discernable evidence” AR5 – 2013 – it is “extremely likely that most …” Tomorrow’s talk Detection and attribution of changes in extremes Event attribution Key reference WCRP summer school on extremes, ICTP, July, 2014

Outline Overview A simpler method A more complex method Covariance matrix estimation Discussion Photo: F. Zwiers (Marsh Wren)

Definition of D & A Detection of change is defined as the process of demonstrating that climate or a system affected by climate has changed in some defined statistical sense without providing a reason for that change. Attribution is defined as the process of evaluating the relative contributions of multiple causal factors to a change or event with an assignment of statistical confidence. In WG1, casual factors usually refer to external influences, which may be anthropogenic (GHGs, aerosols, ozone precursors, land use) and/or natural (volcanic eruptions, solar cycle modulations). IPCC Good Practice Guidance Paper on Detection and Attribution, 2010

Methods Usual assumptions  leads to a regression formulation Involve simple statistical models Complex implementation due to data volumes (which are both small and large) Usual assumptions Key forcings have been identified Signals and noise are additive The large-scale patterns of response are correctly simulated by climate models, but signal amplitude is uncertain  leads to a regression formulation

X Y 11 decades (1901-1911 to 2001-2011) Evaluate Evaluate residuals Observations (HadCRUT4) Multi-model mean (ALL forcings) 1901-1910 1911-1920 1921-1930 1931-1940 1941-1950 1951-1960 1961-1970 1971-1980 1981-1990 1991-2000 2001-2010 11 decades (1901-1911 to 2001-2011) Y X After Weaver and Zwiers (2000) Evaluate scaling factors Evaluate residuals

Detection and Attribution Paradigms Hasselmann (1979, 1993) Hegerl et al (1996, 1997) Tett et al (1999) Allan and Stott (2003) Huntingford et al (2006) Hegerl and Zwiers (2011) Ribes et al (2013a, 2013b) Hannart et al (2014) Hannart (2015) Key features – single vector of observations Signals estimated with climate models Covariance matrices estimated from climate models – initially regularized by taking the Moore-Penrose inverse; more recently using, for example, the Ledoit-Wolf estimator Most studies use the Allan and Stott TLS method (ignores model uncertainty) A few studies have discussed EIV approaches, but these are very difficult to apply New paper by Ribes et al addresses model uncertainty and balance issues by avoid the regression formulation Ribes et al (2016)

Ordinary least squares regression

Y  Observations X  Expected changes – one vector for each “signal” β  Regression coefficients – aka “scaling factors” ε  Residuals – internal variability Idea is to interpret the observations with a regression model, where physics is used to provide representations of expected changes due to external influences, statistics is used to demonstrate a good fit, and physics is used to interpret the fit and rule out other putative explanations Key statistical questions relate to the βi’s and residuals ε

No assumptions about the “covariance structure” of the residuals Key assumptions Responses to forcings are additive Expected patterns of response in vectors Xi are correct Residuals εj, j=1, …, n are zero-mean … some more, discussed later No assumptions about the “covariance structure” of the residuals This is a “small sample” statistical inference problem (even if vector Y is big, covering essentially the globe and the entire instrumental period)

To fit, chose β to minimize where That is, we have a choice as to how we measure distance …. Simple least squares, non-optimal Weighted least squares, partially optimized Generalized linear regression, fully optimized

Minimizing yields Let where . Then Where Thus the signals X and observations Y are being rotated and scaled

IPCC WG1 TAR Box 12.1 (after Hasselmann, 1976) Optimization maximize S/N ratio by projecting observations onto the signal component that is least affected by noise IPCC WG1 TAR Box 12.1 (after Hasselmann, 1976)

Applying the simple OLS form

Observations Y Most studies of surface air temperature use decadal averages and some kind of spatial averaging To reduce noise from internal variability To reduce the dimension of Y Recent studies (e.g., Jones et al, 2013) use Gridded (5°×5°) monthly mean surface temperature anomalies (e.g., HadCRUT4, Morice et al, 2012) Reduced to decadal means for 1901-1920, 1911-1920 … 2001-2010 (11 decades) Often spatially reduced using a “T4” spherical harmonic decomposition ⇒ global array of 5°×5°decadal anomalies reduced to 25 coefficients Yn×1 therefore has dimension n=11×25=275

Signals Xi, i=1, …, s Number of signals s is small s=1  ALL s=2  ANT and NAT s=3  GHG, OANT and NAT s=4  … Can’t separate signals that are “co-linear” Signals estimated from either single model ensembles (size 3-10 in CMIP5) or multi-model ensembles (~172 ALL runs available in CMIP5 from 49 models, ~67 NAT runs from 21 models , ~54 GHG runs from 20 models) Process as we do the observations Transferred to observational grid, “masked”, centered, averaged using same criteria, etc. Courtesy Hui Wan, 10 July 2014 _____________ historical (ALL) simulations ___________NAT_______________GHG_______ ACCESS1-0 1 1850-2005 0 0 ACCESS1-3 3 1850-2005 0 0 bcc-csm1-1 3 1850-2012 1 1850-2012 1 1850-2012 bcc-csm1-1-m 3 1850-2012 0 0 BNU-ESM 1 1850-2005 1 1850-2005 1 1850-2005 CanESM2 5 1850-2012 5 1850-2012 5 1850-2012 CCSM4 6 1850-2005 4 1850-2005 3 1850-2005 CESM1-BGC 1 1850-2005 0 0 CESM1-CAM5-1-FV2 4 1850-2005 2 1850-2005 2 1850-2005 CESM1-CAM5 3 1850-2005 0 0 CESM1-FASTCHEM 3 1850-2005 0 0 CESM1-WACCM 1 1850-2005 0 0 CMCC-CM 1 1850-2005 0 0 CMCC-CESM 1 1850-2005 0 0 CMCC-CMS 1 1850-2005 0 0 CNRM-CM5 10 1850-2012 6 1850-2012 6 1850-2012 CSIRO-MK3-6-0 10 1850-2005 5 1850-2012 5 1850-2012 EC-EARTH 5 1850-2012 0 0 FGOALS-g2 4 1850-2005,2006,2009 3 1850-2009 1 1850-2009 FIO-ESM 3 1850-2005 0 0 GFDL-CM2p1 10 1861-2040 0 0 GFDL-CM3 5 1860-2005 3 1860-2005 3 1860-2005 GFDL-ESM2G 1 1861-2005 0 0 GFDL-ESM2M 1 1861-2005 1 1861-2005 1 1861-2005 GISS-E2-H-CC 1 1850-2010 0 0 GISS-E2-Hp1 6 1850-2012 5 1850-2012 5 1850-2012 GISS-E2-Hp2 5 1850-2005 0 0 GISS-E2-Hp3 6 1850-2005 5 1850-2012 0 GISS-E2-R-CC 1 1850-2010 0 0 GISS-E2-Rp1 5 1850-2012 5 1850-2012 5 1850-2012 GISS-E2-Rp2 6 1850-2005 0 0 GISS-E2-Rp3 6 1850-2012 5 1850-2012 0 HadCM3 10 185912-2005 0 0 HadGEM2-AO 1 1860-2005 0 0 HadGEM2-ES 2 1860-2005 4 1859-2019 4 1859-2030 IPSL-CM5A-LR 6 1850-2005 3 1850-2012 3 1850-2012 (p1&p2) IPSL-CM5B-LR 1 1850-2005 0 0 IPSL-CM5A-MR 3 1850-2005 3 1850-2012 3 1850-2012 (p2) MIROC-ESM 3 1850-2005 3 1850-2005 3 1850-2005 MIROC-ESM-CHEM 1 1850-2005 1 1850-2005 1 1850-2005 MIROC5 5 1850-2012 0 0 MPI-ESM-LR 3 1850-2005 0 0 MPI-ESM-MR 3 1850-2005 0 0 MPI-ESM-P 2 1850-2005 0 0 MRI-CGCM3p1 3 1850-2012 1 1850-2012 1 1850-2005 MRI-CGCM3p2 2 1850-2012 0 0 MRI-ESM1 1 1851-2005 0 0 NorESM1-ME 1 1850-2005 0 0 NorESM1-M 3 1850-2012 1 1850-2012 1 1850-2012 ---------------------------------------------------- total 49 models 172 runs 21 models 67 runs 20 models 54 runs ------------------------------------------------------

Examples of forced signals Solar Volcanic PCM simulated 20th century temperature response to different kinds of forcing GHGs Ozone Direct SO4 aerosol All IPCC WG1 AR4 Fig. 9.1

The generalized regression estimator of β is Need an estimate of Σ Usually estimated from control runs Even with decadal+T4 filtering, Σ is 275x275 need >275 110-year “chunks” of control run for a full-rank estimate  Need further dimension reduction Constraints on dimensionality Need to be able to invert covariance matrix Covariance needs to be well estimated Climate model should represent internal variability well Should be able to represent signal vector well

And one further constraint on estimating Σ To avoid bias, optimization and uncertainty analysis should be performed separately (Hegerl et al, 1997)  Require two independent estimates of of the covariance matrix An estimate for the optimization step and to estimate scaling factors β An estimate to make estimate uncertainties and make inferences Residuals from the regression model, are used to assess misfit and evaluate model based estimates of internal variability

Total least squares Do we really know the signal perfectly, and how do proceed if we don’t know it completely?

= ⇒ + Statistical model for Xi where a single climate simulation j, j=1,…mi, for forcing i produces Simulated 110 year change vector Deterministic forced response Internal variability = + where ⇒ That is, we assume that the δi,j ’s are independent, and that they represent repeated realizations of the internal variability ε of the observed system.

Leads to a more complicated regression model Fitting involves finding the scaling factors β and signals XForced that minimize the “size” of the n×(s+1) matrix of residuals [Δ, ε]

Covariance matrix estimation

Covariance matrix estimation “Even” with CMIP5, we often do not have enough information to estimate Σ well Keep dimensionality small (i.e., make the D&A problem only as complex as necessary) Increases signal-to-noise ratio Eliminates the need for EOF truncation Forces explicit space- and time-filtering decisions prior to conducting the D&A analysis Involves a trade off (e.g., we might lose the ability to distinguish between different signals) Use a “regularized” covariance matrix estimator

Discussion

Global warming attribution Trend in global surface temperature (1951-2010) Global mean temperature relative to 1880-1919 Jones et al, 2013 All forcings See also Figure 10.1, IPCC WG1 AR5 It is extremely likely that human influence has been the dominant cause of the observed warming since the mid-20th century. Figure TS.10, IPCC WG1 AR5

“Trend” D&A Summary Quantifies how the mean state (or some other statistic) has changed over time due to forcing Methods are regression based, rely on additivity assumption, and continue to evolve Examples Global and regional mean temperature Large body of literature, very high confidence Temperature extremes Growing literature, high confidence Precipitation extremes Emerging evidence, medium or lower confidence

Some concerns Most studies implicitly assume Gaussian noise (generally not a large concern) Sampling variability in the estimated noise covariance matrix is not accounted for well in inferences Hannart (2016) proposes a solution Most studies treat inter-model differences as sampling variability equivalent to internal variability Hannart et al (2014) proposes a partial solution Ribes et al (2016) propose an alternative approach In reality, we do not have a comprehensive statistical framework that allows us to describe how the available ensembles of opportunity have been obtained

Some concerns … Many studies still use ad-hoc methods for covariance matrix regularization (e.g., EOF-truncation) Some now use better approaches (e.g., the Ledoit-Wolf (2004) estimator) following Ribes et al (2013a, 2013b) Many studies do not discuss basic assumptions Key forcings have been identified (and thus there are no other confounding influences) Additivity of signals and noise, independence of noise on mean state Tendency to attribute based only on statistical evidence (see discussion in Mitchell et al., 2001)

Questions? Photo: F. Zwiers