Trend analysis: methodology

Slides:



Advertisements
Similar presentations
Marian Scott SAGES, March 2009
Advertisements

Decomposition Method.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Experimental Design, Response Surface Analysis, and Optimization
S Larssen: PM-PP-Stockholm-Oct-2003.ppt slide 1 PM in Europe - State and past trends Emissions and concentration levels Steinar Larssen Norwegian Institute.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Derwent R.G. et al., 1998, “Observation and interpretation of the seasonal cycles in the surface concentration of ozone and carbon monoxide at Mace Head,
Analyzing and Forecasting Time Series Data
Curve-Fitting Regression
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Trend analysis for HMs and POPs Applications I. Ilyin, EMEP / MSC-East.
SIMPLE LINEAR REGRESSION
Copyright 2013 John Wiley & Sons, Inc. Chapter 8 Supplement Forecasting.
Fault Prediction and Software Aging
AGEC 622 Mission is prepare you for a job in business Have you ever made a price forecast? How much confidence did you place on your forecast? Was it correct?
Investigating the long- and short-term variations in meteorology in Atlanta Lucas Henneman 25 April, 2013.
SIMPLE LINEAR REGRESSION
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Simple Linear Regression Models
Comparative analysis of climatic variability characteristics of the Svalbard archipelago and the North European region based on meteorological stations.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Chapter 8 Curve Fitting.
AP Statistics Chapter 15 Notes. Inference for a Regression Line Goal: To determine if there is a relationship between two quantitative variables. Goal:
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
AP Statistics Chapter 15 Notes. Inference for a Regression Line Goal: To determine if there is a relationship between two quantitative variables. –i.e.
Time-Series Forecasting Overview Moving Averages Exponential Smoothing Seasonality.
Agresti/Franklin Statistics, 1 of 88  Section 11.4 What Do We Learn from How the Data Vary Around the Regression Line?
Trend analysis of HMs and POPs on the basis of measurements and modelling data Victor Shatalov and Oleg Travnikov, MSC-E.
TFEIP Workshop, Istanbul, May 2013 Emissions data for of heavy metal and POP modelling Oleg Travnikov, Alexey Gusev, Ilia Ilyin, Olga Rozovskaya, Victor.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression.
Class 23, November 19, 2015 Lesson 4.2.  By the end of this lesson, you should understand (that): ◦ Linear models are appropriate when the situation.
Economics 173 Business Statistics Lecture 25 © Fall 2001, Professor J. Petry
Air Quality trend analyses under EMEP/TFMM and link to EEA work Augustin COLETTE (INERIS), Chair of the TFMM/CLRTAP TFMM National Experts, CCC, MSC-E,
EMEP/WGE Bureaux, March 2015 MSC-E work plan, 2015 TaskItem Calculations of HMs/POPs for b Testing of HM/POP models in the new EMEP grid1.3.4.
FORECASTIN G Qualitative Analysis ~ Quantitative Analysis.
EMEP WGSR, EMEP Progress on HMs, 2006  Review and evaluation of the MSCE-HM model (TFMM)  Atmospheric pollution in 2004 (emissions, monitoring.
29 th TF meeting of the ICP-Vegetation, March, 2016, Dubna, Russia ANALYSIS OF LONG-TERM TRENDS OF ATMOSPHERIC HEAVY METAL POLLUTION IN THE EMEP COUNTRIES.
Application of regression analysis Economic structure and air pollution in a transition economy: The case of the Czech republic Gabriela Jandová Michaela.
Evaluation of pollution levels in urban areas of selected EMEP countries Alexey Gusev, Victor Shatalov Meteorological Synthesizing Centre - East.
Canadian Bioinformatics Workshops
Joint EMEP/WGE meeting, Geneva, 2016 Evaluation of B[a]P pollution in the EMEP region: temporal trends and spatial variability Alexey Gusev, Olga Rozovskaya,
A.Liudchik, V.Pakatashkin, S.Umreika, S.Barodka
Part 5 - Chapter 17.
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Part 5 - Chapter 17.
Forecasting Qualitative Analysis Quantitative Analysis.
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Maximum daily concentrations of the surface ozone in Minsk
POPs and HMs Summary , EMEP TFMM.
Alexey Gusev, Victor Shatalov, Olga Rozovskaya, Nadejda Vulyh
10th TFMM meeting, June, 2009, France, Paris
MSC-E contribution concerning heavy metals
Progress and problems of POP modelling
Trend analysis of contamination in the EMEP region by HMs & POPs
Summary: TFMM trends analysis
Trend analysis for HMs and POPs
Model assessment of HM and POP pollution of the EECCA region
Comparison of model results with measurements
based on EMEP/MSC-W model and EMEP monitoring data
Iakovos Barmpadimos, André Prévôt, Johannes Keller, Christoph Hüglin
Presentation transcript:

Trend analysis: methodology Victor Shatalov Meteorological Synthesizing Centre East : methodology 1

Main topics Trend analysis of annual averages of concentration/deposition fluxes Trend analysis of monthly averages (with seasonal variations)

Trend analysis: generalities Aim: investigation of general tendencies in time series such as: Measured and calculated pollutant concentrations at monitoring sites Average concentrations/deposition fluxes in EMEP countries … Method: trend analysis – decomposition of the considered series into regular component (trend) and random component (residue) Trend Residue 3

Main steps Detection of trend and its character: increasing decreasing mixed Identification of trend type: linear quadratic exponential other Quantification of trend: total reduction annual reduction magnitude of seasonal variations magnitude of random component Interpretation of the obtained results Presentation by Markus Wallasch, 15 TFMM meeting, April 2014 4

Determination of trend existence Mann-Kendall test: Z = (number of increasing pairs) – (number of decreasing pairs) with normalization. Critical values: ± 1.44 at 85% level ± 1.65 at 90% level ± 1.96 at 95% level B[a]P measurements: SE12 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 ng/m 3 Decreasing pair Increasing pair Z = - 1.49 Decreasing trend at 85% significance level Z = - 4.05 Z = 1.8 Mixed trend character: In the period from 1990 to 2000 – statistically significant (at 95% level) decreasing trend In the period from 2004 to 2010 – statistically significant (at 90% level) increasing trend Typical situation for HMs and POPs 5

Determination of trend type: linear trend Conc = A · Time + B + ω Calculation of A and B: regression or Sen’s slope ω – residues (random component) Z = - 3.1 decrease Z = 3.8 increase Residual trend exists Criterion of the choice of trend type: Mann-Kendall test should not show statistically significant trend on all sub-periods of the time series

Criterion of non-linearity Criterion of non-linearity of the obtained trend in time: NL = max[abs(Δi /Cichord)] · 100% i Non-linear trend C ichord Δ i Chord Fraction of non-linear trends Heavy metals (Pb) 87% POPs (B[a]P) 62% Linear trend Supposed threshold value: 10% 7

Determination of trend type: mono-exponential trend Conc = A · exp(- Time / t) + ω, t – characteristic time Calculation of A and t: least square method Z = - 3.3 Z = 3.2 decrease increase Residual trend exists

Determination of trend type: polynomial trend Calculation of A, B and C: least square method Conc = A · Time2 + B · Time + C + ω Z = 0.5 Z = -2.3 no trend decrease Residual trend exists

Determination of trend type: bi-exponential trend Conc = A1 · exp(- Time /t1) + A2 · exp(- Time /t2) Ai – amplitudes, ti – characteristic times Calculated by least square method Z = 0 Z = -1.4 no trend no trend Nonlnear regression, Gordon K. Smith, in Encyclopedia on Environmetrix, ISBN 0471899976, Wiley@Sons, 2002, vol 3, pp. 1405 - 1411 No statistically significant residual trend obtained See [Smith, 2002]

Statistical significance of increasing trend Typical situation for B[a]P: increase in the end of the period Mann-Kendall test for 2004 – 2010: does not confirm statistically significant increasing trend does not claim the absence of increasing trend Z = 1.8 TS0 – slope of calculated trend Confidence interval for trend slope: [TS0 + A, TS0 + B] [A, B] – confidence interval for slope of random component Increase is statistically significant 11

Non-linear trend analysis Conc = A1 · exp(- Year / t1) + A2 · exp(- Year / t2) + ω Regression model, non-linear in the parameters t1 and t2 Non-linear regression models are widely investigated, for example: Nonlinear regression, Gordon K. Smith, in Encyclopedia on Environmetrics, ISBN 0471899976, Wiley&Sons, 2002, vol 3, pp. 1405 – 1411 Estimating and Validating Nonlinear Regression Metamodels in Simulation, I. R. dos Santos and A. M. O. Porta Nova, Communications in Statistics, Simulation and Computation, 2007, vol. 36: pp. 123 – 137 Nonlinear regression, G. A. F. Seber and C. J. Wild, Wiley-Interscience, 2003 Nonlnear regression, Gordon K. Smith, in Encyclopedia on Environmetrix, ISBN 0471899976, Wiley@Sons, 2002, vol 3, pp. 1405 - 1411

Parameters for trend characterization: reduction/growth Total reduction per period Rtot = (Сbeg–Cend)/Cbeg=1–Cend/Cbeg Cbeg ΔCi Relative annual reduction Ri = ΔCi / Ci = (1 – Ci+1 / Ci) For the considered example: Rmin = - 6% (growth) Rmax = 15% Rav = 6% Rtot = 69% Cend Average annual reduction Rav = 1 – (Cend / Cbeg) 1/(N-1) where N – number of years Negative values of reduction mean growth Reduction parameters Rmin = min (Ri) Rmax = max (Ri) Rav Rtot

Parameters for trend characterization: random component Frand Δ Parameter: standard deviation of random component normalized by trend values Frand = σ(Δ/Ctrend) For the considered example: Frand = 11% 14

Seasonal variations of pollution B[a]P concentrations measured at EMEP site CZ3 from 1996 to 2010. Pronounced seasonal variations are seen. Pb concentrations measured at EMEP site DE7 from 1990 to 2008. Seasonal variations are also seen. Seasonal variations are characteristic of heavy metals and (particularly) for POPs 15

Possible approaches to description of seasonal variations t – time t – chatracteristic times, A, B – constants, φ – phase shifts. Bi-exponential approximation Conc = A1 · exp(– t / t1) · (1 + B1 · cos(2p · t – φ1)) + A2 · exp(– t / t2) · (1 + B2 · cos(2p · t – φ2)) Mono-exponential approximation *) Conc = A · exp(– t / t + B · cos(2p · t – φ)) or Log(Conc) = A’ – t / t + B · cos(2p · t – φ) *) Kong et al., Statistical analysis of long-term monitoring data… Environ. Sci. Techn., 10/2014 16

Usage of higher harmonics Measurement data at CZ3 from 1996 to 2010 Trend calculated by bi-exponential approach. Possible artifact: negative trend values Possibility to avoid negative values: usage of higher harmonics Conc = Tr1 + Tr2 , Tri = Ai·exp(– t / ti)·(1+Bi·cos(2p·t–φi)+Ci·cos(4p·t–ψi)) Statistical significance of second harmonic: Fisher’s test F 17

Usage of higher harmonics Average B[a]P concentrations in Europe from 1990 to 2010 (main harmonic only) Poor approximation for small values of concentrations Residues for one-harmonic approximation Pronounced harmonic trend with doubled frequency 18

Usage of higher harmonics Average B[a]P concentrations in Europe from 1990 to 2010 (main harmonic only) Poor approximation for small values of concentrations Trend including two harmonics Significance of second harmonic is confirmed by Fisher’s test 19

Splitting trends to particular components Example: average B[a]P concentrations for Germany from 1990 to 2010. Ctot Ctot = Cmain + Cseas + Crand Full trend Cseas Cmain Crand Relative annual reductions (as above): Rmin, Rmax, Rav, Rtot 20

Splitting trends to particular components Example: average B[a]P concentrations for Germany from 1990 to 2010. Ctot Ctot = Cmain + Cseas + Crand Full trend Cseas Cmain Crand Fraction of trends with essential seasonality Heavy metals (Pb) 93% POPs (B[a]P) 100% Normalization: Cseas/Cmain Average value of the annual amplitude of the normalized seasonal component Fseas Threshold value: 10% 21

Splitting trends to particular components Example: average B[a]P concentrations for Germany from 1990 to 2010. Ctot Ctot = Cmain + Cseas + Crand Full trend Cseas Cmain Crand Normalization: Crand/Cmain Standard deviation of normalized random component Frand 22

Phase shift as a fingerprint of source type Trends for PB concentrations at CZ1 2 4 6 8 10 12 14 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Air concentrations, ng/m 3 Anthropogenic Secondary Δφ Difference Δφ of phase shift φ between Pb pollution at CZ1 location due to anthropogenic and secondary sources. Phase shift can be used to determine which source type (anthropogenic or secondary) mainly contributes to the pollution at given location (in a particular country). 23

List of trend parameters Parameters for trend characterization: Relative reduction over the whole period (Rtot), Relative annual reductions of contamination: average over the period (Rav), maximum (Rmax), minimum (Rmin). Relative contribution of seasonal variability (Fseas). Relative contribution of random component (Frand). Phase shift of maximum values of contamination with respect to the beginning of the year (φ). Statistical tests: Non-linearity parameter (NL) 10% Relative contribution of seasonal variability (Fseas) 10% 24