R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by descriptive statistic.

Slides:



Advertisements
Similar presentations
Chapter 9. Time Series From Business Intelligence Book by Vercellis Lei Chen, for COMP
Advertisements

Multiple Regression and Model Building
Managerial Economics in a Global Economy
DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Exam 1 review: Quizzes 1-6.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by means of inference statistical methods.
Correlation and regression
Chapter 10 Simple Regression.
BA 555 Practical Business Analysis
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
Part II – TIME SERIES ANALYSIS C2 Simple Time Series Methods & Moving Averages © Angel A. Juan & Carles Serrat - UPC 2007/2008.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. by Lale Yurttas, Texas A&M University Chapter 171 CURVE.
Correlation and spectral analysis Objective: –investigation of correlation structure of time series –identification of major harmonic components in time.
Correlation and Regression Analysis
Simple Linear Regression Analysis
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Lecture 5 Correlation and Regression
Forecast for the solar activity based on the autoregressive desciption of the sunspot number time series R. Werner Solar Terrestrial Influences Institute.
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Chapter 11 Simple Regression
LSS Black Belt Training Forecasting. Forecasting Models Forecasting Techniques Qualitative Models Delphi Method Jury of Executive Opinion Sales Force.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Regression Method.
Time-Series Analysis and Forecasting – Part V To read at home.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Chapter 12 Multiple Regression and Model Building.
Simple Linear Regression Models
STAT 497 LECTURE NOTES 2.
Correlation.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Chapter 16: Time-Series Analysis
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Chapter 8 Curve Fitting.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Ch 5-1 © 2004 Pearson Education, Inc. Pearson Prentice Hall, Pearson Education, Upper Saddle River, NJ Ostwald and McLaren / Cost Analysis and Estimating.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
MANAGEMENT SCIENCE AN INTRODUCTION TO
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Review and Summary Box-Jenkins models Stationary Time series AR(p), MA(q), ARMA(p,q)
Module 4 Forecasting Multiple Variables from their own Histories EC 827.
Components of Time Series Su, Chapter 2, section II.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Regression Analysis AGEC 784.
Inference for Least Squares Lines
Part 5 - Chapter
Part 5 - Chapter 17.
What is Correlation Analysis?
Correlation – Regression
BA 275 Quantitative Business Methods
Part 5 - Chapter 17.
SIMPLE LINEAR REGRESSION
3.2. SIMPLE LINEAR REGRESSION
Presentation transcript:

R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by descriptive statistic

Def.: A time series is a sequence of data points measured at successive times (often) spaced in uniform time intervals. Time series analysis comprises methods that attempt to understand time series, often either to understand the underlying context of the data points - where did they come from, what generated them, or to make forecasts (predictions). From Wikipedia.org

Using methods of descriptive Statistic of quantitative cross-section analysis, important measures are: - arithmetic mean - variance - correlation coefficient Do not forget visualization scatter plots example: histogramms For the time series meaningful only for stationarity!

Auto-correlation For the time series auto-covariance used in practice: symmetric for k lag k

It is not known which series is the leading series Correspondence of the cross-correlation to the quantitative cross-section analysis with lag k or Relation of two time series, co-variance: cross-correlation: non-symmetric for k

Time series decomposition into components often non-stationary (we have trends) and periodical variations Models: additive : multiplicative: mixed: T: trend S: seasonal R: rest, noise by logarithmizing → transition to additive model

Step by step: 1.Trend determination 2.Trend subtraction from the series and determination of the seasonal component 3. After removing the seasonal component, the rest remains After this: analysis of the rest, correlation, seasonality or other periodicities or a trend

Determination of the trend Global trend (over the entire observation interval) or polynomial regression model of order p, splines Square sum of errors: F-test Not to be used for prognoses, (increasing with p)

Other linear models: exponential model logistic trend functions A>0 C>0 Local trend: moving average (running mean), to remove oscillations (seasonality) odd: even: point numbers

How does the variance change? 2q+1 is the number of sampling points b i are the weights where Besides, for removing the seasonal means, we have to calculate the running mean over 13 months, with b i = 1/24 for the first and the last month, otherwise b i =1/12 ! For the given examples:

Trend removing by calculation of differences Linear trend: Polynomial trend: recursive formulae

Problems related to the trend determination 1.For short time series, the determined trend will not be equal to the long time trend, and will not be distinguishable from the longer periodicities 2.By smoothing the reversal points of the time series are shifted 3.The production of autocorrelations by smoothing with running averages (quasi-periodicities – Slutzky effect)

FFT of the basic period,without trend FFT of the basic period with trend

Determination of the seasonal component Phase average Assumption: no trend! i is the month k is the number of years also: the perfect case: in practice: A very simple method for constant seasonal variations Standardized phase average

Or dummy regression with: 1 if the month number i 0 else 12 equations ! or together with a polynomial trend For a multiplicative model:

Periodogram analysis Strategies: - Step by step determination of the period T p - Test of a theoretical hypothesis Fourier frequencies (n odd) The entire time interval is used for T 1 Harmonic analysis - non-equidistant time intervals - choice of the basic period

Harmonic series oIf j/n are Fourier frequencies, the regressor functions are orthogonal. All coefficients can be calculated together and they are not changed by the choice of a new m oIf j/n are not Fourier frequencies, then we have to calculate all coefficients again by changing m oIf the data number is equal to the calculated coefficients, then we have no degree of freedom, the calculated series is not an estimation. The error term is zero! → filter

It can be proven that r 2 is the determination coefficient, the part of the explained sum of the squared deviations, besides is the explained sum of the squared deviations Plot of the intensities against the periods T j Periodogram Spectrogram Plot of the intensities against the frequencies f j

Other methods are:  Lomb-Scargle Periodogram  Wavelet

How to determine which is the better model approximation, additive or multiplicative? Analysis of the variance: spread versus level plot (SLP-diagram) - splitting the time series in to intervals, - determination of the standard deviations in the intervals - plotting the stand. dev. against the means line parallel to x-axis → additive model if the SLP linear line → multiplicative model no decision → mixture model

Box/Cox Transformation for λ ≠ 0 or in a simpler form for λ ≠ 0 for λ = 0 λ = 0 multiplicative model λ = 1 additive model for λ = 0 Determination of λ: - stand. dev. plot against logarithms of the mean time interval points - combination with SLP Use simple coeff. λ 1/4;1/3; 1/2;...

I want to acknowledge to the Ministery of Education and Science to support this work under the contract DVU01/0120 Acknowledgement