Dr Mark Cresswell Statistical Forecasting [Part 1] 69EG6517 – Impacts & Models of Climate Change.

Slides:



Advertisements
Similar presentations
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Advertisements

Lesson 10: Linear Regression and Correlation
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Maximum Covariance Analysis Canonical Correlation Analysis.
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. by Lale Yurttas, Texas A&M University Chapter 171 CURVE.
4 Forecasting PowerPoint presentation to accompany Heizer and Render
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Correlation and Regression Analysis
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Calibration & Curve Fitting
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Creating Empirical Models Constructing a Simple Correlation and Regression-based Forecast Model Christopher Oludhe, Department of Meteorology, University.
3. Multiple Regression Analysis: Estimation -Although bivariate linear regressions are sometimes useful, they are often unrealistic -SLR.4, that all factors.
Relationship of two variables
Chapter 11 Simple Regression
Extensions of PCA and Related Tools
Faculty of Social Sciences Induction Block: Maths & Statistics Lecture 3 Precise & Approximate Relationships Between Variables Dr Gwilym Pryce.
Basic linear regression and multiple regression Psych Fraley.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Managerial Economics Demand Estimation & Forecasting.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Time series Decomposition Farideh Dehkordi-Vakil.
Mathematics of PCR and CCA Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Dr Mark Cresswell Statistical Forecasting [Part 2] 69EG6517 – Impacts & Models of Climate Change.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Chapter 8: Simple Linear Regression Yang Zhenlin.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
ANOVA, Regression and Multiple Regression March
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
DEPARTMENT OF MECHANICAL ENGINEERING VII-SEMESTER PRODUCTION TECHNOLOGY-II 1 CHAPTER NO.4 FORECASTING.
Meteorology 485 Long Range Forecasting Friday, February 13, 2004.
Details for Today: DATE:13 th January 2005 BY:Mark Cresswell FOLLOWED BY:Practical Dynamical Forecasting 69EG3137 – Impacts & Models of Climate Change.
MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar 8: Time Series Analysis & Forecasting – Part 1
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter Outline EMPIRICAL MODELS 11-2 SIMPLE LINEAR REGRESSION 11-3 PROPERTIES OF THE LEAST SQUARES ESTIMATORS 11-4 SOME COMMENTS ON USES OF REGRESSION.
Welcome to MM305 Unit 5 Seminar Forecasting. What is forecasting? An attempt to predict the future using data. Generally an 8-step process 1.Why are you.
Stats Methods at IC Lecture 3: Regression.
Statistical Forecasting
Correlation & Regression
Part 5 - Chapter
Part 5 - Chapter 17.
Linear Regression.
Regression Analysis Module 3.
Correlation, Regression & Nested Models
What is Correlation Analysis?
“The Art of Forecasting”
CHAPTER 29: Multiple Regression*
Part 5 - Chapter 17.
Seminar in Economics Econ. 470
Seasonal Forecasting Using the Climate Predictability Tool
Correlation & Regression
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Dr Mark Cresswell Statistical Forecasting [Part 1] 69EG6517 – Impacts & Models of Climate Change

Lecture Topics What is statistical forecasting? Simple linear regression Multiple linear regression Analog forecasting Model Output Statistics (MOS) forecasts PCA and EOF Canonical Correlation Analysis

What is Statistical Forecasting? In nature, observed phenomena are intrinsically linked to each other by physical processes. Such processes are referred to as the causality or causal links. In statistical forecasting, we can exploit this causality mathematically by replicating the pattern of change observed for a particular set of conditions. The physical processes represent the forcing and the observed pattern (of weather!) is the direct result. Different sets of conditions (forcing) will give rise to replicable and specific patterns of weather

What is Statistical Forecasting? Example: Unusually warm sea-surface temperature (SST) conditions in the Indian Ocean is usually associated with a greater than normal frequency (and magnitude) of tropical cyclones. The forcing here is the increased SSTs whilst the observed pattern is enhanced cyclogenesis. Cyclone Eline (Feb 2000)

What is Statistical Forecasting? Model: Since we know there is a causal link (enhanced energy flux, more evaporation over the ocean, greater convection etc) we can determine a statistical relationship from historical observations

Simple Linear Regression Model: The previous example illustrated how a specific forcing can be seen to alter future weather conditions. We can summarise this relationship mathematically in a regression equation This type of Relationship Is known as empirical Regression equation Regression equation

Simple Linear Regression Model: The regression model informs us of the dependence one variable has on another. Usually, we will select variables that are correlated with one another

Simple Linear Regression We must be careful however when using relationships based purely on a correlation as association is not causation In children, shoe size may be strongly correlated with reading skills. This does not mean that children who learn to read new words sprout longer feet ! The simple calculation of one variable from another based on a regression equation is known as the method of least squares. Normally we can insert a line of best fit through a scatter-plot of X and Y data pairs. The line that makes the smallest r.m.s (root mean square) error in predicting Y from X is the regression line

The regression line is often referred to as the least squares line. We can use the slope and intercept characteristics of a least squares line to derive constants, m (slope) and b (intercept) that can be used in our linear regression model equation: The intercept is the height of the least squares line when X is zero. The slope is the rate at which Y increases per unit increase in X Thus, for any given value of our X variable (and values for m and b which we calculate from observations) we can estimate a value for Y Simple Linear Regression

intercept * * * * ** * * * * X variable Y variable

Multiple Linear Regression Often, we will not look at the contribution of a single variable in isolation – but instead a number of predictors will be included in a forecast Different predictors may be causally related to the same weather phenomena. If K is the number of predictor variables then: ŷ = b 0 + b 1 x 1 + b 2 x 2 + ····· + b k x k

Multiple Linear Regression Linear regression models provide a “fit” for our estimate of y for a number of observations of x A straight-line fit (simple linear regression) will not go through all points…but a multivariate regression line will be curved thus allowing a better fit and a more accurate estimate of y for a given value of x Multivariate models are often used on weather prediction to estimate future change based on historical observations of a trend.

Multiple Linear Regression

Analog Forecasting Not all objective statistical forecast procedures are based on regression Some methods were in use prior to the advent of fairly accurate NWP forecasts (12-48hr range). One such method is analog forecasting Analog forecasting is still in use for long-range (seasonal) forecasting – although the climatological database it uses is deemed to be too short for AF forecasts to be competitive in ordinary short-range weather forecasting

Analog Forecasting The idea underlying AF is to search the archives of climatological synoptic data for maps closely resembling current observations, and assume that the future evolution of the atmosphere will be similar to the flows that followed the historical analogs The method is intuitive and gains from the value provided from experienced weather forecasters AF is limited however as the atmosphere apparently does not exactly repeat itself – so matches can only be approximate

Model Output Statistics (MOS) The MOS approach is a preferred method of incorporating NWP forecast information into statistical weather forecasts The MOS approach has the capacity to include directly into the regression equations the influences of specific characteristics of different NWP models at different time projections in the future To develop MOS forecast equations it is necessary to have a developmental data set composed of historical records of the predictand, together with archived records of the forecasts produced by the NWP model for the same days on which the predictand was observed

PCA and EOF Sometimes we might need to compare sets of variables against patterns of change – and synthesise them It might be the case that environmental change (shifts in weather patterns) are due to more than one variable. In order to determine the spatial limits of their influence (in a geographical sense) we can use a spatially dependent correlation scheme – called Principal Components Analysis (PCA). The technique allows data reduction PCA as a technique, became popular following papers by Lorenz in the mid 1950s – who called the technique Empirical Orthogonal Function (EOF) analysis. Both names refer to the same set of procedures

PCA and EOF The purpose of PCA is to reduce a data set containing a large number of variables to a new data set containing far fewer new variables – but which nevertheless represent a large fraction of the variability contained in the original data Following PCA analysis the method provides a number of principal components – which constitute a compact representation of the original data. PCA can yield substantial insights into both the spatial and temporal variations exhibited by the field or fields being analysed

Canonical Correlation Analysis CCA is a statistical technique that identifies a sequence of pairs of patterns in two multivariate data sets – and constructs sets of transformed variables by projecting the original data onto these patterns The patterns are chosen such that the new variables defined by projection of the two data sets onto these patterns exhibit maximum correlation CCA is an extension of multiple regression models. It is often applied to fields – such as SST or heights of pressure.