1 TIME OF DAY CORRELATIONS FOR IMPROVED WIND SPEED PREDICTIONS ANDREW OLIVER, PhDKRISTOFER ZARLING VP TECHNOLOGIESWIND DATA ANALYST 7 TH MAY 2009.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Describing Data: Measures of Dispersion
Copyright © 2010 Pearson Education, Inc. Slide
Copyright © 2010 Pearson Education, Inc. Slide A least squares regression line was fitted to the weights (in pounds) versus age (in months) of a.
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
Assumptions underlying regression analysis
Demand Resource Operable Capacity Analysis – Assumptions for FCA 5.
Chapter 7 Sampling and Sampling Distributions
Chapter 4: Basic Estimation Techniques
1 Challenge the future Subtitless On Lightweight Design of Submarine Pressure Hulls.
Copyright © Cengage Learning. All rights reserved.
Media Terminology.
Statistical Analysis SC504/HS927 Spring Term 2008
Introduction to Science and the Scientific Method
10b.1 Van Horne and Wachowicz, Fundamentals of Financial Management, 13th edition. © Pearson Education Limited Created by Gregory Kuhlemeyer. Chapter.
Simple Linear Regression Analysis
Correlation and Linear Regression
Multiple Regression and Model Building
Chapter 16: Correlation.
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Chapter 8 Estimating with Confidence
Lesson 10: Linear Regression and Correlation
Brief introduction on Logistic Regression
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Neural Network Based Approach for Short-Term Load Forecasting
MAE 552 Heuristic Optimization Instructor: John Eddy Lecture #19 3/8/02 Taguchi’s Orthogonal Arrays.
1 Econometric Load Forecasting Peak and Energy Forecast 06/14/2005 Econometric Load Forecasting Peak and Energy Forecast 06/14/2005.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. by Lale Yurttas, Texas A&M University Chapter 171 CURVE.
Experimental Evaluation
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Research Study on Wind Turbine Acoustics DRAFT March 7, 2014 Interim Results II for WNTAG.
Uncertainty in Wind Energy
Correlation and Linear Regression
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Understanding Research Results
Linear Regression and Correlation
Correlation and Linear Regression
A Statistical Comparison of Weather Stations in Carberry, Manitoba, Canada.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D.
Going to Extremes: A parametric study on Peak-Over-Threshold and other methods Wiebke Langreder Jørgen Højstrup Suzlon Energy A/S.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
1 MCP: Pitfalls & Common Mistakes Dr Jeremy Bass & RES Wind Analysis Teams (UK & USA) SENIOR TECHNICAL MANAGER AWEA Wind Resource & Project Assessment.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Component Factors of the Time-Series Model Smoothing of Data Series  Moving Averages  Exponential.
Robust Estimators.
CHAPTER 2: Basic Summary Statistics
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Preliminary Analysis by: Fawn Hornsby 1, Charles Rogers 2, & Sarah Thornton 3 1,3 North Carolina State University 2 University of Texas at El Paso Client:
Forecast 2 Linear trend Forecast error Seasonal demand.
Application of the CRA Method Application of the CRA Method William A. Gallus, Jr. Iowa State University Beth Ebert Center for Australian Weather and Climate.
Model Selection and the Bias–Variance Tradeoff All models described have a smoothing or complexity parameter that has to be considered: multiplier of the.
A.Liudchik, V.Pakatashkin, S.Umreika, S.Barodka
Part 5 - Chapter
Part 5 - Chapter 17.
Basic Estimation Techniques
PCB 3043L - General Ecology Data Analysis.
1 Chapter 1: Introduction to Statistics. 2 Variables A variable is a characteristic or condition that can change or take on different values. Most research.
Calculating Sample Size: Cohen’s Tables and G. Power
Part 5 - Chapter 17.
Sociology 690 – Data Analysis
Descriptive Statistics
Sociology 690 – Data Analysis
Facultad de Ingeniería, Centro de Cálculo
Linear Regression and Correlation
Presentation transcript:

1 TIME OF DAY CORRELATIONS FOR IMPROVED WIND SPEED PREDICTIONS ANDREW OLIVER, PhDKRISTOFER ZARLING VP TECHNOLOGIESWIND DATA ANALYST 7 TH MAY 2009

WHY THE NEED? Diurnal wind speed profile is often very different at lower versus higher levels Atmospheric stability effects on wind speed at two levels on the same tower Consequently the relationship between Site and Reference station often varies with Time Of Day An MCP process that fails to take account of Time Of Day is likely to be deficient for many sites Many, if not most, long term reference stations (airports, etc) used in M.C.P. analysis (wind speed predictions) measure close to the ground (around 10 m.) Ratio = 1.20 Ratio = 1.61

AN EXAMPLE: DIRECTIONAL RATIOS AND TIME OF DAY RATIOS Binning by DIRECTION 12 sectors (of 30°) wind speed ratio ranges from 1.35 to 1.92 Binning by TIME 12 sectors (of 2 hours) wind speed ratio ranges from 1.21 to 1.98 Standard Deviation of ratios much greater for TIME than for DIRECTION Number of counts is few for some DIRECTIONS. TIME sectors have equal counts Standard Deviation 0.32 Standard Deviation 0.19 Ratio of means analysis for a typical Texas site

A STATEMENT AND A QUESTION STATEMENT It seems obvious that better relationships between a site and a reference station can be obtained for SOME SITES by taking a TIME sectoring approach as opposed to a DIRECTION sectoring approach. Particularly where the terrain is less complex and atmospheric stability varies significantly on a diurnal basis. QUESTION Can it lead to better predictions though?

A TEST CASE CORRELATE 1 YEAR OF DATA WITH A NEARBY REFERENCE STATION First binned the data into the following: Formed 12 Least-squares regressions (Site versus Reference station). - Note: This was just a test. We do not recommend the use of least-squares for MCP!!! Back Predicted Site wind speed using regressions formed & Reference station data Determined regression coefficient of Back Predictions - How well did the regressions describe the relationship between Site and Reference?

CORRELATIONS IMPROVED WHEN TIME WAS INTRODUCED 12 TIME SECTORS PRODUCED A BETTER BACK PREDICTION THAN 12 DIRECTION SECTORS FOR THIS SITE BACK PREDICTION IMPROVED FURTHER WHEN A COMBINATION OF DIRECTION AND TIME WAS USED

A STEP FURTHER

FURTHER IMPROVEMENT WAS MARGINAL WITH MORE (24), AND VARIABLE WIDTH, TIME AND DIRECTION SECTORS

SO WHAT NEXT? ?

PREDICTION METHODOLOGIES (NON EXHAUSTIVE) Regression Types Pre-averaging of data before correlation Least Squares Ratio Of Means Orthogonal Method York Method Matrix Method New Parameter Time (a proxy for stability) Number of Direction Sectors 1 (no binning) 12 Pre-averaging of data before Correlation 10 minute (none) Hourly Daily Monthly 40 ways of predicting wind speed in this non-exhaustive list! Are You Crazy? 1 (no binning) 12 Some combination of m Direction sectors and n Time sectors 4 ways were compared in this study

GATHERING THE DATA & PREDICTING Selected 19 sites where RES Americas had collected at least 2 years of data Good data availability (All >95%) Geographically dispersed Meteorologically varied Use one year to predict another year. Two Ways: Sliced: 2 nd Year Predicts 1 st Year & Vice Versa Akin to what happens in the real world Diced: Every even day Predicts every Odd Day & Vice Versa Removes annual trending biases (if any) Site Reference Station Year 2Year 1 Time Regression Odd Day Even Day

GATHERING THE DATA & PREDICTING, continued A 6 month subset was also used to establish the relationship between Site & Reference Tested ability of prediction method to account for seasonality The total number of predictions carried out was: 19Sites * 4Regression methods * 2Ways (Sliced & Diced) * 2Years (Use 1 st Year to predict 2 nd Year & Vice Versa) * 2Concurrent data lengths (1 Year and 6 months of data) 608 Total For the Direction / Time Combination method the following sectoring was used: 6 Direction Sectors (1 st centered on North) and 2 Time Sectors (6:00am to 6:00pm) No finesse about how the hours, or directions were selected

RESULTS In all cases, the r value (weighted by sector counts) was greatest for the 12 Time sector regressions YEAR SLICED

RESULTS cont. Mean Absolute Error 6 Direction & 2 Time gave the lowest Mean Absolute Error in each case tested: No significant differences in distributions of errors were observed Direction is obviously still important. Mixed results compared with Time only Not a huge improvement, but 6 Direction / 2 Time was chosen arbitrarily Simply keeping 12 directions and dividing the data into day and night could yield improved results without leaving too few data points for regression (50% reduced)

RESULTS cont. Head to head: Direction versus Combination method (which is best?) If we cannot a priori figure out which method to use then using 6 Direction and 2 Time sectors produces a better result than 12 Direction sectors in the majority of cases!!! An important result

CONCLUSIONS Time of Day has been introduced as a binning parameter in MCP as a proxy for atmospheric stability Time of Day is an important factor to consider in MCP Regression coefficients were improved in 100% of Year Sliced test cases (12 Time Sectors versus 12 Direction Sectors). 95% for 6 Months Sliced Choosing 6 Direction sectors and 2 Time sectors instead of the traditional 12 Direction approach produced a better prediction in the majority of cases This choice was fairly arbitrary. It is postulated that varying Time and Direction sector widths on a case by case basis will yield improved results It is further postulated that keeping the traditional 12 Direction Sectors, but adding 2 Time Sectors (for a total of 24 sectors) will also improve predictions while still retaining enough data points in each sector (50% reduced)

FUTURE WORK Understand how to optimize the binning of data into m Direction and n Time sectors Ideal problem for Neural Networks? Understand Energy Bias errors (how well is the wind speed distribution is predicted) But the wider question and the Holy Grail for MCP is still: Which regression method should I use for a given site in order to minimize my error? And the most important consideration (often overlooked) is that one has to have a decent long term reference station in the first place!

THANK YOU ANDREW OLIVER, PhDKRISTOFER ZARLING VP TECHNOLOGIES WIND DATA ANALYST RES AMERICAS, INC West 120th Avenue, Suite 400 BROOMFIELD, CO (303) With thanks to Mike Anderson, Jerry Bass, Rajan Arora, Alex Kapetanovic & Karen-Anne Hutton