Introduction to Weather Data Cleaning Weather data cleaning is fundamental to the provision of high quality weather data. Speedwell Weather offer extensive.

Slides:



Advertisements
Similar presentations
1 Welcome Safety Regulatory Function Handbook April 2006.
Advertisements

SolidWorks Enterprise PDM Data Loading Strategies
Speedwell Weather System Back Office Features 1 Speedwell, Veronica Chamaedrys Known since Roman times as a medicinal herb. A perennial flowering March-July.
Weather Station Data Quality and Interpolation Issues in Modeling Joe Russo International Workshop on Plant Epidemiology Surveillance for the Pest Forecasting.
Speedwell Weather System
Configuration management
Configuration management
Softricity LLC Advance slides with arrow keys. Without PDMLynx Informal processes based upon excel, access, paper files No consistency across organization.
We have developed CV easy management (CVem) a fast and effective fully automated software solution for effective and rapid management of all personnel.
Speedwell Weather System Back Office Functionality - An Overview The Open Weather Derivative and Risk Management System.
As presented to the Weather Risk Management Organisation Australia Feb 2011.
Hydrological information systems Svein Taksdal Head of section, Section for Hydroinformatics Hydrology department Norwegian Water Resources and Energy.
09/04/2015Unit 2 (b) Back-Office processes Unit 2 Assessment Criteria (b) 10 marks.
Strategic Decisions (Part II)
1 Continuity Equations: Analytical Monitoring of Business Processes and Anomaly Detection in Continuous Auditing Michael G. Alles Alexander Kogan Miklos.
Copyright  2003 Pearson Education Canada Inc. CHAPTER 18 Audit of the Inventory and Warehousing Cycle.
Who are Speedwell Weather ?
Monitoring and Controlling the Project
Auditing Concepts.
MIS 325 PSCJ. 2  Business processes can be quite complex  Process model: any abstract representation of a process  Process-modeling tools provide a.
Information System Engineering
Information and Site Visit for tender: “supply and installation of furniture and related services” EIOPA/OP/50/2014 Frankfurt, 6 June 2014.
Speedwell Weather System a high-level summary of the latest release of SWS, v8.0.
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
Business Intelligence Michael Gross Tina Larsell Chad Anderson.
Quantitative Business Forecasting Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
System Development Life Cycle (SDLC)
1 Workshop on Quality Control and Quality Assurance of Greenhouse Gas Inventories and the Establishment of National Inventory Systems 2-3 September 2004.
ISO 9000 Certification ISO 9001 and ISO
Speedwell Weather System SWS SWS is the dominant third-party software for pricing weather risk contracts, managing a portfolio of weather risk contracts.
MSF Testing Introduction Functional Testing Performance Testing.
Object-Oriented Analysis and Design
GLOWEBSTORE GLODASH – General Buying Guide. Packaging Dashboards GLODASH – Explaining Packs Each Pack consists of list of Dashboards Each Dashboard has.
SharePoint at SRP Perry Bellaire Content Management Salt River Project 2/9/2010.
Dr Mark Cresswell Model Assimilation 69EG6517 – Impacts & Models of Climate Change.
PTCS Service Provider Review 0 Background RTF assumed responsibility for maintaining PTCS specifications in March 2003  Developed PTCS Service Provider.
Model Bank Testing Accelerators “Ready-to-use” test scenarios to reduce effort, time and money.
Ethiopia Improving Availability, Access and Use of Climate Information in Ethiopia Tufa Dinku 1, Kinfe Hilemariam 2, David Grimes 3, and Stephen Connor.
Engineering System Design
© Crown copyright Met Office Standard Operating Procedures in service to Disaster and Media Communities Sarah Davies 29 th November 2012.
ZHRC/HTI Financial Management Training Session 9: Stores and Supplies Management.
LSME: L. Flora 1 Detailed Activities Key Activities Role Description To identify expendable and consumable supply needs and requisition, receive, and issue.
Update on e-Placement at Aon Ian Summers. Aon Limited is authorised and regulated by the Financial Services Authority in respect of insurance mediation.
Quality control of daily data on example of Central European series of air temperature, relative humidity and precipitation P. Štěpánek (1), P. Zahradníček.
Project quality management. Introduction Project quality management includes the process required to ensure that the project satisfies the needs for which.
CHAPTER 19 Audit of the Inventory and Warehousing Cycle
New Fire Weather System Bernard Miville Manager of Operational Forecasting.
© 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke Slide 1 Chapter 2 Business Processes, Information, and Information Systems.
University of Calgary Continuing Education Construction Contract Law and Documents Week 12 Changes Delays Claims Contract Documentation.
Copyright © 2007 Pearson Education Canada 9-1 Chapter 9: Internal Controls and Control Risk.
The Right Choice for Call Recording Voice Documentation for Healthcare HIPAA Compliant Communications Documentation.
The Value of USAP in Software Architecture Design Presentation by: David Grizzanti.
Workshop on Accreditation of Bodies Certifying Medical Devices Kiev, November 2014.
Analytics and Value Creation
Auditing Concepts.
Procurement- Lecture 3 Customer service and logistics
Introduction To DBMS.
Demand Planning Scenario Overview
Internal Control.
System.
ITEC 3220A Using and Designing Database Systems
SOFTWARE TESTING OVERVIEW
Monitoring and Controlling the Project
System Development Life Cycle (SDLC)
System Development Life Cycle (SDLC)
Demand Planning Scenario Overview
Industry Best Practice Guide: Schedule 2, Part 1 Paragraph 2 Completing RAD 2: Step by Step Guide This document is to be used as a guidance on how to complete.
System Development Life Cycle (SDLC)
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
Data validation handbook
Presentation transcript:

Introduction to Weather Data Cleaning Weather data cleaning is fundamental to the provision of high quality weather data. Speedwell Weather offer extensive packs of cleaned historical weather data for sites around the world as well as cleaned weather data feeds. We believe we offer a high quality product. This document shows how we approach the task.

Problem: Weather data is not perfect Missing values Erroneous observations Consistency problems Multiple data sets claiming to be for the same weather station Never purchase data from anyone unless they can provide satisfactory answers to these questions: - What is the original source of the data? - What is the observation convention? - What are the attributes (lat, lon, elevation)? - What has been done to the data? Solution (3): Clean / fill the data Fill missing values Detect and replace erroneous observations Confirm the consistency of the data Solution (1): Ignore the problem Erroneous observations will lead to inaccurate pricing Solution (2): Use only good stations Difficult to determine what is good without cleaning it Greatly limits your ability to trade Data Cleaning

The quality of Meteorological observations varies significantly Missing / erroneous observation are common place To safeguard against data problems use cleaned data where available Fundamentals of a proper data cleaning (1)Organization (2)Redundancy (3)Flexibility (4)Human interaction (5)Transparency Fundamental to satisfying the above is the implementation of software systems infrastructure...but data cleaning cannot and SHOULD not be FULLY automated (see 4) Part of the Speedwell Data cleaning process diagram Data Cleaning

Fundamentals of a proper data cleaning (1)Organization -logical flow -data management (2)Redundancy (3)Flexibility (4)Human interaction (5)Transparency Data preparation Initial Review In-depth analysis / data filling Manual Review Data delivery Data Cleaning..Organization Speedwell data quality types

Fundamentals of a proper data cleaning (1)Organization (2)Redundancy -data sources -testing -estimates -delivery (3)Flexibility (4)Human interaction (5)Transparency Data sources bring in as much as possible and keep what is useful. Typical processing includes: Climate data (daily / hourly), Synoptic data, METAR, ECMWF forecast data, climatology If one source fails there are others Testing no one test is applicable for all situations. - comparison against itself -physical consistency -statistical probability -comparison against neighbors -Observations are compared against the median of a basket of proxies and the MAD (median absolute deviation). If the observation is statistically different from the surrounding stations it is sent to the filling process Estimates (filling) Why have one when you can have many? Useful for more in- depth manual analysis Data delivery - Multiple FTP deliveries - 24-hour support - logging of all deliveries -Description of data quality and type Data Cleaning..Redundancy A fundamental pre-requisite for effective data cleaning is access to a library of weather data providing access to near by sites allowing plausibility testing for the site being cleaned Speedwell Weather maintains a very large inventory of weather data for over 50 different weather elements. This is all warehoused by us in a manner that fully respects differing data types (Synoptic/Climate, Cleaned/Raw etc) with a full audit trail. This allows us to document data point changes which may occur when national met offices change data records to reflect their internal QC procedures. Example of weather variables stored for a single site

Fundamentals of a proper data cleaning (1)Organization (2)Redundancy (3)Flexibility -consider the situation -appropriateness of tests (4)Human interaction (5)Transparency Estimate #1 surrounding station regression using deseasonalized data Estimate #2 Estimates of daily observations from hourly observations (curve fitting) Estimate #3 Estimates of daily observations by manipulating other data types (Synoptic, METAR, ½ hourly) Estimate #4 Day +1 forecasts can actually be very good… Estimate #5 Climatology – worst case scenario Estimate #6, #7, #8,… Flexibility allows you to add any appropriate estimates. The possibilities are unlimited. - satellite derived values - installed stations - reanalysis Data Cleaning..Flexibility

Fundamentals of a proper data cleaning (1)Organization (2)Redundancy (3)Flexibility (4)Human interaction -meteorology is complicated -introduction of non-automated information (5)Transparency -explanation of the process -share what has been cleaned -no-one likes black boxes Data Cleaning..the Human element and Transparency

Contact Us Regarding world-wide weather data and forecast matters please see or contact: Phil David Whitehead Telephone: UK office:+44 (0) US office:+1 (0) Address UK: Mardall House, Vaughan Rd, Harpenden, Herts, AL5 4HU Address USA:101 N Columbus Street, Second Floor, Alexandria VA USA Regarding software and consultancy services please see or contact: Stephen Doherty Dr Michael Moreno David Whitehead Telephone: UK office:+44 (0) US office:+1 (0) Speedwell Weather Derivatives Limited is authorised and regulated by the Financial Services Authority. Registered Offices Mardall House, 9-11 Vaughan Road, Harpenden, Herts AL5 4HU, UK. Company No