Verification in NCEP/HPC Using VSDB-fvs Keith F. Brill November 2007.

Slides:



Advertisements
Similar presentations
Model Evaluation Tools MET. What is MET Model Evaluation Tools ( MET )- a powerful and highly configurable verification package developed by DTC offering:
Advertisements

Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Design of Experiments and Analysis of Variance
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.
Security in Databases. 2 Srini & Nandita (CSE2500)DB Security Outline review of databases reliability & integrity protection of sensitive data protection.
Chapter 11 Multiple Regression.
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
Security in Databases. 2 Outline review of databases reliability & integrity protection of sensitive data protection against inference multi-level security.
Chapter 2 Simple Comparative Experiments
Inferences About Process Quality
MOS Performance MOS significantly improves on the skill of model output. National Weather Service verification statistics have shown a narrowing gap between.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Assumption of Homoscedasticity
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Hypothesis Testing.
Means Tests Hypothesis Testing Assumptions Testing (Normality)
Copyright 2012, University Corporation for Atmospheric Research, all rights reserved Verifying Ensembles & Probability Fcsts with MET Ensemble Stat Tool.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,
Project 6 Using The Analysis ToolPak To Analyze Sales Transactions Jason C. H. Chen, Ph.D. Professor of Management Information Systems School of Business.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples.
Real-time Verification of Operational Precipitation Forecasts using Hourly Gauge Data Andrew Loughe Judy Henderson Jennifer MahoneyEdward Tollerud Real-time.
1 Nonparametric Statistical Techniques Chapter 17.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
TYPES OF DATA KEEP THE ACTIVITIES ROLLING Data, Standard Deviation, Statistical Significance.
Chapter 10 Verification and Validation of Simulation Models
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
1 Where we are going : a graphic: Hypothesis Testing. 1 2 Paired 2 or more Means Variances Proportions Categories Slopes Ho: / CI Samples Ho: / CI Ho:
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Software Engineering1  Verification: The software should conform to its specification  Validation: The software should do what the user really requires.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Building Valid, Credible & Appropriately Detailed Simulation Models
1 Where we are going : a graphic: Hypothesis Testing. 1 2 Paired 2 or more Means Variances Proportions Categories Slopes Ho: / CI Samples Ho: / CI / CI.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Two-Sample Hypothesis Testing
Part Four ANALYSIS AND PRESENTATION OF DATA
LEPS VERIFICATION ON MAP CASES
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Spatial Verification Intercomparison Meeting, 20 February 2007, NCAR
Verifying and interpreting ensemble products
Chapter 2 Simple Comparative Experiments
WELCOME TO THE WORLD OF INFERENTIAL STATISTICS
Chapter 10 Verification and Validation of Simulation Models
Discrete Event Simulation - 4
Chapter 10: The t Test For Two Independent Samples
Power Section 9.7.
Section 11-1 Review and Preview
Adding variables. There is a difference between assessing the statistical significance of a variable acting alone and a variable being added to a model.
Inferences from Matched Pairs
Presentation transcript:

Verification in NCEP/HPC Using VSDB-fvs Keith F. Brill November 2007

2 Overview Introduce VSDB-fvs VSDB component fvs component Examples of fvs output

3 VSDB-fvs Major Components VSDB = Verification Statistics DataBase fvs = forecast verification system VSDB-fvs refers to a collection of software 1. Scripted programs to create VSDB records Grid (GEMPAK) to grid comparison (“perfect obs” assumption) Polygon (VG) to grid comparison Grid to point (GEMPAK) comparison Image overlay visual comparison as a by-product 2. fvs ( command line or scripted ) Define VSDB search conditions Search VSDB for matching data Compute and display (GEMPLT) performance measures

4 Brief History of VSDB-fvs ~1995: EMC begins to specify VSDB format 1997: I began coding fvs in the EMC 1998: I delivered first version of fvs 1999: I accepted a position in HPC 1999 – present: I continued the following...  Support fvs for EMC use  Develop HPC oriented VSDB components  Support fvs for HPC use  Add new capabilities to fvs as time permits

5 VSDB Records Space delimited text, single-line records, multiple records per file 9 identifier fields “=” count_value data_value1... data_valueN Number of data values depends on statistic type in ID field 7 data_values (summary statistics) are sums scaled by count_value  these may be combined arbitrarily by fvs  count_value is number of ob-forecast pairs used for data Example record (split into two lines): V01 WWD/ CSQ G212 PBS_WWD/4 SF SFC = vrsn frcst_source fhr valid_date-time obs_src region statistic_type parm level count Defect: “Perfect obs” assumption.

6 Partial List of Statistic Types and Corresponding Data L1L2 – means of forecasts, observations, their squares, their products, and absolute error. FHO – fractions of events Forecast, Hit, and Observed for dichotomous forecasts PHSE – east-west phase error: phase error (km), amplitude error, variances, analysis phase angle and amplitude, more variances PBS – partitioned Brier Score: fractions forecast in probability categories followed by fractions both forecast and observed in categories

7 HPC Verification Regions Defect: No stratification by elevation in West.

8 fvs – the tool for using VSDB data “fvs u” lets user set search conditions  specify statistic type  specify lists of identifier strings for data combination (forecast hours, time, areas, etc.) “fvs s” executes the search through VSDB  applies event equalization consistency checking “fvs p”  computes user requested performance measures  displays graphics under user control  performs statistical significance test for paired data fvs can be run in “batch” mode to populate web pages

9 Examples of Graphs Produced Using VSDB-fvs

10 Example Visual Comparison For HPC Basic Wx Forecasts of Broken Precipitation Purple is QPE based estimate Broken Precip area forecast

11

12 Significance Test: Does HPC beat the 00Z ENS mean? STATISTICAL SIGNIFICANCE ANALYSIS FOR PAIRED DATA FROM TWO TRACES STAT=SSAL1L2 PARAM=SLP V_ANL=HPC/SFC V_RGN=MRDG VHHMM= TO TRACE # 1 MODEL=MEDR/ FHOUR=120 SCALR_CORR TRACE # 2 MODEL=ENSMN FHOUR=132 SCALR_CORR Null Hypothesis (trace1 = trace2): The mean of the differences of the paired values from the two traces is zero. Alternative Hypothesis (trace1 > trace2): The mean of the paired differences exceeds zero. Gaussian probability of wrongly rejecting null hypothesis: Test Level Decision (one-sided Gaussian) REJECT null hypothesis REJECT null hypothesis REJECT null hypothesis REJECT null hypothesis REJECT null hypothesis ZRAW = E+00 ZTST = E+00 N = 90 Lag1 Corr =

13 Significance Test: Does HPC beat the 12Z ENS mean? STATISTICAL SIGNIFICANCE ANALYSIS FOR PAIRED DATA FROM TWO TRACES STAT=SSAL1L2 PARAM=SLP FHOUR=120 V_ANL=HPC/SFC V_RGN=MRDG VHHMM= TO TRACE # 1 MODEL=MEDR/ SCALR_CORR TRACE # 2 MODEL=ENSMN SCALR_CORR Null Hypothesis (trace1 = trace2): The mean of the differences of the paired values from the two traces is zero. Alternative Hypothesis (trace1 > trace2): The mean of the paired differences exceeds zero. Gaussian probability of wrongly rejecting null hypothesis: Test Level Decision (one-sided Gaussian) ACCEPT null hypothesis ACCEPT null hypothesis ACCEPT null hypothesis ACCEPT null hypothesis ACCEPT null hypothesis ZRAW = E-01 ZTST = E-01 N = 88 Lag1 Corr =

14

15

16

17

18

Brier Score Decompositon Attribute Diagram Perfect Reliability Zero Resolution Histogram shows forecast use frequency NO SKILL Brier Score Brier Skill Score NO SKILL SKILL NO SKILL SKILL Sample Climatological Frequency

20 SUMMARY VSDB-fvs is an end-to-end system for verifying forecast objects against analyses or points VSDB is text data base of summary statistics VSDB data are combinable in a variety of ways fvs searches for VSDB data and combines it under user control fvs computes performance metrics and displays them in a variety of ways