Model evaluation for forecasts of wildfire activity and spread

Slides:



Advertisements
Similar presentations
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.
Advertisements

Structural Equation Modeling
Polynomial Regression and Transformations STA 671 Summer 2008.
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Estimating ETAS 1. Straightforward aspects. 2. Main obstacles. 3. A trick to simplify things enormously. 4. Simulations and examples. 1.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
TRIM Workshop Arco van Strien Wildlife statistics Statistics Netherlands (CBS)
Budapest May 27, 2008 Unifying mixed linear models and the MASH algorithm for breakpoint detection and correction Anders Grimvall, Sackmone Sirisack, Agne.
HSRP 734: Advanced Statistical Methods July 24, 2008.
RELATIVE RISK ESTIMATION IN RANDOMISED CONTROLLED TRIALS: A COMPARISON OF METHODS FOR INDEPENDENT OBSERVATIONS Lisa N Yelland, Amy B Salter, Philip Ryan.
TigerStat ECOTS Understanding the population of rare and endangered Amur tigers in Siberia. [Gerow et al. (2006)] Estimating the Age distribution.
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
A methodology for translating positional error into measures of attribute error, and combining the two error sources Yohay Carmel 1, Curtis Flather 2 and.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Data Freshman Clinic II. Overview n Populations and Samples n Presentation n Tables and Figures n Central Tendency n Variability n Confidence Intervals.
REGRESSION AND CORRELATION
Team 2: Final Report Uncertainty Quantification in Geophysical Inverse Problems.
Measures of Association Deepak Khazanchi Chapter 18.
Classification and Prediction: Regression Analysis
Inference for regression - Simple linear regression
Geo479/579: Geostatistics Ch13. Block Kriging. Block Estimate  Requirements An estimate of the average value of a variable within a prescribed local.
Day 7 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.
Simple Linear Regression
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Bayesian Hierarchical Models of Individual Differences in Skill Acquisition Dr Jeromy Anglim Deakin University 22 nd May 2015.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
1. Difficulty of point process model evaluation. 2. RELM and CSEP. 3. Numerical summaries (L-test, N-test, etc.). 4. Functional summaries (error diagrams,
Mathematical Models & Optimization?
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Data Analysis, Presentation, and Statistics
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
Stat 112 Notes 6 Today: –Chapters 4.2 (Inferences from a Multiple Regression Analysis)
The ENSEMBLES high- resolution gridded daily observed dataset Malcolm Haylock, Phil Jones, Climatic Research Unit, UK WP5.1 team: KNMI, MeteoSwiss, Oxford.
Geo597 Geostatistics Ch11 Point Estimation. Point Estimation  In the last chapter, we looked at estimating a mean value over a large area within which.
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Chap 4-1 Chapter 4 Using Probability and Probability Distributions.
GNS Science Testing by hybridization – a practical approach to testing earthquake forecasting models David Rhoades, Annemarie Christophersen & Matt Gerstenberger.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Howard Community College
Missing data: Why you should care about it and what to do about it
making certain the uncertainties
How Good is a Model? How much information does AIC give us?
Overview of Deterministic Computer Models
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
The Matching Hypothesis
Chapter 2 – Properties of Real Numbers
(Residuals and
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
"Did your model account for earthworms?" Rick Paik Schoenberg, UCLA
Log Linear Modeling of Independence
Statement of the Tomography Problem
Rick Paik Schoenberg, UCLA Statistics
Rick Paik Schoenberg, UCLA Statistics
Modelling data and curve fitting
Discrete Event Simulation - 4
Tutorial 8 Table 3.10 on Page 76 shows the scores in the final examination F and the scores in two preliminary examinations P1 and P2 for 22 students in.
Simple Linear Regression
Simple Linear Regression
Analytics – Statistical Approaches
SEM: Step by Step In AMOS and Mplus.
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Ordinary Least Square estimator using STATA
Presentation transcript:

Model evaluation for forecasts of wildfire activity and spread Rick Paik Schoenberg, UCLA Statistics http://www.stat.ucla.edu/~frederic/papers1.html Thanks to Robert Clements, Andrew Bray, Ka Wong, Matt Baltar, Jon Keeley. 1) Overview, problems facing statisticians and wildfire experts. 2) Currently used evaluation methods for spread models. 3) Currently used evaluation methods for forecasts of occurrence. 4) Likelihood methods for spread models. 5) Residual methods for occurrence models a) Deviance residuals b) Superthinned residuals c) Voronoi residuals (from Alexander & Cruz, 2013)

What do we statisticians want from wildfire managers? 1) Overview, cooperation between statisticians and wildfire managers. What do we statisticians want from wildfire managers? * Easy to download online datasets, rather than pages with details on 1 fire at a time, * Information on more covariates, not just weather variables. What statistical things might wildfire managers want from us? * Better models for occurrence? * Better models for spread? But there are currently so many models proposed. * Better ways to evaluate forecasts?

(from Pastor et al., 2003)

What do we statisticians want from wildfire managers? 1) Overview, cooperation between statisticians and wildfire managers. What do we statisticians want from wildfire managers? * Easy to download online datasets, rather than pages with details on 1 fire at a time. * Information on more covariates, not just weather variables. What statistical things might wildfire managers want from us? * Better models for occurrence? * Better models for spread? But there are currently so many models proposed. * Better ways to evaluate forecasts. to help distinguish between competing models. to suggest where a given model might improve.

a) Is the problem of modeling spread solved? 2) Currently used evaluation methods for spread models. a) Is the problem of modeling spread solved? Many simplifying assumptions in current models. Simulations can do very well retrospectively, on data used to make the sims. b) Methods for model assessment Evaluate a particular statistic, like surface temperature or spread rate. Comparison of final burn areas. Error matrices, Cohen’s K, Sørensen’s Q.

(Alexander & Cruz, 2013)

Mell et al. (2007), retrospective Mell et al. (2007), retrospective. The experiment shown here was used in model construction.

Observed vs. Predicted spread rate (Cruz & Alexander, 2013), retrospective:

Observed vs. Predicted spread rate (Cruz & Alexander, 2013), prospective:

Observed vs. Predicted spread rate (Alexander & Cruz, 2013)

Observed and Predicted surface temperature (de Mestre et al Observed and Predicted surface temperature (de Mestre et al., 1989, from Weber 1991)

Errors in windspeed measurements (Sullivan and Knight, 2001)

Observed and Simulated burn polygons (Arca et al., 2007)

Observed and Simulated burn polygons (Fujioka, 2002)

Remote Sensing techniques. Error matrix (Congalton 1991) Cohen’s K (Congalton 1991)

Histogram of over-prediction and under-prediction of areas (Cruz and Alexander, 2013).

Overlap proportion (Duff et al., 2013). Black = overlapping burn area, white = burn area predicted but not observed, grey = burn area observed but not predicted.

Difference between burn areas, evaluated radially as a function of q (Cui & Perera, 2010) (Duff et al., 2013)

Few studies look at evolution of predicted & observed burn areas over time (Mell et al. 2007)

3. Currently used evaluation methods for forecasts of occurrence. Percentage of days with wildfires vs. Index (Viegas et al., 1999)

Mean area burned per day vs. Index (Viegas et al., 1999)

Error diagrams (Molchan 1990, Xu and Schoenberg 2011)

4) Likelihood methods for spread models. Model evaluation may be simplified using probabilistic spread models. Can simply use the likelihood as a measure of fit. If model A routinely assigns lower likelihoods to the observed wildfires than model B, then model B should be preferred. * Simple, straightforward, meaningful comparisons of multiple competing models. * Less potential for subtle disagreements between model and data to be amplified. * L-test can be used to see if discrepancies for 1 model are statistically significant. (Simulate, and see if the likelihood for the data is in the middle 95% range.) Schorlemmer et al. 2007

5) Recent methods for point process models for occurrences. a) Deviance residuals b) Superthinned residuals c) Voronoi residuals -- Given two competing models, can consider the difference between residuals, number of observed fires – number expected, over each pixel. Problem: Hard to interpret. If difference = 3, is this because model A overestimated by 3? Or because model B underestimated by 3? Or because model A overestimated by 1 and model B underestimated by 2? -- Better: consider difference between log-likelihoods, in each pixel. The result may be called deviance residuals (Clements et al. 2011), ~ resids from gen. linear models.

b) Superthinning (Clements et al., 2012) Choose some number c ~ mean( ). Superpose: where (t ,x,y) < c, add in points of a simulated Poisson process of rate c - (t , x, y) . Thin: where (ti,xi,yi) > c, keep each point (ti,xi,yi) with prob. c / (ti,xi,yi) .

c) Voronoi residuals (Bray et al. 2013) A Voronoi tessellation divides a space into cells Ci where Ci contains all locations closer to event i than any other observed event. Within each cell, calculate residuals ( (Tanemura 2003) spatially adaptive and nonparametric.

Conclusions. * Likelihood methods could facilitate evaluation of probabilistic spread models. * Deviance residuals are useful for comparing models on grid cells. * Superthinned residuals do not rely on a grid and can be useful to highlight where a model overpredicts or underpredicts. * Voronoi residuals use an automatically adaptive grid and are powerful for both comparison and to see where a particular model over or underpredicts.

Log-likelihood (see e.g. Daley and Vere-Jones, 2003) Given a model for (t,x,y), the estimated rate of events around time t and loc (x,y), the log-likelihood L = ∑i log (ti,xi,yi) - ∫ (t,x,y) dt dx dy.