Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Estimation of uncertainty in status class assessment for Wel waterbodies Jannicke.

Slides:



Advertisements
Similar presentations
Event, date: Reporting of SoE biology, Author: Jannicke Moe (NIVA) 1 Agenda item 2: Practical information for reporting of State-of-Environment.
Advertisements

Event, date: Reporting of SoE biology, Author: Jannicke Moe (NIVA) 1 Agenda item 4: Practical training for reporting of State-of-Environment.
Design of Experiments Lecture I
Development and validation of methods for integrated assessment of ecological status of rivers and lakes to support RBMP Lake macrophytes Warsaw, 9-10.
Segment 4 Sampling Distributions - or - Statistics is really just a guessing game George Howard.
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund PALEOLIMNOLOGICAL STUDIES IN THE WEL LAKES Sigrid Haande, NIVA DeWELopment workshop,
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Revised Project Implementation Plan after prolongation of the Project Hanna Soszka,
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Integration of different metrics to a whole water body classification for all.
Development and validation of methods for integrated assessment of ecological status of rivers and lakes to support RBMP Lake macrophytes Oslo, June.
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund WBs assessment AFTER DISCUSSION Expert views Warsaw,
Hypothesis Testing: One Sample Mean or Proportion
Statistical Concepts (continued) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of.
Understanding Research Results. Effect Size Effect Size – strength of relationship & magnitude of effect Effect size r = √ (t2/(t2+df))
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Introduction to the design (and analysis) of experiments James M. Curran Department of Statistics, University of Auckland
Chemometrics Method comparison
1 1 Slide Statistical Inference n We have used probability to model the uncertainty observed in real life situations. n We can also the tools of probability.
Phytoplankton – element in ecological status assessment for lakes of the Wel river catchment area - published 2009 A. Krzebietke, A. Pasztaleniec, A. Hutorowicz.
Virtual COMSATS Inferential Statistics Lecture-6
Anne Lyche Solheim, Norwegian Institute for Water Research, Oslo, Norway Workshop on ”In situ trialing for ecological and chemical studies in support of.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
Hypothesis Testing Quantitative Methods in HPELS 440:210.
Value of information Marko Tainio Decision analysis and Risk Management course in Kuopio
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Benthic invertebrates in lakes Małgorzata Gołub (IOŚ) Jacek Koszałka (UWM) Paweł.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Statistics for Decision Making Basic Inference QM Fall 2003 Instructor: John Seydel, Ph.D.
Environmental Assessment and Sustainability CIV913 BIOLOGICAL ASSESSMENT of River Water Quality Assessing the biological quality of fresh waters : Wright,
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Geographic data validation. Index Basic concepts Why do we need validation? How to assess geographic data Initial checks Intermediate checks Advanced.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Water Bodies in Europe: Integrated Systems to assess Ecological Status and Recovery Funded under FP7, Theme 6: Environment (including Climate Change) Contract.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
WG 2A ECOSTAT 7-8 July 2004 Task on Harmonisation of Freshwater Biological Methods Status Report AC Cardoso and A Solimini Harmonisation Task Team: JRC.
Test data exchange to support development of a biological indicators in rivers and lakes Anne Lyche Solheim and Jannicke Moe, NIVA EEA European Topic Centre.
From the population to the sample The sampling distribution FETP India.
1 Probability and Statistics Confidence Intervals.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Pragmatic combination of BQE results into final WB assessment in Norway Anne Lyche.
Two Approaches to Estimation of Classification Accuracy Rate Under Item Response Theory Quinn N. Lathrop and Ying Cheng Assistant Professor Ph.D., University.
River Intercalibration Phase 2: Milestone 4 reports Presented by Wouter van de Bund Joint Research Centre Institute for Environment and Sustainability.
River Intercalibration Phase 2: Milestone 2 reports Presented by Wouter van de Bund Joint Research Centre Institute for Environment and Sustainability.
River Intercalibration Phase 2: Milestone 3 reports Presented by Wouter van de Bund Joint Research Centre Institute for Environment and Sustainability.
“ Building Strong “ Delivering Integrated, Sustainable, Water Resources Solutions Uncertainty & Variability Charles Yoe, Ph.D.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Comparison of freshwater nutrient boundary values Geoff Phillips 1 & Jo-Anne Pitt 2 1 University of Stirling & University College London 2 Environment.
1 A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting Reporter : Zhao-Wei Luo Che-Jung Chang,Der-Chiang.
WISER Water bodies in Europe: Integrative Systems to assess Ecological status and Recovery.
Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Third phase of deWELopment project Scope of the work Warsaw, 1st Feb
Two-Sample Hypothesis Testing
Agenda item 5: Discussion of next steps
Task on Harmonisation of Freshwater Biological Methods
Constructive Cost Model
Intercalibration progress: Central - Baltic GIG Rivers
What is Project Cost Management?
SoE Guidance – Biological reporting sheets
Setting Classboundaries
CIS Working Group 2A ECOSTAT SCG Meeting in Brussels
EU Water Framework Directive
IMPRESS Guidance and Policy Summary Water Directors Copenhagen, 21-22nd November 2002 Working Group leaders: Volker Mohaupt, Umwelt Bundes Amt Isobel.
Introduction to the design (and analysis) of experiments
FITTING THE ITALIAN METHOD FOR EVALUATING LAKE ECOLOGICAL QUALITY FROM BENTHIC DIATOMS (EPI-L) IN THE “PHYTOBENTHOS CROSS-GIG” INTERCALIBRATION EXERCISE.
Working Group D Reporting, Brussels 31 March – 1 April 2008
Mismatches between nutrients and BQEs: what does it tell us?
Presentation transcript:

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Estimation of uncertainty in status class assessment for Wel waterbodies Jannicke Moe (NIVA) deWELopment project meeting , Warszaw

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund From the deWELopment project description: Third phase: from single metrics to BQE-level assessment We will test different methods of combination of these single metric results to obtain a total result at the whole element level, taking into account the uncertainty in the different single metrics In our project we will test alternative approaches [ to the one-out-all-out principle] using different methods –simple averaging, weighted averaging, multimetric approach

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund From the deWELopment project description: Fourth phase: from BQE-level to waterbody-level assessment Testing different ways of combining the assessment results for different BQEs into one final result for the whole waterbody. –Here the recommended by one-out-all-out rule will be compared to other alternative methods. The risk of misclassification will be estimated –software STARBUGS (  WISERBUGS)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Outline Uncertainty and risk of misclassification at BQE level Integration of uncertainty from BQE level to waterbody level WISERBUGS tool: examples with deWELopment results Next steps

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Uncertainty and risk of misclassification at BQE level

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Uncertainty: "accuracy" vs. "precision" High accuracy, but low precision –"roughly right" High accuracy and high precision –optimal result Low accuracy, but high precision –"precisely wrong" metric True value metric True value metric True value We can never know the ”true value”of a BQE metric – only the measured value Standard Deviation (SD) is a measure of precision, not accuracy

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Uncertainty in BQE level If we can never know the ”true value” of a BQE metric – how can we say something about uncertainty??? We must assume that the measured mean value represents the true value and the true status class We can assume that measured metric values follow normal distribution due to sampling uncertainty We can let measured SD represent sampling uncertainty Example: Measured metric values: 3, 5, 5, 6, 6 Mean: 5 Standard Deviation: 1.22

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Risk of misclassification - BQE level With the given mean and SD, we can test: if we re-sample the BQE many times, how often will we get the ”true” status class? Assuming that measured status class = true status class Risk of misclassification: = proportion of ”new samples” which result in wrong status class = 0.7% + 20% + 20% + 0.7% = 41.4%

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Risk of misclassifcation increases with SD -Higher SD gives flatter distribution => higher probability that new BQE values fall outside the true class Example: -SD increases from 1.0 to 1.5 -Probability of misclassification increases from 32% to 50% -”True class” is Moderate, but 25% probability that new samples will result in Good or High

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Risk of miscl. increases near class borders -If the measured mean BQE is close to a class border, then new BQE values are more likely to fall into a neighbour class Example: -Mean BQE decreases from 5 to 4.2 -Probability of misclassification increases from 32% to 46% -”True class” is moderate, but 42% probability that new samples will result in Good or High

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund How can we obtain SD for BQE level? Can we get reliable SD estimates from deWELopment data? –Many samples are needed per BQE and waterbody – we have ”only” 1-2 samples per BQE and waterbody (?) –Metric values are not necessarily normally distributed Other distributions can be considered What are the alternatives? –”Use best-available information from replicated sampling studies on environmentally similar waterbodies” (WISER data?) –Use best guesses –(Ask WISER WP6.1 for advice)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Integration of uncertainty from BQE level to waterbody level

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Macroinvertebrates Phytobenthos Hydrology Acidification Organic Combining metrics and BQEs

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Two issues: 1)How to combine status classes for different metrics and BQEs Average; weighted average; all-out-one-out; etc. Will not be discussed here (see my presentation June 2010) 2)How to combine uncertainty from different metrics and BQEs Tool: WISERBUGS Combining metrics and BQEs

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund WISERBUGS – brief introduction WISER Bioassessment Uncertainty Guidance Software Excel-based tool developed within EU project WISER Purpose: assist in quantifying uncertainty in the assessment of ecological status of waterbodies Can be used for testing impact on classification of: –Combination rules for metrics and for BQEs –Class boundaries and reference conditions –Sampling uncertainty – SD (per metric) –Sorting/identification uncertainty – SD (per metric) –Uncertainty in reference condition – SD (per metric) Can not be used for –estimating SD for metrics (must be done separately) –estimating type I/II errors (because true status class is not known)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund WISERBUGS – brief introduction WISERBUGS output: probability of each status class for... each waterbody (overall assessment) each BQE within a waterbody each metric within a BQE

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund WISERBUGS – brief introduction WISERBUGS input For each metric: –Measured value for each sample in each waterbody –SD representing sampling variation –Class boundaries (H/G, G/M, M/P, P/B) –”E1”: metric value for which EQR = 1 (Reference value) –”E0”: metric value for which EQR = 0 (bottom of metric scale) –(SD representing sorting/identification variation) –(SD for reference value) For overall assessment: –Combination rules for metrics within BQE –Combination rules for BQEs within waterbody –(Correlation between metrics)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund WISERBUGS tool: examples with deWELopment results

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Data received from deWELopment Selected metrics: –Rivers: MP: River Macrophyte Index PB: Multimetric Diatom Index for rivers MI: Benthic Macroinvertebrate Index FI: European Fish Index + –Lakes: PP: Chlorophyll a; Phytoplankton Metric for Polish Lakes MP: Ecological State Macrophyte Index PB: Diatom Index for Lakes MI: Benthic Quality Index based on Chironomid Pupal Exuvial Techn. FI: Fish Index 'Summ Best’ (no class boundaries yet)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Data received from deWELopment Metric values: –13 rivers + 10 lakes –Usually all BQEs for each waterbody Standard deviations per metric: not available –Randomised values used for this excercise –(To be discussed) Class boundaries and reference conditions (”E1”) per metric: –Sometimes waterbody-specific (OK for WISERBUGS) ”E0” – given in correct scale?

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Metric specification - 1 (Lakes) Class boundaries Metric names Other details

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Metric specification - 2 (Lakes) Other types of variation SD from sampling variation NB: SD values are made up!

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Metric specification - 3 (Lakes) Grouping of metrics by pressure within BQE Grouping of metrics by BQE

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Metric specification - 4 (Lakes) Weighting of each BQE Rule for combining BQEs within waterbody (here: one-out-all-out) Rule for combining metrics within BQE

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Results: assessment for individual metrics NB: Fake SD values - results must not be interpreted as real.

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Results: assessment combined per BQE and per waterbody Combination rule for total assessment: ”worst case” (all 4 BQEs)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Results: assessment for BQEs and waterbody Combination rule for total assessment: ”average” (all 4 BQEs)

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Next steps

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Next steps for data analysis Try to obtain SD estimates / best guesses from BQE groups Quality-check of class boundaries, E0, E1 etc. –Confusion EQR scale vs. original metric scale? Explore impact on classification of... –different level of uncertainty (sampling SD) –different combination rules –etc. –what is most useful for deWELopment? Estimate risk of misclassification for selected cases –”True class” will be determined by the given metric values, and an agreed set of SD and combination rules Include physico-chemical parameters in assessment? Other suggestions?

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Publication 1 manuscript on exploring risk of misclassification using the metrics and classification system developed by deWELopment and the WISERBUGS tool Potential co-authors –BQE group leaders ? –Gosia ? –Coordinators: Hanna and Anne (or acknowledgement?) –WISERBUGS author: Ralph Clarke (or acknowledgement?)