Introduction to assessment performance Mikko Pohjola, THL.

Slides:



Advertisements
Similar presentations
Climate prediction: a limit to adaptation? Living with climate change: are there limits to adaptation? 7 & 8 February 2008, Royal Geographical Society,
Advertisements

Mywish K. Maredia Michigan State University
The design process IACT 403 IACT 931 CSCI 324 Human Computer Interface Lecturer:Gene Awyzio Room:3.117 Phone:
Chapter 4 Design Approaches and Methods
Determining CLIMASP Competencies Jerash University Development of Interdisciplinary Program on Climate Change and Sustainability Policy- CLIMASP Development.
Introduction to assessment performance Mikko Pohjola, THL.
Lecture 7 Evaluation. Purpose Assessment of the result Against requirements Qualitative Quantitative User trials Etc Assessment of and Reflection on process.
Title slide PIPELINE QRA SEMINAR. PIPELINE RISK ASSESSMENT INTRODUCTION TO GENERAL RISK MANAGEMENT 2.
IFSA 2004 Workshop 5 Combined micro-economic and ecological assessment tools for sustainable rural development in the context of Farming Systems Analysis.
Evaluation. Practical Evaluation Michael Quinn Patton.
Multi-Agency Radiological Laboratory Analytical Protocols Manual: MARLAP Presentation to the Radiation Advisory Committee/Science Advisory Board April.
Best-Fit Evaluation Strategies: Are They Possible? John Carlo Bertot, John T. Snead, & Charles R. McClure Information Use Management and Policy Institute.
What is Business Analysis Planning & Monitoring?
Risk management: State-of-the-art? Mikko Pohjola, THL.
Needs Assessment: Collecting information, making decisions, and accomplishing results Ryan Watkins, Ph.D. George Washington University Maurya West Meiers.
Performance Measurement and Analysis for Health Organizations
Sociology 3322a. “…the systematic assessment of the operation and/or outcomes of a program or policy, compared to a set of explicit or implicit standards.
Standards-Based Science Instruction. Ohio’s Science Cognitive Demands Science is more than a body of knowledge. It must not be misperceived as lists of.
Management & Development of Complex Projects Course Code - 706
Introduction to assessment performance Mikko Pohjola, THL.
06/10/2015 Presentation name / Author1 Evaluating assessment performance Mikko Pohjola, THL.
Management of assessments and decision making: execution, facilitation, evaluation Mikko V. Pohjola, Nordem Oy (THL)
Creating a Shared Vision Model. What is a Shared Vision Model? A “Shared Vision” model is a collective view of a water resources system developed by managers.
A COMPETENCY APPROACH TO HUMAN RESOURCE MANAGEMENT
HCI in Software Process Material from Authors of Human Computer Interaction Alan Dix, et al.
# 1 US Army Engineer Research and Development Center Multi-Criteria Decision Analysis and Environmental Risk Assessment for Nanomaterials Jeff Steevens.
Module 4: Systems Development Chapter 12: (IS) Project Management.
Shared understanding Jouni Tuomisto, THL. Outline What is shared understanding? Main properties Examples of use How does it make things different? Rules.
National Public Health Institute, Finland Open risk assessment Lecture 7: Evaluating assessment performance Mikko Pohjola KTL, Finland.
Introduction to the Research Framework Work-in-progress Conceptualizing the Criteria to assess ‘appropriateness’ of actions in given ‘national’ circumstances.
Introduction to assessment performance Mikko Pohjola, THL.
Integrated Risk Management Charles Yoe, PhD Institute for Water Resources 2009.
Decision analysis and risk management: Introduction to course Jouni Tuomisto, THL.
Max Booleman, Statistics Netherlands Antonio Baigorri, Eurostat The 10 commandments of process and product quality in official statistics.
Risk management: A social learning perspective? Mikko Pohjola, THL.
Chapter 14: Using the Scalable Decision Process on Large Projects The process outlined is meant to be scaleable. Individual steps can be removed, changed,
The Major Steps of a Public Health Evaluation 1. Engage Stakeholders 2. Describe the program 3. Focus on the evaluation design 4. Gather credible evidence.
1 V&V Needs for NextGen of 2025 and Beyond A JPDO Perspective Maureen Keegan JPDO Integration Manager October 13, 2010.
The new EC impact assessment: what for? EUROPEAN TRADE UNION CONFEDERATION Sophie Dupressoir.
The science-policy interface at MNP INTARESE training on uncertainty & quality, 16/17 October 2007 Arthur Petersen.
Shared understanding Jouni Tuomisto, THL. Outline What is shared understanding? Main properties Examples of use How does it make things different? Rules.
National Public Health Institute, Finland Open Risk Assessment Lecture 2: General assessment framework Mikko Pohjola KTL, Finland.
SOLUTION What kind of plan do we need? How will we know if the work is on track to be done? How quickly can we get this done? How long will this work take.
DARM 2013: Assessment and decision making Mikko V. Pohjola, Nordem Oy, (THL)
RLV Reliability Analysis Guidelines Terry Hardy AST-300/Systems Engineering and Training Division October 26, 2004.
Fundamentals of Governance: Parliament and Government Understanding and Demonstrating Assessment Criteria Facilitator: Tony Cash.
Guidance for Uncertainty Scanning and Assessment at RIVM Jeroen van der Sluijs, James Risbey, Penny Kloprogge (Copernicus Institute, Utrecht) Jerry Ravetz.
Designing New Programs Design & Chronological Perspectives (Presentation of Berk & Rossi’s Thinking About Program Evaluation, Sage Press, 1990)
National Public Health Institute, Finland Open Risk Assessment Lecture 2: General assessment framework Mikko Pohjola KTL, Finland.
Model validity, testing and analysis. Conceptual and Philosophical Foundations Model Validity and Types of Models –Statistical Forecasting models (black.
Copernicus Institute Interfaces between Science & Society, Milano, more info: Break-out session Uncertainty, assumptions and value.
Stream A LEGISLATION AND POLICY report back. Main issuess Formal Aspects Experience and lessons learned Plans and visions for the future Actions.
Management and evaluation of open policy processes Mikko V. Pohjola Nordem Oy, THL, Santasport Institute.
NMFS Use Case 1 review/ evaluation and next steps April 19, 2012 Woods Hole, MA Peter Fox (RPI* and WHOI**) and Andrew Maffei (WHOI) *Tetherless World.
Session 2: Developing a Comprehensive M&E Work Plan.
Risk management: Facilitation of (open) risk management Mikko Pohjola, THL.
Organizations of all types and sizes face a range of risks that can affect the achievement of their objectives. Organization's activities Strategic initiatives.
Decision analysis and risk management: Introduction to course Jouni Tuomisto, THL.
Verification vs. Validation Verification: "Are we building the product right?" The software should conform to its specification.The software should conform.
PRAGMATIC Study Designs: Elderly Cancer Trials
Chris Lintern Co-operative Financial Services
Background Non-Formal Education is recognized as an important sub-sector of the education system, providing learning opportunities to those who are not.
Software Verification and Validation
Application of toxicological risk assessment in the society
DARM 2013: Assessment and decision making
Introduction to risk management
Impact assessment and decision making
Frequently asked questions about software engineering
Strategic Environmental Assessment (SEA)
Workshop 1: PROJECT EVALUATION
Presentation transcript:

Introduction to assessment performance Mikko Pohjola, THL

Contents Setting & concepts Common perspectives (& examples) Quality assurance/quality control Uncertainty analysis Model performance Properties of good assessment Summary & discussion

Setting Decision making under uncertainty – Information inputs Assessment results News Hearsay, gossip – Decision making Background knowledge Values/emotions Interpretation,cognition, communication – Outputs Decision(s) -> action(s) -> outcome(s)

Setting Assessment performance is about evaluating – Information...in use making of... – In a situation of incomplete knowledge Are there actually any other situations? How good is it? – Why? – How to evaluate? – What for?

Concepts Some basic concepts: Performance = goodness! Assessment, Management Model Process (making/using), Product Output, Outcome Assessor, Decision/Policy maker, Stakeholder Participant, User

Rationale Why evaluation of assessment performance is important? Efficient use of resources? Value of work done? Importance/meaning of information? Implications of information? Actual impacts of information? … …because funder, customer, user, boss, peer, stakeholder etc. wants/needs to know!

Roles and interests ExpertsData quality, analysis procedure, coherence, comprehensiveness, … FundersRelevance, efficiency, timeliness, importance, … Users (DM)Understandability, reliability (of source), acceptance, practicality, … Interested (SH)(same as DM, but different perspective)

General RA/RM framework Process, product, use

Common perspectives & examples Quality assurance/quality control Focus on assessment process An “engineering” perspective Uncertainty Focus on assessment output (product) A scientists perspective??? Model performance Focus on modelling and model Combines QA/QC and uncertainty perspectives A modellers perspective

Quality assurance/quality control Principle: “Good process guarantees good outputs/outcomes!” Question: “How should an assessment be done?” Examples: Ten steps by Jakeman et al.(2006) IDEA framework (Briggs, 2008) (Over)appreciation of randomized controlled trials (RCT’s)

Ten iterative steps in development and evaluation of environmental models Jakeman et al.: Ten iterative steps in development and evaluation of environmental models. Environmental Modelling & Software Issue 5, May 2006, Pages

IDEA framework (INTARESE) Briggs: A framework for integrated environmental health impact assessment of systemic risks. Environmental Health 2008, 7:61.

RCT as the ultimate study type? RCT results often regarded as the best possible kind of information – Does RCT suit all situations? – DO RCT's provide answers to all kinds of questions? – Are RCT results always better than results from any other kind of study? After all, RCT is just a study procedure!

Uncertainty analysis Principles: “Performance is an intrinsic property of an information product!” “The more accurate, the better!” Question: How good are the assessment results?

Uncertainty analysis Examples: Statistical uncertainty Mean, variance, confidence limits, distributions, … Cf. D. Lindley: Philosophy of Statistics, 2000 Sources of uncertainty For example model, parameter & scenario uncertainty (as applied e.g. by the U.S.EPA) Extensive approaches E.g. inclusion of qualitative aspects, sources of uncertainty as in NUSAP (

NUSAP N: numeral U: unit S: spread A: assessment (qualitative judgment) P: pedigree (historical path leading to result)

NUSAP - pedigree Jeroen van der Sluijs: NUSAP- some examples. Presentation. Available:

Model performance Principle: The model is the essence of the assessment! Question: How good is the model? Examples: Verification, validation, (reliability, usability, …) Outcome-oriented approach by Matthews et al. 2011

Outcome-oriented modelling approach Matthews et al.: Raising the bar? – The challenges of evaluating the outcomes of environmental modelling and software. Environmental Modelling & Software, March 2011, Pages

Summary of common perspectives Assessment process and outputs are addressed in many ways Use of results mostly not considered The link between outputs and outcomes (cf. Matthews et al. 2011) Evaluation often a separate process Expert processes of making assessments and using their results Expert processes of evaluating performance Alternative perspectives?

Properties of good assessment

Ex post (after assessment) evaluation Ex ante (before/during assessment) evaluation Guidance of design and execution Links process and output with use Thereby also linking them to outcomes

Example: what makes a good hammer?

How is the hammer made? By whom? What properties does the hammer have? What do you want to do with the hammer? How does the hammer help you do it? What is it that really makes a difference?

Summary Consideration of (intended) use is essential Consideration of process and product in light of use In policy-support information is a tool (a means to an end) In policy support, information is a tool – Consider the instrumental value of information Cf. absolute value (a common science view) Cf. Ad hoc solutions (a common practice view) Contextuality, situatedness, practicality, … A model is a tool for producing information How does this relate to the previous lectures about DA and the DA study plan exercise?

Discussion example: swine flu vaccination Because of urgence, swine flu vaccination was bought in Finland without a thorough testing. When narcolepsy cases were identified, the decision made without testing was seen as a major mistake. Was it a mistake? – How should we evaluate the situation to find an answer? – How did the decision-maker assess the situation? – How should she have assessed the situation?

Swine flu example: issues in performance? What are the critical issues in the assessment performance? Possibilities include e.g. – The assessment truthfully estimates the total health impact of swine flu. – The assessment truthfully estimates the health impact of a vaccination campaign. – The only tested vaccines are assessed. – The assessment does not underestimate potential side effects of the vaccine, whether tested or not. – Something else, what?

Swine flu example: follow-up as a part of assessment performance? What are the methods to identify if something starts to go on after the decision? Should these be assessed already in the assessment before the decision? How can this be done? Does this improve the assessment performance?

Setting