Workflow Validation Kerstin Kleese van Dam Michela Taufer.

Slides:



Advertisements
Similar presentations
Project Based Learning
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Lecture 8: Testing, Verification and Validation
Science as a Process Chapter 1 Section 2.
Overview of IS Controls, Auditing, and Security Fall 2005.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Computer Engineering 203 R Smith Project Tracking 12/ Project Tracking Why do we want to track a project? What is the projects MOV? – Why is tracking.
CHAPTER NO 2.  Everything in history which has been planned was done in a highly predictable manner  Whatever you do whether launching a new product,
Workshop Charge J.C. Wells & M. Sato Park Vista Hotel Gatlinburg, Tennessee September 5-6, 2014.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Object-Oriented Analysis and Design
Informatics 43 – May 12, Restatement of Goals for Testing Want to verify software’s correctness  Need to test  Need to decide on test cases 
Two main requirements: 1. Implementation Inspection policies (scheduling algorithms) that will extand the current AutoSched software : Taking to account.
1 Presenter: Chien-Chih Chen Proceedings of the 2002 workshop on Memory system performance.
Principle of Functional Verification Chapter 1~3 Presenter : Fu-Ching Yang.
Short Course on Introduction to Meteorological Instrumentation and Observations Techniques QA and QC Procedures Short Course on Introduction to Meteorological.
Handouts Software Testing and Quality Assurance Theory and Practice Chapter 11 System Test Design
Open loop vs closed loop By Norbert Benei ZI5A58.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Chapter : Software Process
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
Section 2: Science as a Process
It would be difficult to support any claim that problem solving is not an important goal in the teaching of science. Scientists by their very nature are.
® IBM Software Group © 2006 IBM Corporation Writing Good Use Cases Module 4: Detailing a Use Case.
CS 501: Software Engineering Fall 1999 Lecture 16 Verification and Validation.
CPIS 357 Software Quality & Testing
CMSC 345 Fall 2000 Unit Testing. The testing process.
Business Analysis and Essential Competencies
Introduction to Earth Science Section 1- What is Earth Science Section 2- Science as a Process.
Event Management & ITIL V3
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
1 Science as a Process Chapter 1 Section 2. 2 Objectives  Explain how science is different from other forms of human endeavor.  Identify the steps that.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
A simple performance measurement framework A good performance measurement framework will focus on the customer and measure the right things. Performance.
1 Creation versus Evolution. 2 Creation versus Evolution The Theory of Intelligent Design holds that certain aspects of the Universe are best explained.
Application of Provenance for Automated and Research Driven Workflows Tara Gibson June 17, 2008.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
DAME: A Distributed Diagnostics Environment for Maintenance Duncan Russell University of Leeds.
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Chap. 5 Building Valid, Credible, and Appropriately Detailed Simulation Models.
Chapter 3: Software Project Management Metrics
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
UNIT-1 SOFTWARE PRODUCT AND PROCESS: Introduction – S/W Engineering paradigm – Verification – Validation – Life cycle models – System engineering –
16/11/ Semantic Web Services Language Requirements Presenter: Emilia Cimpian
Software Engineering for Capstone Courses Richard Anderson CSE 481b Winter 2007.
Chapter 1 Preview Objectives Physics The Scientific Method Models
The Implementation of BPR Pertemuan 9 Matakuliah: M0734-Business Process Reenginering Tahun: 2010.
Census Processing Baku Training Module.  Discuss:  Processing Strategies  Processing operations  Quality Assurance for processing  Technology Issues.
Mahmut Ali GÖKÇEIndustrial Systems IEU Introduction to System Engineering ISE 102 Spring 2007 Notes & Course Materials Asst. Prof. Dr. Mahmut.
© Fraunhofer IESE Fraunhofer IESE Multi-Language Development of Embedded Systems Dr. Thomas Kuhn Tel.:
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
Physical Science and You Chapter One: Studying Physics and Chemistry Chapter Two: Experiments and Variables Chapter Three: Key Concepts in Physical Science.
Quality Is in the Eye of the Beholder: Meeting Users ’ Requirements for Internet Quality of Service Anna Bouch, Allan Kuchinsky, Nina Bhatti HP Labs Technical.
ERP and Related Technologies
Chapter 1 Section 2 Science as a Process Ms. Scerra.
Identify and Explain Effective Working Practices OCR Diploma Level 2.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
RUP RATIONAL UNIFIED PROCESS Behnam Akbari 06 Oct
Name of Student : PATEL ARPITKUMAR RAJNIKANT Enrollment No
Electron Ion Collider New aspects of EIC experiment instrumentation and computing, as well as their possible impact on and context in society (B) COMPUTING.
CS4311 Spring 2011 Process Improvement Dr
WHAT IS THE NATURE OF SCIENCE?
Section 2: Science as a Process
Gabor Madl Ph.D. Candidate, UC Irvine Advisor: Nikil Dutt
Scientific Inquiry Unit 0.3.
Overview of Workflows: Why Use Them?
Process Wind Tunnel for Improving Business Processes
Day 1, Session 4 Building Your Service Catalog
Presentation transcript:

Workflow Validation Kerstin Kleese van Dam Michela Taufer

Discussion Topics Identify State of the Art, Gaps and Future Research Topics for: Validating Performance Validation and Reproducibility Validating Accuracy

Performance Validation (1) State of the Art: No tools, just one off studies. Single application performance tools are available, but not necessarily applicable to workflows as is. Different vision of performance in Cloud computing - i.e., latency driven. Cloud deploys the concept of enough accurate service within a certain time /amount of resources Gaps: Tools to model and predict workflow performance, tools to monitor performance to validate and improve future predictions and models (importance of factors) Information flow between system, application and workflow How to express and achieve performance goals - for a facility versus for a specific workflow Do we know what metrics we need to capture for different performance goals? Who is capturing them and which granularity

Performance Validation (2) Future Research Topics: Tools to monitor, model and predict performance Performance information capture at runtime - workflow and system, capture information about events, capture at different levels - provenance In-situ analysis of performance data Intelligent schedulers at different levels to optimize performance according to set goals

Reproducibility and Validation (1) State of the Art: Definition of “reproducibility” depends on community and single scientist; it ranges from stringent bit-to-bit reproducibility to reproducibility of the science with different methods. Some disagreements on who is responsible for reproducibility: the application, the workflow, the system? In the end in complex applications, the workflow has major role on domain decompositions. Gaps: Not clear role of the workflow in tracking aspects related to reproducibility. Hidden information that if revealed can be overwhelming. How can you track everything at exascale, especially when you have unexpected events? What is the responsibility of the workflow system and what it needs to track? Provenance should be viewed from the point of view of the consumer of the information The level of resolution to keep provenance need to be define Need to define the lifespam of validation/provenance data

Reproducibility and Validation (2) Future Research Topics: Integration of application, programming models, and run-time systems to pursue reproducibility Pursue reproducibility of results to identify incorrect science. When publishing results, we need to provide provenance, so that others can reproduce the results Automatic annotation embedded in multi-layer or modular workflows

Validating Accuracy (1) State of the Art: Who is responsible for which part of accuracy? Application responsible to guarantee accuracy No test suite to check for workflow and system accuracy regularly No one is watching the correctness of the workflow manager establish accuracy of components a priori, workflow system can monitor specific variables to see if application/workflow is progressing to plan and present to user. First research in using data mining and machine learning to identify deviation and correct for it - challenge its black box - so how to convey it to the user, trade offs of between workflow completion and accuracy observed Gaps: Link to reproducibility - can we reproduce to validate accuracy Can we use accuracy measure to determine what data to keep or through away, can accuracy metrics trigger capture of more system information to inform later debugging Investigate human in the loop role in determining accuracy (what deviation is acceptable), and decide on suitable actions.

Validating Accuracy (2) Future Research Topics: We cannot always get exact numbers, but the numbers need to be within a certain range For a given problem, can you capture what the best settings are?