A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.

Slides:



Advertisements
Similar presentations
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Advertisements

Semantic Grid Tools for Rural Policy Development & Appraisal Department of Computing Science, University of Aberdeen Department of Geography & Environment,
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Supporting Simulations on the Cloud using Workflows & Virtual Machines Gary Polhill Macaulay Land Use Research Institute Edoardo Pignotti Computing Science,
UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
1 Semantic Grid Services for Video Analysis Gayathri Nadarajan, Yun-Heh Chen-Burger, James Malone Centre for Intelligent Systems and their Applications.
WS-VLAM Introduction presentation WS-VLAM Semantic tools Systems, Networking, and Engineering group Institute of informatics University of Amsterdam.
Connecting People With Information DoD Net-Centric Services Strategy Frank Petroski October 31, 2006.
DSM Workshop, October 22 OOPSLA 2006 Model-Based Workflows Leonardo Salayandía University of Texas at El Paso.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Variability Oriented Programming – A programming abstraction for adaptive service orientation Prof. Umesh Bellur Dept. of Computer Science & Engg, IIT.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
UvA, Amsterdam June 2007WS-VLAM Introduction presentation WS-VLAM Requirements list known as the WS-VLAM wishlist System and Network Engineering group.
Cyberinfrastructure for Rapid Prototyping Capability Tomasz Haupt, Anand Kalyanasundaram, Igor Zhuk, Vamsi Goli Mississippi State University GeoResouces.
Evgeny Zolin, School of Computer Science, University of Manchester, UK, Andrey Bovykin, Department of Computer Science, University.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
1 Semantic-Based Workflow Composition for Video Processing in the Grid Gayathri Nadarajan, Yun-Heh Chen-Burger, James Malone Centre for Intelligent Systems.
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Deciding Semantic Matching of Stateless Services Duncan Hull †, Evgeny Zolin †, Andrey Bovykin ‡, Ian Horrocks †, Ulrike Sattler † and Robert Stevens †
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Taverna and my Grid Basic overview and Introduction Tom Oinn
Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow,
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
111 Notion of a Project Notes from OOSE Slides – a different textbook used in the past Read/review carefully and understand.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Semantic Web Fred: Project Objectives & SWF Framework Michael Stollberg Reinhold Herzog Peter Zugmann - 07 April
Scientific Workflow Interchanging Through Patterns: Reversals and Lessons Learned Bruno Fernandes Bastos Regina Maria Maciel Braga Antônio Tadeu Azevedo.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
E-Science for the SKA WF4Ever: Supporting Reuse and Reproducibility in Experimental Science Lourdes Verdes-Montenegro* AMIGA and Wf4Ever teams Instituto.
Joint agINFRA & SCI-BUS workshop, 30/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA Joint agINFRA & SCI-BUS workshop agINFRA.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
Verification and Validation in the Context of Domain-Specific Modelling Janne Merilinna.
Workflow Project Status Update Luciano Piccoli - Fermilab, IIT Nov
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
GEON Cyberinfrastructure Workshop Beijing, China, July 21-23, 2006 Workflow-Driven Ontologies for the Geosciences Leonardo Salayandía The University of.
GREGORY SILVER KUSHEL RIA BELLPADY JOHN MILLER KRYS KOCHUT WILLIAM YORK Supporting Interoperability Using the Discrete-event Modeling Ontology (DeMO)
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Service Service metadata what Service is who responsible for service constraints service creation service maintenance service deployment rules rules processing.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
Knowledge Representation Breakout KR: to create content (objects, reltnshps) for SMS (logic/inference) that will be useful for enhancing the discovery.
Page 1© Crown copyright 2004 FLUME Metadata Steve Mullerworth 3 rd -4 th October May 2006.
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
Interoperability from the e-Science Perspective Yannis Ioannidis Univ. Of Athens and ATHENA Research Center
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
16/11/ Semantic Web Services Language Requirements Presenter: Emilia Cimpian
Recording Actor Provenance in Scientific Workflows Ian Wootten, Shrija Rajbhandari, Omer Rana Cardiff University, UK.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
An Ontological Framework to Support Provenance in a Virtual Research Environment Edoardo Pignotti University of Aberdeen.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
CCNT Lab of Zhejiang University
Web Ontology Language for Service (OWL-S)
Business Process Management and Semantic Technologies
Scientific Workflows Lecture 15
Presentation transcript:

A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School of Natural & Computing Sciences University of Aberdeen The Macaulay Institute, Aberdeen

Content  Background  Workflow Technologies  Upper Deeside Case Study  Research Challenges  Scientist’s Intent Framework  Semantic Workflow Architecture  Evaluation & Conclusions

Background  Semantic Grid Vision –Support for researchers to publish, share and re-use resources, integrate heterogeneous information and collaborate. (  Central to this view is the integration of computational Grid technologies with Semantic Web technologies.  Fearlus-G Project (Pignotti et. al 2005) –Deployed existing social simulation model of land-use change onto the Grid. –Created metadata tools to support annotation/sharing of social simulation resources.  Fearlus-G was not designed to be a flexible problem-solving environment.

Workflow in e-Science  Scientific workflows facilitate the creation and execution of experiments from a pool of available services.  Important part of in-silico experimentation to investigate or verify a hypothesis.  Taverna (myGrid, UK eScience) –Easy use of workflow and distributed compute technology. –Metadata to support the discovery of new services.  Kepler (University of California) –Provides Director component Controls the execution environment (i.e. the “flow”). –Import OWL ontologies. –Metadata to describe activities, inputs and outputs.

Evaluation of Workflow Technologies  Participants from several different disciplines.  Task  design an experiment using a Workflow tool;  Feedback via questionnaires, interviews and through direct observation.  Outcome  It is not possible to represent constraints (e.g. regarding data aggregation);  Contextual information about the experiment is missing;  At the moment, detailed technical documentation is needed in order to fully understand a workflow.

Upper Deeside Case Study Goal: Obtain at least one match where the real data falls within 95% confidence interval of the model value. Constraint: If in any of the 50 runs, one land manager owns more than half of the land, ignore this parameter-set. Constraint: Simulation needs to run on a platform compatible with IEEE 754 floating point standard.

Research Challenges  Need to go beyond low-level service composition by capturing higher-level descriptions of the scientific process.  Make the experimental conditions and goals of the experiment transparent.  Represent “scientist's intent” in such a way that: –it is meaningful to the researcher; –it can be reasoned about by a software application; –it can be re-used across different workflows; –it can be used as provenance (documenting the process that led to some result).

Scientist's Intent Framework  Capture the logic behind scientist’s intent  Model of scientist’s intent based upon rules: PreCondition/Post Action  Based on workflow metadata support and SWRL (Semantic Web Rule Language).  Example GOAL: obtain at least one match where the real data falls within 95% confidence interval of the model value. Pre Condition: ParameterSet( ?x1 ) ^ DataSet( ?x2 ) ^ ComparisonTest( ?x3 ) ^ compares( ?x3, ?x1 ) ^ compares( ?x3, ?x2 ) ^ similarity( ?x3, ?x4 ) ^ [more-than ( ?x4, 95%) = true

Scientist’s Intent for Monitoring Workflow  CONSTRAINT: check if the simulation is running on a platform compatible with the IEEE 754 floating point standard. PreCondition: GridTask( ?x1 ) ^ Simulation( ?x2 ) ^ runsSimulation( ?x1, ?x2 ) ^ neg runsOnPlatform( ?x1, `IEEE754' ) ^ hasResult( ?x2, ?x3) ^ PostCondition: hasInvalidResults( ?x2, ?x3 ) ^ ACTION:resubmitTask(?x1)  Actions performed depend on the ability of the workflow to process events.

Scientist’s Intent for Result Enrichment  CONSTRAINT: if the similarity between the results of a simulation run and an existing dataset is below 10%, it is interesting to investigate the behavior of the run. PreCondition ParameterSet( ?x1 ) ^ DataSet( ?x2 ) ^ Simulation( ?x3 ) ^ hasSimulationRun( ?x3, ?x4 ) ^ ComparisonTest( ?x5 ) ^ compares( ?x5, ?x4 ) ^ compares( ?x5, ?x2 ) ^ similarity( ?x5, ?x6 ) ^ [less-than ( ?x6, 10%) = true] PostAction: runToExplore(?x3, ?x4 )

Scientist’s Intent as Provenance  Scientist’s Intent Ontology  We have attempted to align out ontology with the core characteristics of the Open Provenance Model (OPM)

Workflow Metadata Support  Possible sources of metadata: –metadata about the result(s) generated upon completion of the workflow; –metadata about the data generated at the end of an activity within the workflow or sub-workflow; –metadata about the status of an activity over time, e.g. while the workflow is running; –metadata describing the activity itself (e.g. type of service, platform, etc.)  Ontologies –Scientist’s Intent –PolicyGrid provenance framework

Semantic Workflow Architecture

Scientist’s Intent Interface

Evaluation & Conclusions  Assessing the enhanced workflow representation –Expressiveness of the intent formalism –Reusability –Workflow execution  Creating and utilizing metadata is a non-trivial task  Scale issues with FEARLUS simulation metadata (~250,000,000 triples per experiment).  We aim to provide a closer connection between experimental workflows and the goals and constraints of the researcher, thus making experiments more transparent.

Questions?