A Semantic Type System and Propagation

Slides:



Advertisements
Similar presentations
Overview of the Science Environment for Ecological Knowledge (SEEK) Ricardo Scachetti Pereira.
Advertisements

UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
DSM Workshop, October 22 OOPSLA 2006 Model-Based Workflows Leonardo Salayandía University of Texas at El Paso.
A FRAMEWORK BASED ON WEB SERVICES ORCHESTRATION FOR BIOINFORMATICS WORKFLOW MANAGEMENT Laboratory for Bioinformatics (LBI), Institute of Computing (IC)
Hybrid-Type Extensions for Actor-Oriented Modeling (a.k.a. Semantic Data-types for Kepler) Shawn Bowers & Bertram Ludäscher University of California, Davis.
Chess Review May 8, 2003 Berkeley, CA Classes and Inheritance in Actor- Oriented Models Stephen Neuendorffer Edward Lee UC Berkeley.
February 11, 2010 Center for Hybrid and Embedded Software Systems Ptolemy II - Heterogeneous Concurrent Modeling and Design.
Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Flexible Scientific Workflows Using Dynamic Embedding Anne H.H. Ngu, Nicholas Haasch Terence.
February 12, 2009 Center for Hybrid and Embedded Software Systems Encapsulated Model Transformation Rule A transformation.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
About the Data-Flow Complexity of Web Processes Jorge Cardoso Department of Computer Science, University of Madeira Funchal Portugal.
February 12, 2009 Center for Hybrid and Embedded Software Systems Model Transformation Using ERG Controller Thomas H. Feng.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
January, 23, 2006 Ilkay Altintas
Composing Models of Computation in Kepler/Ptolemy II
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life UC DAVIS Department of Computer Science The Kepler/pPOD Team Shawn.
SOFTWARE DESIGN.
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Semantic Mediation in SEEK/Kepler: Exploiting Semantic Annotation for Discovery, Analysis, and Integration of Scientific Data and Workflows Bertram Ludäscher.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
Rule-Based Programming for VORBs Bertram Ludaescher Arcot Rajasekar Data and Knowledge Systems San Diego Supercomputer Center U.C. San Diego.
Using R in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
Knowledge Representation Breakout KR: to create content (objects, reltnshps) for SMS (logic/inference) that will be useful for enhancing the discovery.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
Efrat Jaeger – SDSC Bertram Ludäscher – UC DAVIS Krishna Sinha – Virginia Tech Ashraf Memon – SDSC Ghulam Memon – SDSC Ilkay Altintas – SDSC Kai Lin –
EcoGrid in SEEK A Data Grid System for Ecology Bertram Ludaescher University of California, Davis Arcot Rajasekar San Diego Supercomputer Center, University.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Scientific workflow in Kepler – hands on tutorial
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Complexity Time: 2 Hours.
Web Service Modeling Ontology (WSMO)
Data R&D Issues for GTL Bertram Ludäscher Data and Knowledge Systems
Software Quality Engineering
Web Engineering.
Web Ontology Language for Service (OWL-S)
Event Relation Graphs and Extensions in Ptolemy II
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Software Connectors – A Taxonomy Approach
Service-centric Software Engineering
Retargetable Model-Based Code Generation in Ptolemy II
Service-centric Software Engineering 1
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Chapter 20 Object-Oriented Analysis and Design
UML profiles.
Mega-modeling for Big Data Analytics
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Dr. Bhavani Thuraisingham The University of Texas at Dallas
Ivan Kurtev, Klaas van den Berg Software Engineering Group
Software Analysis.
Model-Driven Semantic Web Application Development
Stumpf and Teague Object-Oriented Systems Analysis and Design with UML
Business Process Management and Semantic Technologies
Stumpf and Teague Object-Oriented Systems Analysis and Design with UML
Presentation transcript:

A Semantic Type System and Propagation CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON Web Service Based Information Integration A Semantic Type System and Propagation Mechanism for Scientific Workflows Shawn Bowers2 and Bertram Ludäscher1,2,3 1Dept. of Computer Science, 2Genome Center, UC DAVIS 3San Diego Supercomputer Center, UC San Diego www.kepler-project.org The Problem: Design and Reuse of Scientific Workflows and their Components Scientific workflows are becoming increasingly important as a unifying mechanism for interlinking scientific data management, analysis, simulation, and visualization tasks. While current systems like Kepler permit the creation of executable workflows (e.g., from local components and web services), conceptual modeling and design of scientific workflows has been largely neglected so far. Thus design and resuse of (possibly thousands of legacy) components, actors, and workflows is difficult. Our Approach: We have developed a formal model for scientific workflows based on an actor-oriented modeling and design approach, originally developed for studying models of complex concurrent systems. Actor-oriented modeling separates two modeling concerns: component communication (dataflow) and overall workflow coordination (orchestration). Our framework includes a novel hybrid type system, separating further the concerns of conventional data modeling (structural data type) and conceptual modeling (semantic type). In our design methodology, semantic and structural mismatches can be handled independently or simultaneously via different types of adapters, giving rise to new methods of workflow design. The Benefits: Separation of modeling concerns: transport, structure, semantics port types “smart” discovery and linkage of components and data sets Workflow graph is an artifact that can be described, analyzed, shared More independently reusable components Mix of design strategies:step-wise refinement, bottom-up, top-down strategies, data-oriented, task-oriented, … Some costly semantic annotations can be automatically derived Annotation Propagation Problem: Given a structural schemas S (input) and S’ (ouput) and an ontology O, a semantic annotation α a query annotation q Goal: compute α’ Specific Challenges in Scientific Workflow Design: How to support ... (1) ... scientific workflow design process in general? (2) ... "smart" discovery of components (out of thousands ...) (3) ... "smart" linking of data to components (data binding) (4) ... "smart" linking of components to one another (service composition) (5) ... overall orchestration semantics (6) ... propagation of (semantic) type information Command line tools  can invoke R, GMT (1) Actor-oriented SWF Modeling & Design Approach: Separation of Concerns in SWF Modeling and Design: Data ports have ... - a transport type (move data via: object, reference, SRB, GridFTP, scp, ..) - a structural type (XML DTD-ish) - a semantic type (OWL-ish) - a token consumption type (in/out rates of tokens/actor firing) (1)  Design methodology based on an abstract model of SWFs; allows mixes of top-down (stepwise refinement), bottom-up, data-driven, task-driven, structure-driven, semantics-driven ... design (2)  concept/ontology-based actor discovery (3)  semantic annotations of data and actors (4)  use of both structural and semantic types to type-check desired connections and guide suitable pre-/post-actors; introduction of structural and/or semantic adapters ("shims"); basic idea: use logic constraints to express types (5)  employ Ptolemy's Models of Computation/Directors: Process Network, SDF, ... (6)  use query annotations of actors and a procedure similar to the "Chase" (resolution) Scientific Workflow with Semantic Query Annotations (1) Semantic Extensions for SWF Design W0 t W1 W2 Wm Wn … Workflow Design Implementation Top-Down Bottom-Up Input Driven Output Driven Structure Driven Semantic Driven Task Driven Data Driven Interplay between structural and semantic type information Future Plans: Workflow engineers evolve workflows by applying design primitives (left), shown as transformations t; certain primitives can be grouped to form design strategies (right), where each design strategy is shown as a distinct dimension of a design space. Kepler contributors include GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs 022567 (SEEK), EAR-0225673 (GEON), DOE DE-FC02-01ER25486 (SciDAC/SDM), and DARPA F33615-00-C-1703 (Ptolemy).