Hybrid-Type Extensions for Actor-Oriented Modeling (a.k.a. Semantic Data-types for Kepler) Shawn Bowers & Bertram Ludäscher University of California, Davis.

Slides:



Advertisements
Similar presentations
Profiles Construction Eclipse ECESIS Project Construction of Complex UML Profiles UPM ETSI Telecomunicación Ciudad Universitaria s/n Madrid 28040,
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Semantic annotation on the SONet and Semtools projects: Challenges for broad multidisciplinary exchange of observational data Mark Schildhauer, NCEAS/UCSB.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
DSM Workshop, October 22 OOPSLA 2006 Model-Based Workflows Leonardo Salayandía University of Texas at El Paso.
Object-Oriented Analysis and Design
Xyleme A Dynamic Warehouse for XML Data of the Web.
Workflow Exchange and Archival: The KSW File and the Kepler Object Manager Shawn Bowers (For Chad Berkley & Matt Jones) University of California, Davis.
Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Flexible Scientific Workflows Using Dynamic Embedding Anne H.H. Ngu, Nicholas Haasch Terence.
Brian A. Carlsen Apelon, Inc. Tools For Classification Integration Networked Knowledge Organization Systems/Services Workshop June 28, 2001.
February 12, 2009 Center for Hybrid and Embedded Software Systems Encapsulated Model Transformation Rule A transformation.
February 12, 2009 Center for Hybrid and Embedded Software Systems Model Transformation Using ERG Controller Thomas H. Feng.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
Course Instructor: Aisha Azeem
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
January, 23, 2006 Ilkay Altintas
A university for the world real R © 2009, Chapter 18 Process Configuration Florian Gottschalk Marcello La Rosa.
Scientific Workflows Scientific workflows describe structured activities arising in scientific problem-solving. Conducting experiments involve complex.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Second RMSIG Workshop Methodology and Process Wolfram Höpken.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Conceptual Modeling Issues in Web Applications enhanced with Web services Sara Comai, Politecnico di Milano In collaboration with:
Practical RDF Chapter 1. RDF: An Introduction
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
Introduction to MDA (Model Driven Architecture) CYT.
Assessing the Suitability of UML for Modeling Software Architectures Nenad Medvidovic Computer Science Department University of Southern California Los.
Pipelines and Scientific Workflows with Ptolemy II Deana Pennington University of New Mexico LTER Network Office Shawn Bowers UCSD San Diego Supercomputer.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Workflow Topics for the Next- Generation SDM-Center Ilkay Altintas Bertram Ludäscher San Diego Supercomputer Center.
1 Software Design Reference: Software Engineering, by Ian Sommerville, Ch. 12 & 13, 5 th edition and Ch. 10, 6 th edition.
Scientific Workflow Interchanging Through Patterns: Reversals and Lessons Learned Bruno Fernandes Bastos Regina Maria Maciel Braga Antônio Tadeu Azevedo.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Semantic Mediation in SEEK/Kepler: Exploiting Semantic Annotation for Discovery, Analysis, and Integration of Scientific Data and Workflows Bertram Ludäscher.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
Dimitrios Skoutas Alkis Simitsis
Model Driven Development An introduction. Overview Using Models Using Models in Software Feasibility of MDA MDA Technologies The Unified Modeling Language.
1 5 Nov 2002 Risto Pohjonen, Juha-Pekka Tolvanen MetaCase Consulting AUTOMATED PRODUCTION OF FAMILY MEMBERS: LESSONS LEARNED.
The Volcano Optimizer Generator Extensibility and Efficient Search.
Sheet 1XML Technology in E-Commerce 2001Lecture 2 XML Technology in E-Commerce Lecture 2 Logical and Physical Structure, Validity, DTD, XML Schema.
Logical view –show classes and objects Process view –models the executables Implementation view –Files, configuration and versions Deployment view –Physical.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
An Ontology-Driven Framework for Data Transformation in Scientific Workflows Shawn Bowers Bertram Ludäscher San Diego Supercomputer Center University of.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Object Oriented Programming and Data Abstraction Earl Huff Rowan University.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Web Service Modeling Ontology (WSMO)
Web Ontology Language for Service (OWL-S)
Semantic Markup for Semantic Web Tools:
A Semantic Type System and Propagation
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Scientific Workflows Lecture 15
Presentation transcript:

Hybrid-Type Extensions for Actor-Oriented Modeling (a.k.a. Semantic Data-types for Kepler) Shawn Bowers & Bertram Ludäscher University of California, Davis Genome Center & CS Dept. May, 2005

Outline 1.Hybrid Types 2.Hybrid Types and Scientific Workflow Design 3.Super Rapid Prototyping: The “Sparrow Family of Languages” 4.Next Steps: Adding a Hybrid-Type System to Kepler

Hybrid Types

Hybrid Types: Superimposing Semantics Separation of Concerns: Conventional Data Modeling (Structural Data Types) E.g., XML Schema / DTD, etc. Conceptual Data Modeling (Semantic Types) Drawn from ontologies (expressed in Description Logic) Capturing domain knowledge (e.g., biodiversity, ecology) Explicit (external) linkages (Semantic Annotations) Simple links (one concept per item) Links expressed as constraints (logical mappings)

Hybrid Types: Superimposing Semantics table meas sitesppbm string double plot string T1 := * Datatypes a relational table of measurements list spp string T2 := * a list of strings

Hybrid Types: Superimposing Semantics table meas sitesppbm string double plot string T1 := * SpeciesBiomass ⊑ Measurement ⊓  item.Species ⊓  prop.Biomass ⊓  loc.Location “Species biomass is a measure of the amount of biomass of a particular species within a location.” SpeciesCommBiomass ⊑ SpeciesBiomass ⊓  loc.Community “Species community biomass is a species biomass within a community.” SpeciesCommBiomass Semtypes table/X:meas => X:SpeciesCommBiomass each table/meas instance is a measurement

Hybrid Types: Superimposing Semantics table meas sitesppbm string double plot string T1 := * SpeciesBiomass ⊑ Measurement ⊓  item.Species ⊓  prop.Biomass ⊓  loc.Location “Species biomass is a measure of the amount of biomass of a particular species within a location.” SpeciesCommBiomass ⊑ SpeciesBiomass ⊓  loc.Community “Species community biomass is a species biomass within a community.” SpeciesCommBiomass Semtypes table/X:meas, X/Y:site, X/Z:plot => X:SpeciesCommBiomass, C=f(Y,Z), (X,C):loc, C:Community, Community

Hybrid Types: Superimposing Semantics table meas sitesppbm string double plot string T1 := * SpeciesBiomass ⊑ Measurement ⊓  item.Species ⊓  prop.Biomass ⊓  loc.Location “Species biomass is a measure of the amount of biomass of a particular species within a location.” SpeciesCommBiomass ⊑ SpeciesBiomass ⊓  loc.Community “Species community biomass is a species biomass within a community.” SpeciesCommBiomass Semtypes table/X:meas, X/Y:site, X/Z:plot, X/U:spp => X:SpeciesCommBiomass, C=f(Y,Z), (X,C):loc, C:Community, (X,U):item, U:Species, Community Species

Hybrid Types: Superimposing Semantics table meas sitesppbm string double plot string T1 := * SpeciesBiomass ⊑ Measurement ⊓  item.Species ⊓  prop.Biomass ⊓  loc.Location “Species biomass is a measure of the amount of biomass of a particular species within a location.” SpeciesCommBiomass ⊑ SpeciesBiomass ⊓  loc.Community “Species community biomass is a species biomass within a community.” SpeciesCommBiomass Semtypes table/X:meas, X/Y:site, X/Z:plot, X/U:spp, X/B:bm => X:SpeciesCommBiomass, C=f(Y,Z), (X,C):loc, C:Community, (X,U):item, U:Species, (X,B):prop, B:Biomass. Community Species Biomass

Hybrid Types: Superimposing Semantics Searching –Concept-based, e.g., “find all datasets containing biomass measurements” Merging/Integrating –Combining heterogeneous sources based on annotations –Concatenate, Union (merge), Join, etc. Transforming –Construct mappings from schema S1 to S2 based on annotations Semantic Propagation –“Pushing” semantic annotations through transformations/queries

Semantic Annotation Propagation Capture I/O constraints –Similar to unit type constraints –Can enable automated metadata creation (annotation “propagation”) –Can help refine ontologies and existing annotations

Semantic Annotation Propagation table/X:meas, X/Y:site, X/Z:plot, X/U:spp, X/B:bm => X:SpeciesCommBiomass, C=f(Y,Z), (X,C):loc, C:Community, (X,U):item, U:Species, (X,B):prop, B:Biomass. table meas sitesppbm string double plot string T1 := * seasonal species p1::T1 p3::T3 p3.obs (site, plot, spp, bm) :- p1.meas (site, plot, spp, bm), p2.spp (spp). Port 1 Annotation Actor I/O constraint (approx.) table obs sitesppbm string double plot string T3 := * table/X:obs, X/Y:site, X/Z:plot, X/U:spp, X/B:bm => X:SpeciesCommBiomass, C=f(Y,Z), (X,C):loc, C:Community, (X,U):item, U:Species, (X,B):prop, B:Biomass. Port 2 “Chased” Annotation p2::T2

Hybrid Types and Scientific Workflow Design

Workflow Design Primitives End-to-End Workflow Design and Implementation –Viewed as a series of primitive “transformations” –Each takes a WF and produces a new WF –Can be combined to form design “strategies” W0W0 t W1W1 W2W2 WmWm WnWn … t t Workflow Design Workflow Implementation Top-Down Bottom-Up Input Driven Output Driven Structure Driven Semantic Driven Task Driven Data Driven

Workflow Design Primitives Semantic types and Actor Oriented Modeling: –Actors and Workflows: can have semantic types conceptually describing their “function” –Ports: can have semantic types conceptually describing what they consume and produce –I/O Constraints: a general form of constraint between input and output (e.g., like unit constraints) … approximating the function of an actor

Basic Design Primitives Inherited from Ptolemy Basic Transformations Starting WorkflowResulting Workflow t 1 : Entity Introduction (actor or data connection) t 2 : Port Introduction t 6 : Data Connection t 4 : Hierarchical Abstraction t 5 : Hierarchical Refinement t 3 : Datatype Refinement ( s’ s, t’ t ) s t 7 : Director Introduction Resulting Workflow sstt t

Additional (Planned) Design Primitives for Semantic Types Extended TransformationsStarting WorkflowResulting Workflow t 9 : Actor Semantic Type Refinement (T T) T t 12 : I/O Constraint Strengthening (    ) t 10 : Port Semantic Type Refinement (C C, D D) C t 14 : Adapter Insertion T t 11 : Annotation Constraint Refinement (    ) s C 11  t 15 : Actor Replacement f f t 16 : Workflow Combination (Map) t 13 : Data Connection Refinement … f1f1 f2f2 f1f1 … f2f2  Resulting Workflow D C DC D t D 22 11 t D 22 s C 11 t D 22 s C

Adapters for Semantic and Structural Incompatibility Adapters may: –be abstract (no impl.) –be concrete –bridge a semantic gap –fix a structural mismatch –be generated automatically (e.g., Taverna’s “list mismatch”) –be reused components (based on signatures) C1C1 C1C1 D1D1 C1C1 C2C2 CDC CD D DD C2C2 C2C2 D2D2 f2f2 f1f1 [S] S T f1f1 [T] f2f2 map f2f2 f1f1 [[S]] S T f1f1 [[T]] f2f2 map

Applying the Replacement Primitive C f D C f D general replacement C f D C f D unsafe replacement C f context-sensitive replacement (“wiggle room”) DC D C f D  C  D C D C,C overlap (e.g., C C) D,D overlap (e.g., D D) C C  (e.g., C C  ) D D  (e.g., D D  ) –General replacement doesn’t consider surrounding connections –Context-sensitive replacement gives more “wiggle room” by “tuning” the actors semtypes based on connections

Workflow Elaboration Adapter insertion, replacement, and search provide a powerful mechanism for workflow “elaboration”: 1.Given an initial, user specified set of connected “abstract” actors 2.Repeatedly search for replacement “concrete” actors (atomic and composite) 3.At each step, insert adapters when necessary 4.Allow user to select returned workflows to be combined

Super Rapid Prototyping: The Sparrow “Family of Languages”

The “Sparrow Family of Language” Basic Idea: Have both Machine and Human readable syntax Sparrow-DL Description logic Sparrow-DTD Datatypes, variant of XML DTDs Sparrow-Annotate Configuring concepts; linking datatypes and ontologies Sparrow-SWF KSW-Based MoML Metadata Sparrow-Rule Fancy stuff, like type constraints (a la unit types), function approximation, and misc. other constraints

The “Sparrow Family of Languages” Sparrow-SWF Sparrow-DL Sparrow-DTD

Sparrow-Toolkit Operations Sparrow-Toolkit (example) Operations –Is w1 semantically and/or structurally well typed? –What can be semantically connected to a3? –Insert “abstract” adapter between a3 and a4 –What can replace (e.g., implement) the adapter? –… a1 a3 a4 p1:: [string] p1:: string p2:: int p1:: string P2:: {price=string, cond=string, seller={…}} AuthorISBNAuthor ASINAMSQuote w1

Future Steps: Adding a Hybrid-Type System to Kepler

Concept-based Actor Search –Implemented as proof-of- concept About to undergo major revision Additional operations slated for next Kepler Release (data search, actor-based port search, etc.) Biggest Challenges –Building/searching a repository … –Making changes to MoML (see KSW) –GUI changes –Ontology management Current and Future Implementation Workflow Components (MoML) Ontologies (OWL) Default + Other Semantic Annotations urn ids instance expressions