Provenance-aware faceted search Peter Fox Stephan Zednik Patrick West Tetherless World Constellation, RPI EGU 2010.

Slides:



Advertisements
Similar presentations
Geoinformatics 2008 Fox Semantic Provenance 1 Semantic Provenance for Image Data Processing Peter Fox (HAO/ESSL/NCAR) Deborah McGuinness (RPI) Jose Garcia,
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
April 24, 2007McGuinness NIST Interoperability Week Ontology Summit Semantic Web Perspective Deborah L. McGuinness Acting Director & Senior Research Scientist.
August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Open Provenance Model Tutorial Session 2: OPM Overview and Semantics Luc Moreau University of Southampton.
ICS-FORTH May 23, An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik,
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Experiences Developing a User- centric Presentation of A Domain- enhanced Provenance Data Model Cynthia Chang 1, Stephan Zednik 1, Chris Lynnes 2, Peter.
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Linked Data and the Provenance Explosion Deborah L. McGuinness Tetherless World Constellation Chair Professor of Computer Science and Cognitive Science.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SOUPA: Standard Ontology for Ubiquitous and Pervasive Applications Harry Chen, Filip Perich, Tim Finin, Anupam Joshi Department of Computer Science & Electrical.
References: [1] [2] [3] Acknowledgments:
Understanding PML Paulo Pinheiro da Silva. PML PML is a provenance language (a language used to encode provenance knowledge) that has been proudly derived.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
A Semantically-Enabled Provenance- Aware Water Quality Portal Joint work with: Jin Guang Zheng, Ping Wang, Evan Patton, Timothy Lebo, Joanne Luciano Deborah.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West 1 Nancy Hoebelheinrich.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Semantics and analytics = making the data and the decisions smarter? Digital Antiquity CI Feb 7-8, 2013, Arlington VA Peter Fox (RPI and WHOI)
Realities in Science Data and Information - Let's go for translucency AGU FM10 IN13B-02 Peter Fox (RPI) Tetherless World.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Explainable Adaptive Assistants Deborah L. McGuinness, Tetherless World Constellation, RPI Alyssa Glass, Stanford University Michael Wolverton, SRI International.
17 th October 2002Data Provenance Grid Data Requirements Scoping Metadata & Provenance Dave Pearson Oracle Corporation UK.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Information Model Driven Semantic Framework Architecture and Design for Distributed Data Repositories AGU 2011, IN51D-04 December 9, 2011 Peter Fox (RPI)
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Annotating and Embedding Provenance in Science Data Repositories to Enable Next Generation Science Applications Deborah L. McGuinness.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
‘Ontology Management’ Peter Fox (Semantic Web Cluster lead)
improve the efficiency, collaborative potential, and
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Understanding PML A Proof Markup Language
Modeling Data Set Versioning Operations
Data Provenance.
Modeling Data Set Versioning Operations
Presentation transcript:

Provenance-aware faceted search Peter Fox Stephan Zednik Patrick West Tetherless World Constellation, RPI EGU 2010

Provenance def: A record of ownership of a work of art or an antique, used as a guide to authenticity or quality Documentation of processes in a digital object’s life cycle Origin or source from which something comes, manner of manufacture, production, or discovery, documented in sufficient detail to allow reproducibility or validation 2

Provenance-related Use Cases What calibrations have been applied to this image? 1 What were the cloud cover and seeing conditions during the observation period of this image? 1 Why does this image look bad? 1 What are the processing and parameter differences between the MODIS Daily AOT Data Product vs. the MODIS Monthly AOT Data Product? SPCDIS 2 MDSA

One* View of Provenance in e- Science Assume the provenance of objects is represented by an annotated causality graph For our purpose a provenance graph is a representation of a record of past execution Process Artifact 4 * One of MANY. This representation of provenance satisfies our Need to capture processing history and information dependency Artifact Process

Modeling a Provenance Use Case What calibrations have been applied to this image? Provenance concepts Solar Science concepts Data Product Data Processing concepts Data Filtering Process Data Filtering Process Raw Image Optics Calibration Process Data Calibration Process Flat-field Calibration Angle of Incidence Calibration Junk Data Filter 5

Old way Tetherless World Constellation 6

7

8 Resulting Image, no further information provided

Provenance aware faceted search Tetherless World Constellation 9

Knowledge Base with Provenance and Domain Models in Alignment 10 Data Capture Data Capture Instrument Justification Conclusion Source Rule CSR Image #MyImage #MyImage _justificatio n #He-1083 nm Continuum Image Capture T17:30:00- 08:00 #CHIP NodeSet SourceUsage xsd:DateTim e hasSourceUsage rdf:type rdf:datatype rdf:type hasInferenceRule

Foundation Ontologies/Vocabularies Ontologies designed with reuse and extension in mind – Some contain loosely-scoped high-level concepts – Some narrow focus with intent to be used in diverse domains Simplify information exchange and interoperability Examples – FOAF – SKOS – Dublin Core – OWL Time 11 OWL Time

Foundation Data Provenance Models Provenance Markup Language (PML) – OWL ontology – from automated intelligent systems community – Provenance as information justification – Additional Semantic support for explanation and trust Open Provenance Model (OPM) – Language-agnostic* model – from workflow community – Provenance as artifact creation – Additional support for modeling controlling processes and actors 12

Interoperability with Provenance Tools 13 Probe-It! Inference Web Browser

Proof Markup Language (PML) Justification – Explanation – Causality graph Provenance – Conclusion – Source – Engine – Rule Trust – Trust/Belief metrics NodeSet Justification Conclusion NodeSet Justification Conclusion NodeSet Justification Conclusion Engine Rule hasAntecedentList hasSourceUsage hasInferenceRule hasInferenceEngine SourceUsage Source DateTime 14

Open Provenance Model Agents – Catalyst and controlling entity of a process Processes – Action or Series of actions performed resulting in new artifacts Artifacts – Immutable piece of state Roles – Non-semantic flat tags used to provide context in relations Artifact Process wasGeneratedBy(Role) Agent Artifact used(Role) wasControlledBy(Role) Artifact wasDerivedFrom(Role) Process wasGeneratedBy(Role) wasTriggeredBy(Role) 15

SourceUsage Concept Alignment (PML) Data Capture Data Capture Instrument Data Product Data Calibration Data Calibration Raw Data NodeSet Justification Conclusion NodeSet Justification Conclusion hasAntecedentList Observation Period hasSourceUsage Source DateTime Rule Engine Rule Calibration 16

Concept Alignment (OPM) 17 Data Capture Data Capture Instrument Data Product Data Calibration Data Calibration Raw Data Observation Period Calibration Artifact Process wasGeneratedBy(DataCalibrationProcess) Agent Artifact wasControlledBy(Instrument) Process used(RawData) wasGeneratedBy(DataCaptureProcess) Artifact used(DataCalibration) Artifact used(Timestamp)

Alignment via Ontology Constructs Use ontology constructs to map a relationship between concepts in different domains Can be defined in a separate ontology than the models being mapped Does not require a change to the source models! OWL – owl:equivalentClass – owl:equivalentProperty – owl:sameAs RDFS – rdfs:subClassOf – Rdfs:subPropertyOf 18 Instrument Source rdfs:subClassOf Calibration Rule rdfs:subClassOf Conclusion Data Product rdfs:subClassOf

Direct Alignment using Rules* Rules provide conditional logic on semantic constructs outside application logic Rules can be updated or tweaked without requiring an application update. Easily shared and managed Provides for more complex mapping than ontology constructs 19 *Many rule systems exist, this slide uses the Semantic Web Rule Language (SWRL) ex:Instrument(?x)  pmlp:Sensor(?x) pmlp:Information(?x) ^ pmlp:hasURL(?x,?url) ^ swrlb:endsWith(?url, ”.hsh.fts ”)  Ex:CHIPIntensityImage(?x)

Querying/Interrogat ing the Knowledge Base Back to the use case: What calibrations have been applied to this image? We construct a query returns any individuals with type Calibration used as the InferenceRule in the justification from any artifact the current artifact was derived from. We assume that any calibration applied to an artifact the current artifact was derived from can also be considered as ‘applied’ to the current artifact, and that the wasDerivedFrom property is transitive 20 #Image #_A2 #Intermediate2 #_A1 #Intermediate1 #_A0 #RawImage #_A0 wasDerivedFrom #Angle of Incidence Calibration #Angle of Incidence Calibration #Flat Field Calibration #Flat Field Calibration rdf:type hasInferenceRule

Final Remarks/Discussion Provenance concepts describe how domain concepts are related Domain and provenance models should be independent, but aligned Aligning with a well-supported provenance model can enhance interoperability and tool support Aligned knowledge base supports complex multi-domain query and search 21

Links PML: OPM: SWRL: Inference Web Probe-It! SPCDIS MDSA Contacts: – – – 22

Acknowledgements NASA/GSFC – Gregory Lepkotuh – Chris Lynnes UTEP/CyberSHARE – Paulo Pinheiro da Silva – Nicholas del Rio RPI – Patrick West 23