1 Foundations VI: Provenance Deborah McGuinness and Peter Fox CSCI-6962-01 Week 12, November 30, 2009.

Slides:



Advertisements
Similar presentations
Geoinformatics 2008 Fox Semantic Provenance 1 Semantic Provenance for Image Data Processing Peter Fox (HAO/ESSL/NCAR) Deborah McGuinness (RPI) Jose Garcia,
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
April 24, 2007McGuinness NIST Interoperability Week Ontology Summit Semantic Web Perspective Deborah L. McGuinness Acting Director & Senior Research Scientist.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
K S L W i n e A g e n t : Testbed Application for Semantic Web Technologies Deborah McGuinness Eric Hsu Jessica Jenkins Rob McCool Sheila McIlraith Paulo.
Explanation in GILA 2 Stanford -> RPI McGuinness, Ding January 15, 2008.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik,
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Provenance-aware faceted search Peter Fox Stephan Zednik Patrick West Tetherless World Constellation, RPI EGU 2010.
Explaining Preference Learning Alyssa Glass CS229 Final Project Computer Science Department, Stanford University  Augment PLIANT to gather additional.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Knowledge Provenance in Semantic Wikis Li Ding, Jie Bao, and Deborah McGuinness Tetherless World Constellation, Rensselaer Polytechnic Institute Troy,
Explanation and Trust for Adaptive Systems Alyssa Glass (Stanford / SRI / Willow Garage) In collaboration with Deborah McGuinness (Stanford/RPI), Michael.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
January, 23, 2006 Ilkay Altintas
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
March 26, 2007 McGuinness et al Explaining Task Processing in Cognitive Assistants that Learn Deborah McGuinness 1, Alyssa Glass 1,2, Michael Wolverton.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
1 Class Exercise I: Use Cases Deborah McGuinness and Peter Fox (NCAR) CSCI Week 4 (part II), 2008.
Investigations into Trust for Collaborative Information Repositories: A Wikipedia Case Study Deborah L. McGuinnessDeborah L. McGuinness, Co-Director Knowledge.
Fox OOS meeting 1 Ontologies and Semantic Applications in Earth Sciences Peter Fox (TWC/RPI; formerly HAO/NCAR) Thanks to many. Projects funded.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Explanation: The Next Phase in Question Answering Deborah L. McGuinness Knowledge Systems Laboratory Stanford University
Understanding PML Paulo Pinheiro da Silva. PML PML is a provenance language (a language used to encode provenance knowledge) that has been proudly derived.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 7, October 18, 2010.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Inference Web: Portable and Sharable Proofs for Hybrid Systems Deborah L. McGuinness, Paulo Pinheiro da Silva and Bill MacCartney with Richard Fikes, Gleb.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
Introduction to Tetherless World RPI by Jie Bao Slides will be available from:
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Explanation and Trust for Adaptive Agents Alyssa Glass Joint work with Deborah McGuinness (Stanford), Michael Wolverton (SRI), and.
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
123 Jiao Tao 1, Li Ding 2, Deborah L. McGuinness 3 Tetherless World Constellation Rensselaer Polytechnic Institute Troy, NY, USA 1 PhD Student 2 Postdoctoral.
Enabling Explanations: The Inference Web and PML Approach Deborah McGuinness, Paulo Pinheiro da Silva, Li Ding Knowledge Systems Laboratory Stanford University.
Realities in Science Data and Information - Let's go for translucency AGU FM10 IN13B-02 Peter Fox (RPI) Tetherless World.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Explainable Adaptive Assistants Deborah L. McGuinness, Tetherless World Constellation, RPI Alyssa Glass, Stanford University Michael Wolverton, SRI International.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Explainable Adaptive Assistants Deborah L. McGuinness, Tetherless World Constellation, RPI Alyssa Glass, Stanford University Michael Wolverton, SRI International.
Approach to building ontologies A high-level view Chris Wroe.
Explanation Infrastructure Supporting Transparency and Accountability Deborah L. McGuinness Co-Director and Senior Research Scientist Knowledge Systems,
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
Annotating and Embedding Provenance in Science Data Repositories to Enable Next Generation Science Applications Deborah L. McGuinness.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
Encoding Extraction as Inferences
improve the efficiency, collaborative potential, and
Understanding PML A Proof Markup Language
Foundations VI: Provenance
Presentation transcript:

1 Foundations VI: Provenance Deborah McGuinness and Peter Fox CSCI Week 12, November 30, 2009

References PML -McGuinness, Ding, Pinheiro da Silva, Chang. PML 2: A Modular Explanation Interlingua. AAAI 2007 Workshop on Explanation-aware Computing, Vancouver, Can., 7/07. Stanford Tech report KSL Inference Web - McGuinness and Pinheiro da Silva. Explaining Answers from the Semantic Web: The Inference Web Approach. Web Semantics: Science, Services and Agents on the World Wide Web Special issue: International Semantic Web Conference Edited by K.Sycara and J.Mylopoulis. Volume 1, Issue 4. Journal published Fall, McGuinness, D.L.; Zeng, H.; Pinheiro da Silva, P.; Ding, L.; Narayanan, D.; Bhaowal, M. Investigations into Trust for Collaborative Information Repositories: A Wikipedia Case Study. The Workshop on the Models of Trust for the Web (MTW'06), Edinburgh, Scotland, May 22, More from 2

3 Semantic Web Methodology and Technology Development Process Establish and improve a well-defined methodology vision for Semantic Technology based application development Leverage controlled vocabularies, et c. Use Case Small Team, mixed skills Analysis Adopt Technology Approach Leverage Technology Infrastructure Rapid Prototype Open World: Evolve, Iterate, Redesign, Redeploy Use Tools Science/Expert Review & Iteration Develop model/ ontology Evaluation

4 Ingest/pipelines: problem definition Data is coming in faster, in greater volumes and outstripping our ability to perform adequate quality control Data is being used in new ways and we frequently do not have sufficient information on what happened to the data along the processing stages to determine if it is suitable for a use we did not envision We often fail to capture, represent and propagate manually generated information that need to go with the data flows Each time we develop a new instrument, we develop a new data ingest procedure and collect different metadata and organize it differently. It is then hard to use with previous projects The task of event determination and feature classification is onerous and we don't do it until after we get the data

Fox VSTO et al. 5

6 Who (person or program) added the comments to the science data file for the best vignetted, rectangular polarization brightness image from January, 26, :09UT taken by the ACOS Mark IV polarimeter? What was the cloud cover and atmospheric seeing conditions during the local morning of January 26, 2005 at MLSO? Find all good images on March 21, Why are the quick look images from March 21, 2008, 1900UT missing? Why does this image look bad? Use cases

Fox VSTO et al. 7

8

9 Provenance Origin or source from which something comes, intention for use, who/what generated for, manner of manufacture, history of subsequent owners, sense of place and time of manufacture, production or discovery, documented in detail sufficient to allow reproducibility Knowledge provenance; enrich with ontologies and ontology-aware tools

Semantic Technology Foundations PML – Proof Markup Language – used for knowledge provenance interlingua Inference Web Toolkit – used to manipulate and access knowledge provenance OWL-DL ontologies (including SWEET and VSTO ontologies) PML -McGuinness, Ding, Pinheiro da Silva, Chang. PML 2: A Modular Explanation Interlingua. AAAI 2007 Workshop on Explanation-aware Computing, Vancouver, Can., 7/07. Stanford Tech report KSL Inference Web - McGuinness and Pinheiro da Silva. Explaining Answers from the Semantic Web: The Inference Web Approach. Web Semantics: Science, Services and Agents on the World Wide Web Special issue: International Semantic Web Conference Edited by K.Sycara and J.Mylopoulis. Volume 1, Issue 4. Journal published Fall, 2004

WWW Toolkit Proof Markup Language (PML) Learners JTP/CWM SPARK UIMA IW Explainer/ Abstractor IWBase IWBrowser IWSearch Trust Justification Provenance * KIF/N3 SPARK-L Text Analytics IWTrust provenance registration search engine based publishing Expert friendly Visualization End-user friendly visualization Trust computation OWL-S/BPEL SDS Trace of web service discovery Learning Conclusions Trace of task execution Trace of information extraction Theorem prover/Rules Inference Web Explanation Architecture Semantic Web based infrastructure PML is an explanation interlingua –Represent knowledge provenance (who, where, when…) –Represent justifications and workflow traces across system boundaries Inference Web provides a toolkit for data management and visualization

Global View and More Explanation as a graph Customizable browser options –Proof style –Sentence format –Lens magnitude –Lens width More information –Provenance metadata –Source PML –Proof statistics –Variable bindings –Link to tabulator –… Views of Explanation Explanation (in PML) filteredfocusedglobal abstraction discourse provenance trust

Provenance View Source metadata: name, description, … Source-Usage metadata: which fragment of a source has been used when Views of Explanation Explanation (in PML) filteredfocusedglobal abstraction discourse provenance trust

Trust Tab Fragment colored by trust value Detailed trust explanation Trust View (preliminary) simple trust representation Provides colored (mouseable) view based on trust values Enables sharing and collaborative computation and propagation of trust values Views of Explanation Explanation (in PML) filteredfocusedglobal abstraction discourse provenance trust

Discourse View (Limited) natural language interface Mixed initiative dialogue Exemplified in CALO domain Explains task execution component powered by learned and human generated procedures Views of Explanation Explanation (in PML) filteredfocusedglobal abstraction discourse provenance trust

Selected IW and PML Applications Portable proofs across reasoners: JTP (with temporal and context reasoners (Stanford); CWM (W3C), SNARK(SRI), … Explaining web service composition and discovery (SNRC) Explaining information extraction (more emphasis on provenance – KANI, UIMA) Explaining intelligence analysts’ tools (NIMD/KANI) Explaining tasks processing (SPARK / CALO) Explaining learned procedures (TAILOR, LAPDOG, / CALO) Explaining privacy policy law validation (TAMI) Explaining decision making and machine learning (GILA) Explaining trust in social collaborative networks (TrustTab) Registered knowledge provenance: IW Registrar (Explainable Knowledge Aggregation) Explaining natural science provenance – VSTO, SPCDIS, …

PML1 vs. PML2 PML1 was introduced in 2002 –It has been used in multiple contexts ranging from explaining theorem provers to text analytics to machine learning. –It was specified as a single ontology PML2 improves PML1 by –Adopting a modular design: splitting the original ontology into three pieces: provenance, justification, and trust This improves reusability, particularly for applications that only need certain explanation aspects, such as provenance or trust. –Enhancing explanation vocabulary and structure Adding new concepts, e.g. information Refining explanation structure

PML Provenance Ontology Scope: annotating provenance metadata Highlights –Information –Source Hierarchy –Source Usage

Referencing, Encoding and Annotating a Piece of Information Referencing a piece of information –using URI Encoding the content of information –Complete Quote: (type TonysSpecialty SHELLFISH) –Obtained from URL: web.org/ksl/registry/storage/documents/tonys_fact.kif Annotations –For human consumption: Tonys’ Specialty is ShellFish –For machine consumption Language: Format:

Source Hierarchy Source is the container of information Our source hierarchy offers –Many well-known sources such as Sensor (e.g. geo-science) InferenceEngine (e.g. reasoner) WebService (e.g. workflow) –Finer granularity of source than just document DocumentFragment (for text analytics)

Source Usage –logs the action that accesses a source at a certain dateTime to retrieve information –is part of PML1 Example: Source #ST was accessed on certain date T10:30:00Z

PML Justification Ontology Scope: annotating justification process Highlights –Template for question- answer/justification –Four types of justification

Four Types of Justification Goalconclusion without justification Assumptionconclusion assumed (using Assumption Rule) asserted by an InferenceEngine, no antecedent Direct Assertionconclusion directly asserted (using DirectAssertion rule) by an InferenceEngine, no antecedent Regularconclusion derived from antecedent conclusions

PML Trust Ontology Scope: annotate trust and belief assertions Highlights –Extensible trust representation (user may plug in their quantitative metrics using OWL class inheritance feature) –Has been used to provide a trust tab filter for wikipedia – see McGuinness, Zeng, Pinheiro da Silva, Ding, Narayanan, and Bhaowal. Investigations into Trust for Collaborative Information Repositories: A Wikipedia Case Study. WWW2006 Workshop on the Models of Trust for the Web (MTW'06), Edinburgh, Scotland, May 22, 2006.

25

Fox VSTO et al. 26

Fox VSTO et al. 27 Quick look browse

28

29 Visual browse

30

31

Search and structured query 32 Search Structured Query

Fox VSTO et al. 33 Search

Next week Next class –Architecture and Middleware Questions? 34