Principles of High Quality Documentation for Provenance: A Philosophical Discussion Paul Groth, Simon Miles, Steve Munroe University of Southampton.

Slides:



Advertisements
Similar presentations
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
Advertisements

UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
Architecture Tutorial Summary and Conclusions. Architecture Tutorial The Provenance Architecture.
What is Science? We are going to be studying science all year long! Take a moment and write down on your paper in several sentences what you think science.
A second example of Chi Square Imagine that the managers of a particular factory are interested in whether each line in their assembly process is equally.
THE PROCESS OF SCIENCE. Assumptions  Nature is real, understandable, knowable through observation  Nature is orderly and uniform  Measurements yield.
Lecture 7: Ways of Knowing - Reason. Part 1: What is reasoning? And, how does it lead to knowledge?
Architecture Tutorial Overview of Today’s Talks Provenance Data Structures Recording and Querying Provenance –Break (30 minutes) Distribution and Scalability.
Software Engineering- Scrum 徐 瑋 Alen 林芳瑜 Flora 1.
Source A Advance to next slide Advance to source material Click for help EndPPT.
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Architecture Tutorial 1 Overview of Today’s Talks Provenance Data Structures Recording and Querying Provenance –Break (30 minutes) Distribution and Scalability.
U.S. History. History has many different definitions, depending on who you ask! Simply put, history is the study of the past. “History is a set of lies.
What is Science ? Chapter 1.
U.S. History. History has many different definitions, depending on who you ask! Simply put, history is the study of the past. “History is a set of lies.
The Science of Biology Ch 2. I. Science is Based on Inquiry and Observations Science = A way to answer questions about the natural world Starts with asking.
Steps to the Scientific Method. Scientific Method Steps: At the conclusion of our time together, you should be able to: 1. List the 6 steps of the scientific.
Fact and Opinion “ We can have facts Without thinking, but we cannot have thinking without facts.” John Dewey (1859 – 1952)
Agent-Oriented Data Curation in Bioinformatics Simon Miles University of Southampton PASOA project:
Security Issues in a SOA- based Provenance System Victor Tan, Paul Groth, Simon Miles, Sheng Jiang, Steve Munroe, Sofia Tsasakou and Luc Moreau PASOA/EU.
Gathering and Evaluating Information. Researching Information ► Gathering and evaluating information ●Examine what you know already and areas where you.
Type your question here. Type Answer Type your question here. Type Answer.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Building Simulation Model In this lecture, we are interested in whether a simulation model is accurate representation of the real system. We are interested.
July 27, 2005High Performance Distributed Computing 05 Recording and Using Provenance in a Protein Compressibility Experiment Paul Groth, Simon Miles,
Formalising a protocol for recording provenance in Grids Paul Groth – University of Southampton.
History Is the study of people and events of the past… It includes- – WHAT happened – WHY it happened – Definition for notes- – (History is the study of.
THE SCIENTIFIC METHOD. THE SCIENTIFIC METHOD: is a process used to find answers to questions about the world around us is an organized series of steps.
Types of Research: General categories. The general types: 1. Analytical –Historical –Philosophical –Research synthesis (meta-analysis) 2. Descriptive.
Andrew McNabGESA/Authz, GGF9, 7 Oct 2003Slide 1 Authorization status Andrew McNab High Energy Physics University of Manchester
Title Authors Introduction Text, text, text, text, text, text Background Information Text, text, text, text, text, text Observations Text, text, text,
The Process of Conducting Research. What is a theory? a set of general principles that explains the how and why of phenomena. Theories are not directly.
1 Scientific Method Learning Target: I can describe the steps in the scientific method.
FREQUENCY DISTRIBUTION
SCIENTIFIC METHOD.
Section 2.1: The Scientist’s Mind
EQ: What is the scientific method?
Theory, hypothesis, and law
SCIENCE INQUIRY.
Intro. To Science.
Section 2.1: The Scientist’s Mind
Frequently asked questions about software engineering
Scientific Method Vocabulary.
The scientific Method.
Laws, Hypotheses and Development of Theories
Interpreting Observations
Source Documents The accounting department is informed of transactions by means of business papers that are sent to it. These business papers are called.
Scientific method.
Introduction to Science and the Scientific Method
Thinking Like A Scientist
Myths and Truths about science
Scientific Methods The scientific method is:
The Scientific Method.
Requirement Validation
Introduction to the Scientific Method
Observations information gathered by our senses.
Presentation transcript:

Principles of High Quality Documentation for Provenance: A Philosophical Discussion Paul Groth, Simon Miles, Steve Munroe University of Southampton

Where’s the shirt from? From Calvin Klein Heh! That’s not true. I gave that to you as a gift and I bought it myself at Walmart for $19.99

Some Background We consider software systems Software components are called actors Actors document “what they do” Users can query this documentation (process documentation) to answer questions about the provenance of data items.

Why Principles for Provenance? Massively distributed applications Data lasts along time Users may be different than those expected

Why Principles for Provenance? Massively distributed applications Data lasts along time Users may be different than those expected What are the fundamental expectations that users can have about documentation when asking provenance questions?

What historians want Why historians? They ask a lot of provenance questions. Good Sources (M. Stanford) –Sources are documentation or artifacts –Are accurate. They represent what actually occurred in the world. –Have authors. The historian knows who was responsible for the information source. This makes it easier to understand the source.

Recording source material Our systems should be good sources for historians and users. To ensure that our systems produce documentation that is accurate and has authorship, we introduce two principles. –Factuality –Attribution

Factuality As part of documentation of the past, actors must only record information that they can verify to be true, where truth is defined by the correspondence theory of truth. Simply, actors should record what they observe not guess or infer.

Attribution Each statement making up documentation of the past for a computer system must be attributable to a particular actor. Attribution allows users to interpret and judge documentation.

High Quality Documentation High quality documentation is documentation that is factual and attributable. It provides a firm basis for answering provenance questions. Users know what to expect from such documentation.

Enforcement Our data model (the p-structure) for documentation specifically supports these principles. The protocol for recording process documentation (PReP) enforces the principles. –Attribution is supported through a requirement for signatures. –Types provide a framework that supports factuality.

Conclusion The systems we have designed enforce the principles presented here. It is important to state the underlying principles that systems use to generate documentation of the past to the user community. Users then know how to best interpret and und use the documentation to answer their provenance questions.

What do you think? - Are these the right principles? - Are there other principles? - Is this useful at all? Paul Groth