Provenance of scientific information as experienced in DRIVER 6th e-Infrastructure Concertation Event Lyon, 24 th November 2008 Wolfram Horstmann Bielefeld.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

Enhanced Publications Presentation for ODaF Europe 2009 Thomas Place 2 April 2009.
International Technology Alliance In Network & Information Sciences International Technology Alliance In Network & Information Sciences Paul Smart, Ali.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
E-Infrastructures as Standardisation Drivers DATA TRACK Chair : Krystyna Marek Rapporteur: Wolfram Horstmann 5th e-Infrastructure Concertation Barcelona.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Introducing Symposia : “ The digital repository that thinks like a librarian”
A centre of expertise in digital information management UKOLN is supported by: Signed metadata : method and application International Conference.
“provenance” DATA TRACK Chair : Krystyna Marek Rapporteur: Wolfram Horstmann 6th e-Infrastructure Concertation Lyon 24 Nov 2008.
A centre of expertise in digital information management UKOLN is supported by: Signed metadata : method and application International Conference.
19-NOV-2007 ERIH -- DRIVER ERIH & Large scale content networks Perspectives from DRIVER Wolfram Horstmann.
David Tarrant University of Southampton Applying Open Storage to Institutional Repositories.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
Logics for Data and Knowledge Representation
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
A centre of expertise in digital information management UKOLN is supported by: Monica Duke Project.
A Perspective on Preservation of Linked Data Richard Cyganiak DERI, NUI Galway.
Metadata Modularization Concepts and Tools Carl Lagoze CS
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Ontology Repositories: Discussions and Perspectives Mathieu d’Aquin KMi, the Open University, UK
® GRDC Hydrologic Metadata - core concepts - 5 th, WMO/OGC Hydrology DWG New York, CCNY, August 11 – 15, 2014 Irina Dornblut, GRDC of WMO at BfG Copyright.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Scientific Data Management - From the Lab to the Web Semantic Data Management Dagstuhl Seminar April 2012 José Manuel Gómez Pérez, iSOCO
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata and Technology/Architecture Working Groups DLF Aquifer Project DLF Fall Forum Providence, RI November 14, 2008.
ALA Metadata - Goals and Issues Donald Hobern, Director, Atlas of Living Australia 29 August 2008.
EUROPEANA DATA MODEL, short-term plans EDM worskhop 2015 Netherlands, Public Domain , Rijksmuseum Anonymous Arrival of a Portuguese ship.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
EDM Europeana Data Model Guus Schreiber with input from Carlo Meghini, Antoine Isaac, Stefan Gradmann, Maxx Dekkers et al. from Europeana V1.
National Library of Finland Strategic, Systematic and Holistic Approach in Digitisation Cultural unity and diversity of the Baltic Sea Region – common.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
RESEARCH METHODS IN TOURISM Nicos Rodosthenous PhD 07/03/ /3/2013Dr Nicos Rodosthenous1.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Middleware independent Information Service
Repository Software - Standards
Wheat Data Interoperability Esther DZALE YEUMO KABORE Richard FULSS
Outline Pursue Interoperability: Digital Libraries
OpenAIRE Services for Open Science
NSDL Data Repository (NDR)
Session 2: Metadata and Catalogues
Malte Dreyer – Matthias Razum
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Presentation transcript:

Provenance of scientific information as experienced in DRIVER 6th e-Infrastructure Concertation Event Lyon, 24 th November 2008 Wolfram Horstmann Bielefeld University / DRIVER

Notions of Provenance Where do data objects* originate from? –Scientific Work -- examples Instrumentation techniques –Manufacturers of hard- and software Methodologies –Processes, e.g. gene sequencing –Technical/Local -- examples (web)-identifiers Database, repository name * Primary data, documents, metadata …

Why Provenance? Quoting / Citing / Referencing as global scientific principle –„Reproducible research“ Giving credits to authors / creators in distributed environments Original location / context has to be known Experienced in Grid-Environments [1]

Provenance & Interoperability Re-Use / Sharing: “Addressing/Accessing” –Common view, common use –Unidirectional: No change of data objects! Federation: “Discovering in Context” –Remote representation of distributed DOs Aggregation: “Contextualizing” –Add unchanged object in a context Processing/Annotation: “Changing” –Uni- vs. Bidirectional: Change of DOs and remote representation vs. back-storage (e.g. CVS)

Scenarios in DRIVER

Digital Scientific Data

Digital Object Collections ⊃ ⊃ ⊃⊃

Digital Object Repositories =

Digital Information Space

Conventional Web Data

„Simple“ Applications

Metadata Infrastructure

Basic Provenance Settings Indicate Production Situation –Metadata Author, Instrumentation etc. Remote Representation –Indicate place of origin in remote systems Metadata as digital objects / first order citizens –Allow lineage respresentation Credits in remote environments / versioning

Orders of Provenance 1st order: Metadata –Provenance attached to data –Minimal „knowledge“ required in application –Allow remote handling of data objects –Require metadata infrastructure –Metadata introduce 2 objects: requires linkage 2nd order: context / compounds –Express multiple relations between objects –May introduce semantic model

Provenance in DRIVER #1 Simple Objects: OAI-PMH [2] –1st order provenance Metadata: minimum OAI-DC –2nd order provenance DRIVER explicit identifiers for repositories OAI-PMH: inline representation („about“)

Semantic/Compound Data

„Semantic“ Applications

Provenance in DRIVER #2 „Enhanced Publications“ –Research project in DRIVER-II –Representation of data /document packages –Use of OAI-ORE

Provenance in OAI-ORE OAI-ORE: Object Re-Use and Exchange [4] –Uses Resource Maps < Named Graphs –Uses „lineage“ to represent expl. Provenance –Future: explicit provenance model [7] ?

Summary Provenance essential for … –Indicating origin in distributed data spaces Accessing / Addressing Federation / Aggregation Processing / Annotation –Document and data citation / trace-back –1st order: describing data > metadata –2nd order: describing context > semantic data

Lessons learnt in DRIVER Use web-enabled Identification (URI/UDDI etc.) –„Dark“ databases don‘t interoperate 1st order provenance at place of origin –Requires metadata to describe origin –Enables a metadata infrastructure –Introduces linkage problem 2nd order provenance in contexts –Requires data provider identification in federators / aggregators in order to link back –May require semantic model for context –Would benefit from a semantic infrastructure

Resources [1] On provenance in the eScience / grid-environment – –In GLITE [2] On provenance in OAI-PMH – [3] On provenance OAI-ORE (referred to as ore:lineage) – (general) – (definition) [4] Named Graphs, Provenance and Trust (Caroll et al. ) – [5] W3C: On provenance in RDF – [6] Open Provenance Model – [7] DRIVER: Digital Repository Infrastructure for European Research –