DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.

Slides:



Advertisements
Similar presentations
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Advertisements

Health Ingenuity Exchange (HingX) Best Practices for User Groups and Resource Registration.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Welcome to the Conference !! Juan Bicarregui Chair, APA Executive.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
ONC Standards and Interoperability Framework Use Case Simplification Key Steps Forward 3 November 2011.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
The Software Product Life Cycle. Views of the Software Product Life Cycle  Management  Software engineering  Engineering design  Architectural design.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
NIST Special Publication Revision 1
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
RDA’s Recently Endorsed Outputs September 16, 2015.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 09. Review Introduction to architectural styles Distributed architectures – Client Server Architecture – Multi-tier.
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
Data Fabric IG Introduction. 2  about 50 interviews & about 75 community interactions  Data Management and Processing is too time consuming and costly.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
CLARIN work packages. Conference Place yyyy-mm-dd
Working Group Practical Policy based on slides and latest documents from the PP WG chaired by Reagan Moore, Rainer Stotzka presented by Johannes Reetz.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
EPA Enterprise Data Architecture Metadata Framework Assessment Kevin J. Kirby, Enterprise Data Architect EPA Enterprise Architecture Team
RDA Data Foundation and Terminology (DFT) WG: Overview  Prepared for Collab Chairs Meeting, NIST, Nov 13-14, 2014  Gary Berg-Cross, Raphael Ritz, Peter.
Hydro DWG at the RDA Plenary: BoF and Aligning HDWG work with WMO expectations and timeline Sylvain, Tony, Silvano, Ilya.
Health eDecisions Use Case 2: CDS Guidance Service Strawman of Core Concepts Use Case 2 1.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
SDMX IT Tools Introduction
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Hydro DWG at the RDA Plenary BoF - Improve sharing of water resource data globally 24 September BREAKOUT :30-15:00.
The Importance of Standards in Digital Preservation Tina Norris Kayla Payne Jennifer
Discussion of Data Fabric Terms & Preparation for RDA P7 Virtual Meeting Monday, January 25, 2016 Organized by Gary Berg-Cross (DFT-IG) and Peter Wittenburg.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Preservation e-Infrastructure IG Description: help ensure preservation of needed data succeeds Goals: foster worldwide collaboration; ensure consistency.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
GEO ADC Co-Chair meeting USGS, Reston, Dec Slide # 1 AR Interface implementations for GEOSS George Percivall
Data Fabric IG From Testing to Recommendations Beth Plale.
Draft Data Foundation and Terminology (DFT) Vocabulary Development Process Prepared for WG-Core meeting 24/25.2 Munich/Garching Gary Berg-Cross Co-Chair.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Research Data Repository Interoperability Thomas Jejkal.
Data Foundations And Terminology (DFT) IG Virtual Meeting July 6 th 2016 Co-Chairs DFT IG :Gary Berg-Cross & Raphael Ritz P8 Sessions DFT IG Breakout Session.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
Update from the Faster Payments Task Force
RDA Data Foundation and Terminology (DFT) WG
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
DataNet Collaboration
Donatella Castelli CNR-ISTI
Data Type Registries #2 12 Month Status Larry Lannom, Tobias Weigel Date Location TBD? CC BY-SA 4.0.
Data Type Registries Breakout
Data Foundation and Terminology (DFT) Vocabulary Development Session
Agenda Welcome and overview (Peter)
C2CAMP (A Working Title)
Alignment of Part 4B with ISAE 3000
Health Ingenuity Exchange - HingX
From Observational Data to Information (OD2I IG )
WG Research Data Collections Draft outputs of a RDA bottom-up effort P9 - April 2017 Co-chairs: Bridget Almas, Frederik Baumgardt, Tobias Weigel, Thomas.
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
HingX Project Overview
Bird of Feather Session
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
Presentation transcript:

DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders to better synchronize conceptualization To enable better understanding within and between communities and finally To stimulate tool building, such as for data services, supportive of the basic model's use OUTCOMES "Model overview" document summarizing about 20 models for data organization in use today "Model analysis & synthesis" document including an abstract data organization model derived from the above A "core set of terms and definitions" describing the abstract data model (snapshot of the ongoing discussion) Use Cases illustrating the applicability of the terminology to real-world situations TeD-T: Term Definition Tool at enabling and documenting the evolution and discussion of the terminology WG Chairs: Peter Wittenburg, Gary Berg-Cross, Raphael Ritz | Over 40 members of the WG. More information : Reaping the Fruits, Sept ‘14 The DFT WG task is to describe a basic, abstract data organization model which can be used to derive a reference data terminology.

DATA TYPE REGISTRIES WG 4 th Plenary Update THE PLUM Data sharing requires that data can be parsed, understood, and reused by people and applications other than those that created the data. The Data Type Registries Working Group is evaluating approaches based on use cases to define and allow registration of data types and subsequently enable resolution of those data types for consumption by humans and computer applications. Data types may be thought of as, for instance, a flexible extension to the notion of MIME types. GOALS To build a Data Type Registry that enables:  data producers and data managers to easily and consistently describe data structures, assumptions, and usage conventions in the form of data type records.  data consumers, including both humans and automated systems, to process and understand data with the help of registered data types. OUTCOMES  Design, implementation and release an open source version of the Registry software by CNRI. A few prototypes have been released to date (  Multiple groups including the EUDAT project in Europe, NIST in the US, and the International DOI Foundation to put significant efforts into exercising the prototype or a variation of it.  The Registry designed as low-level infrastructure with wide applicability.  Potential establishment of multiple type registries. The current prototype is stand-alone, but the basic design allows for federation of distributed and separately administered registries.  Contribution to an evolving infrastructure connecting various RDA WG outcomes, e.g., it is a key component in the PID Information Types WG, as will be discussed in the Data Fabric IG. The working group, consisting of around 50 members, is nearing the end of its approved RDA timeline. The Data Type Registry, while it will specifically benefit data consumers, requires significant input from data production and management groups, e.g., those who create and manage specific data structures. Thus, participation from those people and organizations, in the form of data type registration, is needed to propel adoption and integration of the Data Type Registry concept into operational and research environments. The working group, consisting of around 50 members, is nearing the end of its approved RDA timeline. The Data Type Registry, while it will specifically benefit data consumers, requires significant input from data production and management groups, e.g., those who create and manage specific data structures. Thus, participation from those people and organizations, in the form of data type registration, is needed to propel adoption and integration of the Data Type Registry concept into operational and research environments. Reaping the RDA Fruits, September 10, 2015 Use cases include human interpretation of data, machine processing, finding data by type, connecting data to available processing services, characterizing the types of data in a given repository, assigning an existing and well- known type to newly collected data, and evaluating ingest data according to a type specification.

PID INFORMATION TYPES WG Reaping the RDA Fruits, September 10, th Plenary Update THE PLUM It is becoming increasingly necessary to automate tasks associated with research and scholarly communication infrastructures. Identifiers (PIDs) already provide key linkages among the individual entities managed within these infrastructures, but consistent methods of defining and exposing the properties associated with these entities have not yet emerged. The PID Information Types WG has addressed these challenges and produced practical outcomes ready for adoption. Chairs : Tobias Weigel, Tim DiLauro It is becoming increasingly necessary to automate tasks associated with research and scholarly communication infrastructures. Identifiers (PIDs) already provide key linkages among the individual entities managed within these infrastructures, but consistent methods of defining and exposing the properties associated with these entities have not yet emerged. The PID Information Types WG has addressed these challenges and produced practical outcomes ready for adoption. Chairs : Tobias Weigel, Tim DiLauro Goals:  Define information types; how they relate to digital objects and their PIDs and metadata; and how they can be stored, accessed and managed.  Determine how information types can be used in typical e-infrastructure workflows.  Develop the necessary tools to deal with the associated extra information on PIDs: an API and a conceptual framework giving structure to types, while providing scaffolding for future types as they emerge.  Present some first exemplary types that highlight practical usage scenarios for types in data management OUTCOMES The PIT WG has developed essential core types for information associated with PIDs, an accompanying conceptual architecture to organize these types and an application programming interface (API) to access and manage them.  The API is agnostic towards particular use cases; a higher level service, for example, might employ the API to determine whether it can provide its services for a given entity.  Providing such capabilities through the PID infrastructure would enable more performant, more comprehensive, and more manageable services, while reducing the overall cost of providing them. The outcomes support the future development of a coherent data fabric, in which PIDs can form an elemental access and management interface for digital objects independent from particular PID systems. PID Information Types aggregate essential state information for data objects and data collections and the harmonization effort will help to streamline development towards common interfaces. Using our results The PIT WG encourages RDA members interested in adopting the outputs and in future development to collaborate with PIT and its members. The outcomes are particularly relevant to PID providers and institutions providing e-infrastructure, archiving, and citation services.

PRACTICAL POLICY WG Reaping the RDA Fruits, September 10, th Plenary Update THE PLUM Computer actionable policies are used to enforce management, automate administrative tasks, validate assessment criteria, and automate scientific analyses. The Practical Policy Working Group assembles a collection of production and research policies, analyze the submitted policies to identify best practices, and promote the formation of policy-based data management systems. GOALS :  To bring together practitioners in policy making and policy implementation  To identify typical application scenarios for policies such as replication, preservation etc.  To collect and to register practical policies  To enable sharing, revising, adapting, and re-using of computer actionable policies OUTCOMES  a survey identified eleven generic policy areas of interest for the majority of the institutions and in common to almost all data management systems  templates for the production policies to be reported in the “Outcomes Policy Templates: Practical Policy Working Group, September 2014” document  a collection of example policies, their description (in English) to be included in the document “Implementations: Practical Policy Working Group, September 2014” Who ? Adopters and reviewers of the policy sets:  practitioners in policy making and policy implementation  data centers  communities building up data collections Who ? Adopters and reviewers of the policy sets:  practitioners in policy making and policy implementation  data centers  communities building up data collections