Agenda (AM) 9:30-10:15 Introduction to RDA

Slides:



Advertisements
Similar presentations
IATI Technical Advisory Group Technical Proposals Simon Parrish IATI Technical Advisory Group, DIPR March 2010.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
The NSDL Registry Jon Phipps Stuart Sutton Diane Hillmann Ryan Laundry Cornell U. U. of Washington.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
Presented by DOI Create: TERN as a use-case Siddeswara Guru
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Open Data Protocol * Han Wang 11/30/2012 *
RDA Terminology: Data Management and Data Fabric Prepared for RDA 6 th Plenary Paris, Sept. 23, 2015 Gary Berg-Cross Co-Chair DFT IG, Co-organizing Chair.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Discussion Issues for IIB Presented by Steve Browdy.
Data Type Registries (DTR) RDA 4th WG/IG Collab Meeting NIST: Dec 2015 Larry Lannom CNRI.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The Data Type.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Approaches to Making Data Citeable Recommendations of the RDA Working Group Andreas Rauber, Ari Asmi, Dieter van Uytvanck Stefan Pröll.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Data Foundations And Terminology (DFT) IG Virtual Meeting July 6 th 2016 Co-Chairs DFT IG :Gary Berg-Cross & Raphael Ritz P8 Sessions DFT IG Breakout Session.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
Evaluating Barriers to Output Adoption in the Digital Humanities Lindsay Poirier RDA Data Share Fellow, Co-Chair Empirical Humanities Metadata WG Plenary.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
Data Type Registries #2 Co-Chairs: RDA Chairs’ Mtg Gothenburg
Workshop on Brokering in Data Fabrics - community perspectives -
RDA WG on Dynamic Data Citation
Sabri Kızanlık Ural Emekçi
Making Sense of the Alphabet Soup of Standards
WG Research Data Collections RDA P10 Montréal – September 2017
Data Type Registries #2 12 Month Status Larry Lannom, Tobias Weigel Date Location TBD? CC BY-SA 4.0.
Data Ingestion in ENES and collaboration with RDA
Data Type Registries Breakout
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Accessing a national digital library: an architecture for the UK DNER
RDA Plenary 9 Breakout Session
PID centric fabric constructed piece by piece
An Architecture for Complex Objects and their Relationships
Introducing the Publishing Data Services WG
The Re3gistry software and the INSPIRE Registry
Metadata for research outputs management Part 2
Relevance of RDA Outputs in the Humanities
Metadata for research outputs management
New input for CEOS Persistent Identifier Best Practices
Brief WG/IG reporting Tobias Weigel on behalf of co-chairs
From Observational Data to Information (OD2I IG )
WG Research Data Collections Draft outputs of a RDA bottom-up effort P9 - April 2017 Co-chairs: Bridget Almas, Frederik Baumgardt, Tobias Weigel, Thomas.
WG Research Data Collections An overview of the recommendation
Using the RDA Collections API to Shape Humanities Data
Tech introduction.
Data types and persistent identifiers in
Metadata in Digital Preservation: Setting the Scene
LOD reference architecture
Introduction to the MIABIS SOP Working Group
The Anatomy and The Physiology of the Grid
Bird of Feather Session
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
QoS Metadata Status 106th OGC Technical Committee Orléans, France
1st Call for Collaboration Projects
Presentation transcript:

Agenda (AM) 9:30-10:15 Introduction to RDA 10:15-10:30 Participant Feedback Exercise 10:30-10:50 Coffee Break 10:50-11:50 RDA Output Deep Dive 11:50-1:00 Participant Introductions 1:00-2:30 Lunch

Agenda (PM) 2:30-3:15 Use Case Walk Through - Applying an output 3:15-4:00 Discussion What are barriers to participation and adoption? What role will/should RDA play in the future of the humanities?

Adopting RDA Outputs in the Humanities Bridget Almas RDA/ADHO Workshop DH2016 Krakow, July 12, 2016 @BridgetAlmas

All of us are... Creating data of various types and uses Text, Images, XML, RDF, JSON, JSON-LD, Tabular, ... Manuscripts, Bibliographic, Prosopographic, Geographic, ... Assigning stable (if not “persistent”) identifiers to our data URNs, URLs, DOIs/Handles, ARKs,... Reusing data from other sources @BridgetAlmas

Most or many of us... Want others to be able to reuse our data Want our data to be machine-actionable Have copyright/access requirements for our data @BridgetAlmas

Very few of us... Publish formal, machine-actionable descriptions of our data Use automated systems for assignment of identifiers to data Have standard practices which we apply across projects Use identifiers for our data which guarantee persistence, advertise what they are capable of, and can be reliably resolved by anyone anywhere @BridgetAlmas

How do we make data sharing a reality? A first step is to publish data with stable identifiers, but it isn’t enough. We need to know What sort of data does it identify? How can we get it in a format that we can process? What is its provenance and how to cite it? Are there newer/older versions of it? Is it a part of a collection? … ? Right now there is no consistent way to get answers to these questions across different projects, providers, domains We’re all figuring it out and building ad-hoc solutions that work in some cases for some data and not others @BridgetAlmas

RDA Data Fabric + Tools, Services, Manual processes,... Diagram Source: Peter Wittenberg @BridgetAlmas

RDA DTR, PIT and Collections Data Types Registry: provides a recommendation for formalizing, registering and communicating definitions of machine-actionable data types PID Types: provides a recommendation for standard approach to coupling metadata with persistent identifiers to enable services that support discovery, access, verification of integrity and authenticity and a variety of other use cases. Collections WG: will provide recommendations for common collection models and an multidisciplinary API for building, sharing and expanding collections of data objects @BridgetAlmas

RDA Output: Data Types Registry Defines “Data Types” as characterizations of data at any level of granularity which are identified, defined and registered Proposes a Data Model and JSON Schema Defines an API for Creating, Reading, Updating, Deleting and Querying Data Type Records Defines Requirements for Registry Implementation and Federation @BridgetAlmas

RDA Output: DTR Proposed Data Model Identifier Type Name Human Readable Description Provenance (including contributors/source, creation date,modification date) Related Standards and Recommendations Expected Uses Representations and Semantics Properties Specific to this Type Relationships to Other Types @BridgetAlmas

RDA Output: Data Type Registries

RDA Output: PID Types Provides: a conceptual model for a PID record An API for Creating, Reading and Querying PID records Can work on top of existing PID systems in a brokering model, and/or be provided directly by the PID system Depends upon the Data Types Registry @BridgetAlmas

RDA Output: PID Types Source: dx.doi.org/10.15497/FDAA09D5-5ED0-403D-B97A-2675E1EBE786 @BridgetAlmas

PID Record Consists of a number of properties Each property itself has a value and bears a PID, pointing to a property definition with a name and range A PID record type is a specific aggregation of properties, mandatory and optional A PID record profile is a specific aggregation of types, mandatory and optional All properties, types, and profiles have PIDs and are registered in the Data Types Registry Source: dx.doi.org/10.15497/FDAA09D5-5ED0-403D-B97A-2675E1EBE786 @BridgetAlmas

PID Record properties for a CTS URN Type? urn:cts:greekLit:tlg0012.tlg001.perseus-grc2 Property ID (Property Name) Property Value 11314.2/31810b2c24913929bb5e0d4d949de9f7 License CC-BY-SA 11314.2/467d9ba30e2d9879fd9d483f319e462c Predecessor identifier urn:cts:greekLit:tlg0012.tlg001.perseus-grc1 11314.2/5546b0166091d9ae869f081f5548f3fc Repository of Record http://data.perseus.org …. CTS API Endpoint http://cts.perseids.org/api ...

PID Record properties for a LOD/URL Record Type? https://pleiades.stoa.org/places/530809 Property ID (Property Name) Property Value 11314.2/31810b2c24913929bb5e0d4d949de9f7 License CC-BY ... Available Formats JSON,CSV,HTML,RDF,KML …. Format Specifier HTTP Header Accepts @BridgetAlmas

RDA WIP: Collections WG Formalization of Collections Models API for Create/Read/Update/List/Query operations on Collections Use cases include virtual, local and mixed collections, collections with open and access protected data, heterogeneous and homogenous data types,... Operations will include basic CRUD/L, but also query and set operations Builds upon the PID Types and DTR components Collections will be identified by Data Types and have typed Capabilities Must be implementable by existing collection solutions @BridgetAlmas

RDA WIP: Collections WG (Modeling proposal) Diagram Source: Tobias Weigel, DKRZ

Simple (Re)Use Case A service wants to analyze data referenced in scholarly publications by PID, such as a text passage referenced by CTS URN and a place identified by Gazetteer URL. A PID Types broker service provides the PIT API. CTS text and Gazetteer data providers register their URNs and URLs with the PID Types broker service (via HTTP calls to the PTI API). The analysis service can query the PIT broker to find out if the PIDs in a publication are registered, retrieve properties that tell it where to resolve the URN to the text, the formats available for the Gazetteer URL and how to specify them to get data it can use. The underlying data is then available for reuse by the service. @BridgetAlmas

More Complex Data Management Use Case @BridgetAlmas

Our data types @BridgetAlmas Text (Structured, unstructured, digitized books) Persistent identifiers Bibliographic Geographic/Map Tiles Prosopographic Ethnographic/Fieldwork (Traditional and virtual) Museum data Images Historical attributes, relationships Text alignments Treebanks …. @BridgetAlmas

Our unmet infrastructure needs Institutional service for assigning persistent, nationally or internationally recognized identifiers for our digital publications and datasets Data curation systems (that are) integrated with the active research phase Authentication services Tools for converting data outputs from different sources and formats Data visualization services Data mining tools and services Data storage services Pre-made secure endpoints for managing ontological models Narratives about how to choose an appropriate tool and how to get started with research data Storage for datasets during the course of my research, as opposed to finalised datasets. Services for hosting URI-based gazetteers of specific regions, periods, etc Registry for hosting data about collections ... @BridgetAlmas

What’s next? Do the RDA outputs provide value and a means to begin addressing some of our unmet needs? If not, why? If so, what do we need to do start taking advantage of them? Identify our core Data Types (primitives and derived types) Identify our core PID record types/profiles Evaluate their use with a test DTR and PIT API Begin the work of implementing @BridgetAlmas