Linked Data Initiatives at NLM

Slides:



Advertisements
Similar presentations
Technical Highlights 25th August 2011 Sebastian Peters German National Library of Science and Technology.
Advertisements

Making and Moving Metadata: Two Library of Congress Initiatives Sally McCallum NDMSO, Library of Congress NISO/BISG Forum - June 22, 2012.
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Progress Update Semantic Web, Ontology Integration, and Web Query Seminar Department of Computing David George.
Nancy Fallgren Technical Services Division National Library of Medicine National Institutes of Health U.S. Department of Health and Human Services Presentation.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Cambridge University Library RDA - Hugh Taylor, 7 Jan 09 RDA: Past, Present, Future Hugh Taylor CILIP Representative, Joint Steering Committee for Development.
The Librarian Infobutton Tailoring Environment (LITE) James J. Cimino National Institutes of Health and Columbia University.
A Registry for controlled vocabularies at the Library of Congress
Barbara Bushman & Nancy Fallgren Technical Services Division National Library of Medicine National Institutes of Health U.S. Department of Health and Human.
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Mobile Framework Lorna Schmid, AEI Tim Kern, Fort Collins Science Center.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
ISO 9001:2015 Revision overview - General users
Sara Kim, PhD, Director, Associate Professor Instructional Design and Technology Unit, UCLA David Geffen School of Medicine Katherine Wigan, BS, MBA, Senior.
Hydra: future development A Hydra roadmap… Hydra Europe Symposium – Dublin – 7/8 April 2014 Richard Green.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
SPARQL All slides are adapted from the W3C Recommendation SPARQL Query Language for RDF Web link:
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Continuous Improvement Monitoring (CIM) Collaborative Partner Forum Awareness Session June 2015.
Nancy Fallgren Metadata Librarian Cataloging and Metadata Management Section, TSD National Library of Medicine National Institutes of Health U.S. Department.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Betsy L. Humphreys Betsy L. Humphreys ~ National Library of Medicine National Institutes of.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
WVU Libraries Intranet  internet | intranet | extranet  library intranet | extranet ideals  plans  issues.
IBIS-Admin New Mexico’s Web-based, Public Health Indicator, Content Management System.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The Ripple Effect: The Benefits of Focused Resource Initiatives Susan Swogger, MLIS Collections Development Librarian Barbara Rochen.
DukeWeb Enterprise CMS Update for Web Community 2/10/2004 Cheryl Crupi Senior Manager, Duke OIT Office of Web Services.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
IBIS-Admin New Mexico’s Web-based, Public Health Indicator, Content Management System.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
11 ALCTS RDA Forum American Library Association Annual Conference Anaheim, California, June 23, 2012 U.S. RDA Test Coordinating Committee Update Beacher.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
Welcome to the Minnesota SharePoint User Group. Introductions / Overview SharePoint 101 High level overview of SharePoint Differences between SharePoint.
MAC/MLA 2001 Eyes on NLM October 19, 2001 Ocean City, Maryland.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Oregon Spatial Data Library Enhancements BIENNIUM FIT PROPOSAL.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
1 Overview of the U.S. RDA Test by Tina Shrader Cataloging Section Head and CONSER Coordinator National Agricultural Library June 28, 2010.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
A Framework for Assessing Needs Across Multiple States, Stakeholders, and Topic Areas Stephanie Wilkerson & Mary Styers REL Appalachia American Evaluation.
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Linked Data Competency Index
LOINC – SNOMED CT Cooperation on Content
Building A Repository for Digital Objects
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
Development of Assessment Literacy Knowledge Base
News in schedule and agendas for 2015
PREMIS Tools and Services
LOD reference architecture
RDA in a non-MARC environment
Stakeholder Update Building A New Trauma Registry
Presentation transcript:

Linked Data Initiatives at NLM Barbara Bushman & Nancy Fallgren Technical Services Division National Library of Medicine National Institutes of Health U.S. Department of Health and Human Services CNI Membership Meeting December 8-9, 2014

Agenda Background NLM Linked Data Infrastructure Working Group MeSH (Medical Subject Headings) RDF Pilot Next Steps Lessons Learned

Background Replace MARC format with a web-based standard 2009 Working Group on the Future of Bibliographic Control 2011 U.S. RDA Test Coordinating Committee 2012 Bibliographic Framework Initiative 2013 internal report “Linked Data at NLM: Environmental Scan, NLM Data Survey and Next Steps” 3rd party RDF versions of NLM data RDF data published by other national libraries RDF data published by health information organizations

Background Existing NLM Linked Data Initiatives PubChem RDF BIBFRAME MESH RDF Prototype

NLM Linked Data Infrastructure Working Group Broad collaboration across NLM divisions Develop and build infrastructure for transforming, storing and publishing NLM linked data Research best practices in publishing linked data Recommend NLM-wide policies and guidelines for linked data publishing Document guidance for maintaining the established linked data infrastructure Recommend processes for future data linking projects Prioritize NLM datasets for publication as linked data

NLM Linked Data WG Process Shared working environment SharePoint for administrative documentation GitHub private site for development Develop a common level of understanding Review existing linked data initiatives PubChem RDF MeSH RDF prototype

Pilot Project: MeSH RDF Community impact Widely used in the health and medical community Ability to relate many disparate health and medical resources Community interest evidenced by Multiple 3rd party versions published Requests stemming from BIBFRAME experimentation Research version of MeSH RDF already developed for internal use at NLM

MeSH RDF Pilot Goals Provide authoritative MeSH RDF and ensure its maintenance and preservation Develop an infrastructure for publishing NLM linked data Increase our knowledge of MeSH use cases

Decisions URI (id.nlm.nih.gov) Predicates (create our own vs. existing vocabularies) License Consultants

How to Provide the Linked Data FTP XML, XSLT, RDF SPARQL endpoint MeSH RDF files loaded into a graph Stored in Virtuoso triple store Accessible via Lodestar interface

Creating MeSH RDF

Transformation of MeSH XML to MeSH RDF Creating MeSH RDF Transformation of MeSH XML to MeSH RDF USERS NLM PUBLIC NLM INTERNAL

Anti-Bacterial Agents MeSH in RDF meshv:D015242 meshv:D015242 meshv:Q000009 meshv:allowableQualifier meshv:D015242 meshv:D000900 meshv:pharmacologicalAction meshv:D000900 Anti-Bacterial Agents label

Anti-Bacterial Agents MeSH Triples Graph Ofloxacin label meshv:D000900 meshv:pharmacologicalAction Anti-Bacterial Agents meshv:Q000009 meshv:allowableQualifier meshv:D015242 mesh:D015242 mesh:Q000009 meshv:allowableQualifier mesh:D015242 mesh:D000900 meshv:pharm.Action mesh:D000900 Anti-Bacterial Agent label

XML2RDF Modeling Issues Descriptor/Qualifier pairs Not exposed in MeSH XML How to handle ‘illegal’ descriptor/qualifier combinations Some XML elements only used internally Tree nodes Logic for hierarchical inheritance is inferred

MeSH Trees for Eye

Ontological Modeling Issues The arrows represent broader relationships, but are eyebrows really a narrower term for sense organs?

Ontological Modeling Issues Face D005145 Sense Organs D012679 meshv:treeNumber meshv:treeNumber A01.456.505 A09 meshv:broader meshv:broader meshv: broaderTransitive meshv: broaderTransitive Eye D005123 A01.456.505.420 A09.371 meshv:treeNumber meshv:treeNumber meshv: broaderTransitive meshv: broaderTransitive meshv:broader meshv:broader A09.371.613 A01.456.505.420.338 Oculomotor Muscles D009801 Eyebrows D005138 meshv:treeNumber meshv:treeNumber

(Soft) Beta Launch http://id.nlm.nih.gov Work in progress Launched Nov. 17, 2014 Work in progress Still tweaking model and documentation No public news announcements/press release No links on website

MeSH RDF Beta Demo Landing page Technical documentation GitHub Sample SPARQL query

Beta Evaluation Feedback from partners and others Public GitHub site https://github.com/HHS/meshrdf Customer service http://apps2.nlm.nih.gov/mainweb/siebel/nlm/index.cfm/ Social media Analytics Log files

MeSH RDF Next Steps Next release of MeSH RDF ca. May 2015 Update to 2015 MeSH Resolve outstanding issues raised during beta Updating/versioning Review MeSH XML elements

Using MeSH RDF at NLM Integrate with existing Linked Data Initiatives PubChem BIBFRAME Future linked data projects Research project to develop MEDLINE RDF

NLM Linked Data WG Next Steps Internal report and recommendations on the future of linked data at NLM Documentation of best practices Recommendations on infrastructure and resources needed Guidelines and prioritization for future projects

Lessons Learned Have a flexible timeframe Collaborate broadly within your institution Document everything Ask for help Understand expectations and anticipated outcomes Create an evaluation plan Value community collaboration

Questions/Comments Barbara Bushman Nancy Fallgren Beta MeSH RDF bushmanb@mail.nlm.nih.gov Nancy Fallgren fallgrennj@mail.nlm.nih.gov Beta MeSH RDF http://id.nlm.nih.gov/mesh/