LINKED DATA what you need to know to understand, produce, and work with Linked Data Robert Chavez, PhD. Senior Content Solutions Architect, NEJMGroup NETSL.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
From Ontology Design to Deployment Semantic Application Development with TopBraid Holger Knublauch
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Semantic Web Introduction
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
Converting Metadata to Linked Data Hydra Connect October 2, 2014 Karen Estlund, Head, Digital Scholarship Center Director, Oregon Digital Newspaper Program.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Presented by Gentre Dozier and Spencer Dille management.com/newsletters/database_metadata_unstructured_data_triple_store html.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
DartGrid Browser-based mapping tool of SQL to RDF Point Template Zhejiang University & OpenLink Software.
Context and Prosopography: Putting the 'Archives' Into LOD-LAM Corey A Harper SAA MDOR
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
GRITS Working with AVM Data Astronomy Visualization Metadata June 11th, 2010 Casey Rosenthal
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
Not Just For Data Geeks! A Practical Approach to Linked Data for Digital Library Managers Cory Lampert and Silvia Southwick Salt Lake City October 9, 2013.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Open Data Protocol * Han Wang 11/30/2012 *
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society OpenDAP 2007
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Linked Data: Emblematic applications on Legacy Data in Libraries.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
LINKED DATA DEMYSTIFIED PRACTICAL EFFORTS TO TRANSFORM CONTENTDM METADATA INTO LINKED DATA.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
LoCloud Conference - Sharing local cultural heritage online with LoCloud services Microservices in LoCloud Walter Koch Gerda Koch
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Cloud based linked data platform for Structural Engineering Experiment
Linked Data and Libraries
Lifting Data Portals to the Web of Data
Cataloging the Internet
PREMIS Tools and Services
Semantic Annotation service
LOD reference architecture
Linked Data Ryan McAlister.
Taxonomy of public services
Presentation transcript:

LINKED DATA what you need to know to understand, produce, and work with Linked Data Robert Chavez, PhD. Senior Content Solutions Architect, NEJMGroup NETSL 2016

Relational Data prevalent since 1970s uses defined data schemas organizes records into tables record attributes and fields organized into columns Standard query language: SQL Intuitive: spreadsheets, anyone?

Document Data prevalent with the advent of the internet many diverse ‘document’ models (images, unstructured text, XML, JSON, etc.) can have a schema or not: no pre-defined data model very easy to scale no single standard query language (although, XQuery) works well with REST services

Graph Data a relatively recent occurrence: 2000s schema-less, simple data model allows dynamic properties allows nodes to be arbitrarily linked Not strictly built for the Semantic Web. RDF datastores are a type of graph database.

Why graph data? … evolution Relational model shortcomings: Identifiers internal (relational) Can be difficult to work with complex data (relational) Little schema flexibility (relational) Document model shortcomings: Poor for interconnected data (relational + document) Queries mainly limited to keys and indexed values (document)

Infrastructure Evolution System and Web infrastructure has evolved along with our needs and expectations: Software As A Service (SAAS) Cloud Computing Application Program Interfaces (APIs) Service and Application focused Modular architectures and micro-services replace monoliths More and more internet-centric

Information Evolution The way we think about information (data), the way we find and use that information (data) has evolved: The Web: a place for exploration Web Standards: protocols, methods and ways to explore data and reference formats Interconnectivity: we expect it Information: active use and re-use of data Realization: different users (working with the same data) have different needs

Wait. What about Linked Data? styled after a graph data Resource Description Framework (RDF) = Semantic Data describes and models information about resources, as granular as you want to be describes complex relationships in a way that you, your query language, and other technologies can easily understand. human and machine readable

Linked Data? Semantic Web? Linked Data is Semantic Data organizes information into three part chunks of data, with a subject, a predicate, and an object. (Triples) built on the architecture of the Web (facilitates sharing of data on a global scale) Standard query and access protocol (SPARQL Protocol) The Four Principles

1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful information using the standards (RDF, SPARQL) RDF: making statements and forming sentences SPARQL: querying data and discovering relationships 4. Include links to other URIs, so that they can discover more things

RDF in 2 minutes (or maybe 4) SubjectPredicateObject Michelle MelloContributed toPrevalence and Characteristics of Physicians Prone to Malpractice Claims

RDF in 2 minutes (or maybe 4) Subject (IRI)Predicate (IRI)Object (IRI/Literal) /contributor Prevalence and Characteristics of Physicians Prone to Malpractice Claims and Characteristics of Physicians Prone to Malpractice Claims sa

Linked Data in 2 minutes Subject (IRI)Predicate (IRI)Object (IRI/Literal) sa Subject (IRI)Predicate (IRI)Object (IRI/Literal) sa dc:title Prevalence and Characteristics of Physicians Prone to Malpractice Claims Subject (IRI)Predicate (IRI)Object (IRI/Literal) sa schema:hasParthttps://doi.org/ /NEJM sa

How do I create triples? What do I need? RDF data: can be created in multiple ways (manual and automated methods) aggregation from other sources (DBPedia, Getty, Library of Congress, British Library, Europeana, National Library of Medicine, Linked Jazz, OCLC -- WorldCat, Dewey Decimal Classification, etc.) conversion of local data newly minted data RDF Tools: RDF Converters, OpenRefine, LODRefine, Catmandu, TopBraid Linked University: converting legacy data to RDF See: RDF-converters RDF-converters

How do I create triples? What do I need? Web server: to handle HTTP services, triplestore, SPARQL Endpoints, Gateways, APIs, etc. Linux/Windows server AWS, Azure Hosted solutions: Open Knowledge Systems DataHub See: A Triplestore: for triple storage and management Open Source and Paid options (including platform and integration) Apache Jena/TDB, Apache Marmotta, MarkLogic, Ontotext, Sesame, Virtuoso, See: SPARQL: for querying your (and other) triplestores Open Source and paid toolkits, clients, etc.

Fine. But, why bother? Problem 1: disambiguate and unify identification schemas Search: (not an Alfred Hitchcock problem) VIAF Record: Library of Congress Record: Problem 2: enrich metadata, enhance discoverability MeSH:

Solving problems with LD: example 1

VIAF:

Solving problems with LD: example 1

VIAF Triples:... " ".. "Michelle M. "Warning: skos:prefLabels are not ensured against "Michelle\n M. "Mello, Michelle M.".. NEJM Triple:

Solving problems with LD: example 2

"Zika Virus " "^^.. " "^^. "D ".

Silos: connect, don’t break This is the proverbial data silo Datasets = catalogs of things of collections of articles of rights of formats of contributors of subjects of types We can categorized all these by using controlled vocabularies and taxonomies (i.e. create domain models) We can establish relationships between all these (i.e. create ontologies)

Silos: connect, don’t break How we store and organize our data and define our data models matters Linking data allows us and our audience to access and query our data from any single point Because these datasets are linked, a single query can retrieve articles in a given journal, by a given contributor, on a given subject

Connect to (and share with) the wider world Solid well defined data in our Silo Modeled as Linked Data Enables connectivity to other datasets data models on the Web Graphic from Nature.com

Further Reading… Linked Data for Libraries (LD4L) Common Ground: Exploring Compatibilities Between the Linked Data Models of the Library of Congress and OCLC linked-data-2015.html linked-data-2015.html Linked Data in Libraries: Status and Future Direction Libraries.shtml Libraries.shtml A Linked Data Landscape landscape/ landscape/