Publishing to the Semantic Web Dr Owen Conlan Dr Alexander O’Connor.

Slides:



Advertisements
Similar presentations
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Advertisements

Semantic Web Thanks to folks at LAIT lab Sources include :
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
Semantic Web Introduction
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
The Web of data with meaning... By Michael Griffiths.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Linked Data Practices for the Geospatial Community Talk subtitle Presented at GEOSS Workshop on Climate Boulder Colorado, 23 September 2011 Stephan Zednik,
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
© 1Spatial All rights reserved. An Internet of Places Making Location Data Pervasive Paul Watson Giuseppe Conti* Federico Prandi*
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Linked Data The Short Version. Linked Data is a set of best practices for publishing and deploying instance and class data using the RDF data model, naming.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Shared innovation How to Publish Linked Data on the Web Dr. Tom Heath Platform Division Talis Information Ltd
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
2013Dr. Ali Rodan 1 Handout 1 Fundamentals of the Internet.
Practical RDF Chapter 1. RDF: An Introduction
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
The Semantic Web Web Science Systems Development Spring 2015.
Shared innovation An Introduction to Linked Data Dr Tom Heath Platform Division Talis Information Ltd 13/14.
The INTERNET how it works. the internet: defined So, what is it?
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
© 2012 IBM Corporation Best Practices for Publishing RDF Vocabularies Arthur Ryman,
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
Linked Data: Emblematic applications on Legacy Data in Libraries.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
DBpedia - A Crystallization Point
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Chapter 5 The Semantic Web 1. The Semantic Web  Initiated by Tim Berners-Lee, the inventor of the World Wide Web.  A common framework that allows data.
Extending Karkinos for Linked Data Digital Innovation Group - Fall 2015 Tushar Jain
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Linked Data & Semantic Web Technology The Semantic Web Part 3. URI for the Semantic Web Dr. Myungjin Lee.
Tutorial 1 Getting Started with Adobe Dreamweaver CS5.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Linked Data Web that can be processed by machines
Building the Semantic Web
Jenn Riley Metadata Librarian Digital Library Program
HTTP: the hypertext transfer protocol
Cataloging the Internet
PREMIS Tools and Services
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
Introduction to World Wide Web
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Publishing to the Semantic Web Dr Owen Conlan Dr Alexander O’Connor

The World Wide Web HTTP HTML The Semantic Web Vision RDF Semantic Web Stack Web 2.0 / The Social Web Tagging Crowd Sourcing The Web of Data (Linked Data) VoID DBpedia Semantic Web Reasoning Logic, Rules Trust The Long Road for Semantic Web / ????

Tim Berners Lee driven “Linked Data uses a small slice of the technologies that make up the Semantic Web” Treat Schemas as Vocabularies Reuse existing schemas The Linked Data Movement

Community project with W3C support started in early 2007 Idea: take existing (open) data sets and make them available on the Web in RDF Interlink them with other data sets Linking Open Data Project

A Pretty (Scary) Diagram

DBpedia Transforming Wikipedia into a knowledge base Structure from Infoboxes HTML (titles) Categories Links other languages, redirects, disambiguations, etc Uses: as a controlled vocab, as an ontology Check out : ege,_Dublin ege,_Dublin (or Google “Trinity College Dublin dbpedia”)

Publish structured data in RDF on the web using URIs and shared vocabularies rather than the traditional Semantic Web focus on ontologies and inference Lowers barriers to entry Fosters widespread adoption Mature tools, techniques, patterns Linking Data

Formulated by Tim Berners-Lee (2006): 1.Use URIs as names for things 2.Use HTTP URIs so that people/apps can lookup these names 3.When someone/an app looks up a URI, provide useful information 4.Include links to other URIs so that they can discover more things This not an unambiguous specification, just a set of principles. Linked Data Principles

The Hypertext Transfer Protocol (HTTP) is an application-level protocol for distributed, collaborative, hypermedia information systems. It is a generic, stateless, protocol which can be used for many tasks beyond its use for hypertext, such as name servers and distributed object management systems, through extension of its request methods, error codes and headers. A feature of HTTP is the typing and negotiation of data representation, allowing systems to be built independently of the data being transferred. [RFC2616]

A Uniform Resource Identifier (URI) is a compact sequence of characters that identifies an abstract or physical resource [RFC3986] Syntax: URI = scheme “:” hier-part [“?” query] [“#” fragment] Example Note: scheme not the same as protocol URIs

Linked data needs dereferenceable URIs (ones we can use HTTP to retrieve a description of that resource) But we cannot serialise people, things over the internet (yet?) => we publish RDF documents on the web that describe them A real-word object != a document about that object e.g. creation-date for you != creation-date for your web- page Identifying Linked Data Resources

URI that identifies a real-word object != URI that identifies a document about that object Can make statements about object and can make statements about the document describing it How do we link these 2 URIs together? Identifying Linked Data Resources

303 Redirect (e.g. smith) Used for large, dynamic data sets Flexible because redirection can be separately configured for each resource e.g. can store data in multiple files or DB. Can change this at deployment/run-time. Typically used for resource descriptions in large data-sets URI styles for Linked Data

Fragment (e.g. smith) Used for small, static data sets Reduced number of HTTP round-trips => reduced latency A single HTTP request retrieves the entire document May transmit unnecessary data across the web Used for RDFa (defined via RDFa “about=” attribute) Typically used for vocabulary definitions URI styles for Linked Data

1.Create URIs for concept/thing and documents e.g. (URI identifying the person Dave Smith) (URI for RDF/XML document describingDave Smith) for HTML document describing Dave Smith) 2.Use HTTP redirects/content negotiation to access the desired resource description for the specific user agent 1.Client HTTP GET request on a URI identifying a object 2.Server recognizes URI, it answers using the HTTP 303 to send the URI of a description of the object 3.Client HTTP GET request on new URI 4.Server sends document from new URI 303 Redirect Approach

The picture below shows how dereferencing a HTTP URI identifying a non-information resource plays together with content negotiation: Simples… Huh?

1.Assign a URI to the RDF document defining the concepts e.g. (document URI) 2.Assign fragment identifiers to concepts within the document e.g Use HTTP requests to get the description 1.Client truncates a fragment URI to just refer to the document 2.Send HTTP GET to request the document 3.Server sends back the full document 4.Linked data application now inspects triples to find fragment Fragment Approach

Class <!--  Property Now we can refer to stuff

5 Steps to Publishing Linked Data

1.Understand the Principles 2.Understand your Data 3.Choose URIs for Things in your Data 4.Set up Your Infrastructure 5.Link to other Data Sets

A.Use URIs as names for things Anything, not just documents You are not your homepage Information resources (can be transmitted electronically) and non- information resources (cannot be transmitted electronically, e.g. a person!) B.Use HTTP URIs Globally unique names, distributed ownership Allows people to lookup those names Step 1: Understanding the Principles

C.Provide useful information in RDF when someone looks up a URI We can include RDF triple statements! D.Include RDF links to other URIs To enable discovery of related information e.g. via “follow your nose” browsing Relationship Links – to add context Identity Links – for URI aliases in other sources Vocabulary Links – to enable self-description Step 1: Understanding the Principles (cont.)

What are the key things in your data? People Places Events Book Flims Musician … This why domain expertise are critically important Step 2: Understand your Data

What vocabularies can be used to describe these? Principles: Reuse, don’t reinvent Mix liberally Examples: foaf -- Friend-of-a-Friend ontology geonames -- GeoNames ontology skos -- Simple Knowledge Organization System ckan.net Step 2: Understand your Data (cont.)

Step 2: Common Vocabularies bibo -- Bibilographic ontology cc -- Creative Commons ontology damltime -- Time Zone ontology doap -- Description of a Project ontology event -- Event ontolog foaf -- Friend-of-a-Friend ontology frbr -- Functional Requirements for Bibliographic Records geo -- Geo wgs84 ontology geonames -- GeoNames ontology mo -- Music Ontology opencyc -- OpenCyc knowledge base owl -- Web Ontology Language pim_contact -- PIM (personal information management) Contacts ontology po -- Programmes Ontology (BBC) rss -- Really Simple Syndicate (1.0) ontology sioc -- Socially Interlinked Online Communities ontolog sioc_types -- SIOC extension skos -- Simple Knowledge Organization System umbel -- Upper Mapping and Binding Exchange Layer ontology wordnet -- WordNet lexical ontology yandex_foaf -- FOAF (Friend-of-a- Friend) Yandex extension ontology

Use HTTP URIs Keep out of other people’s namespaces Create own URI and include alias information Abstract away from implementation details: Is better than this: bin/resource.php?id=/Berlin Use Natural Keys within URIs: Need to ensure the uniqueness of URIs Useful to base them on some existing primary key Whenever possible, use a key that is meaningful within the domain of the data set. e.g. use the ISBN as part of the URI of a book Step 3: Choosing URIs

Common patterns for URIs:  Thing  RDF  HTML Or use the file name extension: Step 3: Choosing URIs (cont.)

Describe the Data-set! e.g. dataset name, authorship, updates, licensing terms, crawler support, SPARQL endpoint location,... Vocabulary of Interlinked Datasets (VoID) A little later… Pick a Publication Pattern Is your input data: queryable, structured or text? What is the data volume? Is it static or dynamic? Test it Step 4: Set up Your Infrastructure

Step 4: Set up Your Infrastructure (cont.)

Popular predicates for linking owl:sameAs Foaf:depection Foaf:homepage Foaf:topic Foaf:based_near Foaf:maker/foaf:made Foaf:page Foaf:primaryTopic Rdfs:seeAlso Step 5: Linking

VoID (from "Vocabulary of Interlinked Datasets") is an RDF based schema to describe linked datasets A dataset is a collection of data, published and maintained by a single provider, available as RDF, and accessible, for example, through dereferenceable HTTP URIs or a SPARQL endpoint Step 5: Linking (cont.)

1.Understand the Principles 2.Understand your Data 3.Choose URIs for Things in your Data 4.Set up Your Infrastructure 5.Link to other Data Sets

Thank you!

1. 2.Debugging Semantic Web sites with cURL, sites-with- curl 3.Linked Data Tutorial, michael- hausenblas Linked Data Applications, M Hausenblas, DERI Technical Report Linked Data: Evolving the Web into a Global Data Space, Tom Heath, Christian Bizer References