I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people.

Slides:



Advertisements
Similar presentations
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Advertisements

The Semantic Web – WEEK 4: RDF
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
RDF Tutorial.
Semantic Web Introduction
RDF formats for Linked Data by Mabi Harandi. RDF is not a format, it is a model for data So: It will provide supports for different formats like :  Turtle.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
Dr. Alexandra I. Cristea RDF.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Presented by Gentre Dozier and Spencer Dille management.com/newsletters/database_metadata_unstructured_data_triple_store html.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Logics for Data and Knowledge Representation SPARQL Protocol and RDF Query Language (SPARQL) Feroz Farazi.
Semantic Web Series 1 Mohammad M. R. Cowdhury UniK, Kjeller.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Practical RDF Chapter 1. RDF: An Introduction
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
CC L A W EB DE D ATOS P RIMAVERA 2015 Lecture 2: RDF Model & Syntax Aidan Hogan
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Resource Identity and Semantic Extensions: Making Sense of Ambiguity David Booth, Ph.D. Cleveland Clinic (contractor) Semantic Technology Conference 25-June-2010.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Of 41 lecture 4: rdf – basics and language. of 41 RDF basic ideas the fundamental concepts of RDF  resources  properties  statements ece 720, winter.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Doc.: IEEE /0169r0 Submission Joe Kwak (InterDigital) Slide 1 November 2010 Slide 1 Overview of Resource Description Framework (RFD/XML) Date:
USB for Audio There are also several USB Audio chips. You install a custom driver on the host computer, and the USB sound device appears as a Windows (or.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people.
Knowledge Technologies Manolis Koubarakis 1 Some Other Useful Features of RDF.
Chapter 5 The Semantic Web 1. The Semantic Web  Initiated by Tim Berners-Lee, the inventor of the World Wide Web.  A common framework that allows data.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Event Linking With Meaning: Ontological Hypertext and the Semantic Web Hugh Davis Learning Societies Lab ECS The University of Southampton, UK All Notes.
Semantic Web in Depth RDFa, GRDDL and POWDER Dr Nicholas Gibbins
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Syntax and semantics >AMYLASEE1 TGCATNGY A very simple FASTA file.
Linked Data & Semantic Web Technology The Semantic Web Part 4. Resource Description Framework (1) Dr. Myungjin Lee.
Web-Technology Lecture 13.
Service-Oriented Computing: Semantics, Processes, Agents
Building the Semantic Web
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Service-Oriented Computing: Semantics, Processes, Agents
Tutorial on Semantic Web
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
CC La Web de Datos Primavera 2016 Lecture 2: RDF Model & Syntax
RDF 1.1 Concepts and Abstract Syntax
Resource Description Framework (RDF)
Presentation transcript:

I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers. A "Semantic Web", which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The "intelligent agents" people have touted for ages will finally materialize. Tim Berners-Lee 1999 Image by Paul Clarke, Wikimedia Commons, CC-BY

"It is not easy to build a robot, and only very clever boys should try it." Carol Ryrie Brink (1966) Andy Buckram's Tin Men. How cool would it be to make the intelligent agents come into existence ?!

The Baskauf Rule for Technology Adopting new technology requires that it do something better than the old technology W3C Semantic Web Activity logos used according to usage guidelineshttp:// W3C Resource Description Framework

What does RDF and SPARQL do better than traditional databases and SQL? If the answer is "nothing", then we shouldn't waste our time using it!

RDF is an abstract, graph-based model Triples are represented in text as serializations. Several serializations are W3C Recommendations: XML (media type: application/rdf+xml) Turtle (media type: text/turtle) also RDFa and JSON-LD (but won't talk about today) RDF/XML plays well with XML tools like XSLT and Xquery, but isn't very readable RDF/Turtle is easier for humans to read. SPARQL is based on Turtle syntax.

W3C RDF/XML Validation/visualization Service Load RDF/XML file from IRIs=ovals, literals=rectangles, predicates=arrows Graph model of data in van-gogh.rdf

Serializations of the data Namespace abbreviations abbreviated IRIs type blank (anonymous) node XML Turtle

paintingpainteryear The Starry NightVincent van Gogh1889 Birth of VenusSandro Botticelli1485 The Starry Night Vincent van Gogh 1889 Birth of Venus Sandro Botticelli 1485 dbres:The_Starry_Night dcterms:creator viaf: ; dcterms:created "1889"^^xsd:gYear. dcterms:creator viaf: ; dcterms:created "1485"^^xsd:gYear. Database table XML RDF (Turtle serialization) IRIs denote resources. The resource that is denoted is the referent.

RDF "means" something. dbres:The_Starry_Night dcterms:creator viaf: denotes the actual painting entitled "The Starry Night" denotes the actual person whose name was "Vincent van Gogh" denotes the relationship of a subject resource having a maker who is the object agent.

information resource (web page; deliverable via Internet) non-information resource (a painting; not deliverable via Internet) simple literal (denotes a string of characters with NO meaning) IRI (denotes the person, Vincent van Gogh)

Datatyped literals "mean" something dbres:The_Starry_Night dcterms:created "1889"^^xsd:gYear. denotes the actual painting entitled "The Starry Night" denotes the actual year of 1889 CE denotes the relationship of a subject resource being made in the object time period. dbres:The_Starry_Night dcterms:created "1889". The triple does not actually mean anything that makes sense.

What does RDF do better? RDF "means" something. Great if you care about imparting meaning. Really annoying if you don't care about the complications and just want to do string searching.

What is the Semantic Web? "The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources … It is also about language for recording how the data relates to real world objects."

Let's play with van Gogh and The Starry Night graph! It's loaded in the Heard Library triplestore as the graph: Important note: the graph does NOT live in the triplestore as any particular serialization! It's just a pot full of triples.

PREFIX rdf: PREFIX rdfs: PREFIX xsd: PREFIX foaf: PREFIX schema: PREFIX dc: PREFIX dcterms: PREFIX dbres: PREFIX viaf: PREFIX orcid: PREFIX owl: PREFIX dbp: PREFIX prov: PREFIX dbo: These are all of the namespace prefixes we will be using in the rest of the examples (see Gist).

This is the skeleton SPARQL query that we will use (see Gist). SELECT DISTINCT ?label FROM WHERE { dbres:The_Starry_Night rdfs:label ?label. } Replace stuff in orange text with your experimentation. DISTINCT keyword prevents repetition if the same triple is found multiple times.

What kinds of classes of things are present in this graph? ( rdf:type or " a" ) SELECT DISTINCT ?resource ?class FROM WHERE { ?resource a ?class. } Notes: The foaf:Document is represented by a blank node. There is no limit to the number of classes a resource can be an instance of.

Human-friendly labels for referents. SELECT DISTINCT ?label FROM WHERE { viaf: rdfs:label ?label. } Replace stuff in orange text with your experimentation. Try schema:name, schema:familyName, and schema:givenName. rdfs:label is the most generic (built-in property) but more specific properties give more precise information. Schema.org is run by Google, Microsoft, Yahoo, with contributions by Dan Brickley (of FOAF fame).

Find human-friendly labels for The Starry Night. SELECT DISTINCT ?label FROM WHERE { dbres:The_Starry_Night rdfs:label ?label. } Replace stuff in orange text with more the specific Dublin Core term dcterms:title. Dublin Core is the most commonly used vocabulary for metadata.

What is the Semantic Web? "The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources … It is also about language for recording how the data relates to real world objects."

Linked Data Tim Berners-Lee expressed the "Linked Data Principles" in 2006: 1. Use URIs as names for things. 2. Use HTTP URIs, so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL). 4. Include links to other URIs, so that they can discover more things. "Linked Data" is a similar idea to "the Semantic Web" but focused on HTTP URIs as identifiers and more on data discovery than reasoning.

Dereference the HTTP URI and ask for RDF dbres:The_Starry_Night is an abbreviation for

SELECT DISTINCT ?label FROM WHERE { dbres:The_Starry_Night rdfs:label ?label. } Can we get more information by merging data from dbpedia? SELECT DISTINCT ?property FROM WHERE { dbres:The_Starry_Night ?property ?value. } Look for more labels: Look for more properties (try first without dbpedia data): Yay! We have "learned" more about The Starry Night by adding triples to our graph via Linked Data!

What does RDF do better? Expressiveness is great. If there isn't a property that you need, you can make one up! Consistency is terrible. Resources described with ad hoc properties are not likely to be usefully combined with other people's data. AAA principle: Anyone can say Anything about Anything.

SELECT DISTINCT ?name FROM WHERE { viaf: schema:name ?name. } Can we get more information from dbpedia about van Gogh? Look for more properties (try first without dbpedia data):

فينسنت فان غوخ 文森特 · 梵高 Vincent van Gogh Ван Гог, Винсент Vincent van Gogh Vincent van Gogh Vincent van Gogh Vincent van Gogh Vincent van Gogh フィンセント・ファン・ゴッホ Vincent van Gogh Vincent van Gogh Vincent van Gogh Gogh, Vincent van Vincent van Gogh Gogh, Vincent van Why didn't it work? Grrrrrrr. They didn't use schema:name like we did!

SELECT DISTINCT ?name FROM WHERE { {viaf: schema:name ?name.} UNION {viaf: rdfs:label ?name.} UNION {viaf: foaf:name ?name.} UNION {viaf: dbp:name ?name.} } Try something more complicated Grrrrrrr. They didn't use viaf: as an IRI for van Gogh as we did:

Here's the solution!

SELECT DISTINCT ?name FROM WHERE { {viaf: schema:name ?name.} UNION { ?person owl:sameAs viaf: ?person rdfs:label ?name. } UNION { ?person owl:sameAs viaf: ?person foaf:name ?name. } UNION { ?person owl:sameAs viaf: ?person dbp:name ?name. } Try something more complicated

References: Harry Halpin, Patrick J. Hayes, James P. McCusker, Deborah L. McGuinness, and Henry S. Thompson When owl:sameAs isn’t the Same: An Analysis of Identity in Linked Data. International Semantic Web Conference (ISWC). Also, blog post on "bloating" caused by owl:sameAs of-rdf-agnostic-part-5.html

What does RDF do better? With RDF, you can discover other people's triples (Linked Data). Great if they used standard properties to link and standard IRIs to identify. Really annoying if they made up their own properties and IRIs. So the examples in the book where you make up your own vocabulary don't really leverage the power of Linked Data. You're not much better off than if you used standard database and querying techniques.

One can infer previously unstated facts based on logic (Entailment) This is a key benefit of having RDF "mean" something rather than just making it be a transfer mechanism or database system.

"The chief utility of a formal semantic theory is not to provide any deep analysis of the nature of the things being described by the language or to suggest any particular processing model, but rather to provide a technical way to determine when inference processes are valid, i.e. when they preserve truth." RDF Semantics A semantic client does not “know” what the URIs and literals “mean” dwc:decimalLatitude has no more meaning to a machine than: xq:p2-glwsopgn_2q4as " " is just a string of Unicode characters

"The chief utility of a formal semantic theory is not to provide any deep analysis of the nature of the things being described by the language or to suggest any particular processing model, but rather to provide a technical way to determine when inference processes are valid, i.e. when they preserve truth." RDF Semantics But a semantic client can follow rules about what can be inferred to be true If aaa rdfs:range XXX. uuu aaa vvv. then vvv rdf:type XXX.

Application of an entailment rule The FOAF vocabulary asserts: foaf:depiction rdfs:range foaf:Image. This does NOT mean that the object of a triple containing foaf:depiction must be an image. The AAA Principle allows the predicate foaf:depiction to be used with any kind of object. The entailment rule rdfs3 means that that a semantic client can materialize an entailed triple stating that the rdf:type of the object is foaf:Image. FOAF = Friend of a Friend vocabulary

Entailment rule example The AAA Principle allows me to assert that: foaf:depiction. In English we would say: {The person Vincent van Gogh} has a depiction {a certain jpeg image} From the range of foaf:depiction, a client can infer that: rdf:type foaf:Image.

RDF also allows me to assert that: foaf:depiction. In English we would say: {The name Physeter macrocephalus Linnaeus, 1758} has a depiction {the novel Moby Dick} DBpedia declares rdf:type bibo:Book But a semantic client infers rdf:type foaf:Image. based on the range declaration of foaf:depiction A novel is an image !!! Oops. We must be more careful with foaf:depiction because of its range declaration. Image by Randy Son of Robert Wikimedia Commons cc-by-2.0

Aside on inconsistencies The Open World assumption assumes that we cannot infer anything from triples that are unstated (i.e. not making a statement does not imply that the statement is false). Stating more triples restricts the possible states of the "world" of discourse described by the graph. It is possible to make statements which entail that there is no possible "world" that the graph describes, e.g. ex:steve my:age "18.5"^^xsd:integer. Careless use of terms with strong entailments increase the likelihood of rendering a graph inconsistent. see rdf-agnostic-part-4.html for examples and more on this. rdf-agnostic-part-4.html

Entailment summary Entailment rules do NOT enforce conditions. Entailment rules imply that other unstated triples exist. Inferred triples are true to the extent that the statements which entail them are also true. This introduces a requirement for an element of trust. A client is not required to apply all possible entailment rules. A client is not required to to apply rules to any particular set of triples. Quote from section 3 of OWL 2 Primer primer/#Modeling_Knowledge:_Basic_Notionshttp:// primer/#Modeling_Knowledge:_Basic_Notions "a set of statements A entails a statement a if in any state of affairs wherein all statements from A are true, also a is true."

"… the vocabulary of the graph may be interpreted relative to a stronger notion of vocabulary entailment, i.e. with a larger set of semantic conditions understood to be imposed on the interpretations. … [This] can be thought of as an addition of information, and may make more entailments hold than held before the change. " section 6 of RDF Semantics W3C Recommendation vocabulary-interpretation rdf-interpretation rdfs-interpretation owl-interpretation entailment weaker stronger semantic conditions imposed fewer more information less more likelihood of inconsistency less more Vocabulary trends

What does RDF do better? With RDF, you reason entailed triples that nobody has explicitly stated. Great if consistent use of terms entails triples that make sense. Really annoying if careless use of terms entails triples that are nonsensical or that generate inconsistencies.