Detailed introduction into RDF and the Semantic Web Search & Find Workshop Ghent, Belgium, 2008-08-22 Ivan Herman, W3C.

Slides:



Advertisements
Similar presentations
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Advertisements

Semantic Web Motivating Example. A Motivating example Here’s a motivating example, adapted from a presentation by Ivan Herman It introduces semantic web.
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
RDF Tutorial.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Dr. Alexandra I. Cristea RDF.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Practical RDF Chapter 1. RDF: An Introduction
Logics for Data and Knowledge Representation
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
RELATORS, ROLES AND DATA… … similarities and differences.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
1 Tutorial on the Semantic Web (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C Given at WE course by Peter Dolog Adapted: October 2010.
1 Introduction and Applications of the Semantic Web Ivan Herman, W3C May 2009.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
1 Tutorial on the Semantic Web cntd. (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C Given at WE course by Peter Dolog Adapted: October.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Semantic Web Presented by Xia Li. 2 Outline Introduction Examples Semantic Web technologies Applications Concerns.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
1 Semantic Web, Linked Data, and Semantic 3D Media Ivan Herman, W3C FOCUS 3D Conference, Sophia Antipolis, France
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Semantic Web in Depth RDFa, GRDDL and POWDER Dr Nicholas Gibbins
What is the Semantic Web? 17 th XBRL International Conference Eindhoven, the Netherlands 5 st May, 2008 Ivan Herman, W3C.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Introduction to the Semantic Web (through an Example…) Ivan Herman, W3C (Last updated: 11 November 2009)
An Introduction to Semantic Web (Tutorial) 17 th International World Wide Web Conference Beijing, China, 21 st April, 2008 Ivan Herman ( 郝易文 ), W3C.
Introduction to the Semantic Web Ivan Herman, W3C ESA, Noordwijk, the Netherlands 16 th April, 2007 Ivan Herman.
What is the Semantic Web? (In 15 minutes…) ISOC Nieuwjaarsreceptie , Amsterdam, The Netherlands Ivan Herman, W3C.
Semantics of RDF(S) and OWL Ivan Herman, W3C (Last update: 28 Sep 2007) Ivan Herman.
Introduction to the Semantic Web 2 nd European Semantic Technology Conference Vienna, Austria September 24, 2008 Ivan Herman, W3C.
Overview of the Semantic Web Ralph R. Swick World Wide Web Consortium (W3C) 17 October 2009.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Tutorial on Semantic Web
Charlie Abela Department of Intelligent Computer Systems
The Semantic Web By: Maulik Parikh.
Linked Data Web that can be processed by machines
Tutorial on Semantic Web
RDFa How and Why Ralph R. Swick World Wide Web Consortium
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Materi Minggu ke 11 Introduction to the Semantic Web
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Tutorial on Semantic Web
How does the Semantic Web Work?
Overview of the Semantic Web Ralph R
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Analyzing and Securing Social Networks
ece 720 intelligent web: ontology and beyond
Triple Stores.
PREMIS Tools and Services
Linking Guide Michel Böhms.
Semantic Web Basics (cont.)
Semantic-Web, Triple-Strores, and SPARQL
Presentation transcript:

Detailed introduction into RDF and the Semantic Web Search & Find Workshop Ghent, Belgium, Ivan Herman, W3C

Copyright © 2008, W3C ( 2 )‏ (2)(2) Towards a Semantic Web Tasks often require to combine data on the Web: hotel and travel information may come from different sites searches in different digital libraries etc. Humans combine these information easily even if different terminologies are used the information is incomplete, or buried in images, videos, …

Copyright © 2008, W3C ( 3 )‏ (3)(3) However… However: machines are ignorant! partial information is unusable difficult to make sense from, e.g., an image drawing analogies automatically is difficult difficult to combine information automatically is same as ? …

Copyright © 2008, W3C ( 4 )‏ (4)(4) Example: automatic airline reservation Your automatic airline reservation knows about your preferences builds up knowledge base using your past can combine the local knowledge with remote services: airline preferences dietary requirements calendaring etc It communicates with remote information (i.e., on the Web!) (M. Dertouzos: The Unfinished Revolution)

Copyright © 2008, W3C ( 5 )‏ (5)(5) Example: data(base) integration Databases are very different in structure, in content Lots of applications require managing several databases after company mergers combination of administrative data for e-Government biochemical, genetic, pharmaceutical research combination of online library data etc. Most of these data are accessible from the Web (though not necessarily public yet)

Copyright © 2008, W3C ( 6 )‏ (6)(6) And the problem is real…

Copyright © 2008, W3C ( 7 )‏ (7)(7) Example: social networks Social sites are everywhere these days (LinkedIn, Facebook, Dopplr, Digg, Plexo, Zyb, …) Data is not interchangeable: how many times did you have to add your contacts? Applications should be able to get to those data via standard means there are, of course, privacy issues…

Copyright © 2008, W3C ( 8 )‏ (8)(8) Example: digital libraries Sort of catalogues on the Web librarians have known how to do that for centuries goal is to have this on the Web, World-wide extend it to multimedia data, too But it is more: software agents should also be librarians! e.g., help you in finding the right publications

Copyright © 2008, W3C ( 9 )‏ (9)(9) What is needed? (Some) data should be available for machines for further processing Data should be possibly combined, merged on a Web scale Machines may also need to reason about that data Create a Web of Data (beyond the Web of Documents)

Copyright © 2008, W3C ( 10 )‏ ( 10 ) Find the right experts at NASA Expertise locater for nearly 70,000 NASA civil servants, integrating 6 or 7 geographically distributed databases, data sources, and web services… Michael Grove, Clark & Parsia, LLC, and Andrew Schain, NASA, (SWEO Case Study)(SWEO Case Study)

Copyright © 2008, W3C ( 11 )‏ ( 11 ) Integration of “social” software data Internal usage of wikis, blogs, RSS, etc, at EDF uses: public ontologies public datasets for tagging Details are hidden from end users (via plugins, extra layers, etc) Courtesy of Alexandre Passant, EDF R&D and LaLIC, Université Paris-Sorbonne, (SWEO Case Study)(SWEO Case Study)

Copyright © 2008, W3C ( 12 )‏ ( 12 ) Eli Lilly’s Target Assessment Tool Prioritization of drug target, integrating data from different sources and formats Integration, search via ontologies (proprietary and public) Courtesy of Susie Stephens, Eli Lilly (SWEO Case Study)(SWEO Case Study)

Copyright © 2008, W3C ( 13 )‏ ( 13 ) In what follows… We will use a simplistic example to introduce the main Semantic Web concepts We take, as an example area, data integration

Copyright © 2008, W3C ( 14 )‏ ( 14 ) The rough structure of data integration 1. Map the various data onto an abstract data representation make the data independent of its internal representation… 2. Merge the resulting representations 3. Start making queries on the whole! queries that could not have been done on the individual data sets

Copyright © 2008, W3C ( 15 )‏ ( 15 ) A simplified bookstore data (dataset “A”)

Copyright © 2008, W3C ( 16 )‏ ( 16 ) 1 st : export your data as a set of relations

Copyright © 2008, W3C ( 17 )‏ ( 17 ) Some notes on the exporting the data Relations form a graph the nodes refer to the “real” data or contain some literal how the graph is represented in machine is immaterial for now Data export does not necessarily mean physical conversion of the data relations can be generated on-the-fly at query time via SQL “bridges” scraping HTML pages extracting data from Excel sheets etc. One can export part of the data

Copyright © 2008, W3C ( 18 )‏ ( 18 ) Another bookstore data (dataset “F”)

Copyright © 2008, W3C ( 19 )‏ ( 19 ) 2 nd : export your second set of data

Copyright © 2008, W3C ( 20 )‏ ( 20 ) 3 rd : start merging your data

Copyright © 2008, W3C ( 21 )‏ ( 21 ) 3 rd : start merging your data (cont.)

Copyright © 2008, W3C ( 22 )‏ ( 22 ) 3 rd : merge identical resources

Copyright © 2008, W3C ( 23 )‏ ( 23 ) Start making queries… User of data “F” can now ask queries like: “give me the title of the original” This information is not in the dataset “F”… …but can be retrieved by merging with dataset “A”!

Copyright © 2008, W3C ( 24 )‏ ( 24 ) However, more can be achieved… We “feel” that a:author and f:auteur should be the same But an automatic merge does not know that! Let us add some extra information to the merged data: a:author same as f:auteur both identify a “Person” a term that a community may have already defined: a “Person” is uniquely identified by his/her name and, say, homepage it can be used as a “category” for certain type of resources

Copyright © 2008, W3C ( 25 )‏ ( 25 ) 3 rd revisited: use the extra knowledge

Copyright © 2008, W3C ( 26 )‏ ( 26 ) Start making richer queries! User of dataset “F” can now query: “give me the home page of the original’s author” The information is not in datasets “F” or “A”… …but was made available by: merging datasets “A” and datasets “F” adding three simple extra statements as an extra “glue”

Copyright © 2008, W3C ( 27 )‏ ( 27 ) Combine with different datasets Using, e.g., the “Person”, the dataset can be combined with other sources For example, data in Wikipedia could be extracted using dedicated tools

Copyright © 2008, W3C ( 28 )‏ ( 28 ) Merge with Wikipedia data

Copyright © 2008, W3C ( 29 )‏ ( 29 ) Merge with Wikipedia data

Copyright © 2008, W3C ( 30 )‏ ( 30 ) Merge with Wikipedia data

Copyright © 2008, W3C ( 31 )‏ ( 31 ) Is that surprising? Maybe but, in fact, no… What happened via automatic means is done all the time, every day by the users of the Web! The difference: a bit of extra rigour (e.g., naming the relationships) is necessary so that machines could do this, too

Copyright © 2008, W3C ( 32 )‏ ( 32 ) What did we do? We combined different datasets that are somewhere on the web are of different formats (mysql, excel sheet, XHTML, etc) have different names for relations We could combine the data because some URI-s were identical (the ISBN-s in this case) We could add some simple additional information (the “glue”), including using common terminologies that a community has produced As a result, new relations could be found and retrieved

Copyright © 2008, W3C ( 33 )‏ ( 33 ) It could become even more powerful We could add extra knowledge to the merged datasets e.g., a full classification of various types of library data geographical information etc. This is where ontologies, extra rules, etc, come in ontologies/rule sets can be relatively simple and small, or huge, or anything in between… Even more powerful queries can be asked as a result

Copyright © 2008, W3C ( 34 )‏ ( 34 ) What did we do? (cont)

Copyright © 2008, W3C ( 35 )‏ ( 35 ) So where is the Semantic Web? The Semantic Web provides technologies to make such integration possible! Hopefully you get a full picture at the end of the tutorial…

Copyright © 2008, W3C ( 36 )‏ ( 36 ) Basic RDF

Copyright © 2008, W3C ( 37 )‏ ( 37 ) RDF triples Let us begin to formalize what we did! we “connected” the data… but a simple connection is not enough… it should be named somehow hence the RDF Triples: a labelled connection between two resources

Copyright © 2008, W3C ( 38 )‏ ( 38 ) RDF triples (cont.) An RDF Triple (s,p,o) is such that: “ s ”, “ p ” are URI-s, ie, resources on the Web; “ o ” is a URI or a literal “ s ”, “ p ”, and “ o ” stand for “subject”, “predicate”, and “object” conceptually: “ p ” connects, or relates the “ s ” and the “ o ” note that we use URI-s for naming: i.e., we can use here is the complete triple: RDF is a general model for such triples with machine readable formats like RDF/XML, Turtle, N3, RXR, RDFa, … (,, )

Copyright © 2008, W3C ( 39 )‏ ( 39 ) A simple RDF example (in RDF/XML) Le palais des mirroirs Le palais des mirroirs (Note: namespaces are used to simplify the URI-s)

Copyright © 2008, W3C ( 40 )‏ ( 40 ) A simple RDF example (in Turtle) f:titre "Le palais des ; f:original. f:titre "Le palais des ; f:original.

Copyright © 2008, W3C ( 41 )‏ ( 41 ) URI-s play a fundamental role URI-s made the merge possible Anybody can create (meta)data on any resource on the Web e.g., the same XHTML file could be annotated through other terms semantics is added to existing Web resources via URI-s URI-s make it possible to link (via properties) data with one another URI-s ground RDF into the Web information can be retrieved using existing tools this makes the “Semantic Web”, well… “Semantic Web”

Copyright © 2008, W3C ( 42 )‏ ( 42 ) RDF in programming practice For example, using Java+Jena (HP’s Bristol Lab): a “Model” object is created the RDF file is parsed and results stored in the Model the Model offers methods to retrieve: triples (property,object) pairs for a specific subject (subject,property) pairs for specific object etc. the rest is conventional programming… Similar tools exist in Python, PHP, etc.

Copyright © 2008, W3C ( 43 )‏ ( 43 ) Vocabularies

Copyright © 2008, W3C ( 44 )‏ ( 44 ) Need for vocabulary specifications We saw the need and the power of vocabulary specification in our example “Person” “author” vs. “auteur” etc Several technologies are provided for that purpose: RDFS, OWL, SKOS, … trade-off between expressibility and simplicity (eg, for implementations)

Copyright © 2008, W3C ( 45 )‏ ( 45 ) RDFS: the basics Think of well known traditional ontologies or taxonomies: use the term “novel” “every novel is a fiction” “«The Glass Palace» is a novel” etc. RDFS defines resources and classes: everything in RDF is a “resource” “classes” are also resources, but… …they are also a collection of possible resources (i.e., “individuals”) “fiction”, “novel”, …

Copyright © 2008, W3C ( 46 )‏ ( 46 ) Classes, resources, … (cont.) Relationships are defined among classes/resources: “typing”: an individual belongs to a specific class (“«The Glass Palace» is a novel”) to be more precise: “« X » is a novel” “subclassing”: all instances of one are also the instances of the other (“every novel is a fiction”) RDFS formalizes these notions in RDF

Copyright © 2008, W3C ( 47 )‏ ( 47 ) Classes, resources in RDF(S) RDFS defines rdfs:Resource, rdfs:Class as nodes; rdf:type, rdfs:subClassOf as properties (these are all special URI-s, we just use the namespace abbreviation)

Copyright © 2008, W3C ( 48 )‏ ( 48 ) Inferred properties is not in the original RDF data… …but can be inferred from the RDFS rules “RDFS aware” RDF environments return that triple, too ( rdf:type #Fiction)

Copyright © 2008, W3C ( 49 )‏ ( 49 ) Inference: let us be formal… The RDF Semantics document has a list of (44) entailment rules:RDF Semantics “if such and such triples are in the graph, add this and this triple” do that recursively until the graph does not change The relevant rule for our example: If: uuu rdfs:subClassOf xxx. vvv rdf:type uuu. Then add: vvv rdf:type xxx. If: uuu rdfs:subClassOf xxx. vvv rdf:type uuu. Then add: vvv rdf:type xxx.

Copyright © 2008, W3C ( 50 )‏ ( 50 ) Properties Properties’ range and domain can be specified i.e., what type of resources can serve as object and subject There is also a possibility for a “sub-property” all resources bound by the “sub” are also bound by the other

Copyright © 2008, W3C ( 51 )‏ ( 51 ) Property specification example In RDF/XML: In Turtle: :title rdf:type rdf:Property; rdfs:domain :Fiction; rdfs:range rdfs:Literal. :title rdf:type rdf:Property; rdfs:domain :Fiction; rdfs:range rdfs:Literal.

Copyright © 2008, W3C ( 52 )‏ ( 52 ) What does this mean? Again, new relations can be deduced. Indeed, if :title rdf:type rdf:Property; rdfs:domain :Fiction; rdfs:range rdfs:Literal. :title “The Glass Palace”. :title rdf:type rdf:Property; rdfs:domain :Fiction; rdfs:range rdfs:Literal. :title “The Glass Palace”. then the system can infer that: rdf:type :Fiction.

Copyright © 2008, W3C ( 53 )‏ ( 53 ) A bit of RDFS can take you far… Remember the power of merge? We could have used, in our example: f:auteur is a sub-property of a:author and vice versa (although we will see other ways to do that…) Of course, in some cases, more complex knowledge is necessary

Copyright © 2008, W3C ( 54 )‏ ( 54 ) Ontologies Complex applications may want more possibilities: identification of objects with different URI-s disjointness or equivalence of classes construct classes, not only name them more complex classification schemes can a program reason about some terms? E.g.: “if «Person» resources «A» and «B» have the same « foaf: » property, then «A» and «B» are identical” etc. This is where OWL comes in

Copyright © 2008, W3C ( 55 )‏ ( 55 ) Web Ontology Language = OWL OWL is an extra layer, a bit like RDF Schemas own namespace, own terms it relies on RDF Schemas Needs extra tools, inference engines, etc, to implement it that is why it is a separate recommendation use it only if you need it…

Copyright © 2008, W3C ( 56 )‏ ( 56 ) OWL examples: term equivalence For classes: owl:equivalentClass : two classes have the same individuals owl:disjointWith : no individuals in common For properties: owl:equivalentProperty remember the a:author vs. f:auteur ? For individuals: owl:sameAs : two URIs refer to the same concept (“individual”) owl:differentFrom : negation of owl:sameAs

Copyright © 2008, W3C ( 57 )‏ ( 57 ) Example: connecting to French

Copyright © 2008, W3C ( 58 )‏ ( 58 ) Property characterization Characterize the behaviour of properties: symmetric, transitive, functional, inverse functional…

Copyright © 2008, W3C ( 59 )‏ ( 59 ) Characterization example “ foaf: ” is inverse functional (i.e., two different subjects cannot have identical objects)

Copyright © 2008, W3C ( 60 )‏ ( 60 ) What this means is… If the following holds in our triples: then the following holds, too: : rdf:type owl:InverseFunctionalProperty. : rdf:type owl:InverseFunctionalProperty. : I.e., new relationships were discovered again (beyond what RDFS could do) owl:sameAs.

Copyright © 2008, W3C ( 61 )‏ ( 61 ) Classes in OWL In RDFS, you can subclass existing classes… that’s all In OWL, you can construct classes from existing ones: enumerate its content through intersection, union, complement etc

Copyright © 2008, W3C ( 62 )‏ ( 62 ) What we have so far… The OWL features listed so far are already fairly powerful E.g., various databases can be linked via owl:sameAs, functional or inverse functional properties, etc. It is still possible to find inferred relationships using a traditional rule engine

Copyright © 2008, W3C ( 63 )‏ ( 63 ) However… even that may not be enough Very large vocabularies (ontologies) might require even more complex features one major issue is the way classes (i.e., “concepts”) are defined. for example, define a class by characterizing the value a property can have on its instances eg: allowed prices should have their currency set to €… OWL includes those extra features but… the inference engines become (much) more complex

Copyright © 2008, W3C ( 64 )‏ ( 64 ) OWL has “subspecies” Some features of OWL require complex programs OWL has defined “subspecies”: OWL Lite, OWL DL, OWL Full applications choose a level that is good enough for them current work on OWL may define some more

Copyright © 2008, W3C ( 65 )‏ ( 65 ) Ontologies examples Large ontologies are being developed eClassOwleClassOwl: eBusiness ontology for products and services, 75,000 classes and 5,500 properties National Cancer Institute’s ontologyNational Cancer Institute’s ontology: about 58,000 classes Open Biomedical Ontologies FoundryOpen Biomedical Ontologies Foundry: a collection of ontologies, including the Gene Ontology to describe gene and gene product attributes in any organism or protein sequence and annotation terminology and data (UniProt)Gene OntologyUniProt BioPAXBioPAX: for biological pathway data

Copyright © 2008, W3C ( 66 )‏ ( 66 ) Another vocabulary specification tool: SKOS (Simple Knowledge Organization System) Goal: represent and share classifications, glossaries, thesauri, etc, as developed in the “Print World”. for example: Dewey Decimal Classification, Art and Architecture Thesaurus, ACM classification of keywords and terms… Dewey Decimal Classification US Library of Congress subject headings Allow for a quick port of this traditional data, combine it with other data No need for complex inferencing

Copyright © 2008, W3C ( 67 )‏ ( 67 ) Example: Library of Congress subject headings

Copyright © 2008, W3C ( 68 )‏ ( 68 ) The structure can be translated into RDF

Copyright © 2008, W3C ( 69 )‏ ( 69 ) SKOS and OWL SKOS is geared towards some specific (though large) use cases, like taxonomies, glossaries, … annotations of complex structures SKOS is a based on a very simple usage of OWL roughly on the rule based level no complex inference possibilities Ideal tool to create “bridges” between the library world and the (Semantic) Web eg, by providing standard “terms” (ie, their URI-s)

Copyright © 2008, W3C ( 70 )‏ ( 70 ) How to get to RDF(S) data?

Copyright © 2008, W3C ( 71 )‏ ( 71 ) How to get to RDF(S) data? Write RDF/XML or Turtle “manually” in some cases that is necessary, but it really does not scale… Embed RDF data into HTML (using, eg, RDFa) Extract RDF from XML data (using, eg, GRDDL) … But this does not scale

Copyright © 2008, W3C ( 72 )‏ ( 72 ) Access to databases “Bridges” are defined to get to RDBS content the same database system can be used in a traditional environment but its content can be “viewed” as RDF Specialized RDF database systems also exist to store large amount of data

Copyright © 2008, W3C ( 73 )‏ ( 73 ) Example: Linking Open Data Project Goal: “expose” open datasets in RDF via various means Set RDF links among the data items from different datasets Billions triples, millions of “links”

Copyright © 2008, W3C ( 74 )‏ ( 74 ) DBpedia: Extracting structured data from Wikipedia dbpedia:native_name “Kolkata dbpedia:altitude “9”; dbpedia:populationTotal “ ”; dbpedia:population_metro “ ”; geo:lat “ ”^^xsd:float;... dbpedia:native_name “Kolkata dbpedia:altitude “9”; dbpedia:populationTotal “ ”; dbpedia:population_metro “ ”; geo:lat “ ”^^xsd:float;...

Copyright © 2008, W3C ( 75 )‏ ( 75 ) Automatic links among open datasets owl:sameAs ;... owl:sameAs ;... owl:sameAs wgs84_pos:lat “ ”; wgs84_pos:long “ ”; sws:population “ ”... owl:sameAs wgs84_pos:lat “ ”; wgs84_pos:long “ ”; sws:population “ ”... DBpedia Geonames Processors can switch automatically from one to the other…

Copyright © 2008, W3C ( 76 )‏ ( 76 ) The importance of the LOD project It creates a basis for large scale, Web-based data integration Applications begin to appear that use this dataset Other large datasets are continuously added. Some possible candidates: data on movies, published music various library catalogues BBC programmes etc.

Copyright © 2008, W3C ( 77 )‏ ( 77 ) “Review Anything” data in RDF links to, eg, (DB/Wiki)Pedia enhance output with linked data

Copyright © 2008, W3C ( 78 )‏ ( 78 ) Faviki: social bookmarking with Wiki tagging Tag bookmarks via Wikipedia terms Use DBpedia URI-s and extra information Extra info from DBpedia

Copyright © 2008, W3C ( 79 )‏ ( 79 ) What have we achieved?

Copyright © 2008, W3C ( 80 )‏ ( 80 ) Remember the integration example?

Copyright © 2008, W3C ( 81 )‏ ( 81 ) Same with what we learned

Copyright © 2008, W3C ( 82 )‏ ( 82 ) Available documents, tools

Copyright © 2008, W3C ( 83 )‏ ( 83 ) “Core” vocabularies There are also a number “core vocabularies” (not necessarily OWL based) Dublin CoreDublin Core: about information resources, digital libraries, with extensions for rights, permissions, digital right management FOAFFOAF: about people and their organizations DOAPDOAP: on the descriptions of software projects SIOCSIOC: Semantically-Interlinked Online Communities vCard in RDF … One should never forget: ontologies/vocabularies must be shared and reused!

Copyright © 2008, W3C ( 84 )‏ ( 84 ) Some books G. Antoniu and F. van Harmelen: Semantic Web Primer, 2 nd edition in 2008 D. Allemang and J. Hendler: Semantic Web for the Working Ontologist, 2008 … See the separate Wiki page collecting book referencesWiki page

Copyright © 2008, W3C ( 85 )‏ ( 85 ) Further information Dave Beckett’s ResourcesDave Beckett’s Resources at Bristol University huge list of documents, publications, tools, … Planet RDFPlanet RDF aggregates a number of SW blogs Semantic Web Interest Group a forum developers with archived (and public) mailing list, and a constant IRC presence on freenode.net#swig anybody can sign up on the list

Copyright © 2008, W3C ( 86 )‏ ( 86 ) Lots of Tools (not an exhaustive list!) Categories: Triple Stores Inference engines Converters Search engines Middleware CMS Semantic Web browsers Development environments Semantic Wikis … Some names: Jena, AllegroGraph, Mulgara, Sesame, flickurl, … TopBraid Suite, Virtuoso environment, Falcon, Drupal 7, Redland, Pellet, … Disco, Oracle 11g, RacerPro, IODT, Ontobroker, OWLIM, Tallis Platform, … RDF Gateway, RDFLib, Open Anzo, DartGrid, Zitgist, Ontotext, Protégé, … Thetus publisher, SemanticWorks, SWI-Prolog, RDFStore… …

Copyright © 2008, W3C ( 87 )‏ ( 87 ) Tools Worth noting: major companies offer (or will offer) Semantic Web tools or systems using Semantic Web: Adobe, Oracle, IBM, HP, Software AG, webMethods, Northrop Gruman, Altova, Dow Jones, BBN, … See also the W3C Wiki page on toolsW3C Wiki page on tools

Copyright © 2008, W3C ( 88 )‏ ( 88 ) Applications We have no time to look at more applications… There is a collection of Use Cases and Case Studies on the W3C site: examples coming from: BT, Vodafone, Agfa, Sun, UK Ordnance Survey, NASA, Eli Lilly, Tata, Elsevier, UN FAO, Oracle, … see

Copyright © 2008, W3C ( 89 )‏ ( 89 ) Thank you for your attention! These slides are publicly available on: