1 What is new in W3C land? 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C

Slides:



Advertisements
Similar presentations
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
RDF Tutorial.
Semantic Web Introduction
Ivan Herman W3C. (2)  Current RDF has been published in 2004  Significant deployment since then ◦ implementation experiences ◦ users’ experiences 
(1) Standardizing for Open Data Ivan Herman, W3C Open Data Week Marseille, France, June Slides at:
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Ontology Notes are from:
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
A year on the Semantic W3C (or: what is happening these days?) Semantic Web Meetup, Seattle, Ivan Herman, W3C.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
A Registry for controlled vocabularies at the Library of Congress
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
Semantic Web Series 1 Mohammad M. R. Cowdhury UniK, Kjeller.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Practical RDF Chapter 1. RDF: An Introduction
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. Collaborative Building of Controlled Vocabularies Crosswalks Mateusz.
Semantic Publishing Update Second TUC meeting Munich 22/23 April 2013 Barry Bishop, Ontotext.
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
Logics for Data and Knowledge Representation
Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
MD9.6 Release: Highlights Increased the character limit for all URL resources to 600 characters. Data_Center/Service_Provider Data_Set_Citation/Service_Citation.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Semantic Publishing Benchmark Task Force Fourth TUC Meeting, Amsterdam, 03 April 2014.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Charnelle Bacon & Brandon Carr. Benefits of a Social Web Share Create Connect  The social web is a place that one can share a multiplex of information,
RDF and Relational Databases
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
Semantic Web in Depth RDFa, GRDDL and POWDER Dr Nicholas Gibbins
What is the Semantic Web? 17 th XBRL International Conference Eindhoven, the Netherlands 5 st May, 2008 Ivan Herman, W3C.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
1 State of the Semantic Web (at least as seen from W3C…) 3rd Chinese Semantic Web Symposium Nanjing, China August 30, 2009 Ivan Herman, W3C
1 W3C Current Activities Dagstuhl Perspective Workshop: Semantic Web Reflections and Future Directions Dagstuhl, Germany, June 30, 2009 Ivan Herman
Overview of the Semantic Web Ralph R. Swick World Wide Web Consortium (W3C) 17 October 2009.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
RDFa How and Why Ralph R. Swick World Wide Web Consortium
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Overview of the Semantic Web Ralph R
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Analyzing and Securing Social Networks
PREMIS Tools and Services
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Semantic Web Update W3C RDF, OWL Standards, Development and Applications Dave Beckett.
Graph Data on the Web: extend the pivot, don’t reinvent the wheel
Presentation transcript:

1 What is new in W3C land? 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C

2  Lots of things are happening at W3C:  technology work  POWDER, OWL 2, RIF, SPARQL, RDFa, SKOS, media (primarily video) annotations, Linked Open Data community  thematic Interest Groups (acting or planned)  Health Care and Life Sciences, XBRL, eGovernment  incubator groups  Semantic Sensor Webs, Social Spaces, Product Modelling

3 What I will do…  Highlight some technologies that have been under development  … and may not have received all the attention they deserve (yet!)  Unfortunately, there is no time to go into the details of what the thematic groups do

4  Some of the topics are part of the conference program directly:  RDFa (Tuesday morning, Mark Birbeck)  Linked Data (Tuesday morning, Paul Miller et al)  OWL 2 (Tuesday afternoon panel, Mike Smith et al.)  XBRL (Tuesday afternoon, Dianne Mueller and Dave Raggett)  RDF Redux (Tuesday afternoon, Pat Hayes)  OWL 2 RL Reasoning (Thursday afternoon, Chris Matheus)  Many of the technologies will be addressed, some way or other, in most of the talks, too

5 But first: do you remember this image from last year?

6  The Semantic Web has many facets, many different communities, many possible emphasis  I will take a particular route to link things together  But this is not a “canonical” one, there might be many different ways of looking at the family of technologies

7 A thread that binds lots of of the issues together…

8 Linking Open Data Project  I am cheating here a bit…  W3C was, sort of, a midwife assisting at the birth of the LOD project it was initiated as a “Community Project” by the SWEO IG  but the child has grown since, became successful, and is running fairly independently of W3C  That being said…

9 Linking Open Data Project  Goal:  “expose” open datasets via RDF  set RDF links among the data items from different datasets  a typical example is to set an owl:sameAs between two items in different datasets that refer to the same “thing”  set up query endpoints (usually SPARQL)  Altogether billions of triples, millions of links…  The “seed” for a general Web of Data

10 The LOD “cloud”, March 2008

11 The LOD “cloud”, September 2008

12 The LOD “cloud”, March 2009

13 Applications using the cloud begin to emerge  Bookmarking systems, exploration of social graphs, financial reporting  LOD nodes (eg, DBPedia) provide a set of referenceable URI-s for many things  Worth looking at the proceedings of the latest workshop, for example  April 2009, at WWW2009 

14 Example: integration of “social” data  Internal usage of wikis, blogs, RSS, etc, at EDF  goal is to manage the flow of information better  Items are integrated via  RDF as a unifying format  simple vocabularies like SIOC, FOAF, MOAT (all public)  internal data is combined with linked open data like Geonames  SPARQL is used for internal queries  Details are hidden from end users (via plugins, extra layers, etc) Courtesy of A. Passant, EDF R&D and LaLIC, Université Paris-Sorbonne, (SWEO Case Study)(SWEO Case Study)

15 Example: integration of “social” data Courtesy of A. Passant, EDF R&D and LaLIC, Université Paris-Sorbonne, (SWEO Case Study)(SWEO Case Study)

16 Example: integration of “social” data Courtesy of A. Passant, EDF R&D and LaLIC, Université Paris-Sorbonne, (SWEO Case Study)(SWEO Case Study)

17 Application areas add their own “sub-cloud”  Some “bio” related blobs were added by W3C’s HCLS IG  The eGovernment IG plans to do the same

18 Challenge: get the data out there!

19 How to access a database  Many of the LOD blobs come from relational databases  Issue: how to “map” a relational database content to RDF  different tools exist (Virtuoso’s RDF view, D2RQ, Triplify, R2O, Dartgrid toolkit, Asio, RDBToOnto)  the W3C RDB2RDF Incubator Group published a survey:  Report.pdf

20 How to access a database (cont.)  A new RDB2RDF Working Group is planned  Goal:  “standardize a language for mapping relational data and relational database schemas into RDF and OWL”  how to assign public identifiers to database entries  group should start in July/August, watch the news and join!

21 Data in other formats  But many data are in XML, HTML and not in databases  Fortunately, GRDDL and RDFa are already around to easily produce (RDF) data  the usual tools begin to adopt GRDDL and RDFa to retrieve RDF automatically  These data can be added to the cloud easily

22 Publication of data: SlideShare

23 Publication of data: SlideShare

24 Publication of SVG metadata: RDF/XML and/or RDFa

25 Publication of SVG metadata: RDF/XML and/or RDFa

26 How to “assign” RDF data to a collection of resources?  Instead of spelling out information for each resource, is it possible to “generate” those?  Some examples:  copyright information for all of your photographs  is a Web page collection usable on a mobile phone and how?  bibliographical data for a series of publications  provenance data for a collection of resources  annotation of the data resulting from a scientific experiment  etc

27 How to “assign” RDF data to a collection of resources?  The issue:  given the URI of the resource (photograph, publication, etc), how do I find the relevant RDF data?

28 POWDER (Protocol for Web Description Resources)  Lets you define predicates that can be automatically assigned to a set of resources

29 POWDER scenario: copyright for photos

30 The technical details…  The “description resource” is an XML file: <powder xmlns=" xmlns:cc=" /img/ <powder xmlns=" xmlns:cc=" /img/

31 The technical details…  Powder processors may then return  direct RDF triples, eg:  or can transform this XML file into an OWL for more generic processors  a canonical processing of the XML file is defined by the POWDER specification cc:license.

32 POWDER Service  Online POWDER service can also be set up:  a Web service with  submit a URI and a resource description file  return the RDF statements for that URI  such service should be set up, eg, at W3C

33 Challenge: get the data organized

34  Just getting the data out there is not enough  Some sort of organization, categorization of data is necessary  Ie, the LOD needs various sorts of vocabularies to rely on

35 SKOS (Simple Knowledge Organization System)  Represent and share classifications, glossaries, thesauri, etc  for example:  Dewey Decimal Classification, Art and Architecture Thesaurus, ACM classification of keywords and terms…  classification of Web 2.0 type tags  Define classes and properties to add those structures to an RDF universe  allow for a quick port of this traditional data, combine it with other data

36 SKOS  SKOS is based on a simple structure  the central concept is, well, a “SKOS concept”  concepts can have preferred and alternate labels  a concept may be narrower or broader then another one  concepts may be related to one another  concepts can be collected in “concept schemes”  and that is it (well, almost...)  Other resources can then refer to these concepts as, eg, their subject

37 Typical example: LC Subject Headings

38 Typical example: LC Subject Headings skos:prefLabel skos:broader

39 Typical example: LC Subject Headings

40 Using the LCSH terms…

41 Using the LCSH terms… dc:title "The Glass dc:subject ;... a skos:Concept; skos:prefLabel "Historical dc:created " T00:00:00-04:00"^^xsd:dateTime; skos:broader ;... a skos:Concept; skos:prefLabel skos:narrower ;... dc:title "The Glass dc:subject ;... a skos:Concept; skos:prefLabel "Historical dc:created " T00:00:00-04:00"^^xsd:dateTime; skos:broader ;... a skos:Concept; skos:prefLabel skos:narrower ;...

42 Another Thesaurus example Courtesy of Timo Borst and Joachim Neubert, German Nat. Libr. of Economics, (SWEO Case Study)(SWEO Case Study)

43 Vocabularies on the LOD (and elsewhere)  The LOD blobs include a number of vocabularies  UMBEL, OpenCyc, Yago, …  The level of formality may be different  in some cases a simple structure is way enough  in some cases a much more formal system is necessary

44 SKOS and OWL  SKOS is geared towards specific (though large) use cases, like  taxonomies, glossaries, …  annotations of complex structures (including ontologies)  SKOS is a based on a very simple usage of OWL  using some simple OWL Full constructions  the emphasis is on organization and not on logical inferences  “OWL is a Harley-Davison, SKOS is a mountain bike” — (Tom Baker, co-chair of the relevant WG)

45 But, of course, there is also OWL  The LOD does create new challenges due to the amount of data  running a full and complete OWL DL inference might be a challenge, to say the least...  OWL 2 introduces “profiles” that might be better fit for many applications  restrictions on which OWL term can be used in under what circumstances

46 OWL 2 profiles  Three profiles have been defined  Classification and instance queries in polynomial time: OWL-EL  Implementable on top of conventional relational database engines: OWL-QL  Implementable on top of traditional rule engines: OWL-RL  Come to the OWL 2 panel if you want to hear more…

47 Rules  OWL 2 RL shows the importance of rule engines  W3C's Rule work (RIF) is getting to completion  b.t.w., OWL 2 RL can be expressed in RIF  I have no time to go into details here...

48 Query the data

49 Querying the data: SPARQL  Is a W3C Standard since January 2008  it has already become one of the absolutely essential technologies on the SW  all LOD blobs offer a “SPARQL endpoint”  there is even a SPARQL endpoint for the whole LOD

50 SPARQL as a unifying point!

51 New SPARQL WG: Goals  To define a small set of extensions to SPARQL  No complex change, backward compatibility  Listen to user and implementation experiences of the past few years  Group started in February 2009

52 Planned features  Update, ie, ability to change the RDF store  Service description framework  what type of extensions, inference possibilities, etc, are available at the endpoint  Addition to the query language  aggregate functions  subqueries  negation  project expressions

53 Planned features (tentative syntax examples)  Aggregate functions and project expressions:  Subqueries:  Negation: SELECT AVG(?age) AS average_age WHERE {.... } SELECT (?age < 18) AS minor WHERE {... } SELECT AVG(?age) AS average_age WHERE {.... } SELECT (?age < 18) AS minor WHERE {... } SELECT ?person (SELECT ?n WHERE { ?person foaf:name ?n } LIMIT 1) WHERE { foaf:knows ?person. } SELECT ?person (SELECT ?n WHERE { ?person foaf:name ?n } LIMIT 1) WHERE { foaf:knows ?person. } SELECT * WHERE { ?x :p ?v. UNSAID { ?x :q ?v. } } SELECT * WHERE { ?x :p ?v. UNSAID { ?x :q ?v. } }

54 Possible features (time permitting)  Definition of “entailment regimes”  RDFS, OWL Profiles, RIF  Property paths  Commonly used functions (eg, string manipulation)  Basic control for federated queries  Additional query language syntax  commas in select lists, some operators in filters

55 Are we done?

56 Certainly not…  There are a number of issues, problems  missing functionalities, unsolved problems  misconceptions, messaging problems  need for more applications, deployment, acceptance  we need more experts  etc

57  Description of datasets  Semi-automatic generation of links among datasets  it is a bit like ontology alignment but on instances  The “ID Jungle”  what to do when the same concept (like a specific person) has a large number of URI-s  owl:sameAs may not be the best choice in all cases Number of issues raised by the LOD…

58 Other items…  Security, trust, provenance  combining cryptographic techniques with the RDF model, sign a portion of the graph, etc  trust models  What does reasoning mean on billions of triples?  incomplete reasoning: “give me what you can in 2 minutes, even if it is not complete”

59 Some future items: uncertainty  Fuzzy logic  look at alternatives of DL based on fuzzy logic  alternatively, extend RDF(S) with fuzzy notions  Probabilistic statements  have an OWL class membership with a specific probability  combine reasoners with Bayesian networks  A W3C Incubator Group issued a report on the current status, possibilities, directions, etc  report published in April 2008

60 Revision of the RDF model?  Some restrictions in RDF may be unnecessary (bNodes as predicates, literals as subject, …)‏  Semantics of bNodes  Named graphs  Syntax issues in RDF/XML  …

61 Conclusions  Many things are happening at W3C to evolve the Semantic Web  Many more issues are still to be done…  So join the club! After all, this is really a community effort…

62 Just some announcements…  POWDER Proposed Recommendation published on June 5  SKOS Proposed Recommendation published… today  OWL 2 Candidate Recommendation published… today  RIF 2 nd Last Call published later this week

63 Thank you for your attention! And enjoy the conference! These slides are also available on the Web: